10,000 Matching Annotations
  1. Oct 2025
    1. Miscellany, Portugal, Early 14th century

      Biblioteca Central da Universidade de Brasília

      BCE UnB Ms.02

      o material comprado da víuva do Serafim da Silva Neto corresponde a três manuscritos medievais: o Livro das Aves, o Flos Sanctorum e o Diálogos de São Gregório. Entendo que você procura informações sobre o Livro das Aves

      Otros títulos para esta coleccion: Formas variantes do título (R) - A: Manuscrito 02 Formas variantes do título (R) - A: Ms.02 OBR/BCE/UNB Formas variantes do título (R) - A: De bestiis et aliis rebus Formas variantes do título (R) - A: De avibus Formas variantes do título (R) - A: História natural das aves Formas variantes do título (R) - A: História sobrenatural das aves Formas variantes do título (R) - A: De natura avibus Formas variantes do título (R) - A: Aviarium https://consulta.bce.unb.br/acervo/1032302

      https://unbbr-my.sharepoint.com/:b:/g/personal/obrasraras_bce_unb_br/ES7w2PD1QXpJm_UwEVRVa0gB8itRW6rhHHEiJAXbJBE-Rg

      https://static.wikitide.net/intercriaturaswiki/d/d8/BCE_UnB_Ms.02.pdf

      Supposedly: https://bdce.unb.br/manuscritos-medievais/?perpage=12&view_mode=records&paged=1&order=ASC&orderby=date&fetch_only_meta=42803%2C43100%2C42618%2C42658%2C68387%2C121587%2C42638%2C87997%2C42622%2C42652%2C42648%2C42910&fetch_only=thumbnail (NOT ACCESIBLE OUTSIDE INTRANET BNE)

    1. Ms. Serafim da Silva Neto

      Biblioteca Central da Universidade de Brasília

      BCE UnB Ms.02

      o material comprado da víuva do Serafim da Silva Neto corresponde a três manuscritos medievais: o Livro das Aves, o Flos Sanctorum e o Diálogos de São Gregório. Entendo que você procura informações sobre o Livro das Aves

      Otros títulos para esta coleccion: Formas variantes do título (R) - A: Manuscrito 02 Formas variantes do título (R) - A: Ms.02 OBR/BCE/UNB Formas variantes do título (R) - A: De bestiis et aliis rebus Formas variantes do título (R) - A: De avibus Formas variantes do título (R) - A: História natural das aves Formas variantes do título (R) - A: História sobrenatural das aves Formas variantes do título (R) - A: De natura avibus Formas variantes do título (R) - A: Aviarium https://consulta.bce.unb.br/acervo/1032302

      https://unbbr-my.sharepoint.com/:b:/g/personal/obrasraras_bce_unb_br/ES7w2PD1QXpJm_UwEVRVa0gB8itRW6rhHHEiJAXbJBE-Rg

      https://static.wikitide.net/intercriaturaswiki/d/d8/BCE_UnB_Ms.02.pdf

      Supposedly: https://bdce.unb.br/manuscritos-medievais/?perpage=12&view_mode=records&paged=1&order=ASC&orderby=date&fetch_only_meta=42803%2C43100%2C42618%2C42658%2C68387%2C121587%2C42638%2C87997%2C42622%2C42652%2C42648%2C42910&fetch_only=thumbnail (NOT ACCESIBLE OUTSIDE INTRANET BNE)

    1. W badaniu wzięło udział 70 dzieci i młodzieży w wieku od 6 do 17 lat z kliniczną diagnozą ADHD. Zostali oni losowo przydzieleni do grupy interwencyjnej lub aktywnej grupy kontrolnej. Interwencja była niezwykle krótka – składała się z pojedynczej, 10-minutowej sesji MBI, która obejmowała trzy ćwiczenia: (a) ćwiczenie oddechowe, (b) skanowanie ciała oraz (c) ćwiczenie uwagi. Jako główny wskaźnik fizjologiczny mierzono kontrolę wagalną serca (CVC), ocenianą za pomocą standardowych miar HRV w domenie czasu (RMSSD) i częstotliwości (HF-HRV).   Główne wyniki tego badania są niezwykle istotne. Stwierdzono mały, ale statystycznie istotny efekt interwencji na wskaźniki CVC (wielkość efektu Cohena d=0.37). Dokładniejsza analiza miary RMSSD wykazała istotny statystycznie efekt warunku (p=0.049) oraz czasu (p=0.012), co wskazuje na niewielki, ale mierzalny wzrost aktywności nerwu błędnego w grupie MBI bezpośrednio po interwencji. Jednocześnie, ta pojedyncza, krótka sesja nie miała istotnego wpływu na obiektywne miary uwagi (mierzone testem CPT) ani na subiektywnie oceniany nastrój.   Wyniki te należy interpretować nie jako dowód na skuteczność kliniczną jednorazowej, 10-minutowej sesji, ale jako niezwykle ważny dowód na istnienie mechanizmu (proof-of-concept). Fakt, że tak minimalna interwencja była w stanie wywołać mierzalną zmianę fizjologiczną, dowodzi, że autonomiczny układ nerwowy u dzieci z ADHD jest natychmiastowo i mierzalnie podatny na modulację poprzez odgórne, wolicjonalne kierowanie uwagi na doznania cielesne. To potężny argument za tym, że ścieżka "uwaga → interocepcja → regulacja wagalna" jest w tej populacji funkcjonalna i może stanowić obiecujący cel terapeutyczny w dłuższej perspektywie.

      Już pojedyńcza interwencja z body scanem pomogła zwiększyć HRV u młodych dorosłych. Co pokazuje, że istnieje jakiś mechanizm przez który HRV może wzrastac.

    2. Ponadto, dowody z badań na populacjach ogólnych dostarczają ważnego kontekstu. Badania porównujące MBI z biofeedbackiem HRV i ćwiczeniami fizycznymi u osób z podwyższonym poziomem stresu wykazały podobną skuteczność wszystkich trzech metod w poprawie funkcji poznawczych i samopoczucia. Sugeruje to istnienie wspólnych, leżących u podstaw mechanizmów, takich jak świadoma regulacja oddechu i skupienie uwagi na sygnałach z ciała. Wiele badań obserwacyjnych i eksperymentalnych w populacjach ogólnych konsekwentnie wykazuje, że praktyka mindfulness prowadzi do ostrego (w trakcie sesji) i przewlekłego (długoterminowego) wzrostu wskaźników HRV. Należy jednak odnotować, że metaanaliza z 2021 roku, syntetyzująca wyniki wielu badań, stwierdziła, że dowody na to, iż MBI prowadzi do poprawy HRV w porównaniu z grupami kontrolnymi, są wciąż niewystarczające, głównie z powodu wysokiej heterogeniczności metodologicznej istniejących badań.

      Mindfulness, bofeedback HRV i ćwiczenia fizyczne mogą mieć jakaś wspólna ukrytą zmienną, która podnosi HRV

    3. Synteza danych z funkcjonalnego rezonansu magnetycznego (fMRI) i pomiarów fizjologicznych pozwala na zbudowanie spójnego modelu neurobiologicznego ASMR. Badania fMRI pokazują, że podczas mrowienia ASMR dochodzi do znacznej aktywacji w regionach mózgu związanych z nagrodą (jądro półleżące) oraz pobudzeniem emocjonalnym i interocepcją (przednia kora zakrętu obręczy, wyspa). 1 Wzór ten jest podobny do obserwowanego podczas dreszczy wywołanych muzyką (frisson), ale odróżnia się skutkiem fizjologicznym – ASMR prowadzi do redukcji tętna, podczas gdy frisson może je podnosić. 2

      Charakterystyka ASMR

    4. Późniejsze, jeszcze bardziej rygorystyczne metaanalizy, potwierdziły ten sceptyczny pogląd. Badanie Boxum i wsp. z 2024 roku, przeprowadzone na dużej, połączonej kohorcie (N=417), nie znalazło żadnego związku między podtypami EEG (w tym wysokim TBR) a nasileniem objawów behawioralnych ADHD, co ostatecznie podważyło jego wartość diagnostyczną. Co więcej, analiza wykazała, że podwyższona moc w paśmie theta może w niektórych przypadkach wynikać z artefaktu metodologicznego – wolniejszej indywidualnej częstotliwości szczytowej alfa (iAF), która "przesuwa się" do zdefiniowanego na sztywno pasma theta.

      U osób z ADHD Theta może "przesówać się" i być mierzona w zakresie "Alpha"

    5. Oprócz HEP, inne markery EEG również dostarczają wglądu w neuronalne mechanizmy uwagi interoceptywnej. Świadome kierowanie uwagi na doznania cielesne, jak ma to miejsce w medytacji czy skanowaniu ciała, jest związane z systematycznymi zmianami w pasmach częstotliwości. Przeglądy wskazują na wzrost mocy w paśmie theta w okolicach czołowych, co jest związane ze wzmożoną kontrolą uwagi, oraz na zmiany w mocy i koherencji pasma alfa, które odgrywa rolę w filtrowaniu informacji sensorycznych. Praktyki uważności mogą również prowadzić do zmian w asymetrii proporcji fal alfa-beta w obszarach czołowych, co jest wiązane z regulacją nastroju i emocji. Z kolei w ASD, obserwowana zwiększona zmienność sygnału EEG i niższa koherencja fazowa w paśmie alfa mogą wskazywać na niestabilność sieci neuronalnych, co może dodatkowo utrudniać utrzymanie skupienia na subtelnych sygnałach interoceptywnych.

      Zmiany w pasmach EEG w ADHD

    6. W przeciwieństwie do niejednoznacznych wyników w ASD, dowody dotyczące interocepcji w ADHD są znacznie bardziej spójne. Przegląd systematyczny obejmujący 17 artykułów naukowych jednoznacznie sugeruje, że dokładność interoceptywna jest obniżona u osób z ADHD. Badania eksperymentalne potwierdzają te wnioski, wykazując, że dorośli z ADHD uzyskują istotnie gorsze wyniki w zadaniach liczenia uderzeń serca w porównaniu do grup kontrolnych.   Co istotne, stopień obniżenia dokładności interoceptywnej jest negatywnie skorelowany z nasileniem kluczowych objawów ADHD, takich jak nieuwaga, hiperaktywność/impulsywność, dysfunkcje wykonawcze oraz dysregulacja emocjonalna. Sugeruje to, że trudności w precyzyjnym odczytywaniu sygnałów płynących z ciała mogą bezpośrednio przyczyniać się do problemów z samoregulacją, które stanowią rdzeń tego zaburzenia. Jeśli jednostka ma osłabioną zdolność do monitorowania swojego stanu fizjologicznego, jej zdolność do efektywnej regulacji pobudzenia i zachowań impulsywnych jest również upośledzona.

      Dokładność interoceptywna w ADHD wyraźnie obniżona i związana z objawami ADHD (wszystkimi)

    7. Szczególnie interesujące są wspólne markery neuronalne, które mogą wskazywać na wspólne mechanizmy patofizjologiczne. Metaanaliza badań EEG nad monitorowaniem własnego działania (performance monitoring) dostarcza takich dowodów. Zarówno osoby z ASD, jak i z ADHD, w porównaniu do grup kontrolnych, wykazują zredukowaną amplitudę negatywności związanej z błędem (Error-Related Negativity, ERN) – potencjału wywołanego, który pojawia się natychmiast po popełnieniu błędu. Co więcej, w grupie z ADHD zaobserwowano również zredukowaną amplitudę późniejszego komponentu, pozytywności związanej z błędem (Error Positivity, Pe).   Zredukowana amplituda ERN, wspólna dla obu zaburzeń, wskazuje na istnienie wspólnego deficytu na poziomie neuronalnym w zakresie automatycznego wykrywania i sygnalizowania błędów. Mechanizm ten jest kluczowy dla adaptacyjnego zachowania i uczenia się. Ponieważ komponenty ERN i Pe są generowane między innymi przez przednią korę zakrętu obręczy (anterior cingulate cortex, ACC), która jest kluczowym ośrodkiem nie tylko dla monitorowania działania, ale także dla samoregulacji, przetwarzania interoceptywnego i regulacji emocjonalnej , wspólna dysfunkcja w obwodach ACC może stanowić fundamentalny mechanizm leżący u podstaw trudności w samoregulacji obserwowanych w AuADHD. Deficyt ten może być neurofizjologicznym pomostem łączącym trudności poznawcze (monitorowanie błędów) z deficytami afektywnymi i interoceptywnymi, stanowiąc potencjalny cel dla interwencji transdiagnostycznych.

      Zmienione wykrywanie błędu (i refleksja nad nim)

    Annotators

    1. Reviewer #3 (Public review):

      Summary:

      The authors establish a behavioral task to explore effort discounting in C. elegans. By using bacterial food that takes longer to consume, the authors show that for equivalent effort, as measured by pumping rate, animals obtain less food, as measured by fat deposition.

      The authors formalize the task by applying a neuroeconomic decision making model that includes, value, effort, and discounting. They use this to estimate the discounting C. elegans apply based on ingestion effort by using a population level 2-choice T-maze.

      They then analyze the behavioral dynamics of individual animals transitioning between on-food and off-food states. Harder to ingest bacteria led to increased food patch leaving.

      Finally, they examined a set of mutants defective in different aspects of dopamine signaling, as dopamine plays a key role in discounting in vertebrates and regulates certain aspects of C. elegans foraging.

      In their response to the first set of reviews, the authors take care to ensure their task is analogous to at least some of those used in mammals and make changes to the text to better clarify some of their conclusions. My view is the same--that this is an interesting paper for methodological and scientific reasons that brings an important theoretical framework to bear on C. elegans foraging behavior. While I think the mutant results are somewhat unsatisfying, this is not the principal contribution of the work.

      Strengths:

      The behavioral experiments and neuroeconomic analysis framework are compelling and interesting and make a significant contribution to the field. While these foraging behaviors have been extensively studied, few include clearly articulated theoretical models to be tested.

      Demonstrating that C. elegans effort discounting fits model predictions and has stable indifference points is important for establishing these tasks as a model for decision making.

      Weaknesses:

      The dopamine experiments are harder to interpret. The authors point out the perplexing lack of an effect of dat-1 and cat-2. dop-3 leads to general indifference. I am not sure this is the expected result if the argument is a parallel functional role to discounting in vertebrates. dop-3 causes a range of locomotor phenotypes and may affect feeding (reduced fat storage), and thus there may be a general defect in the ability to perform the task rather than anything specific to discounting.

      That said, some of the other DA mutants also have locomotor defects and do not differ from N2. But there is no clear result here-my concern is that global mutants in such a critical pathway exhibit such pleiotropy that it's difficult to conclude there is a clear and specific role for DA in effort discounting. This would require more targeted or cell-specific approaches. The authors state these experiments are outside the scope of the current study, and that at minimum their results implicate dopamine signaling in some form. I tend to agree but still think locomotion defects of DA mutants complicate this question.

      Meanwhile, there are other pathways known to affect responses to food and patch leaving decisions-5HT, PDF, tyramine, etc. in their response the authors state they focus on dopamine because of its role in discounting behavior in mammals.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1(Public Reviews):

      Summary: 

      Here, Millet et al. consider whether the nematode C. elegans 'discounts' the value of reward due to effort in a manner similar to that shown in other species, including rodents and humans. They designed a T-maze effort choice paradigm inspired by previous literature, but manipulated how effortful the food is to consume.C. elegans worms were sensitive to this novel manipulation, exhibiting effort-discountinglike behaviour that could be shaped by varying the density of food at each alternative in order to calculate an indifference point. This discounting-like behaviour was related to worms' rates of patch leaving, which differed between the low and high effort patches in isolation. The authors also found a potential relationship to dopamine signalling, and also that this discounting behaviour was not specific to lab-based strains of C. elegans

      Strengths: 

      The question is well-motivated, and the approach taken here is novel. The authors are careful in their approach to altering and testing the properties of the effortful, elongated bacteria. Similarly, they go to some effort to understand what exactly is driving behavioural choices in this context, both through the application of simple standard models of effort discounting and a kinetic analysis of patch leaving. The comparisons to various dopamine mutants further extend the translational potential of their findings. I also appreciate the comparison to natural isolate strains, as the question of whether this behaviour may be driven by some sort of strain-specific adaptation to the environment is not regularly addressed in mammalian counterparts. The manuscript is well-written, and the figures are clear and comprehensible. 

      Weaknesses: 

      Discounting is typically defined as the alteration of a subjective value by effort (or time, risk, etc.), which is then used to guide future decision-making. By adapting the standard t-maze task for C. elegans as a patch-leaving paradigm, the authors observe behaviour strongly consistent with discounting models, but that is likely driven by a different process, in particular by an online estimate of the type of food in the current patch, which then influences patch-leaving dynamics (Figure 3). This is fundamentally different from decision-making strategies relating to effort that have been described in the rodent and human literatures. 

      We agree that in our study worms are likely making an on-line estimate of food quality in the current patch, but we wish to point out that rodents and humans also use on-line estimates in some significant effort-discounting paradigms. With respect to rodents, we call attention to effort discounting studies involving the widely used progressive ratio task (references in Discussion). In this task, animals can either lever-press for a preferred food or consume a less preferred food that is freely available nearby. However, the number of lever presses required to obtain preferred food increases as a function of the cumulative number of lever presses until the effort-cost of obtaining preferred food becomes too high and the animal switches to a freely available food. In essence, the lever and the freely available food are patches and the animal decides whether or not to leave the “lever” patch. It seems inescapable that the progressive ratio task involves an on-line assessment of the cost/benefit relationship associated with lever pressing. With respect to humans, one highly cited study (reference in Discussion) presented participants with a series of virtual apple trees. They could see how many apples are in the current tree and how much effort (squeezing a handgrip) is required to gather them. Their task was to decide whether or not to gather apples from that tree based on the perceived cost and benefit. Thus, on-line estimation is a common strategy used by animals and humans as shown in the effort discounting literature. We now make this point in the Discussion section titled A model of effort-discounting like behavior.

      Similarly, the calculation of indifference points at the group instead of at the individual level also suggests a different underlying process and limits the translational potential of their findings. The authors do not discuss the implications of these differences or why they chose not to attempt a more analogous trial-based experiment.  

      It is not clear to us why changing the read-out –– from the individual level to the population level –– necessarily suggests that a different biological mechanism is at work. In our view, there is one mechanism and it can be seen from different perspectives (e.g., individual vs population). Furthermore, the analogous trial-based experiment, as we understand it, would be to record behavior one worm at a time in the T-maze. This design is not practical because it entails recording a large number of single worms in the T-maze for 60 min each. 

      In the case of both the dopamine and natural isolate experiments, the data are very noisy despite large (relative to other C. elegans experiments) sample sizes. In the dopamine experiment, disruption of dop1, dop-2, and cat-2 had no statistically significant effect. There do not appear to be any corrections for multiple comparisons, and the single significant comparison, for dop-3, had a small effect size. 

      An ANOVA followed by a Dunnett test was used to test differences between groups in Fig. 4 and 5. The Dunnett test is a multiple comparison test comparing experimental groups to a single control group. It is used to minimize type I error while maintaining statistical power and does not require further correction for multiple comparisons. We have clarified the use of the Dunnett test in the statistical table.  The effect size for dop-3 is 0.5 (Cohen’s d), which is typically interpreted as a medium, not small, effect size.(e.g. Cohen, Psychological Bulletin, 1992, Vol. 112. No. 1,155-159). 

      More detailed behavioural analyses on both these and the wild isolate strains, for example by applying their kinetic analysis, would likely give greater insight as to what is driving these inconsistent effects. 

      More detailed behavioral analysis could reveal why we observe a difference in effort discounting in some strains and not others. However, it is not obvious what type of behavioral analysis would be needed to differentiate between pleiotropic effects of the mutations/natural isolates and more specific effects on effort discounting. A simple kinetic analysis in particular may not be enough to reveal relevant differences between mutants/natural isolates. For this reason, we think that such experiments may be better suited for future follow up studies.

      Reviewer #2 (Public Reviews)

      Summary: 

      Millet et al. show that C. elegans systematically prefers easy-to-eat bacteria but will switch its choice when harder-to-eat bacteria are offered at higher densities, producing indifference points that fit standard economic discounting models. Detailed kinetic analysis reveals that this bias arises from unchanged patch-entry rates but significantly elevated exit rates on effortful food, and dop-3 mutants lose the preference altogether, implicating dopamine in effort sensitivity. These findings extend effortdiscounting behavior to a simple nematode, pushing the phylogenetic boundary of economic costbenefit decision-making. 

      Strengths: 

      (1) Extends the well-characterized concept of effort discounting into C. elegans , setting a new phylogenetic boundary and opening invertebrate genetics to economic-behavior studies. 

      (2) Elegant use of cephalexin-elongated bacteria to manipulate "effort" without altering nutritional or olfactory cues, yielding clear preference reversals and reproducible indifference points. 

      (3) Application of standard discounting models to predict novel indifference points is both rigorous and quantitatively satisfying, reinforcing the interpretation of worm behavior in economic terms. 

      (4) The three-state patch-model cleanly separates entry and exit dynamics, showing that increased leaving rates-rather than altered re-entry-drive choice biases. 

      (5) Investigates the role of dopamine in this behavior to try to establish shared mechanisms with vertebrates. 

      (6) Demonstration of discounting in wild strain (solid evidence). 

      Weaknesses: 

      (1) The kinetic model omits rich trajectory details-such as turning angles or hazard functions-that could distinguish a bona fide roaming transition from other exit behaviors. 

      The overarching goal of present paper was to develop a simple model for effort discounting in a small, genetically tractable organism.  Accordingly,  we focused on quantitative assays that are easy to implement and analyze. The patch-leaving assay and its associated kinetic analysis are one such assay. To keep things simple in this assay, we counted the number of  transitions between the three states shown in Fig. 3A. We chose not to analyze the data in terms of turning angles or hazard functions because the metrics we developed seemed sufficient. Finally, we note that there are new modeling data showing that the presumptive transitions into the roaming state can be explained in terms of a one-state stochastic model in which there is no discrete roaming state (Elife. 2025 Jul 30;14:RP104972. doi:

      10.7554/eLife.104972.PMID: 40736321).

      (2) Only dop-3 shows an effect, and the statistical validity of this result is questionable. It is not clear if the authors corrected for multiple comparisons, and the effect size is quite small and noisy, given the large number of worms tested. Other mutants do not show effects. Given these two concerns, the role of dopamine in C. elegans effort discounting was unconvincing. 

      An ANOVA followed by a Dunnett test was used to test statistical significance in figures 4 and 5 (see above for a discussion of these tests). We believe this approach is rigorous, and the use of these tests is statistically valid. We note that the effect size for this comparison was medium.

      (3) With only five wild isolates tested (and variable data quality), it's hard to conclude that effort discounting isn't a lab-strain artifact or how broadly it varies in natural populations. 

      The fact that four of the five natural isolates tested display levels of effort discounting similar to N2 (only one natural isolate does not display effort discounting) argues against effort discounting being a laboratory adaption.  We have nevertheless weakened the claim regarding natural isolates. We now say effort discounting-like behavior may not be an adaptation to the laboratory environment.  

      (4) Detailed analysis of behavior beyond preference indices would strengthen the dopamine link and the claim of effort discounting in wild strains. 

      Going beyond preference in the behavioral analysis might or might not reveal new phenotypes that strengthen the link with dopamine. At present, however, we think such experiments are beyond the scope of the paper.

      (5) A few mechanistic statements (e.g., tying satiety exclusively to nutrient signals) would benefit from explicit citations or brief clarifications for non-worm specialists. 

      We are unable to identify a mechanistic statement tying satiety to nutrient signals in our manuscript.

      Reviewer #3 (Public Reviews)

      Summary: 

      The authors establish a behavioral task to explore effort discounting in C. eleganss . By using bacterial food that takes longer to consume, the authors show that, for equivalent effort, as measured by pumping rate, they obtain less food, as measured by fat deposition. The authors formalize the task by applying a formal neuroeconomic decision-making model that includes value, effort, and discounting. They use this to estimate the discounting that C. elegans applies based on ingestion effort by using a population-level 2-choice T-maze. They then analyze the behavioral dynamics of individual animals transitioning between on-food and off-food states. Harder to ingest bacteria led to increased food patch leaving. Finally, they examined a set of mutants defective in different aspects of dopamine signaling, as dopamine plays a key role in discounting in vertebrates and regulates certain aspects of C. elegans foraging. 

      Strengths: 

      The behavioral experiments and neuroeconomic analysis framework are compelling, interesting, and make a significant contribution to the field. While these foraging behaviors have been extensively studied, few include clearly articulated theoretical models to be tested. 

      Demonstrating that C. elegans effort discounting fits model predictions and has stable indifference points is important for establishing these tasks as a model for decision making. 

      Weaknesses: 

      The dopamine experiments are harder to interpret. The authors point out the perplexing lack of an effect of dat-1 and cat-2. dop-3 leads to general indifference. I am not sure this is the expected result if the argument is a parallel functional role to discounting in vertebrates. dop-3 causes a range of locomotor phenotypes and may affect feeding (reduced fat storage), and thus, there may be a general defect in the ability to perform the task rather than anything specific to discounting.

      That said, some of the other DA mutants also have locomotor defects and do not differ from N2. But there is no clear result here - my concern is that global mutants in such a critical pathway exhibit such pleiotropy that it's difficult to conclude there is a clear and specific role for DA in effort discounting. This would require more targeted or cell-specific approaches. 

      We agree with the reviewer that the results of the dopamine experiments are puzzling and getting a better understanding of the role of dopamine in effort-discounting will require more sensitive assays and different experimental approaches (e.g. cell-specific rescues). However, as mentioned by the reviewer, all the mutations tested have some pleiotropic effects, yet only dop-3 displays a defect in effort discounting. This, in our opinion, points to a specific role of dop-3 in effort-discounting in C. elegans. This point is now made in the Discussion in the section titled Role of dopamine signaling in effort discountinglike behavior.

      Meanwhile, there are other pathways known to affect responses to food and patch leaving decisions: serotonin, pigment-dispersing factor, tyramine, etc. The paper would have benefited from a clarification about why these were not considered as promising candidates to test (in addition to or instead of dopamine). 

      We focused on DA because of its well-established effect on effort discounting in rodents.

      Testing other pathways is a goal for future research.

      Reviewer #1 (Recommendations for the authors):

      The current results are more a reframing of data gathered from a patch-leaving paradigm, but described in the form of economic choice modelling in which discounting is one possible explanation. One more parsimonious explanation that worms estimate in real-time some rate of reward and leave the patch at some threshold, consistent with canonical foraging models, previous experiments in C. elegans, and the authors' own data (Figure 3). Therefore, I am wary about some of the claims made in this manuscript, such as 'decision-making strategies based on effort-cost trade-offs are evolutionarily conserved'. 

      These points are now addressed in the Discussion in a revised section titled A model of effortdiscounting like behavior. (i) We now call attention to the fact that our T-maze assay is a patch-leaving foraging paradigm. (ii) We now propose a revised model in which “worms make an on-line assessment of food value in the current patch which in turn alters patch-leaving dynamics, increasing the exit rates from cephalexin-treated patches as shown in Figure 3.” (iii) We now provide evidence from the rodent and human literature that the strategy of on-line assessment of reward value may be evolutionarily conserved in the case of a class of effort discounting tasks whose solution requires on-line assessments. 

      If the reason the authors chose to do a patch-leaving style task rather than a traditional t-maze is because C. elegans is unable to retain the sort of information necessary to make such simultaneous decisions - e.g., if pre-training on the two options isn't possible - then this in itself suggests that mechanisms underlying these decisions in worms and mammals are unlikely to be the same. I mention this because I would like to suggest to the authors an alternative interpretation: that patch foraging is actually 'the' canonical computation that translates across species. This would, in fact, be nicely consistent with some other recent modelling work in humans, e.g., https://www.biorxiv.org/content/10.1101/2025.05.06.652482v1

      Please see the previous response.

      Reviewer #2 (Recommendations for the authors):

      Can you provide a picture of the regular and CEPH bacteria? 

      Done (see Figure 1––figure supplement 1).

      Reviewer #3 (Recommendations for the authors):

      I would recommend testing representative mutants in other pathways in the choice task. If possible, more targeted experiments with dop-3, including either cell-specific KOs or rescues, would very much strengthen this aspect of the paper. 

      While valuable, these experiments are out of scope for the present study.

    1. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Bansal et al. present a study on the fundamental blood and nectar feeding behaviors of the critical disease vector, Anopheles stephensi. The study encompasses not just the fundamental changes in blood feeding behaviors of the crucially understudied vector, but then uses a transcriptomic approach to identify candidate neuromodulation pathways which influence blood feeding behavior in this mosquito species. The authors then provide evidence through RNAi knockdown of candidate pathways that the neuromodulators sNPF and Rya modulate feeding either via their physiological activity in the brain alone or through joint physiological activity along the brain-gut axis (but critically not the gut alone). Overall, I found this study to be built on tractable, well-designed behavioral experiments.

      Their study begins with a well-structured experiment to assess how the feeding behaviors of A. stephensi change over the course of its life history and in response to its age, mating, and oviposition status. The authors are careful and validate their experimental paradigm in the more well-studied Ae. aegypti, and are able to recapitulate the results of prior studies, which show that mating is a prerequisite for blood feeding behaviors in Ae. aegypt. Here they find A. Stephensi, like other Anopheline mosquitoes, has a more nuanced regulation of its blood and nectar feeding behaviors.

      The authors then go on to show in a Y-maze olfactometer that ,to some degree, changes in blood feeding status depend on behavioral modulation to host cues, and this is not likely to be a simple change to the biting behaviors alone. I was especially struck by the swap in valence of the host cues for the blood-fed and mated individuals, which had not yet oviposited. This indicates that there is a change in behavior that is not simply desensitization to host cues while navigating in flight, but something much more exciting is happening.

      The authors then use a transcriptomic approach to identify candidate genes in the blood-feeding stages of the mosquito's life cycle to identify a list of 9 candidates that have a role in regulating the host-seeking status of A. stephensi. Then, through investigations of gene knockdown of candidates, they identify the dual action of RYa and sNPF and candidate neuromodulators of host-seeking in this species. Overall, I found the experiments to be well-designed. I found the molecular approach to be sound. While I do not think the molecular approach is necessarily an all-encompassing mechanism identification (owing mostly to the fact that genetic resources are not yet available in A. stephensi as they are in other dipteran models), I think it sets up a rich line of research questions for the neurobiology of mosquito behavioral plasticity and comparative evolution of neuromodulator action.

      We appreciate the reviewer’s detailed summary of our work. We thank them for their positive comments and agree with them on the shortcomings of our approach.

      Strengths:

      I am especially impressed by the authors' attention to small details in the course of this article. As I read and evaluated this article, I continued to think about how many crucial details could potentially have been missed if this had not been the approach. The attention to detail paid off in spades and allowed the authors to carefully tease apart molecular candidates of blood-seeking stages. The authors' top-down approach to identifying RYamide and sNPF starting from first principles behavioral experiments is especially comprehensive. The results from both the behavioral and molecular target studies will have broad implications for the vectorial capacity of this species and comparative evolution of neural circuit modulation.

      We really appreciate that the reviewer has recognised the attention to detail we have tried to put, thank you!

      Weaknesses:

      There are a few elements of data visualizations and methodological reporting that I found confusing on a first few read-throughs. Figure 1F, for example, was initially confusing as it made it seem as though there were multiple 2-choice assays for each of the conditions. I would recommend removing the "X" marker from the x-axis to indicate the mosquitoes did not feed from either nectar, blood, or neither in order to make it clear that there was one assay in which mosquitoes had access to both food sources, and the data quantify if they took both meals, one meal, or no meals.

      We thank the reviewer for flagging the schematic in figure 1F. As suggested, we have removed the “X” markers from the x-axis and revised the axis label from “choice of food” to “choice made” to better reflect what food the mosquitoes chose in the assay. For clarity, we have now also plotted the same data as stacked graphs at the bottom of Fig. 1F, which clearly shows the proportion of mosquitoes fed on each particular choice. We avoid the stacked graph as the sole representation of this data, as it does not capture the variability in the data.

      I would also like to know more about how the authors achieved tissue-specific knockdown for RNAi experiments. I think this is an intriguing methodology, but I could not figure out from the methods why injections either had whole-body or abdomen-specific knockdown.

      The tissue-specific knockdown (abdomen only or abdomen+head) emerged from initial standardisations where we were unable to achieve knockdown in the head unless we used higher concentrations of dsRNA and did the injections in older females. We realised that this gave us the opportunity to isolate the neuronal contribution of these neuropeptides in the phenotype produced. Further optimisations revealed that injecting dsRNA into 0-10h old females produced abdomen-specific knockdowns without affecting head expression, whereas injections into 4 days old females resulted in knockdowns in both tissues. Moreover, head knockdowns in older females required higher dsRNA concentrations, with knockdown efficiency correlating with the amount injected. In contrast, abdominal knockdowns in younger females could be achieved even with lower dsRNA amounts.

      We have mentioned the knockdown conditions- time of injection and the amount dsRNA injected- for tissue-specific knockdowns in methods but realise now that it does not explain this well enough. We have now edited it to state our methodology more clearly (see lines 932-948).

      I also found some interpretations of the transcriptomic to be overly broad for what transcriptomes can actually tell us about the organism's state. For example, the authors mention, "Interestingly, we found that  after a blood meal, glucose is neither spent nor stored, and that the female brain goes into a state of metabolic 'sugar rest', while actively processing proteins (Figure S2B, S3)".

      This would require a physiological measurement to actually know. It certainly suggests that there are changes in carbohydrate metabolism, but there are too many alternative interpretations to make this broad claim from transcriptomic data alone.

      We thank the reviewer for pointing this out and agree with them. We have now edited our statement to read:

      “Instead, our data suggests altered carbohydrate metabolism  after a blood meal, with the female brain potentially entering a state of metabolic 'sugar rest' while actively processing proteins (Figure S2B, S3). However, physiological measurements of carbohydrate and protein metabolism will be required to confirm whether glucose is indeed neither spent nor stored during this period.” See lines 271-277.

      Reviewer #2 (Public review):

      Summary:

      In this manuscript, Bansal et al examine and characterize feeding behaviour in Anopheles stephensi mosquitoes. While sharing some similarities to the well-studied Aedes aegypti mosquito, the authors demonstrate that mated females, but not unmated (virgin) females, exhibit suppression in their bloodfeeding behaviour. Using brain transcriptomic analysis comparing sugar-fed, blood-fed, and starved mosquitoes, several candidate genes potentially responsible for influencing blood-feeding behaviour were identified, including two neuropeptides (short NPF and RYamide) that are known to modulate feeding behaviour in other mosquito species. Using molecular tools, including in situ hybridization, the authors map the distribution of cells producing these neuropeptides in the nervous system and in the gut. Further, by implementing systemic RNA interference (RNAi), the study suggests that both neuropeptides appear to promote blood-feeding (but do not impact sugar feeding), although the impact was observed only  after both neuropeptide genes underwent knockdown.

      Strengths and/or weaknesses:

      Overall, the manuscript was well-written; however, the authors should review carefully, as some sections would benefit from restructuring to improve clarity. Some statements need to be rectified as they are factually inaccurate.

      Below are specific concerns and clarifications needed in the opinion of this reviewer:

      (1) What does "central brains" refer to in abstract and in other sections of the manuscript (including methods and results)? This term is ambiguous, and the authors should more clearly define what specific components of the central nervous system was/were used in their study.

      Central brain, or mid brain, is a commonly used term to refer to brain structures/neuropils without the optic lobes (For example: https://www.nature.com/articles/s41586-024-07686-5). In this study we have focused our analysis on the central brain circuits involved in modulating blood-feeding behaviour and have therefore excluded the optic lobes. As optic lobes account for nearly half of all the neurons in the mosquito brain (https://pmc.ncbi.nlm.nih.gov/articles/PMC8121336/), including them would have disproportionately skewed our transcriptomic data toward visual processing pathways.

      We have indicated this in figure 3A and in the methods (see lines 800-801, 812). We have now also clarified it in the results section for neuro-transcriptomics to avoid confusion (see lines 236-237).

      (2) The abstract states that two neuropeptides, sNPF and RYamide are working together, but no evidence is summarized for the latter in this section.

      We thank the reviewer for pointing this out. We have now added a statement “This occurs in the context of the action of RYa in the brain” to end of the abstract, for a complete summary of our proposed model.

      (3) Figure 1

      Panel A: This should include mating events in the reproductive cycle to demonstrate differences in the feeding behavior of Ae. aegypti.

      Our data suggest that mating can occur at any time between eclosion and oviposition in An. stephensi and between eclosion and blood feeding in Ae. aegypti. Adding these into (already busy) 1A, would cloud the purpose of the schematic, which is to indicate the time points used in the behavioural assays and transcriptomics.

      Panel F: In treatments where insects were not provided either blood or sugar, how is it that some females and males had fed? Also, it is unclear why the y-axis label is % fed when the caption indicates this is a choice assay. Also, it is interesting that sugar-starved females did not increase sugar intake. Is there any explanation for this (was it expected)?

      We apologise for the confusion. The experiment is indeed a choice assay in which sugar-starved or sugar-sated females, co-housed with males, were provided simultaneous access to both blood and sugar, and were assessed for the choice made (indicated on the x-axis): both blood and sugar, blood only, sugar only, or neither. The x-axis indicates the choice made by the mosquitoes, not the choice provided in the assay, and the y-axis indicates the percentage of males or females that made each particular choice. We have now removed the “X” markers from the x-axis and revised the axis label from “choice of food” to “choice made” to better reflect what food the mosquitoes chose to take.

      In this assay, we scored females only for the presence or absence of each meal type (blood or sugar) and are therefore unable to comment on whether sugar-starved females consumed more sugar than sugarsated females. However, when sugar-starved, a higher proportion of females consumed both blood and sugar, while fewer fed on blood alone.

      For clarity, we have now also plotted the same data as stacked graphs at the bottom of Fig. 1F, which clearly shows the proportion of mosquitoes fed on each particular choice. We avoid the stacked graph as the sole representation of this data as it does not capture the variability in the data.

      (4) Figure 3

      In the neurotranscriptome analysis of the (central) brain involving the two types of comparisons, can the authors clarify what "excluded in males" refers to? Does this imply that only genes not expressed in males were considered in the analysis? If so, what about co-expressed genes that have a specific function in female feeding behaviour?

      This is indeed correct. We reasoned that since blood feeding is exclusive to females, we should focus our analysis on genes that were specifically upregulated in them. As the reviewer points out, it is very likely that genes commonly upregulated in males and females may also promote blood feeding and we will miss out on any such candidates based on our selection criteria.

      (5) Figure 4

      The authors state that there is more efficient knockdown in the head of unfed females; however, this is not accurate since they only get knockdown in unfed animals, and no evidence of any knockdown in fed animals (panel D). This point should be revised in the results test as well.

      Perhaps we do not understand the reviewer’s point or there has been a misunderstanding. In figure 4D, we show that while there is more robust gene knockdown in unfed females, blood-fed females also showed modest but measurable knockdowns ranging from 5-40% for RYamide and 2-21% for sNPF.

      Relatedly, blood-feeding is decreased when both neuropeptide transcripts are targeted compared to uninjected (panel C) but not compared to dsGFP injected (panel E). Why is this the case if authors showed earlier in this figure (panel B) that dsGFP does not impact blood feeding?

      We realise this concern stems from our representation of the data. Since we had earlier determined that dsGFP-injected females fed similarly to uninjected females (fig 4B), we used these controls interchangeably in subsequent experiments. To avoid confusion, we have now only used the label ‘control’ in figure 4 (and supplementary figure S9) and specified which control was used for each experiment in the legend.

      In addition to this, we wanted to clarify that fig 4C and 4E are independent experiments. 4C is the behaviour corresponding to when the neuropeptides were knocked down in both heads and abdomens.

      4E is the behaviour corresponding to when the neuropeptides were knocked down in only the abdomens. We have now added a schematic in the plots to make this clearer.

      In addition, do the uninjected and dsGFP-injected relative mRNA expression data reflect combined RYa and sNPF levels? Why is there no variation in these data,…

      In these qPCRs, we calculated relative mRNA expression using the delta-delta Ct method (see line 975). For each neuropeptide its respective control was used. For simplicity, we combined the RYa and sNPF control data into a single representation. The value of this control is invariant because this method sets the control baseline to a value of 1.

      …and how do transcript levels of RYa and sNPF compare in the brain versus the abdomen (the presentation of data doesn't make this relationship clear).

      The reviewer is correct in pointing out that we have not clarified this relationship in our current presentation. While we have not performed absolute mRNA quantifications, we extracted relative mRNA levels from qPCR data of 96h old unmanipulated control females. We observed that both sNPF and RYa transcripts are expressed at much lower levels in the abdomens, as compared to those in the heads, as shown in the graphs inserted below.

      Author response image 1.

      (6) As an overall comment, the figure captions are far too long and include redundant text presented in the methods and results sections.

      We thank the reviewer for flagging this and have now edited the legends to remove redundancy.

      (7) Criteria used for identifying neuropeptides promoting blood-feeding: statement that reads "all neuropeptides, since these are known to regulate feeding behaviours". This is not accurate since not all neuropeptides govern feeding behaviors, while certainly a subset do play a role.

      We agree with the reviewer that not all neuropeptides regulate feeding behaviours. Our statement refers to the screening approach we used: in our shortlist of candidates, we chose to validate all neuropeptides.

      (8) In the section beginning with "Two neuropeptides - sNPF and RYa - showed about 25% and 40% reduced mRNA levels...", the authors state that there was no change in blood-feeding and later state the opposite. The wording should be clarified as it is unclear.

      Thank you for pointing this out. We were referring to an unchanged proportion of the blood fed females. We have now edited the text to the following:

      “Two neuropeptides - sNPF and RYa - showed about 25% and 40% reduced mRNA levels in the heads but the proportion of females that took blood meals remained unchanged”. See lines 338-340.

      (9) Just before the conclusions section, the statement that "neuropeptide receptors are often ligand promiscuous" is unjustified. Indeed, many studies have shown in heterologous systems that high concentrations of structurally related peptides, which are not physiologically relevant, might cross-react and activate a receptor belonging to a different peptide family; however, the natural ligand is often many times more potent (in most cases, orders of magnitude) than structurally related peptides. This is certainly the case for various RYamide and sNPF receptors characterized in various insect species.

      We agree with the reviewer and apologise for the mistake. We have now removed the statement.

      (10) Methods

      In the dsRNA-mediated gene knockdown section, the authors could more clearly describe how much dsRNA was injected per target. At the moment, the reader must carry out calculations based on the concentrations provided and the injected volume range provided later in this section.

      We have now edited the section to reflect the amount of dsRNA injected per target. Please see lines 921-931.

      It is also unclear how tissue-specific knockdown was achieved by performing injection on different days/times. The authors need to explain/support, and justify how temporal differences in injection lead to changes in tissue-specific expression. Does the blood-brain barrier limit knockdown in the brain instead, while leaving expression in the peripheral organs susceptible?

      To achieve tissue-specific knockdowns of sNPF and RYa, we optimised both the time of injection as well as the dsRNA concentration to be injected. Injecting dsRNA into 0-10h females produced abdomen specific knockdowns without affecting head expression, whereas injections into 96h old females resulted in knockdowns in both tissues. Head knockdowns in older females required higher dsRNA concentrations, with knockdown efficiency correlating with the amount injected. In contrast, abdominal knockdowns in younger females could be achieved even with lower dsRNA amounts, reflecting the lower baseline expression of sNPF in abdomens compared to heads and the age-dependent increase in head expression (as confirmed by qPCR). It is possible that the blood-brain barrier also limits the dsRNA entering the brain, thereby requiring higher amounts to be injected for head knockdowns.

      We have now edited this section to state our methodology more clearly (see lines 932-948).

      For example, in Figure 4, the data support that knockdown in the head/brain is only effective in unfed animals compared to uninjected animals, while there is no evidence of knockdown in the brain relative to dsGFP-injected animals. Comparatively, evidence appears to show stronger evidence of abdominal knockdown mostly for the RYa transcript (>90%) while still significantly for the sNPF transcript (>60%).

      As we explained earlier, this concern likely stems from our representation of the data. Since we had earlier determined that dsGFP-injected females fed similarly to uninjected females (fig 4B), we used these controls interchangeably in subsequent experiments. To avoid confusion, we have now only used the label ‘control’ in figure 4 (and supplementary figure S9) and specified which control was used for each experiment in the legend.

      In addition to this, we wanted to clarify that fig 4C and 4E are independent experiments. 4C is the behaviour corresponding to when the neuropeptides were knocked down in both heads and abdomens. 4E is the behaviour corresponding to when the neuropeptides were knocked down in only the abdomen. We have now added a schematic in the plots to make this clearer.

      Reviewer #3 (Public review):

      Summary:

      This manuscript investigates the regulation of host-seeking behavior in Anopheles stephensi females across different life stages and mating states. Through transcriptomic profiling, the authors identify differential gene expression between "blood-hungry" and "blood-sated" states. Two neuropeptides, sNPF and RYamide, are highlighted as potential mediators of host-seeking behavior. RNAi knockdown of these peptides alters host-seeking activity, and their expression is anatomically mapped in the mosquito brain (sNPF and RYamide) and midgut (sNPF only).

      Strengths:

      (1) The study addresses an important question in mosquito biology, with relevance to vector control and disease transmission.

      (2) Transcriptomic profiling is used to uncover gene expression changes linked to behavioral states.

      (3) The identification of sNPF and RYamide as candidate regulators provides a clear focus for downstream mechanistic work.

      (3) RNAi experiments demonstrate that these neuropeptides are necessary for normal host-seeking behavior.

      (4) Anatomical localization of neuropeptide expression adds depth to the functional findings.

      Weaknesses:

      (1) The title implies that the neuropeptides promote host-seeking, but sufficiency is not demonstrated (for example, with peptide injection or overexpression experiments).

      Demonstrating sufficiency would require injecting sNPF peptide or its agonist. To date, no small-molecule agonists (or antagonists) that selectively mimic sNPF or RYa neuropeptides have been identified in insects. An NPY analogue, TM30335, has been reported to activate the Aedes aegypti NPY-like receptor 7 (NPYLR7; Duvall et al., 2019), which is also activated by sNPF peptides at higher doses (Liesch et al., 2013). Unfortunately, the compound is no longer available because its manufacturer, 7TM Pharma, has ceased operations. Synthesising the peptides is a possibility that we will explore in the future.

      (2) The proposed model regarding central versus peripheral (gut) peptide action is inconsistently presented and lacks strong experimental support.

      The best way to address this would be to conduct tissue-specific manipulations, the tools for which are not available in this species. Our approach to achieve head+abdomen and abdomen only knockdown was the closest we could get to achieving tissue specificity and allowed us to confirm that knockdown in the head was necessary for the phenotype. However, as the reviewer points out, this did not allow us to rule out any involvement of the abdomen. This point has been addressed in lines 364-371.

      (3) Some conclusions appear premature based on the current data and would benefit from additional functional validation.

      The most definitive way of demonstrating necessity of sNPF and RYa in blood feeding would be to generate mutant lines. While we are pursuing this line of experiments, they lie beyond the scope of a revision. In its absence, we relied on the knockdown of the genes using dsRNA. We would like to posit that despite only partial knockdown, mosquitoes do display defects in blood-feeding behaviour, without affecting sugar-feeding. We think this reflects the importance of sNPF in promoting blood feeding.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Overall, I found this manuscript to be well-prepared, visually the figures are great and clearly were carefully thought out and curated, and the research is impacwul. It was a wonderful read from start to finish. I have the following recommendations:

      Thank you very much, we are very pleased to hear that you enjoyed reading our manuscript!

      (1) For future manuscripts, it would make things significantly easier on the reviewer side to submit a format that uses line numbers.

      We sincerely apologise for the oversight. We have now incorporated line numbers in the revised manuscript.

      (2) There are a few statements in the text that I think may need clarification or might be outside the bounds of what was actually studied here. For example, in the introduction "However, mating is dispensable in Anophelines even under conditions of nutritional satiety". I am uncertain what is meant by this statement - please clarify.

      We apologise for the lack of clarity in the statement and have now deleted it since we felt it was not necessary.

      (3) Typo/Grammatical minutiae:

      a) A small idiosyncrasy of using hyphens in compound words should also be fixed throughout. Typically, you don't hyphenate if the words are being used as a noun, as in the case: e.g. "Age affects blood feeding.". However, you would hyphenate if the two words are used as a compound adjective "Age affects blood-feeding behavior". This may not be an all-inclusive list, but here are some examples where hyphens need to either be removed or added. Some examples:

      "Nutritional state also influences other internal state outputs on blood-feeding": blood-feeding -> blood feeding

      "... the modulation of blood-feeding": blood-feeding -> blood feeding

      "For example, whether virgin females take blood-meals...": blood-meals -> blood meals

      ".... how internal and external cues shape meal-choice"-> meal choice

      "blood-meal" is often used throughout the text, but is correctly "blood meal" in the figures.

      There are many more examples throughout.

      We apologise for these errors and appreciate the reviewer’s keen eye. We have now fixed them throughout the manuscript.

      b) Figure 1 Caption has a typo: "co-housed males were accessed for sugar-feeding" should be "co-housed males were assessed for sugar feeding"

      We apologise for the typo and thank the reviewer for spotting it. We have now corrected this.

      c) It would be helpful in some other figure captions to more clearly label which statement is relevant to which part of the text. For example, in Figure 4's caption.

      "C,D. Blood-feeding and sugar-feeding behaviour of females when both RYa and sNPF are knocked down in the head (C). Relative mRNA expressions of RYa and sNPF in the heads of dsRYa+dssNPF - injected blood-fed and unfed females, as compared to that in uninjected females, analysed via qPCR (D)."

      I found re-referencing C and D at the end of their statements makes it look as thought C precedes the "Relative mRNA expression" and on a first read through, I thought the figure captions were backwards. I'd recommend reformating here and throughout consistently to only have the figure letter precede its relevant caption information, e.g.:

      "C. Blood-feeding and sugar-feeding behaviour of females when both RYa and sNPF are knocked down in the head. D. Relative mRNA expressions of RYa and sNPF in the heads of dsRYa+dssNPF - injected bloodfed and unfed females, as compared to that in uninjected females, analysed via qPCR."

      We have now edited the legends as suggested.

      Reviewer #2 (Recommendations for the authors):

      Separately from the clarifications and limitations listed above, the authors could strengthen their study and the conclusions drawn if they could rescue the behavioural phenotype observed following knockdown of sNPF and RYamide. This could be achieved by injection of either sNPF or RYa peptide independently or combined following knockdown to validate the role of these peptides in promoting blood-feeding in An. stephensi. Additionally, the apparent (but unclear) regionalized (or tissue-specific) knockdown of sNPF and RYamide transcripts could be visualized and verified by implementing HCR in situ hyb in knockdown animals (or immunohistochemistry using antibodies specific for these two neuropeptides).

      In a follow up of this work, we are generating mutants and peptides for these candidates and are planning to conduct exactly the experiments the reviewer suggests.

      Reviewer #3 (Recommendations for the authors):

      The loss-of-function data suggest necessity but not sufficiency. Synthetic peptide injection in non-host seeking (blood-fed mated or juvenile) mosquitoes would provide direct evidence for peptide-induced behavioral activation. The lack of these experiments weakens the central claim of the paper that these neuropeptides directly promote blood feeding.

      As noted above, we plan to synthesise the peptide to test rescue in a mutant background and sufficiency.

      Some of the claims about knockdown efficiency and interpretation are conflicting; the authors dismiss Hairy and Prp as candidates due to 30-35% knockdown, yet base major conclusions on sNPF and RYamide knockdowns with comparable efficiencies (25-40%). This inconsistency should be addressed, or the justification for different thresholds should be clearly stated.

      We have not defined any specific knockdown efficacy thresholds in the manuscript, as these can vary considerably between genes, and in some cases, even modest reductions can be sufficient to produce detectable phenotypes. For example, knockdown efficiencies of even as low as about 25% - 40% gave us observable phenotypes for sNPF and RYa RNAi (Figure S9B-G).

      No such phenotypes were observed for Hairy (30%) or Prp (35%) knockdowns. Either these genes are not involved in blood feeding, or the knockdown was not sufficient for these specific genes to induce phenotypes. We cannot distinguish between these scenarios.

      The observation that knockdown animals take smaller blood meals is interesting and could reflect a downstream effect of altered host-seeking or an independent physiological change. The relationship between meal size and host-seeking behavior should be clarified.

      We agree with the reviewer that the reduced meal size observed in sNPF and RYa knockdown animals could result from their inability to seek a host or due to an independent effect on blood meal intake. Unfortunately, we did not measure host-seeking in these animals. We plan to distinguish between these possibilities using mutants in future work.

      Several figures are difficult to interpret due to cluttered labeling and poorly distinguishable color schemes. Simplifying these and improving contrast (especially for co-housed vs. virgin conditions) would enhance readability.

      We regret that the reviewer found the figures difficult to follow. We have now revised our annotations throughout the manuscript for enhanced readability. For example, “D1<sup>B</sup>” is now “D1<sup>PBM</sup>” (post-bloodmeal) and “D1<sup>O</sup>” is now “D1<sup>PO</sup>” (post-oviposition). Wherever mated females were used, we have now appended “(m)” to the annotations and consistently depicted these females with striped abdomens in all the schematics. We believe these changes will improve clarity and readability.

      The manuscript does not clearly justify the use of whole-brain RNA sequencing to identify peptides involved in metabolic or peripheral processes. Given that anticipatory feeding signals are often peripheral, the logic for brain transcriptomics should be explained.

      The reviewer is correct in pointing out that feeding signals could also emerge from peripheral tissues. Signals from these tissues – in response to both changing nutritional and reproductive states – are then integrated by the central brain to modulate feeding choices. For example, in Drosophila, increased protein intake is mediated by central brain circuitry including those in the SEZ and central complex (Munch et al., 2022; Liu et al., 2017; Goldschmidt et al., 2023). In the context of mating, male-derived sex peptide further increases protein feeding by acting on a dedicated central brain circuitry (Walker et al., 2015). We, therefore focused on the central brain for our studies.

      The proposed model suggests brain-derived peptides initiate feeding, while gut peptides provide feedback. However, gut-specific knockdowns had no effect, undermining this hypothesis. Conversely, the authors also suggest abdominal involvement based on RNAi results. These contradictions need to be resolved into a consistent model.

      We thank the reviewer for raising this point and recognise their concern. Our reasons for invoking an involvement of the gut were two-fold:

      (1) We find increased sNPF transcript expression in the entero-endocrine cells of the midgut in blood-hungry females, which returns to baseline  after a blood-meal (Fig. 4L, M).

      (2) While the abdomen-only knockdowns did not affect blood feeding, every effective head knockdown that affected blood feeding also abolished abdominal transcript levels (Fig. S9C, F). (Achieving a head-only reduction proved impossible because (i) systemic dsRNA delivery inevitably reaches the abdomen and (ii) abdominal expression of both peptides is low, leaving little dynamic range for selective manipulation.) Consequently, we can only conclude the following: 1) that brain expression is required for the behaviour, 2) that we cannot exclude a contributory role for gut-derived sNPF. We have discussed this in lines 364-371.

      The identification of candidate receptors is promising, but the manuscript would be significantly strengthened by testing whether receptor knockdowns phenocopy peptide knockdowns. Without this, it is difficult to conclude that the identified receptors mediate the behavioral effects.

      We agree that functional validation of the receptors would strengthen the evidence for sNPF and RYa_mediated control of blood feeding in _An. stephensi. We selected these receptors based on sequence homology. A possibility remains that sNPF neuropeptides activate more than one receptor, each modulating a distinct circuit, as shown in the case of Drosophila Tachykinin (https://pmc.ncbi.nlm.nih.gov/articles/PMC10184743/). This will mean a systematic characterisation and knockdown of each of them to confirm their role. We are planning these experiments in the future.

      The authors compared the percentage changes in sugar-fed and blood-fed animals under sugar-sated or sugar-starved conditions. Figure 1F should reflect what was discussed in the results.

      Perhaps this concern stems from our representation of the data in figure 1F? We have now edited the xaxis and revised its label from “choice of food” to “choice made” to better reflect what food the mosquitoes chose to take.

      For clarity, we have now also plotted the same data as stacked graphs at the bottom of Fig. 1F, which clearly shows the proportion of mosquitoes fed on each particular choice. We avoid the stacked graph as the sole representation of this data because it does not capture the variability in the data.

      Minor issues:

      (1) The authors used mosquitoes with belly stripes to indicate mated females. To be consistent, the post-oviposition females should also have belly stripes.

      We thank the reviewer for pointing this out. We have now edited all the figures as suggested.

      (2) In the first paragraph on the right column of the second page, the authors state, "Since females took blood-meals regardless of their prior sugar-feeding status and only sugar-feeding was selectively suppressed by prior sugar access." Just because the well-fed animals ate less than the starved animals does not mean their feeding behavior was suppressed.

      Perhaps there has been a misunderstanding in the experimental setup of figure 1F, probably stemming from our data representation. The experiment is a choice assay in which sugar-starved or sugar-sated females, co-housed with males, were provided simultaneous access to both blood and sugar, and were assessed for the choice made (indicated on the x-axis): both blood and sugar, blood only, sugar only, or neither. We scored females only for the presence or absence of each meal type (blood or sugar) and did not quantify the amount consumed.

      (3) The figure legend for Figure 1A and the naming convention for different experimental groups are difficult to follow. A simplified or consistently abbreviated scheme would help readers navigate the figures and text.

      We regret that the reviewer found the figure difficult to follow. We have now revised our annotations throughout the manuscript for enhanced readability. For example, “D1<sup>B</sup>” is now “D1<sup>PBM</sup>” (post-bloodmeal) and “D1<sup>O</sup>” is now “D1<sup>PO</sup>” (post-oviposition).

      (4) In the last paragraph of the Y-maze olfactory assay for host-seeking behaviour in An. stephensi in Methods, the authors state, "When testing blood-fed females, aged-matched sugar-fed females (bloodhungry) were included as positive controls where ever possible, with satisfactory results." The authors should explicitly describe what the criteria are for "satisfactory results".

      We apologise for the lack of clarity. We have now edited the statement to read:

      “When testing blood-fed females, age-matched sugar-fed females (blood-hungry) were included wherever possible as positive controls. These females consistently showed attraction to host cues, as expected.” See lines 786-790.

      (5) In the first paragraph of the dsRNA-mediated gene knockdown section in Methods, dsRNA against GFP is used as a negative control for the injection itself, but not for the potential off-target effect.

      We agree with the reviewer that dsGFP injections act as controls only for injection-related behavioural changes, and not for off-target effects of RNAi. We have now corrected the statement. See lines 919-920.

      To control for off-target effects, we could have designed multiple dsRNAs targeting different parts of a given gene. We regret not including these controls for potential off-target effects of dsRNAs injected.

      (6) References numbers 48, 89, and 90 are not complete citations.

      We thank the reviewer for spotting these. We have now corrected these citations.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We appreciated the positive, detailed and helpful feedback from all three reviewers.

      Reviewer 1.

      Minor comments.

      1. In the introduction, on page 2, the authors seem a little confused about the Plk1 Polo-box domain - text as written: "...kinase domain linked to tandem Polo-box domains (PBD)", and cite a review paper. Actually, there is only a single Polo-box domain in these kinases, which contains both Polo-boxes and a bit of the upstream linker region. The "PBD" terminology denotes his 2-Polo-box +linker structure. Perhaps it would be better here to cite the PBD structure (Elia et al., Cell, 2002) as a primary citation here.

      Response: Thank you for finding this error, the text has been updated and the new citation included within the text on line 65.

      1. Similarly, the line "...during the G2/M transition following successful DNA damage repair" cites the Seki et al paper, but those findings are shown in the Macurek et al paper, not the Seki et al paper.

      _Response: _Thank you for finding this error, the new citation included within the text on line 69.

      1. Using the model of the ternary complex as shown in Figure 1B, deletion constructs of Bora missing regions within the disordered loops, but still retaining the residues that bind the PBD, FW pocket and Aurora A, can be modeled and tested to see if such deletions can improve the ipTM scores and binding affinity.

      Response: ____AlphaFold3 modelling was attempted with shorter regions of Bora to see the effect on the ipTM scores. Unfortunately, when Bora was reduced to shorter sequences, such as 18-88 or 18-45 modelled with 68-120, the models became inconsistent and of a low quality. Models were also created including the short region of Bora surrounding Ser252 that interacts with the polo box domain as well as Bora 18-120, but this had minimal effect on the calculated iPTM scores.

      1. On page 5, "S112A" within the sentence "Unexpectedly, the F56A/W58A Bora was less efficiently phosphorylated on S112A (Supplementary Figure S11, F compared to H and Supplementary Table S4)." This should be "S112".

      Response: ____Thank you for spotting this, the error has been corrected.

      1. In the assays shown in Figure 2D, the presence of excess F56AW58A Bora that remained unphosphorylated on S112 may complicate the interpretation of the results. Can the authors show that the S112-phosphorylated F56AW68A Bora is predominantly bound to Aurora A in such a mixture, perhaps by NMR using labelled pS112 F56AW58A Bora and unlabeled S112 F56AW58A Bora?

      _Response: _15N13C labelled of Bora 18-120 F56A W58A was produced and assigned. We then phosphorylated a sample using ERK2, tracking with NMR, and when the reaction had progressed to a 50:50 mixture of pSer112 and Ser112 (based on peak intensities) the kinase activity was quenched by addition of EDTA to sequester Mg2+. This produced a solution containing both pS112 and unphosphorylated S112 Bora species with marker peaks in HSQC spectra that could be used to directly compare Aurora-binding to the two species. Aurora-A was introduced to the sample and the peak intensities were monitored. Although both species are affected, there is much greater peak loss from the pS112 related peaks than those for unphosphorylated S112. This indicates that Aurora-A still preferentially binds pS112 Bora over S112 Bora when the F56A W58A mutation is present. This data has been included in Supplementary Figure S11.

      1. Please expand Figure 3A to better show the FW pocket-forming residues on Plk1.

      Response: ____Figure 3 has been amended to reduce the size of the sequence alignments so that 3A could be made slightly larger.

      1. It would be helpful to label the peaks in the mass spectra in Fig. S11 with the phospho-species that they correspond to.

      Response: ____This information has been added to the mass spectra in Fig. S11 (now supplementary Figure S14) to make them easier to view.

      1. In the last paragraph on page 7, "see we" in the sentence "As well as a decrease in intensity around pSer112 in Bora, see we an overall effect with decreased intensity across most of the Bora sequence." Should be corrected to "we see".

      Response: ____Thank you for spotting this, the error has been corrected.

      1. While not required, it would be helpful if binding or Bora to Aurora A after Erk2 phosphorylation could be shown using fluorescence polarization or ITC to lend additional support to the NMR data for S112 and S59 phosphorylation and for CEP192 and TPX2 competition.

      Response: ____This question has been partially answered in previous work by Tavernier et al. (2021), who showed improved binding of Aurora-A to Bora after Erk phosphorylation (by SPR), and they used labelled-TPX2 for a series of competition FP assays in that and the recent parallel study (Pillan et al. 2025).

      We made initial efforts to perform additional FP assays using longer sections of Bora with different phosphorylation states but without success (perhaps due to the multisite-binding nature of the Bora–Aurora interaction, and difficulties with directly expressing phosphorylated Bora). The revised manuscript now includes some additional NMR data to show improved Bora–Aurora-A interaction after phosphorylation at Ser59 (Supplementary Figure S12).

      1. The Aurora A phosphorylation motif has been further defined beyond that reported by the Pinna lab in 2005. Notably, the Ser-59 sequence on Bora (F-R-W-S-I), has, in addition to dominant selection for AR in the -2 position, both favorable -1 (W) and +1 (I) positions based on peptide library measurements (Alexander et al., Science Signaling 2011), further arguing that it may be an excellent Aurora A phosphorylation site.

      Response: ____Thank you for highlighting this publication and how it further reinforces the likelihood of Ser59 being an effective substrate for Aurora-A, this should have been included in the original manuscript. This citation has now been included.

      1. Have the authors tried to model the Drosophila melanogaster Aurora A-Bora-Polo complex to see if the Asn substitution of Bora Ser59, and the expected loss of the interactions between Bora pSer59 and Plk1 Arg59 and Aurora A Arg205 are compensated by other features?

      Response: ____A ternary complex between the Drosophila melanogaster orthologues was modelled using AlphaFold3 (Uniprot code PLK1 (Q9VVR2 72-165), Aurora-A kinase (Q9VGF9) 151-411 and PLK1 (P52304 21-280)). This model was analysed using PDBe PISA to identify potential interactions between the three proteins, focusing on residues that are not conserved between the human and Drosophila sequences. From this model a potential salt bridge was identified between Drosophila Bora Lys120 and PLK1 Glu93 that would not occur in the human ternary complex given Lys120 is replaced with an asparagine. This could be an alternative (kinase-independent) method for improved Bora-PLK1 interaction. When comparing the Bora:Aurora-A side of the predicted interface and focusing on the short region of Bora in between Aurora-A and PLK1, there were no clear differences seen in the residues predicted to bind to Aurora-A. This modelling has been included in Supplementary Figure S10 C and D.

      1. Given the relevance of the recent publication from Zhu et al. to this study, the authors may want to comment on, or test, the relative importance of PKA and Aurora A as a potential kinase for Bora S59. While those authors argue that PKA phosphorylates Bora on Ser-59, one could easily imagine a model in which either PKA or Aurora A could initially phosphorylate that site followed by a propagation step after initial Aurora A activation, in which Aurora A phosphorylation of Bora Ser-59 is the dominant process.

      Response: ____A brief discussion of this recent publication has been added to the discussion, highlighting the similarities between the two publications and the importance of pSer59, as well as suggesting that in cellulo this modification could be achieved via more than one pathway. We also include some additional NMR data to show improved Bora–Aurora-A interaction after phosphorylation at Ser59 (Supplementary Figure S12).

      Reviewer 2.

      Minor comments.

      Page 5: '... a K82R PLK1 mutant was used to increase the stability of the protein' - It is not clear how this mutation confers increased stability of the protein. The authors do not show any data to support this. Isn't the PLK1 K82R an ATP-binding-deficient, kinase-inactive mutant?

      Response: ____Thank you for spotting this, the text has been updated to clarify that this version of PLK1 was used as it is acting as a substrate in the in vitro assay as we didn’t want to see any PLK1 activity within this assay.

      All panels showing the Alphabridge diagram - it would be helpful if pictorial definitions of the colour codes were provided with corresponding score ranges (in addition to the description in the figure legend).

      Response:____The AlphaBridge images have been updated to include details about the plDDT scores each of the different colours refer to.

      Fig 2B - The Fluorescence anisotropy assay curves do not reach a plateau. Though the effect of mutation on binding affinity is pretty clear, if possible, I suggest including more data points at higher concentrations and estimating apparent Kd values.

      __Response:____The direct binding assay was repeated with a higher concentration of PLK1 in order to try and see a top plateau. This was successful and has been included in Figure 2B (shown in black). The measured Kd was 24 ± 3 µM. __

      The cartoon representation of the structures and molecular interfaces - better to avoid shadows, as they compromise the clarity of the figures, particularly the ones where side chains are shown in stick representation.

      Response:____The structural images have been remade to remove the shadows and improve the clarity of the images.

      It is important to discuss how the parallel studies by Verza et al. and Pillan et al. complement this study, highlighting similarities and differences.

      Response:____References to these two publications and details on the similarities and differences seen are now included in the discussion.

      Reviewer 3.

      Major comments

      It would be helpful to measure the level of pThr210 PLK1 in some experiments and graph the data. The current presentation is Fig. 2D-E is qualitative rather than quantitative.

      Response:____Graphs displaying the levels of pThr210 produced in the assay are now shown in Supplementary Figure S4.

      Have the authors measured the binding affinity of the F/W mutant Bora for PLK1 using the assay in Fig. 2B? Likewise, for Fig. 7 the S59 mutant could be tested to see if it affects PLK1 binding or activation.

      Response:____The direct binding assay has been repeated with the use of a FAM-Bora peptide that incorporates the F56A W58A mutation which shows reduced binding (Figure 2B, shown in blue). A version of the Bora peptide phosphorylated on Ser59 was also tested in the direct binding assay and this shows a similar affinity for PLK1 to the wild-type sequence (Figure 2B, shown in red compared to the wild-type shown in black).

      It would be helpful if measurements of pThr210 PLK1 for all conditions were shown in the graph Fig. 7F.

      Response:____This graph has been updated to include the levels of phosphorylation seen for PLK1 in all of the conditions tested.

      Minor comments

      I found Figure S1B easier to understand than Fig S1A and Fig 1A-B. Some of the supplemental data Fig. S1C-E could be moved to a revised Figure 1, dropping the current Fig. 1A-B. Can the interaction plots (Fig. S1C-D) be rotated to have the same original at the top and order of proteins (i.e. Bora > Aurora A > {plus minus} PLK1 depending on the plot).

      Response:____Figure 1 and S1 have been rearranged to hopefully make them easier to understand, with all AlphaFold3 models of the full-length sequences kept in the supplementary figure and the focus in 1B just on the truncated model. The AlphaBridge plots have been rotated as suggested.

      Figure 3F. Typo "Strongyl" not "Strongly".

      Response:____Thank you for spotting this, this has been corrected in the updated manuscript.

      Figure 3 could be supplemental material.

      Response:__Thank you for your suggestion, but we have decided to keep this as a main figure.

      Fig. 7E. Run a positive control reaction +ERK2 on the second gel to allow direct comparison of pThr210 across all the conditions tested.

      Response:____These samples have been rerun on the same membrane and the levels of phosphorylation have been quantified and included in Figure 7F.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary.

      Miles and co-workers have carried out a careful and high-quality study of the activation mechanisms of the mitotic kinase PLK1. Multiple proteins have been implicated in PLK1 activation and localisation as cell enter and pass through mitosis. Initial activation of PLK1 is promoted by a complex of Bora with another kinase Aurora A. Later in mitosis, this activated PLK1 associates with mitotic spindle and centrosome proteins regulating different aspects of mitosis and cytokinesis. In this study, Miles et al. extend previous work on this question by proposing and testing detailed models for Bora/Aurora A-mediated activation of PLK1 to elucidate the mechanism of this reaction.

      Using the latest Alphafold they generate a series of models of the PLK1/Bora/Aurora A complex to home in on the key regions mediating interactions of the three proteins. This approach suggests an arrangement where the first ~120 amino acids of Bora wrap Aurora A and create an interaction surface for the N-terminal kinase domain of PLK1. This orients Thr210 in PLK1 towards Aurora A creating a situation likely favourable for phosphorylation, although has the authors discuss there are some caveats to this. A further prediction of the modelling helps explain the requirement for Bora phosphorylation to promote the interaction with Aurora A. This data is presented in Fig. 1 and Fig. S1-S3.

      In the subsequent figures the details of this model are tested using biochemical assays and structural biology methods to validate key predictions. First the PLK1 interaction with Bora was shown to require the conserved F/W motif of Bora and a conserved pocket close to R106 on PLK1 (Fig. 2 and 3). In reconstituted PLK1 activation assays the F/W motif mutant Bora showed greatly attenuated pThr210 phosphorylation. This reaction also required phosphorylation of Bora at S112, presumably due to the interaction with Aurora A. An R106A mutant PLK1 showed reduced binding to Bora and reduced kinase activation. This data is clear and provides compelling support for the model.

      Using NMR the authors then investigate the interaction between Bora and Aurora A, and more specifically the requirement for Bora phosphorylation at Ser112. The NMR data in Fig. 4 and Fig. 6 provide good support for the Alphafold model. A helpful comparison with known Aurora A binding proteins is also shown to highlight the way CEP192, TPX2 and TACC3 contact a series of conserved pockets on the surface of Aurora A which are common to the Bora interaction. S59 phosphorylation by Aurora A is also shown to play an important role in contacting PLK1 and is required for pThr210 phosphorylation.

      In summary, the authors have made valuable progress in working out details of the PLK1 activation mechanism, that extends previous work in the field.

      Major comments.

      It would be helpful to measure the level of pThr210 PLK1 in some experiments and graph the data. The current presentation is Fig. 2D-E is qualitative rather than quantitative.

      Have the authors measured the binding affinity of the F/W mutant Bora for PLK1 using the assay in Fig. 2B? Likewise, for Fig. 7 the S59 mutant could be tested to see if it affects PLK1 binding or activation.

      It would be helpful if measurements of pThr210 PLK1 for all conditions were shown in the graph Fig. 7F.

      Minor comments.

      I found Figure S1B easier to understand than Fig S1A and Fig 1A-B. Some of the supplemental data Fig. S1C-E could be moved to a revised Figure 1, dropping the current Fig. 1A-B. Can the interaction plots (Fig. S1C-D) be rotated to have the same original at the top and order of proteins (i.e. Bora > Aurora A > {plus minus} PLK1 depending on the plot). Figure 3F. Typo "Strongyl" not "Strongly". Figure 3 could be supplemental material. Fig. 7E. Run a positive control reaction +ERK2 on the second gel to allow direct comparison of pThr210 across all the conditions tested.

      Significance

      Timely and orchestrated activation of multiple mitotic protein kinases is crucial for the alignment and segregation of chromosomes, and for the process of cell division. In this study the authors explore how activation of the mitotic kinase PLK1 is triggered by another mitotic kinase Aurora A, and the role played by a scaffold protein Bora.

      Strengths: Detailed analysis of mechanism using biochemical and structural approaches.

      Limitations: The study is focussed on the biochemical and structural mechanisms rather than the cellular outcomes. Some data would benefit from additional quantitative measurement.

      Relevance: Cancer and cell biology due to the role of Aurora A in many cancers.

      Reviewer expertise: Biochemistry, molecular and cell biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Miles et al. used a combination of AlphaFold modeling, biochemical assays of mutant constructs and NMR spectroscopy to model the ternary complex of Aurora A, Bora and Plk1, and elucidate how Bora can act as a molecular bridge that facilitates the phosphorylation of the activation loop Thr210 within Plk1 by Aurora A. Their studies identified an interaction between residues 52-73 within Bora and the 'FW' pocket on the N-terminal lobe of Plk1, which binds Phe56 and Trp58 of Bora. Additionally, Ser59 of Bora was identified as a good Aurora A substrate using a Bora peptide array, and pSer59 was predicted to form bridging interactions with Aurora Arg205 and Plk1 Arg59. This was supported by NMR and biochemical assays. In addition, the authors validate that phosphorylation of Ser-112 on Bora enhances stabilization of the Aurora A-Bora complex Overall, the model revealed novel details of the interactions within the Aurora A-Bora-Plk1 ternary complex that are supported by the biochemical and NMR data. The work will be of significant interest to basic scientists whose work involves protein kinase signaling, cell division/mitosis, signal transduction, and cancer biology. We recommend publication of this manuscript with the following minor changes and additions.

      1. In the introduction, on page 2, the authors seem a little confused about the Plk1 Polo-box domain - text as written: "...kinase domain linked to tandem Polo-box domains (PBD)", and cite a review paper. Actually, there is only a single Polo-box domain in these kinases, which contains both Polo-boxes and a bit of the upstream linker region. The "PBD" terminology denotes his 2-Polo-box +linker structure. Perhaps it would be better here to cite the PBD structure (Elia et al., Cell, 2002) as a primary citation here.
      2. Similarly, the line "...during the G2/M transition following successful DNA damage repair" cites the Seki et al paper, but those findings are shown in the Macurek et al paper, not the Seki et al paper.
      3. Using the model of the ternary complex as shown in Figure 1B, deletion constructs of Bora missing regions within the disordered loops, but still retaining the residues that bind the PBD, FW pocket and Aurora A, can be modeled and tested to see if such deletions can improve the ipTM scores and binding affinity.
      4. On page 5, "S112A" within the sentence "Unexpectedly, the F56A/W58A Bora was less efficiently phosphorylated on S112A (Supplementary Figure S11, F compared to H and Supplementary Table S4)." This should be "S112".
      5. In the assays shown in Figure 2D, the presence of excess F56AW58A Bora that remained unphosphorylated on S112 may complicate the interpretation of the results. Can the authors show that the S112-phosphorylated F56AW68A Bora is predominantly bound to Aurora A in such a mixture, perhaps by NMR using labelled pS112 F56AW58A Bora and unlabeled S112 F56AW58A Bora?
      6. Please expand Figure 3A to better show the FW pocket-forming residues on Plk1.
      7. It would be helpful to label the peaks in the mass spectra in Fig. S11 with the phospho-species that they correspond to.
      8. In the last paragraph on page 7, "see we" in the sentence "As well as a decrease in intensity around pSer112 in Bora, see we an overall effect with decreased intensity across most of the Bora sequence." Should be corrected to "we see".
      9. While not required, it would be helpful if binding or Bora to Aurora A after Erk2 phosphorylation could be shown using fluorescence polarization or ITC to lend additional support to the NMR data for S112 and S59 phosphorylation and for CEP192 and TPX2 competition.
      10. The Aurora A phosphorylation motif has been further defined beyond that reported by the Pinna lab in 2005. Notably, the Ser-59 sequence on Bora (F-R-W-S-I), has, in addition to dominant selection for AR in the -2 position, both favorable -1 (W) and +1 (I) positions based on peptide library measurements (Alexander et al., Science Signaling 2011), further arguing that it may be an excellent Aurora A phosphorylation site.
      11. Have the authors tried to model the Drosophila melanogaster Aurora A-Bora-Polo complex to see if the Asn substitution of Bora Ser59, and the expected loss of the interactions between Bora pSer59 and Plk1 Arg59 and Aurora A Arg205 are compensated by other features?
      12. Given the relevance of the recent publication from Zhu et al. in https://doi.org/10.1038/s41467-025-63352-y to this study, the authors may want to comment on, or test, the relative importance of PKA and Aurora A as a potential kinase for Bora S59. While those authors argue that PKA phosphorylates Bora on Ser-59, one could easily imagine a model in which either PKA or Aurora A could initially phosphorylate that site followed by a propagation step after initial Aurora A activation, in which Aurora A phosphorylation of Bora Ser-59 is the dominant process.

      -Dan Lim and Michael Yaffe

      Significance

      The work is well done and clearly presented.

    1. Reviewer #1 (Public review):

      In this study, the authors investigate LFP responses to methionine in the olfactory system of the Xenopus tadpole. They show that this response is local to the glomerular layer, arises ipsilaterally, and is blocked by pharmacological blockade of AMPA and NMDA receptors, with little modulation during blockade of GABA-A receptors. They then show that this response is translently enlarged following transection of the contralateral olfactory nerve, but not the optic lobe nerve. Measurement of ROS- a marker of inflammation- was not affected by contralateral nerve transection, and LFP expansion was not affected by pharmacological blockade of ROS production. Imaging biased towards presynaptic terminals suggests that the enlargement of the LFP has a presynaptic component. A D2 antagonist increases the LFP size and variability in intact tadpoles, while a GABA-B antagonist does not. On this basis, the authors conclude that the increase driven by contralateral nerve transection is due to DA signaling.

      Overall, I found the array of techniques and approaches applied in this study to be creatively and effectively employed. However, several of the conclusions made in the Discussion are too strong, given the evidence presented. For example, the authors state that "The observed potentiation was not related to inflammatory mediators associated to inury, because it was caused by a release of the inhibition made by D2 dopamine receptor present in OSN axon terminals." This statement is too strong - the authors have shown that D2 receptors are sufficient to cause an increase in LFP, but not that they are required for the potentiation evoked by nerve transection. The right experiment here would be to get rid of the D2 receptors prior to transection and show that the potentiation is now abolished. In addition, the authors have not shown any data localizing D2 receptors to OSN axon terminals.

      Similarly, the authors state, "the onset of LFP changes detected in glomeruli is determined by glutamate release from OSNs." Again, the authors have shown that blockade of AMPA/NMDA receptors decreases the LFP, and that uncaging of glutamate can evoke small negative deflections, but not that the intact signal arises from glutamate release from OSNs. The conclusions about the in vivo contribution of this contralateral pathway are also rather speculative. Acute silencing of one hemisphere would likely provide more insight into the moment-to-moment contributions of bilateral signals to those recorded in one hemisphere.

    1. Reviewer #1 (Public review):

      In this study, the authors aim to elucidate both how Pavlovian biases affect instrumental learning from childhood to adulthood, as well as how reward outcomes during learning influence incidental memory. While prior work has investigated both of these questions, findings have been mixed. The authors aim to contribute additional evidence to clarify the nature of developmental changes in these processes. Through a well-validated affective learning task and a large age-continuous sample of participants, the authors reveal that adolescents outperform children and adults when Pavlovian biases and instrumental learning are aligned, but that learning performance does not vary by age when they are misaligned. They also show that younger participants show greater memory sensitivity for images presented alongside rewards.

      The manuscript has notable strengths. The task was carefully designed and modified with a clever, developmentally appropriate cover story, and the large sample size (N = 174) means their study was better powered than many comparable developmental learning studies. The addition of the memory measure adds a novel component to the design. The authors transparently report their somewhat confusing findings.

      The manuscript also has weaknesses, which I describe in detail below.

      It was not entirely clear to me what central question the researchers aimed to address. They note that prior studies using a very similar learning task design have reported inconsistent findings, but they do not propose a reason for why these inconsistent findings may emerge nor do they test a plausible cause of them (in contrast, for example, Raab et al. 2024 explicitly tested the idea that developmental changes in inferences about controllability may explain age-related change in Pavlovian influences on learning). While the authors test a sample of participants that is very large compared to many developmental studies of reinforcement learning, this sample is much smaller than two prior developmental studies that have used the same learning task (and which the authors cite - Betts et al., 2020; Moutoussis et al., 2018). Thus, the overall goal seems to be to add an additional ~170 subjects of data to the existing literature, which isn't problematic per se, but doesn't do much to advance our theoretical understanding of learning across development. They happen to find a pattern of results that differs from all three prior studies, and it is not clear how to interpret this.

      Along those lines, the authors extend prior work by adding a memory manipulation to the task, in which trial-unique images were presented alongside reward outcomes. It was not clear to me whether the authors see the learning and memory questions as fundamentally connected or as two separate research questions that this paradigm allows them to address. The manuscript would potentially be more impactful if the authors integrated their discussion of these two ideas more. Did they have any a priori hypotheses about how Pavlovian biases may affect the encoding of incidentally presented images? Could heightened reward sensitivity explain both changes in learning and changes in memory? It was also not clear to me why the authors hypothesized that younger participants would demonstrate the greatest effects of reward on memory, when most of the introduction seems to suggest they might hypothesize an adolescent peak in both learning and memory.

      As stated above, while the task methods seemed sound, some of the analytic decisions are potentially problematic and/or require greater justification for the results of the study to be interpretable.

      Firstly, it is problematic not to include random participant slopes in the regression models. Not accounting for individual variation in the effects of interest may inflate Type I errors. I would suggest that the authors start with the maximal model, or follow the same model selection procedure they did to select the fixed effects to include for the random effects as well.

      Secondly, the central learning finding - that adolescents demonstrate enhanced learning in Pavlovian-congruent conditions only - is interesting, but it is unclear why this is the case or how much should be made of this finding. The authors show that adolescents outperform others in the Pavlovian-congruent conditions but not the Pavlovian-incongruent conditions. However, this conclusion is made by analyzing the two conditions separately; they do not directly compare the strength of the adolescent peak across these conditions, which would be needed to draw this strong conclusion. Given that no prior study using the same learning design has found this, the authors should ensure that their evidence for it is strong before drawing firm conclusions.

      It was also not clear to me whether any of the RL models that the authors fit could potentially explain this pattern. Presumably, they need an algorithmic mechanism in which the Pavlovian bias is enhanced when it is rewarded. This seems potentially feasible to implement and could help explain the condition-specific performance boosts.

      I also have major concerns about the computational model-fitting results. While the authors seemingly follow a sound approach, the majority of the fitted lapse rates (Figure S10) are near 1. This suggests that for most participants, the best-fitting model is one in which choices are random. This may be why the authors do not observe age-related change in model parameters: for these subjects, the other parameter values are essentially meaningless since they contribute to the learned value estimate, which gets multiplied by a near-0 weight in the choice function. It is important that the authors clarify what is going on here. Is it the case that most of these subjects truly choose at random? It does seem from Figure 2A that there is extensive variability in performance. It might be helpful if the authors re-analyze their data, excluding participants who show no evidence of learning or of reward-seeking behavior. Alternatively, are there other biases that are not being accounted for (e.g., choice perseveration) that may contribute to the high lapse rates?

      Parameter recovery also looks poor, particularly for gain & loss sensitivity, the lapse rate, and the Pavlovian bias - several parameters of interest. As noted above, this may be due to the fact that many of the simulations were conducted with lapse rates sampled from the empirical distribution. It would be helpful for the authors to a.) plot separately parameter recoverability for high and low lapse rates and b.) report the recoverability correlation for each parameter separately.

      Finally, many of the analytic decisions made regarding the memory analyses were confusing and merit further justification.

      (1) First, it seems as though the authors only analyze memory data from trials where participants "could gain a reward". Does this mean only half of the memory trials were included in the analyses? What about memory as a function of whether participants made a "correct" response? Or a correct x reward interaction effect?

      (2) The RPE analysis overcomes this issue by including all trials, but the trial-wise RPEs are potentially not informative given the lapse rate issue described above.

      (3) The authors exclude correct guesses but include incorrect guesses. Is this common practice in the memory literature? It seems like this could introduce some bias into the results, especially if there are age-related changes in meta-memory.

      (4) Participants provided a continuum of confidence ratings, but the authors computed d' by discretizing memory into 'correct' or 'incorrect'. A more sensitive approach could compute memory ROC curves taking into account the full confidence data (e.g., Brady et al., 2020).

      (5) The learning and memory tradeoff idea is interesting, but it was not clear to me what variables went into that regression model.

    1. Reviewer #1 (Public review):

      Summary:

      This study examines how different parts of the brain's reward system regulate eating behavior. The authors focus on the medial shell of the nucleus accumbens, a region known to influence pleasure and motivation. They find that nerve cells in the front (rostral) portion of this region are inhibited during eating, and when artificially activated, they reduce food intake. In contrast, similar cells at the back (caudal) are excited during eating but do not suppress feeding. The team also identifies a molecular marker, Stard5, that selectively labels the rostral hotspot and enables new genetic tools to study it. These findings clarify how specific circuits in the brain control hedonic feeding, providing new entry points to understand and potentially treat conditions such as overeating and obesity.

      Strengths:

      (1) Conceptual advance: The work convincingly establishes a rostro-caudal gradient within the medNAcSh, clarifying earlier pharmacological studies with modern circuit-level and genetic approaches.

      (2) Methodological rigor: The combination of fiber photometry, optogenetics, CRISPR-Cas9 genetic engineering, histology, FISH, scRNA-seq, and novel mouse genetics adds robustness, with complementary approaches converging on the central claim.

      (3) Innovation: The generation of a Stard5-Flp line is a valuable resource that will enable precise interrogation of the rostral hotspot in future studies.

      (4) Specificity of findings: The dissociation between appetitive and aversive conditions strengthens the interpretation that the observed gradient is restricted to feeding.

      Weaknesses and points for clarification

      (1) Role of D2-SPNs: Since D1 and D2 pathways often show opposing roles in feeding, testing, or discussing D2-SPN contributions would provide an important control and context. Since the claim is that Stard5 is expressed in both D1- and D2MSNs, it seems to contradict the exclusive role of D1R MSNs in authorizing food intake.

      (2) Behavioral analyses:

      a) In Figure 2, group differences in consumption appear uneven; additional analyses (e.g., lick counts across blocks and session totals) would strengthen interpretation.

      b) The design and contribution of aversive assays to the main conclusions remain somewhat unclear and could be better justified.

      c) The scope of behavior is mainly limited to consumption; testing related domains (motivation, reward valuation, and extinction) could broaden the significance.

      (3) Molecular profiling:

      a) Stard5 expression is present in both D1- and D2-SPNs; comparisons to bulk calcium signals and quantification of percentages across rostral and caudal cells would be helpful. The authors should establish whether these cells also express SerpinB2, an established marker of LH projecting neurons.

      b) Verification of the Stard5-2A-Flp line (specificity, overlap with immunomarkers) should be documented more thoroughly.

      c) The molecular analysis is restricted to a small set of genes; broader spatial transcriptomics could uncover additional candidate markers. See also above.

    1. Reviewer #2 (Public review):

      Summary:

      This study analyzes muscle interactions in post-stroke patients undergoing rehabilitation, using information-theoretic and network analysis tools applied to sEMG signals with task performance measurements. The authors identified patterns of muscle interaction that correlate well with therapeutic measures and could potentially be used to stratify patients and better evaluate the effectiveness of rehabilitation.

      However, I found that the Methods and Materials section, as it stands, lacks sufficient detail and clarity for me to fully understand and evaluate the quality of the method. Below, I outline my main points of concern, which I hope the authors will address in a revision to improve the quality of the Methods section. I would also like to note that the methods appear to be largely based on a previous paper by the authors (O'Reilly & Delis, 2024), but I was unable to resolve my questions after consulting that work.

      I understand the general procedure of the method to be: (1) defining a connectivity matrix, (2) refining that matrix using network analysis methods, and (3) applying a lower-dimensional decomposition to the refined matrix, which defines the sub-component of muscle interaction. However, there are a few steps not fully explained in the text.

      (1) The muscle network is defined as the connectivity matrix A. Is each entry in A defined by the co-information? Is this quantity estimated for each time point of the sEMG signal and task variable? Given that there are only 10 repetitions of the measurement for each task, I do not fully understand how this is sufficient for estimating a quantity involving mutual information.

      In the previous paper (O'Reilly & Delis, 2024), the authors initially defined the co-information (Equation 1.3) but then referred to mutual information (MI) in the subsequent text, which I found confusing. In addition, while the matrix A is symmetrical, it should not be orthogonal (the authors wrote AᵀA = I) unless some additional constraint was imposed?

      (2) The authors should clarify what the following statement means: "Where a muscle interaction was determined to be net redundant/synergistic, their corresponding network edge in the other muscle network was set to zero."

      (3) It should be clarified what the 'm' values are in Equation 1.1. Are these the co-information values after the sparsification and applying the Louvain algorithm to the matrix 'A'? Furthermore, since each task will yield a different co-information value, how is the information from different tasks (r) being combined here?

      (4) In general, I recommend improving the clarity of the Methods section, particularly by being more precise in defining the quantities that are being calculated. For example, the adjacency matrix should be defined clearly using co-information at the beginning, and explain how it is changed/used throughout the rest of the section.

      (5) In the previous paper (O'Reilly & Delis, 2024), the authors applied a tensor decomposition to the interaction matrix and extracted both the spatial and temporal factors. In the current work, the authors simply concatenated the temporal signals and only chose to extract the spatial mode instead. The authors should clarify this choice.

    1. sur cette édition,

      "d'éditer précisément la petite pièce en deux scènes de La Conqueste de B**, précisément sur son privilège et son prologue" ? (pure suggestion, tu n'es bien sûr pas obligé de suivre)

  2. milenio-nudos.github.io milenio-nudos.github.io
    1. (ulfert-blank_assessing_2022?) suggests to work with a unified construct denominated Digital Self-efficacy (hereinafter DSE) to reach a high-level research on this issue. Considering the gaps and inconsistencies in previous measurements, (ulfert-blank_assessing_2022?) points out that DSE construct have to

      Creo que debemos hacer una distinción mejor para que se entienda esto (de partida ya estamos usando la abreviación DSE antes de esta parte). Como nos referimos a las anteriores escalas como mediciones de DSE y hasta el momento no eran escalas que en estricto rigor medían la DSE creo que lleva a la confusión. Diría autoeficacia asociada a la tecnología hasta este punto del paper, así quedaría algo más claro, ya que las anteriores escalas no miden lo mismo.

    1. ART TEST FEVER

      chrome-extension://bjfhmglciegochdpefhhlphglcehbmek/pdfjs/web/viewer.html?file=file%3A%2F%2F%2FUsers%2Fprestontaylor%2FDownloads%2F10.1515_9781400881321-005.pdf

    Annotators

    1. Necesario entender que el mismo testo no trata de dar una solución a un problema, más bien como en sí mismo propone; Es más importante diferenciar entre las propuestas y soluciones, de entre ellas, ¿Cual es la más acertada y de menos rechazo?

    2. El autor constantemente menciona la importancia de cuestionar y analizar de manera consiente la información que encontramos del tema. Y enfatiza lo repetitivo o cíclico que esto puede ser, pero, me gustaría que indagara en el proceso de formular esas cuestiones que ayudarán al investigador a centrarse en el tema.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews

      Reviewer #1 (Public review):

      Summary:

      The authors performed genome assemblies for two Fagaceae species and collected transcriptome data from four natural tree species every month over two years. They identified seasonal gene expression patterns and further analyzed species-specific differences.

      Strengths:

      The study of gene expression patterns in natural environments, as opposed to controlled chambers, is gaining increasing attention. The authors collected RNA-seq data monthly for two years from four tree species and analyzed seasonal expression patterns. The data are novel. The authors could revise the manuscript to emphasize seasonal expression patterns in three species (with one additional species having more limited data). Furthermore, the chromosome-scale genome assemblies for the two Fagaceae species represent valuable resources, although the authors did not cite existing assemblies from closely related species.

      Thank you for your careful assessment of our manuscript.

      Weaknesses:

      Comment; The study design has a fundamental flaw regarding the evaluation of genetic or evolutionary effects. As a basic principle in biology, phenotypes, including gene expression levels, are influenced by genetics, environmental factors, and their interaction. This principle is well-established in quantitative genetics.

      In this study, the four species were sampled from three different sites (see Materials and Methods, lines 543-546), and additionally, two species were sampled from 2019-2021, while the other two were sampled from 2021-2023 (see Figure S2). This critical detail should be clearly described in the Results and Materials and Methods. Due to these variations in sampling sites and periods, environmental conditions are not uniform across species.

      Even in studies conducted in natural environments, there are ways to design experiments that allow genetic effects to be evaluated. For example, by studying co-occurring species, or through transplant experiments, or in common gardens. To illustrate the issue, imagine an experiment where clones of a single species were sampled from three sites and two time periods, similar to the current design. RNA-seq analysis would likely detect differences that could qualitatively resemble those reported in this manuscript.

      One example is in line 197, where genus-specific expression patterns are mentioned. While it may be true that the authors' conclusions (e.g., winter synchronization, phylogenetic constraints) reflect real biological trends, these conclusions are also predictable even without empirical data, and the current dataset does not provide quantitative support.

      If the authors can present a valid method to disentangle genetic and environmental effects from their dataset, that would significantly strengthen the manuscript. However, I do not believe the current study design is suitable for this purpose.

      Unless these issues are addressed, the use of the term "evolution" is inappropriate in this context. The title should be revised, and the result sections starting from "Peak months distribution..." should be either removed or fundamentally revised. The entire Discussion section, which is based on evolutionary interpretation, should be deleted in its current form.

      If the authors still wish to explore genetic or evolutionary analyses, the pair of L. edulis and L. glaber, which were sampled at the same site and over the same period, might be used to analyze "seasonal gene expression divergence in relation to sequence divergence." Nevertheless, the manuscript would benefit from focusing on seasonal expression patterns without framing the study in evolutionary terms.

      We sincerely thank the reviewer for the detailed and thoughtful comments. We fully recognize the importance of carefully distinguishing genetic and environmental contributions in transcriptomic studies, particularly when addressing evolutionary questions. The reviewer identified two major concerns regarding our study design: (1) the use of different monitoring periods across species, and (2) the use of samples collected from different study sites. We addressed both concerns with additional analyses using 112 new samples and now present new evidence that supports the robustness of our conclusions.

      (1) Monitoring period variation does not bias our conclusions<br /> To address concerns about the differing monitoring periods, we added new RNA-seq data (42 samples each for bud and leaf samples for L. glaber and 14 samples each for bud and leaf samples for _L. eduli_s) collected from November 2021 to November 2022, enabling direct comparison across species within a consistent timeframe. Hierarchical clustering of this expanded dataset (Fig. S6) yielded results consistent with our original findings: winter-collected samples cluster together regardless of species identity. This strongly supports our conclusion that the seasonal synchrony observed in winter is not an artifact of the monitoring period and demonstrates the robustness of our conclusions across datasets.

      (2) Site variation is limited and does not confound our findings<br /> Although the study included three sites, two of them (Imajuku and Ito Campus) are only 7.3 km apart, share nearly identical temperature profiles (see Fig. S2), and are located at the edge of similar evergreen broadleaf forests. Only Q. acuta was sampled from a higher-altitude, cooler site. To assess whether the higher elevation site of Q. acuta introduced confounding environmental effects, we reanalyzed the data after excluding this species. Hierarchical clustering still revealed that winter bud samples formed a distinct cluster regardless of species identity (Fig. S7), consistent with our original finding.

      Furthermore, we recalculated the molecular phenology divergence index D (Fig. 4C) and the interspecific Pearson’s correlation coefficients (Fig. 5A) without including Q. acuta. These analyses produced results that were similar to those obtained from the full dataset (Fig. S12; Fig. S14), indicating that the observed patterns are not driven by environmental differences associated with elevation.

      (3) Justification for our approach in natural systems<br /> We agree with the reviewer that experimental approaches such as common gardens, reciprocal transplants, and the use of co-occurring species are valuable for disentangling genetic and environmental effects. In fact, we have previously implemented such designs in studies using the perennial herb Arabidopsis halleri (Komoto et al., 2022, https://doi.org/10.1111/pce.14716) and clonal Someiyoshino cherry trees (Miyawaki-Kuwakado et al., 2024, https://doi.org/10.1002/ppp3.10548) to examine environmental effects on gene expression. However, extending these approaches to long-lived tree species in diverse natural ecosystems poses significant logistical and biological challenges. In this study, we addressed this limitation by including three co-occurring species at the same site, which allowed us to evaluate interspecific differences under comparable environmental conditions. Importantly, even when we limited our analyses to these co-occurring species, the results remained consistent, indicating that the observed variation in transcriptomic profiles cannot be attributed to environmental factors alone and likely reflects underlying genetic influences.

      Accordingly, we added four new figures (Fig. S6, Fig. S7, Fig. S12 and Fig. S14) and revised the manuscript to clarify the limitations and strengths of our design, to tone down the evolutionary claims where appropriate, and to more explicitly define the scope of our conclusions in light of the data. We hope that these efforts sufficiently address the reviewer’s concerns and strengthen the manuscript.

      To better support the seasonal expression analysis, the early RNA-seq analysis sections should be strengthened. There is little discussion of biological replicate variation or variation among branches of the same individual. These could be important factors to analyze. In line 137, the mapping rate for two species is mentioned, but the rates for each species should be clearly reported. One RNA-seq dataset is based on a species different from the reference genome, so a lower mapping rate is expected. While this likely does not hinder downstream analysis, quantification is important.

      We thank the reviewer 1 for the helpful comment. To evaluate the variation among biological replicates, we compared the expression level of each gene across different individuals. We observed high correlation between each pair of individuals (Q. glauca (n=3): an average correlation coefficient r = 0.947; Q. acuta (n=3): r = 0.948; L. glaber (n=3): r = 0.948)). This result suggests that the seasonal gene expression pattern is highly synchronized across individuals within the same species. We mentioned this point in the Result section in the revised manuscript. We also calculated the mean mapping rates for each species. As the reviewer expected, the mapping rate was slightly lower in Q. acuta (88.6 ± 2.3%) and L. glaber (84.3 ± 5.4%), whose RNA-Seq data were mapped to reference genomes of related but different species, compared to that in Q. glauca (92.6 ± 2.2%) and L. edulis (89.3 ± 2.7%). However, we minimized the impact of these differences on downstream analysis. These details have been included in the revised main text.

      In Figures 2A and 2B, clustering is used to support several points discussed in the Results section (e.g., lines 175-177). However, clustering is primarily a visualization method or a hypothesis-generating tool; it cannot serve as a statistical test. Stronger conclusions would require further statistical testing.

      We thank the reviewer for the helpful comment. As noted, we acknowledge that hierarchical clustering (Fig. 2A) is primarily a visualization and hypothesis-generating method. To assess the biological relevance of the clusters identified, we conducted a Mann-Whitney U test or the Steel-Dwass test to evaluate whether the environmental temperatures at the time of sample collection differed significantly among the clusters. This analysis (Fig. 2B) revealed statistically significant differences in temperature in the cluster B3 (p < 0.01), indicating that the gene expression clusters are associated with seasonal thermal variation. These results support the interpretation that the clusters reflect coordinated transcriptional responses to environmental temperature. We revised the Results section to clarify this point.

      The quality of the genome assemblies appears adequate, but related assemblies should be cited and discussed. Several assemblies of Fagaceae species already exist, including Quercus mongolica (Ai et al., Mol Ecol Res, 2022), Q. gilva (Front Plant Sci, 2022), and Fagus sylvatica (GigaScience, 2018), among others. Is there any novelty here? Can you compare your results with these existing assemblies?

      We agree that genome assemblies of Fagaceae species are becoming increasing available. However, our study does not aim to emphasize the novelty of the genome assemblies per se. Rather, with the increasing availability of chromosome-level genomes, we regard genome assembly as a necessary foundation for more advanced analyses. The main objective of our study is to investigate how each gene is expressed in response to seasonal environmental changes, and to link genome information with seasonal transcriptomic dynamics. To address the reviewer’s comment in line with this objective, we added a discussion on the syntenic structure of eight genome assemblies spanning four genera within the Fagaceae, including a species from the genus Fagus (Ikezaki et al. 2025, https://doi.org/10.1101/2025.07.31.667835). This addition helps to position our work more clearly within the context of existing genomic resources.

      Most importantly, Figure 1B-D shows synteny between the two genera but also indicates homology between different chromosomes. Does this suggest paleopolyploidy or another novel feature? These chromosome connections should be interpreted in the main text-even if they could be methodological artifacts.

      A previous study on genome size variation in Fagaceae suggested that, given the consistent ploidy level across the family, genome expansion likely occurred through relatively small segmental duplications rather than whole-genome duplications. Because Figure 1B-D supports this view, we cited the following reference in the revised version of the manuscript. Chen et al. (2014) https://doi.org/10.1007/s11295-014-0736-y

      In both the Results and Materials and Methods sections, descriptions of genome and RNA-seq data are unclear. In line 128, a paragraph on genome assembly suddenly introduces expression levels. RNA-seq data should be described before this. Similarly, in line 238, the sentence "we assembled high-quality reference genomes" seems disconnected from the surrounding discussion of expression studies. In line 632, Illumina short-read DNA sequencing is mentioned, but it's unclear how these data were used.

      We relocated the explanation regarding the expression levels of single-copy and multi-copy genes to the section titled “Seasonal gene expression dynamics.” Additionally, we clarified in the Materials and Methods section that short-read sequencing data were used for both genome size estimation and phylogenetic reconstruction.

      Reviewer #2 (Public review):

      Summary:

      This study explores how gene expression evolves in response to seasonal environments, using four evergreen Fagaceae species growing in similar habitats in Japan. By combining chromosome-scale genome assemblies with a two-year RNA-seq time series in leaves and buds, the authors identify seasonal rhythms in gene expression and examine both conserved and divergent patterns. A central result is that winter bud expression is highly conserved across species, likely due to shared physiological demands under cold conditions. One of the intriguing implications of this study is that seasonal cycles might play a role similar to ontogenetic stages in animals. The authors touch on this by comparing their findings to the developmental hourglass model, and indeed, the recurrence of phenological states such as winter dormancy may act as a cyclic form of developmental canalization, shaping expression evolution in a way analogous to embryogenesis in animals.

      Strengths:

      (1) The evolutionary effects of seasonal environments on gene expression are rarely studied at this scale. This paper fills that gap.

      (2) The dataset is extensive, covering two years, two tissues, and four tree species, and is well suited to the questions being asked.

      (3) Transcriptome clustering across species (Figure 2) shows strong grouping by season and tissue rather than species, suggesting that the authors effectively controlled for technical confounders such as batch effects and mapping bias.

      (4) The idea that winter imposes a shared constraint on gene expression, especially in buds, is well argued and supported by the data.

      (5) The discussion links the findings to known concepts like phenological synchrony and the developmental hourglass model, which helps frame the results.

      We are grateful for the reviewer for the detailed and thoughtful review of our manuscript.

      Weaknesses:

      (1) While the hierarchical clustering shown in Figure 2A largely supports separation by tissue type and season, one issue worth noting is that some leaf samples appear to cluster closely with bud samples. The authors do not comment on this pattern, which raises questions about possible biological overlap between tissues during certain seasonal transitions or technical artifacts such as sample contamination. Clarifying this point would improve confidence in the interpretation of tissue-specific seasonal expression patterns.

      Leaf samples clustered into the bud are newly flushed leaves collected in April for Q. glauca, May for Q. acuta, May and June for L. edulis, and August and September for L. glaber. To clarify this point, we highlighted these newly flushed leaf samples as asterisk in the revised figure (Fig. 2A).

      (2) While the study provides compelling evidence of conserved and divergent seasonal gene expression, it does not directly examine the role of cis-regulatory elements or chromatin-level regulatory architecture. Including regulatory genomic or epigenomic data would considerably strengthen the mechanistic understanding of expression divergence.

      We thank the reviewer for this insightful comment. As noted in the Discussion section, we hypothesize that such genome-wide seasonal expression patterns—and their divergence across species—are likely mediated by cis-regulatory elements and chromatin-level mechanisms. While a direct investigation of regulatory architecture was beyond the scope of the present study, we fully agree that incorporating regulatory genomic and epigenomic data would significantly deepen the mechanistic understanding of expression divergence. In this regard, we are currently working to identify putative cis-regulatory elements in non-coding regions and are collecting epigenetic data from the same tree species using ChIP-seq. We believe the current study provide a foundation for these future investigations into the regulatory basis of seasonal transcriptome variation. We made a minor revision to the Discussion to note that an important future direction is to investigate the evolution of non-coding sequences that regulate gene expression in response to seasonal environmental changes.

      (3) The manuscript includes a thoughtful analysis of flowering-related genes and seasonal GO enrichment (e.g., Figure 3C-D), providing an initial link between gene expression timing and phenological functions. However, the analysis remains largely gene-centric, and the study does not incorporate direct measurements of phenological traits (e.g., flowering or bud break dates). As a result, the connection between molecular divergence and phenotypic variation, while suggestive, remains indirect.

      We would like to note that phenological traits have been observed in the field on a monthly basis throughout the sampling period and the phenological data were plotted together with molecular phenology (e.g. Fig. 2A, C; Fig. 3C, D). Although the temporal resolution is limited, these observations captured species-specific differences in key phenological events such as leaf flushing and flowering times. We revised the manuscript to clarify this point.

      (4) Although species were sampled from similar habitats, one species (Q. acuta) was collected at a higher elevation, and factors such as microclimate or local photoperiod conditions could influence expression patterns. These potential confounding variables are not fully accounted for, and their effects should be more thoroughly discussed or controlled in future analyses.

      We fully agree with the reviewer that local environmental conditions, including microclimate and photoperiod differences, could potentially influence gene expression patterns. To assess whether the higher elevation site of Q. acuta introduced confounding environmental effects, we reanalyzed the data after excluding this species. Hierarchical clustering still revealed that winter bud samples formed a distinct cluster regardless of species identity (Fig. S7), consistent with our original finding.

      Furthermore, we recalculated the molecular phenology divergence index D (Fig. 4C) and the interspecific Pearson’s correlation coefficients (Fig. 5A) without including Q. acuta. These analyses produced results that were qualitatively similar to those obtained from the full dataset (Fig. S12; Fig. S14), indicating that the observed patterns are not driven by environmental differences associated with elevation.

      We believe these additional analyses help to decouple the effects of environment and genetics, and support our conclusion that both seasonal synchrony and phylogenetic constraints play key roles in shaping transcriptome dynamics. We added four new figures (Fig. S6, Fig. S7, Fig. S12 and Fig. S14) and revised the text accordingly to clarify this point and to acknowledge the potential impact of site-specific environmental variation.

      (5) Statistical and Interpretive Concerns Regarding Δφ and dN/dS Correlation (Figures 5E and 5F):

      a) Statistical Inappropriateness: Δφ is a discrete ordinal variable (likely 1-11), making it unsuitable for Pearson correlation, which assumes continuous, normally distributed variables. This undermines the statistical validity of the analysis.

      We thank the reviewer for the insightful comment. We would like to clarify that the analysis presented in Figures 5E and 5F was based on linear regression, not Pearson’s correlation. Although Δ_φ_ is a discrete variable, it takes values from 0 to 6 in 0.5 increments, resulting in 13 levels. We treated it as a quasi-continuous variable for the purposes of linear regression analysis. This approach is commonly adopted in practice when a discrete variable has sufficient resolution and ordering to approximate continuity. To enhance clarity, we revised the manuscript to explicitly state that linear regression was used, and we now reported the regression coefficient and associated p-value to support the interpretation of the observed trend.

      b) Biological Interpretability: Even with the substantial statistical power afforded by genome-wide analysis, the observed correlations are extremely weak. This suggests that the relationship, if any, between temporal divergence in expression and protein-coding evolution is negligible.

      Taken together, these issues weaken the case for any biologically meaningful association between Δφ and dN/dS. I recommend either omitting these panels or clearly reframing them as exploratory and statistically limited observations.

      We agree with the reviewer’s comment. While we retained the original panels, we reframed our interpretation to emphasize that, despite statistical significance, the observed correlation is very weak—suggesting that coding region variation is unlikely to be the primary driver of seasonal gene expression patterns. Accordingly, we revised the “Relating seasonal gene expression divergence to sequence divergence” section in the Results, as well as the relevant part of the Discussion.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Sentences around lines 250-251 are incomplete and need revision.

      We thank the reviewer for pointing this out. We revised the sentences in the subsection “Peak month distribution of rhythmic genes and intra-genus and inter-genera comparison” in the Results section to ensure clarity and completeness. In addition, to improve the interpretability of the peak month distribution, we added arrows to indicate the major peaks in the circular histograms shown in Fig. 3C and 3D.

      Reviewer #2 (Recommendations for the authors):

      (1) In Figure 1E-G, the term Copy number or Copy number variation could be misleading, as it is commonly associated with inter-individual gene copy number variation in a population. Since the analysis here refers to orthology relationships rather than population-level variation, a more precise term, such as orthogroup classification, may be preferable.

      We thank the reviewer for this helpful suggestion. We agree that the term “copy number” could be misleading in this context. Accordingly, we updated the labeling in Fig. 1 to reflect the more precise term “orthogroup classification.”

      (2) In Figure 3A, the x-axis label Period (month) may be misleading, as it could be mistaken for calendar months rather than referring to the periodicity of gene expression cycles. A more explicit label, such as Expression periodicity (months), might improve clarity for the reader.

      We thank the reviewer for this valuable suggestion. In the original version of Fig. 3A, we used the label “Period (month),” which could indeed be misinterpreted as referring to calendar months. To clarify that this axis represents the length of gene expression cycles, we revised the label to “Period length (months).” This change also aligns with the terminology used throughout the manuscript, where “Period” refers specifically to cycle length, and “Periodicity” denotes the presence or absence of rhythmic expression.

      Other minor revisions

      We also made minor revisions for the reference list and the grant number details, and included the accession numbers for all DNA and RNA sequence data deposited in the DNA Data Bank of Japan (DDBJ) in the Data deposition and code availability section, in addition to the BioProject ID.

    1. Author Response:

      The following is the authors’ response to the current reviews.

      Reviewer #1 (Public review):

      The scale bar for fly and ovary images should be included in Figures 9, 10, and 12.

      We agree with this comment and apologize for the oversight. We have now modified Figures 9, 10, and 12 to include the scale bars for the ovary images. The fly images were acquired using a stereo microscope where scale bar calculation was not possible. However, all images were acquired at the same magnification for consistency.

      Reviewer #2 (Public review):

      A weakness of this paper is the phylogenetic analysis to investigate if there is correspondence in the phylogenetic distribution of ITP-type and Gyc76C-type genes/proteins. Unfortunately, the evidence presented is rather limited in scope. Essentially, the authors report that they only found ITP-type and Gyc76C-type genes/proteins in protostomes, but not in deuterostomes. What is needed is a more fine-grained analysis at the species level within the protostomes. However, I recognise that such a detailed analysis may extend beyond the scope of this paper, which is already rich in data.

      We thank the reviewer for their comment and the suggestion to perform a fine-grained species level comparison of ITP and Gyc76C genes across protostomes. We are unsure of the utility of this analysis for the present study given that we have now shown that ITPa can activate Gyc76C using both an ex vivo and a heterologous assay, the latter being the gold standard in GPCR and guanylate cyclase discovery (see Huang et al 2025 https://doi.org/10.1073/pnas.2420966122; Beets et al 2023 https://doi.org/10.1016/j.celrep.2023.113058); Chang et al 2009 https://doi.org/10.1073/pnas.0812593106.

      Additionally, absence of a gene in a genome/proteome is hard to prove especially when many/most of the protostomian datasets are not as high-quality as those of model systems (e.g. Drosophila melanogaster and Caenorhabditis elegans). Secondly, based on previous findings in Bombyx mori (Nagai et al. 2014 https://doi.org/10.1074/jbc.m114.590646 and Nagai et al. 2016 https://doi.org/10.1371/journal.pone.0156501) and Drosophila (Xu et al. 2023 https://doi.org/10.1038/s41586-023-06833-8 and our study) it is evident that different products of the ITP gene (ITPa and ITPL) could signal via different receptor types depending on the species. Hence, we would need to explore the presence of several genes (ITP, tachykinin, pyrokinin, tachykinin receptor, pyrokinin receptor, CG30340 orphan receptor and Gyc76C) to fully understand which components of these diverse signaling systems are present in a given species to decipher the potential for cross-talk.

      While this species-level comparison will certainly be useful in the context of ITP-Gyc76C evolution, it will not alter the conclusions of the present study – ITPa acts via Gyc76C in Drosophila. We therefore agree with the reviewer that these analyses are beyond the scope of this paper.


      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):  

      Summary:  

      In Drosophila melanogaster, ITP has functions on feeding, drinking, metabolism, excretion, and circadian rhythm. In the current study, the authors characterized and compared the expression of all three ITP isoforms (ITPa and ITPL1&2) in the CNS and peripheral tissues of Drosophila. An important finding is that they functionally characterized and identified Gyc76C as an ITPa receptor in Drosophila using both in vitro and in vivo approaches. In vitro, the authors nicely confirmed that the inhibitory function of recombinant Drosophila ITPa on MT secretion is Gyc76C-dependent (knockdown Gyc76C specifically in two types of cells abolished the anti-diuretic action of Drosophila ITPa on renal tubules). They also used a combination of multiple approaches to investigate the roles of ITPa and Gyc76C on osmotic and metabolic homeostasis modulation in vivo. They revealed that ITPa signaling to renal tubules and fat body modulates osmotic and metabolic homeostasis via Gyc76C.  

      Furthermore, they tried to identify the upstream and downstream of ITP neurons in the nervous system by using connectomics and single-cell transcriptomic analysis. I found this interesting manuscript to be well-written and described. The findings in this study are valuable to help understand how ITP signals work on systemic homeostasis regulation. Both anatomical and single-cell transcriptome analysis here should be useful to many in the field. 

      We thank this reviewer for the positive and thorough assessment of our manuscript.  

      Strengths:  

      The question (what receptors of ITPa in Drosophila) that this study tries to address is important. The authors ruled out the Bombyx ITPa receptor orthologs as potential candidates. They identified a novel ITP receptor by using phylogenetic, anatomical analysis, and both in vitro and in vivo approaches. 

      The authors exhibited detailed anatomical data of both ITP isoforms and Gyc76C (in the main and supplementary figures), which helped audiences understand the expression of the neurons studied in the manuscript.  

      They also performed connectomes and single-cell transcriptomics analysis to study the synaptic and peptidergic connectivity of ITP-expressing neurons. This provided more information for better understanding and further study on systemic homeostasis modulation.  

      Weaknesses:  

      In the discussion section, the authors raised the limitations of the current study, which I mostly agree with, such as the lack of verification of direct binding between ITPa and Gyc76C, even though they provided different data to support that ITPa-Gyc76C signaling pathway regulates systemic homeostasis in adult flies. 

      We now provide evidence of Gyc76C activation by ITPa in a heterologous system (new Figure 7 and Figure 7 Supplement 1).

      Reviewer #2 (Public Review):  

      Summary:  

      The physiology and behaviour of animals are regulated by a huge variety of neuropeptide signalling systems. In this paper, the authors focus on the neuropeptide ion transport peptide (ITP), which was first identified and named on account of its effects on the locust hindgut (Audsley et al. 1992). Using Drosophila as an experimental model, the authors have mapped the expression of three different isoforms of ITP (Figures 1, S1, and S2), all of which are encoded by the same gene.  

      The authors then investigated candidate receptors for isoforms of ITP. Firstly, Drosophila orthologs of G-protein coupled receptors (GPCRs) that have been reported to act as receptors for ITPa or ITPL in the insect Bombyx mori were investigated. Importantly, the authors report that ITPa does not act as a ligand for the GPCRs TkR99D and PK2-R1 (Figure S3). Therefore, the authors investigated other putative receptors for ITPs. Informed by a previously reported finding that ITP-type peptides cause an increase in cGMP levels in cells/tissues (Dircksen, 2009, Nagai et al., 2014), the authors investigated guanylyl cyclases as candidate receptors for ITPs. In particular, the authors suggest that Gyc76C may act as an ITP receptor in Drosophila.  

      Evidence that Gyc76C may be involved in mediating effects of ITP in Bombyx was first reported by Nagai et al. (2014) and here the authors present further evidence, based on a proposed concordance in the phylogenetic distribution ITP-type neuropeptides and Gyc76C (Figure 2). Having performed detailed mapping of the expression of Gyc76C in Drosophila (Figures 3, S4, S5, S6), the authors then investigated if Gyc76C knockdown affects the bioactivity of ITPa in Drosophila. The inhibitory effect of ITPa on leucokinin- and diuretic hormone-31-stimulated fluid secretion from Malpighian tubules was found to be abolished when expression of Gyc76C was knocked down in stellate cells and principal cells, respectively (Figure 4). However, as discussed below, this does not provide proof that Gyc76C directly mediates the effect of ITPa by acting as its receptor. The effect of Gyc76C knockdown on the action of ITPa could be an indirect consequence of an alteration in cGMP signalling.  

      Having investigated the proposed mechanism of ITPa in Drosophila, the authors then investigated its physiological roles at a systemic level. In Figure 5 the authors present evidence that ITPa is released during desiccation and accordingly, overexpression of ITPa increases survival when animals are subjected to desiccation. Furthermore, knockdown of Gyc76C in stellate or principal cells of Malphigian tubules decreases survival when animals are subject to desiccation. However, whilst this is correlative, it does not prove that Gyc76C mediates the effects of ITPa. The authors investigated the effects of knockdown of Gyc76C in stellate or principal cells of Malphigian tubules on i). survival when animals are subject to salt stress and ii). time taken to recover from of chill coma. It is not clear, however, why animals overexpressing ITPa were also not tested for its effect on i). survival when animals are subject to salt stress and ii). time taken to recover from of chill coma. In Figures 6 and S8, the authors show the effects of Gyc76C knockdown in the female fat body on metabolism, feeding-associated behaviours and locomotor activity, which are interesting. Furthermore, the relevance of the phenotypes observed to potential in vivo actions of ITPa is explored in Figure 7. The authors conclude that "increased ITPa signaling results in phenotypes that largely mirror those seen following Gyc76C knockdown in the fat body, providing further support that ITPa mediates its effects via Gyc76C." Use of the term "largely mirror" seems inappropriate here because there are opposing effects- e.g. decreased starvation resistance in Figure 6A versus increased starvation resistance in Figure 7A. Furthermore, as discussed above, the results of these experiments do not prove that the effects of ITPa are mediated by Gyc76C because the effects reported here could be correlative, rather than causative. 

      We thank this reviewer for an extremely thorough and fair assessment of our manuscript. 

      We have now performed salt stress tolerance and chill coma recovery assays using flies over-expressing ITPa (new Figure 10 Supplement 1).

      We agree that the use of the term “largely mirrors” to describe the effects of ITPa overexpression and Gyc76C knockdown is not appropriate and have changed this sentence. We also agree that the experiments did not provide direct evidence that the effects of ITPa are mediated by Gyc76C. To address this, we now provide evidence of Gyc76C activation by ITPa in a heterologous system (new Figure 7 and Figure 7 Supplement 1).

      Lastly, in Figures 8, S9, and S10 the authors analyse publicly available connectomic data and single-cell transcriptomic data to identify putative inputs and outputs of ITPa-expressing neurons. These data are a valuable addition to our knowledge ITPa expressing neurons; but they do not address the core hypothesis of this paper - namely that Gyc76C acts as an ITPa receptor.  

      The goal of our study was to comprehensively characterize an anti-diuretic system in Drosophila. Hence, in addition to identifying the receptor via which ITPa exerts its effects, we also wanted to understand how ITPa-producing neurons are regulated. Connectomic and single-cell transcriptomic analyses are highly appropriate for this purpose. We have now updated the connectomic analyses using an improved connectome dataset that was released during the revision of this manuscript. Our new analysis shows that lNSC<sup>ITP</sup> are connected to other endocrine cells that produce other homeostatic hormones (new Figure 13F). We also identify a pathway through which other ITP-producing neurons (LNd<sup>ITP</sup>) receive hygrosensory inputs to regulate water seeking behavior (new Figure 13E). Moreover, we now include results which showcase that ITPa-producing neurons (l-NSC<sup>ITP</sup>) are active (new Figure 8A and B) and release ITPa under desiccation. Together with other analyses, these data provide a comprehensive outlook on the when, what and how ITPa regulates systemic homeostasis.  

      Strengths:  

      (1) The main strengths of this paper are i) the detailed analysis of the expression and actions of ITP and the phenotypic consequences of overexpression of ITPa in Drosophila. ii). the detailed analysis of the expression of Gyc76C and the phenotypic consequences of knockdown of Gyc76C expression in Drosophila.  

      (2) Furthermore, the paper is generally well-written and the figures are of good quality. 

      We thank this reviewer for highlighting the strengths of this manuscript.

      Weaknesses:  

      (1) The main weakness of this paper is that the data obtained do not prove that Gyc76C acts as a receptor for ITPa. Therefore, the following statement in the abstract is premature: "Using a phylogenetic-driven approach and the ex vivo secretion assay, we identified and functionally characterized Gyc76C, a membrane guanylate cyclase, as an elusive Drosophila ITPa receptor." Further experimental studies are needed to determine if Gyc76C acts as a receptor for ITPa. In the section of the paper headed "Limitations of the study", the authors recognise this weakness. They state "While our phylogenetic analysis, anatomical mapping, and ex vivo and in vivo functional studies all indicate that Gyc76C functions as an ITPa receptor in Drosophila, we were unable to verify that ITPa directly binds to Gyc76C. This was largely due to the lack of a robust and sensitive reporter system to monitor mGC activation." It is not clear what the authors mean by "the lack of a robust and sensitive reporter system to monitor mGC activation". The discovery of mGCs as receptors for ANP in mammals was dependent on the use of assays that measure GC activity in cells (e.g. by measuring cGMP levels in cells). Furthermore, more recently cGMP reporters have been developed. The use of such assays is needed here to investigate directly whether Gyc76C acts as a receptor for ITPa. In summary, insufficient evidence has been obtained to conclude that Gyc76C acts as a receptor for ITPa. Therefore, I think there are two ways forward, either:  

      (a) The authors obtain additional biochemical evidence that ITPa is a ligand for Gyc76C.  

      or  

      (b) The authors substantially revise the conclusions of the paper (in the title, abstract, and throughout the paper) to state that Gyc76C MAY act as a receptor for ITPa, but that additional experiments are needed to prove this. 

      We thank the reviewer for this comment and agree with the two options they propose. We had previously tried different a cGMP reporter (Promega GloSensor cGMP assay) to monitor activation of Gyc76C by ITPa in a heterologous system. Unfortunately, we were not successful in monitoring Gyc76C activation by ITPa. We now utilized another cGMP sensor, Green cGull, to show that ITPa can indeed activate Gyc76C heterologously expressed in HEK cells (new Figure 7 and Figure 7 Supplement 1). However, we still cannot rule out the possibility that ITPa can act on additional receptors in vivo. This is based on our ex vivo Malpighian tubule assays (new Figure 6E and F). ITPa inhibits DH31- and LK-stimulated secretion and we show that this effect is abolished in Gyc76C knockdown specifically in principal and stellate cells, respectively. Interestingly, application of ITPa alone can stimulate secretion when Gyc76C is knocked down in principal cells (new Figure 6E). This could be explained by: 1) presence of another receptor for ITPa which results in diuretic actions and/or 2) low Gyc76C signaling activity (RNAi based knockdown lowers signaling but does not abolish it completely) could alter other intracellular messenger pathways that promote secretion. We have added text to indicate the possibility of other ITPa receptors. Nonetheless, our conclusions are supported by the heterologous assay results which indicate that ITPa can activate Gyc76C. Therefore, we do not alter the title. 

      (2) The authors state in the abstract that a phylogenetic-driven approach led to their identification of Gyc76C as a candidate receptor for ITPa. However, there are weaknesses in this claim. Firstly, because the hypothesis that Gyc76C may be involved in mediating effects of ITPa was first proposed ten years ago by Nagai et al. 2014, so this surely was the primary basis for investigating this protein. Nevertheless, investigating if there is correspondence in the phylogenetic distribution of ITP-type and Gyc76C-type genes/proteins is a valuable approach to addressing this issue. Unfortunately, the evidence presented is rather limited in scope. Essentially, the authors report that they only found ITP-type and Gyc76C-type genes/proteins in protostomes, but not in deuterostomes. What is needed is a more fine-grained analysis at the species level within the protostomes. Thus, are there protostome species in which both ITP-type and Gyc76C-type genes/proteins have been lost? Furthermore, are there any protostome species in which an ITP-type gene is present but an Gyc76C-type gene is absent, or vice versa? If there are protostome species in which an ITP-type gene is present but a Gyc76C-type gene is absent or vice versa, this would argue against Gyc76C being a receptor for ITPa. In this regard, it is noteworthy that in Figure 2A there are two ITP-type precursors in C. elegans, but there are no Gyc76Ctype proteins shown in the tree in Figure 2B. Thus, what is needed is a more detailed analysis of protostomes to investigate if there really is correspondence in the phylogenetic distribution of Gyc76C-type and ITP-type genes at the species level. 

      We thank the reviewer for this comment. While the previous study by Nagai et al had implicated Gyc76C in the ITP signaling pathway, how they narrowed down Gyc76C as a candidate was not reported. Therefore, our unbiased phylogenetic approach was necessary to ensure that we identified all suitable candidate receptors. Indeed, our phylogenetic analysis also identified Gyc32E as another candidate ITP receptor. However, we did not pursue this receptor further as our expression data (new Figure 4 Supplement 2) indicated that Gyc32E is not expressed in osmoregulatory tissues and therefore likely does not mediate the osmotic effects of ITPa. 

      We also appreciate the suggestion to perform a more detailed phylogenetic analysis for the peptide and receptor. We did not include C. elegans receptors in the phylogenetic analysis because they tend to be highly evolved and routinely cause long-branch attraction (see: Guerra and Zandawala 2024: https://doi.org/10.1093/gbe/evad108). We (specifically the senior author) have previously excluded C. elegans receptors in the phylogenetic analysis of GnRH and Corazonin receptors for similar reasons (see: Tian and Zandawala et al. 2016: 10.1038/srep28788). 

      Unfortunately, absence of a gene in a genome is hard to prove especially when they are not as high-quality as the genomes of model systems (e.g. Drosophila and mice). Moreover, given the concern of this reviewer that our physiological and behavioral data on ITPa and Gyc76C only provide correlative evidence, we decided against performing additional phylogenetic analysis which also provides correlative evidence. Our only goal with this analysis was to identify a candidate ITPa receptor. Since we have now functionally characterized this receptor using a heterologous system, we feel that the current phylogenetic analysis was able to successfully serve its purpose.  

      (3) The manuscript would benefit from a more comprehensive overview and discussion of published literature on Gyc76C in Drosophila, both as a basis for this study and for interpretation of the findings of this study.  

      We thank the reviewer for this comment. We have now included a broader discussion of Gyc76C based on published literature.  

      Reviewer #3 (Public Review):  

      Summary:  

      The goal of this paper is to characterize an anti-diuretic signaling system in insects using Drosophila melanogaster as a model. Specifically, the authors wished to characterize a role of ion transport peptide (ITP) and its isoforms in regulating diverse aspects of physiology and metabolism. The authors combined genetic and comparative genomic approaches with classical physiological techniques and biochemical assays to provide a comprehensive analysis of ITP and its role in regulating fluid balance and metabolic homeostasis in Drosophila. The authors further characterized a previously unrecognized role for Gyc76C as a receptor for ITPa, an amidated isoform of ITP, and in mediating the effects of ITPa on fluid balance and metabolism. The evidence presented in favor of this model is very strong as it combines multiple approaches and employs ideal controls. Taken together, these findings represent an important contribution to the field of insect neuropeptides and neurohormones and have strong relevance for other animals. 

      We thank this reviewer for the positive and thorough assessment of our manuscript.

      Strengths:  

      Many approaches are used to support their model. Experiments were wellcontrolled, used appropriate statistical analyses, and were interpreted properly and without exaggeration.  

      Weaknesses:  

      No major weaknesses were identified by this reviewer. More evidence to support their model would be gained by using a loss-of-function approach with ITPa, and by providing more direct evidence that Gyc76C is the receptor that mediates the effects of ITPa on fat metabolism. However, these weaknesses do not detract from the overall quality of the evidence presented in this manuscript, which is very strong.  

      We agree with this reviewer regarding the need to provide additional evidence using a loss-of-function approach with ITPa. We now characterize the phenotypes following knockdown of ITP in ITP-producing cells (new Figure 9). Our results are in agreement with phenotypes observed following Gyc76C knockdown, lending further support that ITPa mediates its effects via Gyc76C. Unfortunately, we are not able to provide evidence that ITPa acts on Gyc76C in the fat body using the assay suggested by this reviewer (explained in detail below). Instead, we now provide direct evidence of Gyc76C activation by ITPa in a heterologous system (new Figure 7 and Figure 7 Supplement 1).

      Reviewer #1 (Recommendations For The Authors):  

      Here, I have several extra concerns about the work as below:  

      (1) The authors confirmed the function of ITPa in regulating both osmotic and metabolic homeostasis by specifically overexpressing ITPa driven by ITP-RCGal4 in adult flies (Figures. 5 and 7). Have authors ever tried to knock down ITP in ITP-RC-Gal4 neurons? What was the phenotype? Especially regarding the impact on metabolic homeostasis, does knocking down ITP in ITP neurons mimic the phenotypes of Gyc76C fat body knockdown flies? 

      We thank the reviewer for this suggestion. We now characterize the phenotypes following knockdown of ITP using ITP-RC-Gal4 (new Figure 9). Our results are in agreement with phenotypes observed following Gyc76C knockdown, lending further support that ITPa mediates its effects via Gyc76C.

      The authors mentioned that the existing ITP RNAi lines target all three isoforms. It would be interesting if the authors could overexpress ITPa in ITPRC-Gal4>ITP-RNAi flies and confirm whether any phenotypes induced by ITP knockdown could be rescued. It will further confirm the role of ITPa in homeostasis regulation.  

      We thank the reviewer for this suggestion. Unfortunately, this experiment is not straightforward because knockdown with ITP RNAi does not completely abolish ITP expression (see Figure 9A). Hence, the rescue experiment needs to be ideally performed in an ITP mutant background. However, ITP mutation leads to developmental lethality (unpublished observation) so we cannot generate all the flies necessary for this experiment. Therefore, we cannot perform the rescue experiments at this time. In future studies, we hope to perform knockdown of specific ITP isoforms using the transgenes generated here (Xu et al 2023: 10.1038/s41586-023-06833-8).   

      (2) In Figures 5A and B, the authors nicely show the increased release of ITPa under desiccation by quantifying the ITPa immunolabelling intensity in different neuronal populations. It may be induced by the increased neuronal activity of ITPa neurons under the desiccated condition. Have the authors confirmed whether the activity of ITPa-expressing neurons is impacted by desiccation?  

      The TRIC system may be able to detect the different activity of those neurons before and after desiccation. This may further explain the reduced ITPa peptide levels during desiccation.  

      We thank the reviewer for this suggestion. We have now monitored the activity of ITPa-expressing neurons using the CaLexA system (Masuyama et al 2012: 10.3109/01677063.2011.642910). Our results indicate that ITPa neurons are indeed active under desiccation (new Figure 8A and B). These results are also in agreement with ITPa immunolabelling showing increased peptide release during desiccation (new Figure 8C and D). Together, these results show that ITPa neurons are activated and release ITPa under desiccation.  

      (3) What about the intensity of ITPa immunolabelling in other ITPa-positive neurons (e.g., VNC) under desiccation? If there is no change in other ITPa neurons, it will be a good control. 

      We thank the reviewer for this suggestion. Unfortunately, ITPa immunostaining in VNC neurons is extremely weak preventing accurate quantification of ITPa levels under different conditions. We did hypothesize that ITPa immunolabelling in clock neurons (5<sup>th</sup>-LN<sub>v</sub> and LN<Sub>d</sub><sup>ITP</sup>) would not change depending on the osmotic state of the animal. However, our results (Figure 8C and D) indicate that ITPa from these neurons is also released under desiccation. Interestingly, LNd<sup>ITP</sup>, which also coexpress Neuropeptide F (NPF) have recently been implicated in water seeking during thirst (Ramirez et al, 2025: 10.1101/2025.07.03.662850). Our new connectomic-driven analysis shows that these neurons can receive thermo/hygrosensory inputs (new Figure 13E). Hence, it is conceivable that other ITPa-expressing neurons also release ITPa during thirst/desiccation.

      (4) The adult stage, specifically overexpression of ITPa in ITP neurons, does show significant phenotypes compared to controls in both osmotic and metabolic homeostasis-related assays. It would be helpful if authors could show how much ITPa mRNA levels are increased in the fly heads with ITPa overexpression (under desiccation & starvation or not). 

      We thank the reviewer for this suggestion. We have now included immunohistochemical evidence showing increase in ITPa peptide levels in flies with ITPa overexpression (new Figure 10A). We feel that this is a better indicator of ITPa signaling level instead of ITPa mRNA levels.   

      (5) Another question concerns the bloated abdomens of ITPa-overexpressing flies. Are the bloated abdomens of ITPa OE female flies (Figure 5E) due to increased ovary size (Figure 7G)? Have the authors also detected similar bloated abdomens in male flies with ITPa overexpression? Since both male and female flies show more release of ITPa during the desiccation.  

      We thank the reviewer for this comment. The bloated abdomen phenotype seen in females can be attributed to increased water content since we see a similar phenotype in males (see Author response image 1 below).

      Author response image 1.

      Reviewer #2 (Recommendations For The Authors):  

      (1) Page 1 - change "Homeostasis is obtained by" to "Homeostasis is achieved by".  

      Changed

      (2) Page 1 - change "Physiological responses" to "Physiological processes". 

      Changed

      (3) Page 2 - Change "Recently, ITPL2 was also shown to mediate anti-diuretic effects via the tachykinin receptor" to "Recently, ITPL2 was also shown to exert anti-diuretic effects via the tachykinin receptor". 

      Changed

      (4) Page 9 - "(C) Adult-specific overexpression of ITPa using ITP- RC-GAL4TS (ITP-RC-T2A-GAL4 combined with temperature-sensitive tubulinGAL80) increases desiccation" Unless I am misunderstanding Fig 5C, I think what is shown is that overexpression of ITPa prolongs survival during a period of desiccation. I am not sure what the authors mean by "increases desiccation". In the text (page 9) the authors state "ITPa overexpression improves desiccation tolerance, which is a much clearer statement than what is in the figure legend. 

      We thank the reviewer for identifying this oversight. We have now changed the caption to “increases desiccation tolerance”.  

      (5) Page 11 - The authors conclude that "increased ITPa signaling results in phenotypes that largely mirror those seen following Gyc76C knockdown in the fat body, providing further support that ITPa mediates its effects via Gyc76C." Use of the term "largely mirror" seems inappropriate here because there are opposing effects- e.g. decreased starvation resistance in Figure 6A versus increased starvation resistance in Figure 7A.  

      Perhaps there is a misunderstanding of what is meant by "mirroring" - it means the same, not the opposite. 

      We thank the reviewer for this comment. We agree that the use of the term “largely mirrors” to describe the effects of ITPa overexpression and Gyc76C knockdown is not appropriate and have changed this sentence as follows: “Taken together, the phenotypes seen following Gyc76C knockdown in the fat body largely mirror those seen following ITP knockdown in ITP-RC neurons, providing further support that ITPa mediates its effects via Gyc76C.”

      (6) Page 12 - There appear to be words missing between "neurons during desiccation, as well as their downstream" and "the recently completed FlyWire adult brain connectome" 

      We thank the reviewer for highlighting this mistake. We have changed the sentence as following: “Having characterized the functions of ITP signaling to the renal tubules and the fat body, we wanted to identify the factors and mechanisms regulating the activity of ITP neurons during desiccation, as well as their downstream neuronal pathways. To address this, we took advantage of the recently completed FlyWire adult brain connectome (Dorkenwald et al., 2024, Schlegel et al., 2024) to identify pre- and post-synaptic partners of ITP neurons.”

      (7) Page 15 - "can release up to a staggering 8 neuropeptides" - I suggest that the word "staggering" is removed. The notion that individual neurons release many neuropeptides is now widely recognised (both in vertebrates and invertebrates) based on analysis of single-cell transcriptomic data. 

      Removed staggering.

      (8) Page 16 - "(Farwa and Jean-Paul, 2024)" - this citation needs to be added to the reference list and I think it needs to be changed to "Sajadi and Paluzzi, 2024". 

      We thank the reviewer for highlighting this oversight. The correct citation has now been added.

      (9) It is noteworthy that, based on a PubMed search, there are at least thirteen published papers that report on Gyc76C in Drosophila (PMIDs: 34988396, 32063902, 27642749, 26440503, 24284209, 23862019, 23213443,  21893139, 21350862, 16341244, 15485853, 15282266, 7706258). However, none of these papers are discussed/cited by the authors. This is surprising because the authors' hypothesis that Gyc76C acts as a receptor for ITPa surely needs to be evaluated and discussed with reference to all the published insights into the developmental/physiological roles of this protein. 

      We thank the reviewer for this comment. Some of the references mentioned above (21350862, 16341244, 15485853) mainly report on soluble guanylyl cyclases and not membrane guanylyl cyclase like Gyc76C. Based on other studies on Gyc76C and its role in immunity and development, we have now expanded the discussion on additional roles of ITPa.

      Reviewer #3 (Recommendations For The Authors):  

      I have only a few comments that will help the authors strengthen a couple of aspects of their model.  

      (1) The case for Gyc76C as a receptor for ITPa in regulating fluid homeostasis is clear, given the experiments the authors carried out where they applied ITPa to tubules and showed that the effects of ITPa on tubule secretion were blocked if Gyc76C was absent in tubules. This approach, or something similar, should be used to provide conclusive proof that ITPa's metabolic effects on the fat body go through Gyc76C.  

      At present (unless I missed it) the authors only show that gain of ITPa has the opposite phenotype to fat body-specific loss of Gyc76C. While this would be the expected result if ITPa/Gyc76C is a ligand-receptor pair, it is not quite sufficient to conclusively demonstrate that Gyc76C is definitely the fat body receptor. Ex vivo experiments such as soaking the adult fat body carcasses with and without Gyc76C in ITPa and monitoring fat content via Nile Red could be one way to address this lack of direct evidence. The authors could also make text changes to explicitly mention this lack of conclusive evidence and suggest it as a future direction.

      We thank the reviewer for this comment. We have now conclusively demonstrated that Gyc76C is activated by ITPa in a heterologous assay (new Figure 7 and Figure 7 Supplement 1). With this evidence, we can confidently claim that ITPa can mediate its actions via Gyc76C in various tissues including the Malpighian tubules and fat body. Nonetheless, we liked the suggestion by this reviewer to perform the ex vivo assay and test the effect of ITPa on the fat body. Unfortunately, it is challenging to do this because increased ITPa signaling (chronically using ITPa overexpression) results in increased lipid accumulation in the fat body in vivo. Therefore, we would likely not see the effect of ITPa addition in an ex vivo fat body preparation since lipogenesis will not occur in the absence of glucose. However, ITPa could counteract the effects of other lipolytic factors such as adipokinetic hormone (AKH). To test this hypothesis, we monitored fat content in the fat body incubated with and without AKH (see Author response image 2 below showing representative images from this experiment). Since we did not observe any differences in fat levels between these two conditions, we were unable to test the effects of ITPa on AKH-activity using this assay.

      Author response image 2.

      (2) I did not see any loss of function data for ITPa - is this possible? If so this would strengthen the case for a 1:1 relationship between loss of ligand and loss of receptor. Alternatively, the authors could suggest this as an important future direction. 

      We agree with this reviewer regarding the need to provide additional evidence using a loss-of-function approach with ITPa. We have now characterized the phenotypes following knockdown of ITP in ITP-producing cells (new Figure 9). Our results are in agreement with phenotypes observed following Gyc76C knockdown, lending further support that ITPa mediates its effects via Gyc76C.

      (3) For clarity, please include the sex of all animals in the figure legend. Even though the methods say 'females used unless otherwise indicated' it is still better for the reader to know within the figure legend what sex is displayed. 

      We thank the reviewer for this suggestion and have now included sex of the animals in the figure legends.  

      (4) Please state whether females are mated or not, as this is relevant for taste preferences and food intake. 

      We apologize for this oversight. We used mated females for all experiments. This has now been included in the methods.  

      (5) More discussion on the previous study on metabolic effects of ITP in this study compared with past studies would help readers appreciate any similarities and/or differences between this study and past work (Galikova 2018, 2022) 

      We thank the reviewer for this suggestion. Unfortunately, it is difficult to directly compare our phenotypes with the metabolic effects of ITP reported in Galikova and Klepsatel 2022 because the previous study used a ubiquitous driver (Da-GAL4) to manipulate ITP levels. Ectopically overexpressing ITPa in non-ITP producing cells can result in non-physiological phenotypes. This is evident in their metabolic measurements where both global overexpression and knockdown of ITP results in reduced glycogen and fat levels, and starvation tolerance. Moreover, ITP-RC-GAL4 used in our study to overexpress and knockdown ITPa is more specific than the Da-GAL4 used previously. Da-GAL4 would include other ITP cells (e.g. ITP-RD producing cells). Since ITP is broadly expressed across the animal, it is difficult to parse out the phenotypes of ITPa and other isoforms using manipulations performed with Da-GAL4. We have mentioned this limitation in the results for ITP knockdown as follows: “A previous study employing ubiquitous ITP knockdown and overexpression suggests that Drosophila ITP also regulates feeding and metabolic homeostasis (Galikova and Klepsatel, 2022) in addition to osmotic homeostais (Galikova et al., 2018). However, given the nature of the genetic manipulations (ectopic ITPa overexpression and knockdown of ITP in all tissues) utilized in those studies, it is difficult to parse the effects of ITP signaling from ITPa-producing neurons.”

    1. Author response:

      The following is the authors’ response to the original reviews

      We would like to thank the reviewers for taking the time to thoroughly revise our work. We have considered their suggestions carefully and tried our best to respond to them point by point. Based on their recommendations, two major issues came forward: (1) the strength of our claims about the involvement of cohesin in HR-driven repair in late mitosis; and (2) the underlying mechanism that reconstitutes cohesin in late mitosis after DNA damage. In this revision, we focused on the former and left the latter out (yet it is discussed). We considered that the question of how cohesin returns in late mitosis after DNA damage is important and worthy of further research, but it is beyond the scope of this study (as it is the putative role of condensin). Thus, we have focused on buttressing our main claims, as otherwise pointed out by the reviewers. What have we done to strengthen the role of cohesin in late mitotic DSB repair?

      (1) We have biologically replicated and quantified the reappearance of Scc1 after DSB generation (new Figure 1e). We have also quantified changes for the other core subunits (new Figure 1c-e).

      (2) We now show that the newly synthetized Scc1 serves to assemble back the cohesin complex (new Figure 2a and S1).

      (3) We have performed chromatin fractionation and show that cohesin binding to chromatin increases after the HO-induced DSB (new Figure 2b and S2).

      (4) We have performed ChIP assays and show that, despite the increase in the chromatin-bound fraction, the HOcs DSB does not recruit new cohesin to the locus (new Figure 2c and S3).

      (5) A key assertion in the preprint version was that depleting cohesin using the auxin degron system impairs HR-driven MAT switching. This claim was based on a direct comparison of cultures treated or not with auxin (-/+ IAA). However, during the revision process, we realized that auxin treatment itself could interfere with MAT switching. Firstly, we noticed a diminished HOcs cutting efficiency by HO in +IAA cultures (Figure S6). Secondly, the apparently dramatic delay in gene conversion to MAT_α could actually be related to other undesirable effects of IAA downstream in the repair process. Thus, we decided to repeat this experiment with strains that differ in their response to auxin, so that we could compare all strains in the presence of auxin. We compared four isogenic strains: _SMC3; SMC3-aid*; SMC3 + OsTIR1; and SMC3-aid* + OsTIR1. As a result, we can now show that cohesin depletion does not affect MAT switching (see new Figure 4b-d).

      (6) We recently reported a negative chemical interaction between auxin and phleomycin. Auxin appears to diminish the ability of phleomycin to generate DSBs (Comm Biol 2025, doi: 10.1038/s42003-025-08416-x; see Figures S14 and S15 in that paper). While the underlying nature of this interaction is unknown to us (we are working on it), this leads us to omit the coalescence assay included in the preprint version (old Figure 4c), as the diminished coalescence upon IAA addition is actually due to this effect rather than cohesin depletion. This is also in agreement with the new data we include in the revised version, in which we observed only minor changes in cohesin reconstitution and chromatin binding after phleomycin (Figure 2a,b; S1 and S2).  

      (7) In addition to addressing these reviewers’ requests, we have better characterized the MAT switching in late mitosis by incorporating the kinetics of _rad9_Δ (deficient in the DNA damage checkpoint), _yku70_Δ (deficient in non-homologous end joining) and _mre11_Δ (deficient in DSB end tethering). The effect of _rad52_Δ (deficient in HR) has been described elsewhere (our iScience 2024, 10.1016/j.isci.2024.110250).

      As a result of these new experiments, new figure panels have been added in the main figures and as supplementary figures. To make room for the these panels in the main figures and keep the short report format, the following changes have been made: (i) old figures and new panels have been combined into four main figures, (ii) some panels from the old figures have been moved to supplementary figures, and (iii) some panels have been reordered for the sake of simplicity and fluidity in the main text. 

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      The cohesin complex maintains sister chromatid cohesion from S phase to anaphase. Beyond that, DSBs trigger cohesin recruitment and post-replication cohesion at both damage sites and globally, which was originally reported in 2004. In their recent study, Ayra-Plasencia et al reported in telophase, DSBs are repaired via HR with re-coalesced sister chromatids (Ayra-Plasencia & Machín, 2019). In this study, they show that HR occurs in a Smc3-dependent way in late mitosis.

      Strengths:

      The authors take great advantage of the yeast system, they check the DSB processing and repair of a single DSB generated by HO endonuclease, which cuts the MAT locus in chromosome III. In combination with cell synchronization, they detect the HR repair during G2/M or late mitosis. and the cohesin subunit SMC3 is critical for this repair. Beyond that, full-length Scc1 protein can be recovered upon DSBs.

      Weaknesses:

      These new results basically support their proposal although with a very limited molecular mechanistic progression, especially compared with their recent work.

      Reviewer #2 (Public Review):

      Summary:

      The manuscript "Cohesin still drives homologous recombination repair of DNA double-strand breaks in late mitosis" by Ayra-Plasencia et al. investigates regulations of HR repair in conditional cdc15 mutants, which arrests the cell cycle in late anaphase/telophase. Using a non-competitive MAT switching system of S. cerevisiae, they show that a DSB in telophase-arrested cells elicits a delayed DNA damage checkpoint response and resection. Using a degron allele of SMC3 they show that MATa-to-alpha switching requires cohesin in this context. The presence of a DSB in telophase-arrested cells leads to an increase in the kleisin subunit Scc1 and a partial rejoining of sister chromatids after they have separated in a subset of cells.

      Strengths:

      The experiments presented are well-controlled. The induction systems are clean and well thought-out.

      Weaknesses:

      The manuscript is very preliminary, and I have reservations about its physiological relevance. I also have reservations regarding the usage of MAT to make the point that inter-sister repair can occur in late mitosis.

      Regarding these two weaknesses:

      - Physiological relevance: This is something we already addressed in our previous research work (Nat Commun. 2019; 10(1):2862. doi: 10.1038/s41467-019-10742-8), and which was further discussed in a follow-up theoretical paper (Bioessays. 2020 ;42(7):e2000021. doi: 10.1002/bies.202000021). In summary, this is physiologically relevant because a DSB in anaphase activates a late-mitotic checkpoint so the DSB can be repaired before cytokinesis. The fact that anaphase is quick and only a minor fraction of cells get a DSB in this cell cycle stage in an asynchronous population does not preclude its importance since it is enough a single mis-repaired DSB in hundreds of cells to mutate a population in an health- or evolution-relevant way.

      - MAT system in late mitosis: It was not our intention to use the MAT switching assay to state that inter-sister repair can occur in late-M. The purpose was to address whether HR was fully functional in this non-G2/M non-G1 stage. Having said that, it is very challenging to design a strategy based on sequence-specific DSB to tackle the inter-sister repair in late-M. Any endonuclease-generated DSB is going to cut in both sisters. This is something we also deeply discussed in our previous works (Nat Commun & Bioessays).    

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      Major points:

      (1) Smc3 degradation affects Rad53 activation upon DSBs, and this may directly lead to HR repair deficiency. Smc3 also could be phosphorylated by ATM and functions in DNA damage checkpoint activation, these alternative possibilities should also be tested before addressing the bona fide role of Smc3 in this context.

      Our previous data already suggested that Rad53 hyperphosphorylation still occurs after Smc3 degradation (Figure S6). Regardless, the question of whether the DNA damage checkpoint (DDC) may play a distinct role in the MAT switching has been addressed in this revision by comparing RAD9 versus rad9_Δ. Rad9 is a mediator in the DDC required for the activation of Rad53. We have seen that MAT switching in _rad9_Δ is as efficient as in _RAD9 (new Figure S5d-f).

      On the other hand, our new results, in which we have compared four different strains with all auxin system combinations in the presence of auxin, show that cohesin depletion does not affect MAT switching. Previously, we compared minus versus plus auxin and noticed diminished HO cutting efficiency. Thus, we repeated this experiment with four isogenic strains (SMC3; SMC3-aid*; SMC3 + OsTIR1; and SMC3-aid* + OsTIR1) that differ in their response to auxin and ability to degrade cohesin, so that we could compare all strains in the presence of auxin. As a result, we can now affirm that cohesin depletion does not affect MAT switching (see new Figure 4b-d). Therefore, HR appears efficient after cohesin depletion.

      (2) The requirement of cohesin subunit Smc3 and "coincidently" recovery of Scc1 are not sufficient to claim they act as a cohesin complex in this scenario. CoIP in the chromatin fraction after DSBs to prove the cohesin complex formation is recommended. If they act as a complex, are cohesin loader Scc2/4 required?

      We have constructed a SMC3-HA SCC1-myc strain. We have purified the chromatin-bound fraction as well as performing the co-IP. We have found Smc1-acSmc3-Scc1 forms a complex after Scc1 returns, and that at least a fraction of this complex binds to the chromatin in our HO model of DSBs in late anaphase (the cdc15-2 arrest). This is now shown in the new Figures 2a,b and S1,S2.

      As for the requirement of Scc2/4, we consider that the mechanisms underlying how Scc1 comes back, how a new cohesin complex is reassembled, and how it can partly bind to the chromatin in late anaphase are beyond the scope of this study and worth pursuing in a follow-up story.

      (3) Figure 3b. acetylated SMC3 was prominently detected in the absence of DSBs. During the cohesion cycle, the cohesin was released from chromatin in a separase-dependent manner at the anaphase onset. Released Smc3 was deacetylated by Hos1 subsequently. In principle, the acSMC3 level could be very low in late mitosis.

      In that figure (now renumbered as Fig S6), we did detect acetylated Smc3 for the remnant Smc3 still found in late mitosis, however, a direct comparison between the acetylated versus non-acetylated pools was not performed, and would require more sophisticated approaches. Note that blots are distinctly exposed until the band is detected, and that signal intensity is antibody-specific. The presence of an acSmc3 pool in the cdc15-2 arrest is now further confirmed by the new blots in Figures 2a, S1 and S2b.

      On the other hand, previous time course experiments from G1 and G2/M releases point out that Smc3 deacetylation is incomplete in anaphase, with up to 30% of acetylated Smc3 remaining (Beckouët et al, 2010 doi:10.1016/j.molcel.2010.08.008). This is consistent with the presence of acSmc3 in the cdc15-2 arrest.   

      (4) Did the author examine the acSMC3 levels returning after DSB, as Scc1's levels? If so, how about the Eco1's protein level? Chromatin fractionation could be conducted to check the chromatin-bound SMC3, acSMC3/Eco1, SCC1, SCC1 phosphorylation, and SMC1. These results will tell us whether cohesin functions in DSB repair in late M in a cohesion state.

      As stated above, we have now determined that cohesin depletion does not affect HR-driven MAT switching. As for the other questions, yes, we have performed both an assessment of acSmc3 in the pull down and chromatin fractionation, before and after DSBs (new Figures 2a, S1 and S2b). Interestingly, we have noticed a difference between the HO-generated and the phle-generated DSBs. It appears that the former leads to a better reconstituted Smc1-acSmc3-Scc1 complex and more chromatin-bound cohesin. The overall acSmc3 levels do not appear to significantly change in the whole cell extracts, although there could be further posttranslational modifications in telophase (see the changes in intensity between the two acSmc3 bands in Figure S1).

      The role of Eco1 has not been directly addressed but is discussed. The main point here is that Eco1 levels may be low after G2/M (e.g., Lyons and Morgan, 2011), but there is still a significant acSmc3 pool in anaphase as Hof1 does not deacetylate all Smc3 (Beckouët et al., 2010). 

      (5) Figure 4a, the return of full-length Scc1 is based on a single experiment. What's the mechanism? Inhibition of cleavage or re-expression? How about its mRNA levels?

      We have repeated the full-length Scc1 experiment two more times. Now, an expression graph is included as a new Figure 1e. The two other subunits, Smc1 and Smc3, have been assessed as well, with no major changes in abundance (new Figure 1c and d).

      We feel that the exact molecular mechanism of how Scc1 returns is beyond the scope of this study, but we discuss that the DDC may either inactivate separase or protect Scc1 against it. Indeed, there is literature that supports both mechanisms (e.g., Heidinger-Pauli et al., 2008 doi:10.1016/j.molcel.2008.06.005; Yam et al., 2020 doi:10.1093/nar/gkaa355).   

      Minor points:

      (6) FACS data should be shown for all cell synchronization experiments.

      From our previous own works, FACS profiles add little to late-M experiments. To properly confirm late-M, microscopy is a must. FACS cannot differentiate between G2/M (metaphase-like), anaphase, telophase and the ensuing G1 (as cdc15-2 cells do not immediately split apart after re-entering G1). In all experiments, Tel samples (late-M cdc15-2 arrest) were characterized by >95% large budded binucleated cells.

      (7) Figure 1d, A loading control of Rad53-P in is missing. The "Arrest" samples should be loaded again on the right to confirm the shift of Rad53, but not due to "smiling gels".

      It is true that the blot on the right has a right-handed smile; however, it is very clear the presence of the Rad53/Rad53-P partner. Because there is not a full shift from Rad53 to Rad53-P, the concern of misidentifying Rad53-P as a result of a blot smile is unfounded.

      (8) Figure 1c, After the HO cut, the resected DNA at the 726 bp site reaches to platform at about 4 hrs, while it still increases at the 5.6 kb site. Thus, it is difficult to conclude that "The time to reach half of the maximum possible resection (t1/2) was ~1 h at 0.7 Kb and ~2.5 h at 5.7 Kb from the DSB, respectively".

      We assumed that both loci reach the plateau at 0.8 (which is consistent with other studies), so the t1/2 was calculated when the resected intersected 0.4.

      (9) Figure 2b and 2c are wrongly labeled.

      We have fixed this (now Fig. 3d and e).

      (10) Figure 2d, Double check and make sure the quantitative data reflects the representative result. E.g. in Figure 2b (in fact should be 2c). For instance, in Figure 2b, the MATα signals seem to remain stable from 60' to 180', but they keep increasing in Figure 2d. In Yamaguchi & James E. Haber's paper, the signals and changes of MATa and MATα over time are way stronger compared to this study.

      We have double checked this. It is true that the sum of MATα, MATalpha and cut HOcs bands throughout the assay does not have the intensity seen for MATa before the HO induction (Tel), but MATalpha and HOcs signals cannot be established based on the equimolarity of the reaction as all band signals are probe-specific (the best indication of this can be seen in the signal comparison between MAT_α and _MAT distal at Tel). Alternatively, some resected HOcs may remain unrepaired.

      As for the referred example (now Figure 3e), note that they are double normalized to ACT1 and MAT_α (Tel), and the _ACT1 band gets fainter after 60’. This explains the increase in the MATalpha quantification in spite of what is apparently seen in the blot.

      (11) Typos and fonts: e.g. lines 111-112; line 76 "his link".

      We have fixed this. Thanks.

      Reviewer #2 (Recommendations For The Authors):

      Major concerns:

      (1) Physiological relevance. The authors show that HR can happen in the anaphase to telophase interval, yet does it outside of an hours-long artificial arrest upon inactivation of Cdc15? It is this reviewer's understanding that the duration of the anaphase to telophase transition is short, in the order of minutes. In fact, break signaling and resection are delayed by ~1 hour (Fig. 1), which suggests that cells avoid dealing with the damage and engaging in HR in the anaphase-telophase interval. Is there any described physiological context or checkpoint that blocks this transition for extended periods, that would make any of the findings in this paper relevant?

      This concern about the physiological relevance was addressed in our previous study (Nat Commun. 2019; 10(1):2862. doi: 10.1038/s41467-019-10742-8). In that paper’s Figure 1, we showed that G1 re-entry after a cdc15-2 release was delayed by several hours when DSBs had been previously generated at the cdc15-2 arrest. We also showed that such a delay depended on Rad9 (i.e., the DNA damage checkpoint). In addition, synchronized (not arrested) cells transiting through anaphase responded to DSB generation by slowing anaphase transition while partly regressing chromosome segregation (Figure S7 in that paper).

      (2) Methodological caveats. It is unclear why the authors chose to study DSB-repair in the context of MATa-to-alpha switching (which uses an ectopic donor on the other chromosome arm) as a model for inter-sister repair. It creates a disconnect in the claims of the paper, which means to study inter-sister repair. Studying the kinetics of DSB repair by cytology following low-dose irradiation or radiomimetic drugs would have been a better option. Phleomycin is used in Fig. 4, but the repair kinetics (e.g. Rad52 foci) is not studied.

      The MAT switching assay was used here to address how much HR was functional in late-M compared to G2/M (metaphase-like). Then, it was employed to check how cohesin depletion hampers HR in late-M. Even though this is something we already deeply discussed previously (Nat Commun. 2019; 10(1):2862. doi: 10.1038/s41467-019-10742-8; Bioessays. 2020 ;42(7):e2000021. doi: 10.1002/bies.202000021), it is worth recapitulating the methodological challenges that the study of inter-sister repair has in late-M: (i) endonuclease-based DSBs are going to generate two DSBs, one per sister chromatid; (ii) the use of a homologous chromosome without the cutting site as a template is pointless because a sister of the homolog is always going to co-segregate with the broken chromatid, and the same caveat applies for any other ectopic sequence. In this context, the MATa with the HML ectopic intrachromosomal sequence is as valid as any other option, with the advantage that it is a very well-known system.

      On the other hand, most of the reviewer’s concerns about the inter-sister repair by cytology and the role of Rad52 was addressed in our previous paper (Nat Commun). Note that our new results about the cohesin role on MAT switching show that this HR-mediated DSB repair does not depend on cohesin (new Figure 4b-d).

      (3) Preliminary work. The requirement of cohesin for MAT switching in cdc15 mutants would have warranted several additional experiments. Indeed, Cohesin has been shown to regulate homology search in multiple ways upon DNA damage checkpoint-induced metaphase-arrest (see Piazza et al. Nat Cell Biol 2021 (10.1038/s41556-021-00783-x), not cited in the current manuscript). Consequently, is the effect of cohesin observed in the MAT system specific to telophase or is it true in other cell-cycle phases? What is the mechanism behind this requirement (one may expect it not to depend on the sister since the HML donor is available within the damaged chromatid)? Does cohesin re-accumulate around the DSB site or genome-wide? How does the Esp1 activity decay from anaphase onset? Is cohesin required for the horseshoe folding of chr. III involved in MATa-to-alpha switching? Furthermore, condensin is involved in MATa-specific switching (Li et al. PLoS Genet 2019, 10.1371/journal.pgen.1008339), and condensin remains active on chromatin in cdc15 arrested cells, as shown on chr. XII (Lazar-Stefanita et al. EMBO J. 2017 10.15252/embj.201797342), which calls for determining the impact contribution of condensin in the recoil of the right ch.XII arm (Fig 4c) and on MAT switching.

      There are several points here:

      - Is the effect of cohesin observed in the MAT system specific to telophase or is it true in other cell-cycle phases?

      Our new results show that cohesin depletion does not affect MAT switching when four different strains with all auxin system combinations are compared in the presence of auxin. Previously, when we compared minus versus plus auxin, we noticed diminished HO cutting efficiency. Therefore, we repeated the experiment using four isogenic strains (SMC3, SMC3-aid*, SMC3 + OsTIR1, and SMC3-aid* + OsTIR1), which differ in their response to auxin and ability to degrade cohesin. This allowed us to compare all strains in the presence of auxin. As a result, we can now confirm that cohesin depletion does not affect MAT switching (see the new Figures 4b–d). Therefore, HR appears efficient after cohesin depletion. In agreement, the new ChIPs we have performed do not detect an increment in local cohesin after the HO DSB in telophase (but it does in cells arrested in G2/M).

      - What is the mechanism behind this requirement (one may expect it not to depend on the sister since the HML donor is available within the damaged chromatid)?

      As just said, we have changed our previous conclusion on cohesin and MAT switching. It was an effect of auxin addition rather than cohesin depletion.

      - Does cohesin re-accumulate around the DSB site or genome-wide?

      We have performed ChIP around the HOcs. We have found that it does accumulate in G2/M after HO induction, but it does not in telophase (new Figures 2c and S3). As for the global binding of cohesin, our chromatin fractionation data suggest there is ~2-fold increase in Smc1-Smc3, which also binds to the newly formed Scc1, rendering an overall increase in the chromatin-bound canonical complex (new Figures 2b and S2). Altogether, this suggests a genome-wide binding but with little role in the repair of HO DSBs.

      - How does the Esp1 activity decay from anaphase onset?

      We have not checked this here but it is an interesting question for a follow-up story.

      - Is cohesin required for the horseshoe folding of chr. III involved in MATa-to-alpha switching?

      Probably not in view of our new data in Figures 2c and 4b-d. The Piazza papers are cited and discussed.

      - Contribution of condensin in the recoil of the right ch.XII arm (Fig 4c) and on MAT switching.

      The role of condensin, which overtakes some cohesin function in late-M as the reviewer reminds, is worth studying indeed. However, we feel this deserves a separate and focus-on study. We does discuss, though, that condensin loading onto the arms in anaphase may prevent Smc1-Smc3 from loading after DSBs.

      Other points:

      (4) Is the retrograde behavior in Fig. 4c dependent on recombination?

      No, this is something we addressed in our previous paper (see Figure 4 in Nat Commun. 2019; 10(1):2862. doi: 10.1038/s41467-019-10742-8).

      (5) Fig 3c: add a scheme of the system.

      A scheme was already shown in the old Figure 2a (note that the old Fig 3c is now Fig S6).

      (6) Fig 3b: annotate as in Fig 2b.

      We have fixed this (now the referred figures are S6a and 3d, respectively).

      (7) Authors used IAA concentrations 4- to 8-fold higher than commonly used. Given the solubility of IAA in DMSO (the most commonly used solvent), it is likely that authors treated their cells with >2% DMSO. This is expected to have broad transcriptional and physiological effects on yeast. A comparison of +IAA samples with a mock (DMSO) treatment would be more appropriate than a lack of treatment.

      The IAA stock solution was 500 mM in DMSO, so the final DMSO concentration for an 8 mM IAA solution was 1.6% (v/v). Although the stock concentration was high and some precipitation was observed during preparation, we always heated, sonicated, and vigorously vortexed the stock tube before adding IAA to the cultures. Thus, we kept the uncertainty in the final IAA concentration to a minimum.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      What are the overarching principles by which prokaryotic genomes evolve? This fundamental question motivates the investigations in this excellent piece of work. While it is still very common in this field to simply assume that prokaryotic genome evolution can be described by a standard model from mathematical population genetics, and fit the genomic data to such a model, a smaller group of researchers rightly insists that we should not have such preconceived ideas and instead try to carefully look at what the genomic data tell us about how prokaryotic genomes evolve. This is the approach taken by the authors of this work. Lacking a tight theoretical framework, the challenge of such approaches is to devise analysis methods that are robust to all our uncertainties about what the underlying evolutionary dynamics might be.

      The authors here focus on a collection of ~300 single-cell genomes from a relatively well-isolated habitat with relatively simple species composition, i.e. cyanobacteria living in hotsprings in Yellowstone National Park, and convincingly demonstrate that the relative simplicity of this habitat increases our ability to interpret what the genomic data tells us about the evolutionary dynamics.

      Using a very thorough and multi-faceted analysis of these data, the authors convincingly show that there are three main species of Synechococcus cyanobacteria living in this habitat, and that apart from very frequent recombination within each species (which is in line with insights from other recent studies) there is also a remarkably frequent occurrence of hybridization events between the different species, and with as of yet unidentified other genomes. Moreover, these hybridization events drive much of the diversity within each species. The authors also show convincing evidence that these hybridization events are not neutral but are driven by selected by natural selection.

      Strengths:

      The great strength of this paper is that, by not making any preconceived assumptions about what the evolutionary dynamics is expected to look like, but instead devising careful analysis methods to tease apart what the data tells us about what has happened in the evolution in these genomes, highly novel and unexpected results are obtained, i.e. the major role of hybridization across the 3 main species living in this habitat.

      The analysis is very thorough and reading the detailed supplementary material it is clear that these authors took a lot of care in devising these methods and avoiding the pitfalls that unfortunately affect many other studies in this research area.

      The picture of the evolutionary dynamics of these three Synechococcus species that emerge from this analysis is highly novel and surprising. I think this study is a major stepping stone toward the development of more realistic quantitative theories of genome evolution in prokaryotes.

      The analysis methods that the authors employ are also partially novel and will no doubt be very valuable for analysis of many other datasets.

      We thank the reviewer for their appreciation of our work.

      Weaknesses:

      I feel the main weakness of this paper is that the presentation is structured such that it is extremely difficult to read. I feel readers have essentially no chance to understand the main text without first fully reading the 50-page supplement with methods and 31 supplementary materials. I think this will unfortunately strongly narrow the audience for this paper and below in the recommendations for the authors I make some suggestions as to how this might be improved.<br /> A very interesting observation is that a lot of hybridization events (i.e. about half) originate from species other than the alpha, beta, and gamma Synechococcus species from which the genomes that are analyzed here derive. For this to occur, these other species must presumably also be living in the same habitat and must be relatively abundant. But if they are, why are they not being captured by the sampling? I did not see a clear explanation for this very common occurrence of hybridization events from outside of these Synechococcus species. The authors raise the possibility that these other species used to live in these hot springs but are now extinct. I'm not sure how plausible this is and wonder if there would be some way to find support for this in the data (e.g that one does not observe recent events of import from one of these unknown other species). This was one major finding that I believe went without a clear interpretation.

      We agree with the reviewer that the extent of hybridization with other species is surprising. While we do feel that our metagenome data provide convincing evidence that “X” species are not present in MS or OS, we cannot currently rule out the presence of X in other springs. In the revision we explicitly mention the alternative hypothesis (Lines 239-242).

      The core entities in the paper are groups of orthologous genes that show clear evidence of hybridization. It is thus very frustating that exactly the methods for identifying and classifying these hybridization events were really difficult to understand (sections I and V of the supplement). Even after several readings, I was unsure of exactly how orthogroups were classified, i.e. what the difference between M and X clusters is, what a `simple hybrid' corresponds to (as opposed to complex hybrids?), what precisely the definitions of singlet and non-singlet hybrids are, etcetera. It also seems that some numbers reported in the main text do not match what is shown in the supplement. For example, the main text talks about "around 80 genes with more than three clusters (SM, Sec. V; fig. S17).", but there is no group with around 80 genes shown in Fig S17! And similarly, it says "We found several dozen (100 in α and 84 in β) simple hybrid loci" and I also cannot match those numbers to what is shown in the supplement. I am convinced that what the authors did probably made sense. But as a reader, it is frustrating that when one tries to understand the results in detail, it is very difficult to understand what exactly is going on. I mention this example in detail because the hybrid classification is the core of this paper, but I had similar problems in other sections.

      We thank the reviewer for pointing out these issues with our original presentation. In the revision, we have redone most of the analysis to simplify the methods and check the consistency of the results. We did not find any qualitative differences in our results after reanalysis, but some of the numbers for different hybridization patterns have changed. The most notable difference is an increase in the number of alpha-gamma simple hybrids and a corresponding decrease in mixed-species clusters (now labeled mosaic hybrids). These transfers are difficult to assign because we only have access to a single gamma genome. We have added a short explanation of this point in Lines 219-222.

      To improve the presentation, we significantly expanded the “Results” section to better explain our analysis and the different steps we take. We included two additional figures (Figs. 3 and 4) that illustrate the different types of hybrids and the heterogeneity in the diversity of alpha which is discussed in the main text and is important for interpreting our results. We also included two additional figures (Figs. 2 and 6) that were previously in the Appendix but were mentioned in the main text. We believe these changes should address most of the issues raised by the reviewer and hopefully make the manuscript easier to read.

      Although I generally was quite convinced by the methods and it was clear that the authors were doing a very thorough job, there were some instances where I did not understand the analysis. For example, the way orthogroups were built is very much along the lines used by many in the field (i.e. orthoMCL on the graph of pairwise matchings, building phylogenies of connected components of the graph, splitting the phylogenies along long branches). But then to subdivide orthogroups into clusters of different species, the authors did not use the phylogenetic tree already built but instead used an ad hoc pairwise hierarchical average linkage clustering algorithm.

      The reviewer is correct that there is an unexplained discrepancy between the clustering methods we used at different steps in our pipeline. We followed previous work by using phylogenetic distances for the initial clustering of orthogroups. On these scales we expect hybridization to play a minor role and phylogenetic distances to correlate reasonably well with evolutionary divergence. However, because of the extensive hybridization we observed, the use of phylogenetic models for species clustering is more difficult to justify. We therefore chose to simply use pairwise nucleotide distances, which make fewer assumptions about the underlying evolutionary processes and should be more robust. We have briefly explained our reasoning and the details of our clustering method in the revision (Lines 182-190).

      Reviewer #2 (Public Review):

      Summary:

      Birzu et al. describe two sympatric hotspring cyanobacterial species ("alpha" and "beta") and infer recombination across the genome, including inter-species recombination events (hybridization) based on single-cell genome sequencing. The evidence for hybridization is strong and the authors took care to control for artefacts such as contamination during sequencing library preparation. Despite hybridization, the species remain genetically distinct from each other. The authors also present evidence for selective sweeps of genes across both species - a phenomenon which is widely observed for antibiotic resistance genes in pathogens, but rarely documented in environmental bacteria.

      Strengths:

      This manuscript describes some of the most thorough and convincing evidence to date of recombination happening within and between cohabitating bacteria in nature. Their single-cell sequencing approach allows them to sample the genetic diversity from two dominant species. Although single-cell genome sequences are incomplete, they contain much more information about genetic linkage than typical short-read shotgun metagenomes, enabling a reliable analysis of recombination. The authors also go to great lengths to quality-filter the single-cell sequencing data and to exclude contamination and read mismapping as major drivers of the signal of recombination.

      We thank the reviewer for their appreciation of our work.

      Weaknesses:

      Despite the very thorough and extensive analyses, many of the methods are bespoke and rely on reasonable but often arbitrary cutoffs (e.g. for defining gene sequence clusters etc.). Much of this is warranted, given the unique challenges of working with single-cell genome sequences, which are often quite fragmented and incomplete (30-70% of the genome covered). I think the challenges of working with this single-cell data should be addressed up-front in the main text, which would help justify the choices made for the analysis.

      We have significantly expanded the “Results” section to better justify and explain the choices we made during our analysis. We hope these changes address the reviewer’s concerns and make the manuscript more accessible to readers.

      The conclusions could also be strengthened by an analysis restricted to only a subset of the highest quality (>70% complete) genomes. Even if this results in a much smaller sample size, it could enable more standard phylogenetic methods to be applied, which could give meaningful support to the conclusions even if applied to just ~10 genomes or so from each species. By building phylogenetic trees, recombination events could be supported using bootstraps, which would add confidence to the gene sequence clustering-based analyses which rely on arbitrary cutoffs without explicit measures of support.

      It seems to us that the reviewer’s suggestion presupposes that the recombination events we find can be described as discrete events on an asexual phylogeny, similar to how rare mutations are treated in standard phylogenetic inference. Popular tools, such as ClonalFrame and its offshoots, have attempted to identify individual recombination events starting from these assumptions. But the main conclusion of both our linkage and SNP block analysis is that the ClonalFrame assumptions do not hold for our data. Under a clonal frame, the SNP blocks we observe should be perfectly linked, similar to mutations on an asexual tree. But our results in Fig. 7D show the opposite. Part of the issue may have been that in our original presentation, we only briefly discuss the results of our linkage analysis and refer readers to the Appendix for more details. To fix this issue we have added an extra figure (Fig. 2), showing rapid linkage decrease in both species and that at long distances the linkage values are essentially identical to the unlinked case, similar to sexual populations. We hope that this change will help clarify this point.

      The manuscript closes without a cartoon (Figure 4) which outlines the broad evolutionary scenario supported by the data and analysis. I agree with the overall picture, but I do think that some of the temporal ordering of events, especially the timing of recombination events could be better supported by data. In particular, is there evidence that inter-species recombination events are increasing or decreasing over time? Are they currently at steady-state? This would help clarify whether a newly arrived species into the caldera experiences an initial burst of accepting DNA from already-present species (perhaps involving locally adaptive alleles), or whether recombination events are relatively constant over time.

      The reviewer raises some very interesting questions about the dynamics of recombination in the population, which we hope to pursue in future work. We have added this as an open question in the Discussion (Lines 365-382).

      These questions could be answered by counting recombination events that occur deeper or more recently in a phylogenetic tree.

      The reviewer here seems to presuppose that recombination is rare enough that a phylogenetic tree can reliably be inferred, which is contrary to our linkage analysis (see the response to an earlier comment). Perhaps the reviewer missed this point in our original manuscript since it was discussed primarily in the Appendix. See also our response to a previous comment by the reviewer.

      The cartoon also shows a 'purple' species that is initially present, then donates some DNA to the 'blue' species before going extinct. In this model, 'purple' DNA should also be donated to the more recently arrived 'orange' species, in proportion to its frequency in the 'blue' genome. This is a relatively subtle detail, but it could be tested in the real data, and this may actually help discern the order of the inferred recombination events.

      We have included an extra figure in the main text (Fig. 6) that addresses the question of timing of events. A quantitative test of our cartoon model along the lines the reviewer suggested would certainly be worthwhile and we hope to do that in future work.  

      The abstract also makes a bold claim that is not well-supported by the data: "This widespread mixing is contrary to the prevailing view that ecological barriers can maintain cohesive bacterial species..." In fact, the two species are cohesive in the sense that they are identifiable based on clustering of genome-wide genetic diversity (as shown in Fig 1A). I agree that the mixing is 'widespread' in the sense that it occurs across the genome (as shown in Figure 2A) but it is clearly not sufficient to erode species boundaries. So I believe the data is consistent with a Biological Species Concept (sensu Bobay & Ochman, Genome Biology & Evolution 2017) that remains 'fuzzy' - such that there are still inter-species recombination events, just not sufficient to erode the cohesion of genomic clusters. Therefore, I think the data supports the emerging picture of most bacteria abiding by some version of a BSC, and is not particularly 'contrary' to the prevailing view.

      We have revised the phrase mentioned by the reviewer to “prevent genetic mixture between bacterial species,” which more accurately represents our conclusions. 

      The final Results paragraph begins by posing a question about epistatic interactions, but fails to provide a definitive answer to the extent of epistasis in these genomes. Quantifying epistatic effects in bacterial genomes is certainly of interest, but might be beyond the scope of this paper. This could be a Discussion point rather than an underdeveloped section of the Results.

      We agree with the reviewer that an exhaustive analysis of epistasis in the population is beyond the scope of the manuscript. Our original intention was to answer whether SNP blocks we discovered showed evidence of strong linkage, as might be expected if only a small number of strains are present in the population. In light of the previous comments by the reviewer regarding the consistency with the clonal frame hypothesis, we believe this is especially relevant for our results. Moreover, the results we found‑especially for the beta population‑were quite conclusive: SNP block linkages in beta are indistinguishable from an unlinked model. To avoid misdirecting the reader about the significance of our results, we have revised the relevant paragraph (Lines 316-319).

      Recommendations For The Authors:

      Reviewer #1 (Recommendations For The Authors):

      Although I am entirely convinced of the validity of the results, methodology, and interpretations presented in this work, I must say I found the paper very hard to read. And I think I am really quite familiar with these kinds of approaches. I fear that for people other than experts on these kinds of comparative genomic analyses, this paper will be almost impossible to read. With the aim of expanding the audience for this compelling work, I think the authors might want to consider ways to improve the presentation.

      At the end of a long project, the obtained results typically form a web of mutual interconnections and dependencies and one of the key challenges in presenting the results in a paper is having to untangle this web of connected results and analysis into a linear ordered narrative so that, at any point in the narrative, understanding the next point only depends on previous points in the narrative. I frankly feel that this paper fails at this.

      The paper reads to me as if one author put together the supplement by essentially writing a report of all the analyses that were done together with supplementary figures summarizing all those analyses, and that another author then wrote the main text by using the materials in the supplement almost in the way a cook uses ingredients for a dish. Almost every other sentence in the main text refers to results in the (31!) supplementary figures and can only be understood by reading the appropriate corresponding sections in the supplementary materials. I found it essentially impossible to read the main text without having first read the entire 50-page supplement.

      I think the paper could be hugely improved by trying to restructure the presentation so as to make it more linear. The main text can be expanded to include a summary of the crucial methods and analysis results from the supplement needed to understand the narrative in the main text. For example, as it currently stands it is really challenging to understand what is shown in figures 2 and 3 of the main text without having to first read a very substantial part of the supplement. Figure 3, even after having read the relevant sections in the supplement, took me quite a while to understand and almost felt like a puzzle to decypher. Rethinking which parts of the supplement are really necessary would also help. Finally, it would also help if the terminology was kept as simple, transparent, and consistent as possible.

      I understand that my suggestion to thoroughly reorganize the presentation may feel like a big hassle, but I am afraid that in its current form, these important results are essentially rendered inaccessible to all but a small group of experts in this area. This paper deserves a wider readership.

      We thank the reviewer for these valuable suggestions. In the revision, we have significantly expanded and restructured the “Results” section to make the presentation more linear, as the reviewer suggested (see our reply to the public comment by the reviewer for details). We hope these changes will make the manuscript easier to read.

      Reviewer #2 (Recommendations For The Authors):

      I found this paper challenging to follow since the main text was so condensed and the supplementary material so extensive. Given that eLife does not impose strong limits on the length of the main text, I suggest moving some key sections from the supplement into the main text to make it easier for the reader to follow rather than flipping back and forth. Adding to the confusion, supplementary figures were referenced out of order in the main text (e.g. S23 is referenced before S1). Please check the numbering and ensure figures are mentioned in the main text in the correct order.

      We thank the reviewer for their feedback on the presentation of the results. In response to similar comments from Reviewer #1, we have significantly expanded and restructured the “Results” section to make it easier to read (see also our responses to Reviewer #1).

      Page 2: The term 'coevolution' is typically reserved for two species that mutually impose selective pressures on one another (e.g. predator-prey interactions; see Janzen, Evolution 1980). In the context of these two cyanobacterial species, it's not clear that this is the case so I would simply refer to them 'cohabitating' or being sympatric in the same environment.

      It is true that the term "coevolution” has become associated with predator-prey interactions, as the reviewer said. However, we feel that in our case “coevolution” fairly accurately describes the continual hybridization over long time scales we observe. We have therefore chosen to keep the term.

      Page 3: The authors mention that the gamma SAG is ~70% complete, which turns out to be quite high. It would be useful to mention early in the Results the mean/median completeness across SAGs, and how this leads to some challenges in analysing the data. Some of the material from the Supplement could be moved into the Results here.

      We have added a short note on the completeness in the Results (Lines 153-154). We have also added an extra figure in Appendix 1 with the completeness of all the SAGs for interested readers.

      I was left puzzled by the sentence: "Alternatively, high rates of recombination could generate different genotypes within each genome cluster that are adapted to different temperatures, with the relative frequencies of each cluster being only a correlated and not a causal driver of temperature adaptation." This is suggesting that individual genes or alleles, rather than entire genomes, could be adapted to temperature. But figure 1B seems to imply that the entire genome is adapted to different temperatures. Anyway, this does not seem to be a key point and could probably be removed (or clarified if the authors deem this an important point, which I failed to understand).

      We have revised this section to clarify the alternative hypothesis mentioned by the reviewer (Lines 100-103).

      Page 4. 'Several dozen' hybrid genes were found, but please also specify how many genes were tested. In general, it would be good to briefly outline the sample size (SAGs or genes) considered for each analysis.

      We have added the total numbers of genes we analyzed at each step of our analysis.

      'Mosaic hybrid loci' are mentioned alongside the issue of poor alignment. Presumably, the mosaic hybrid loci are first filtered to remove the poor alignments? This should be specified, and please mention how many loci are retained before/after this filter.

      We thank the reviewer for highlighting this important point. In the revision, we have implemented a more aggressive filtering of genes with poor alignments. We have added an extra paragraph to Appendix 1 (step 5 in the pipeline analysis) briefly explaining the issue.

      Page 5. "By contrast, the diversity of mosaic loci was typical of other loci within beta, suggesting most of the beta genome has undergone hybridization." Please point to the data (figure) to support this statement.

      We have restructured our discussion of the different hybrid loci so this comment is no longer relevant. In case the reviewer is interested, the synonymous diversity within beta was 0.047, while in mosaic hybrids it was 0.064.

      Page 6. "The largest diversity trough contained 28 genes." Since this trough is discussed in detail and seems to be of interest, it would be nice to illustrate it, perhaps as an inset in Figure 2 or as a separate figure. If I understood correctly, this trough includes genes (in a nitrogen-fixation pathway) that are present in all genomes, but are exchanged by homologous recombination. So I don't think it's correct to say that the "ancestors acquired the ability to fix nitrogen." Rather, the different alleles of these same genes were present in the ancestor. So perhaps there was a selective sweep involving alleles in this region that provided adaptation to local nitrogen sources or concentrations, but not a gain of new genes. Perhaps I misunderstood, in which case clarification would be appreciated.

      The reviewer raises an interesting possibility. We agree that it is in principle possible that the ancestor contained the nitrogen fixation genes and the selective sweep simply replaced the ancestral alleles. In this particular case, there is additional evidence that the entire pathway was acquired around roughly the same time from gene order. The gene order between alpha and beta is almost entirely different, with only a few segments containing more than 2-3 genes in the same order, as shown by Bhaya et al. 2007 and confirmed by additional unpublished analysis of the SAGs. One of the few exceptions is the nitrogen fixation pathway, which has essentially the same gene order over more than 20 kbp. Thus, if the ancestor of both alpha and beta contained the nitrogen-fixation pathway, we would expect these genes to be scatter across the genome. We have revised the sentences in question to clarify this point (Lines 260-271).

      Page 6. Last paragraph on epistasis references Fig 3C, but I believe it should be Fig 3D.

      Fixed.

      Page 7. Figure 3 legend. "Note that alpha-2 is identical to gamma here." I believe it should be beta, not gamma.

      The reviewer is correct. We have fixed this error.

      Page 8. What is the evidence for "at least six independent colonizers"? I could not find the data supporting this claim.

      The statement mentioned by the reviewer was based on the maximum number of species clusters we identified in different core genes. However, during the revision, we found that only a handful of genes contained five or more clusters. We did find several tens of genes with four clusters. In addition, Rosen et al. (2018) also found additional 16S clusters at low frequency in the same springs. Based on these results we conservatively estimate that at least four independent strains colonized the caldera, but the number could be much greater. We have revised the text in question accordingly (Lines 336-339) and added Fig. 2 in Appendix 1 to support the conclusion.

      Page 9. Line 200: "acting to homogenize the population." It should be specified that the population is only homogenized at these introgressed loci, not genome-wide. Otherwise, the genome-wide species clusters seen in Fig 1 would not be maintained.

      It is true that the selective sweeps that lead to diversity throughs only homogenize the introgressed loci. But other hybrid segments could also rise to high frequency in the population during the sweep through hitchhiking. The fact that we observe SNP blocks generated through secondary recombination events of introgressed segments throughout the genome supports this view. While we do not fully understand the dynamics of this process currently, we do feel that the current evidence supports the statement that mixing is occurring throughout the genome and not just at a few loci so we have kept the original statement.

      The final sentence (lines 221-222) is vague and uninformative. On the one hand, "investigating whether hybridization plays a major role" is what the current manuscript has already done - depending on what is meant by 'major' (how much of the genome? Or whether there are ecological implications?). It is also not clear what is meant by a predictive theory and 'possible evolutionary scenarios. This should be elaborated upon, otherwise, it is not clear what the authors mean. Otherwise, this sentence could be cut.

      We thank the reviewer for their feedback. One possible source of confusion could be that in this sentence we were referring to detecting hybridization in other communities. We have changed “these communities” to “other communities” to make this clearer.

      Supplement.

      Broadly speaking, I appreciate the thorough and careful analysis of the single cell data. On the other hand, it is hard to evaluate whether these custom analyses are doing what is intended in many cases. Would it be possible to consider an analysis using more established methods, e.g. taking a subset of genomes with 'good' completeness and using Panaroo to find the core and accessory genome, then ClonalFrameML or Gubbins to infer a phylogeny and recombination events? Such analyses could probably be applied to a subset of the sample with relatively complete genomes. I don't want to suggest an overly time-consuming analysis, but the authors could consider what would be feasible.

      We have added a comparison between our analysis and that from two other methods, including ClonalFrameML mentioned by the author. One important point that we feel might have been lost in the first version is that our linkage results imply that recombination is not rare such that it can be mapped onto an asexual tree as assumed by ClonalFrameML. Note that this is not simply due to technical limitations due to incomplete coverage and is instead a consequence of the evolutionary dynamics of the population. Consistent with this, we found several inconsistencies in how recombination events were assigned by ClonalFrameML. We have summarized these conclusions in Appendix 7 of the revised manuscript.

      Page 8. Line 190. What is meant by 'minimal compositional bias'?

      We mean that the sample is not biased towards strains that grow in the lab. We have revised the sentence to clarify.

      Page 25. Figure S14 is not referenced in the text.

      We have added part of this figure to the main text since it illustrates one of our main results, namely that sites at long genomic distances are essentially unlinked.

      Page 26. The 'unlinked controls' (line 530) are very useful, but it would be even more informative to see if these controls also show the same decline in linkage with distance in the genome as observed in the real data. In particular, it would be good to know if the observed rapid decline in linkage with distance in the low-diversity regions is also observed in controls. Currently, it is unclear if this observation might be due to higher uncertainty in inferring linkage in low-diversity regions, which by definition have less polymorphism to include in the linkage calculation.

      We thank the reviewer for the suggestion. After further consideration, we have decided to remove the subsection on linkage decrease in the low-diversity regions. We feel such detailed quantitative analysis would be better suited for a more technical paper, which we hope to do at a later time.

      Page 26. There are some sections with missing identifiers (Sec ??).

      Fixed.

      Page 27. The information about the typical breadth of SAG coverage (~30%) would be better to include earlier in the Supplement, and also mentioned in the main text so the reader can more easily understand the nature of the dataset.

      We have added an extra figure with the SAG coverages to Appendix 1.

      Page 29. Any sensitivity analysis around the S = 0.9 value? Even if arbitrary, could the authors provide justification why they think this value is reasonable?

      We have significantly revised this section in response to earlier comments by one of the reviewers. We hope that this would clarify the details of our methods to interested readers. To answer the reviewer’s specific question, we chose this heuristic after examining the fraction of cells of each species in different species clusters. For the clusters assigned to alpha and beta, we found a sharp peak near one and that a cutoff of 0.9 captured most clusters while still being high enough to inconsistent with a mixed cluster.

      Page 30. I could not see where Fig. S17 was mentioned in the text. Also, how are 'simple hybrid genes' defined?

      We have removed this figure in the revision. The definition of the different types of hybrid genes have been added to the main text in response to a comment from the other reviewer.

      Page 36. It is hard to see that divergence is 'high' relative to what reference. Would it be possible to include the expected value (from ref. 12) in the plot, or at least explicitly mentioned in the text?

      We have added the mean synonymous and non-synonymous divergences between alpha and beta to the figures for reference.

      Page 38. Line 770 "would be comparable to that of beta." This is not necessarily the case since beta could have a different time to its most recent common ancestor. It could have a different time to the last bottleneck or selective sweep, etc.

      We thank the reviewer for pointing out this misleading statement. Our point here was that in the first scenario the TMRCA of alpha and beta would be similar since the diversity in the high-diversity alpha genes is similar to beta. We have clarified this statement in the revision.

      Page 39. Line 793. The use of the term 'genomic backbone' implies the presence of a clonal frame, which is not what the data seems to support. Perhaps another term such as 'genetic diversity' would more appropriately capture the intended meaning here.

      We agree with the reviewer that the low-diversity regions may not be asexual. We used “genomic backbone” to distinguish from the “clonal frame,” which is usually used to mean that the backbone is asexual. We have added a note in the revision to clarify this point.

      Page 39. Lines 802-805. I found this explanation hard to follow. Could the logic be clarified?

      We simply meant that although the beta distribution is unimodal, it is not consistent with a simple Poisson distribution, unlike in alpha. We have added an extra sentence to clarify this.

    1. Author response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #2 (Public review):

      In this valuable manuscript, Lin et al attempt to examine the role of long non coding RNAs (lncRNAs) in human evolution, through a set of population genetics and functional genomics analyses that leverage existing datasets and tools. Although the methods are incomplete and at times inadequate, the results nonetheless point towards a possible contribution of long non coding RNAs to shaping humans, and suggest clear directions for future, more rigorous study.

      Comments on revisions:

      I thank the authors for their revision and changes in response to previous rounds of comments. As it had been nearly two years since I last saw the manuscript, I reread the full text to familiarise myself again with the findings presented. While I appreciate the changes made and think they have strengthened the manuscript, I still find parts of it a bit too speculative or hyperbolic. In particular, I think claims of evolutionary acceleration and adaptation require more careful integration with existing human/chimpanzee genetics and functional genomics literature.

      We thank the reviewer heartfully for the great patience and valuable comments, which have helped us further improve the manuscript. Before responding to comments point by point, we provide a summary here.

      (1) On parameters and cutoffs.

      Parameters and cutoffs influence data analysis. The large number of Supplementary Notes, Supplementary Figures, and Supplementary Tables indicates that we paid great attention to the influence of parameters and robustness of analyses. Specifically, here we explain the DBS sequence distance cutoff of 0.034, which determines the top 20% genes that most differentiate humans from chimpanzees and influences the gene set enrichment analysis (Figure 2). As described in the revised manuscript, we estimated this cutoff based on Song et al., verified its rationality based on Prufer et al. (Song et al. 2021; Prufer et al. 2017), and measured its influence by examining slightly different cutoff values (e.g., 0.035).

      (2) Analyses of HS TFs and HS TF DBSs.

      It is desirable to compare the contribution of HS lncRNAs and HS TFs to human evolution. Identifying HS TFs faces the challenges that different institutions (e.g., NCBI and Ensembl) annotate orthologous genes using different criteria, and that multiple human TF lists have been published by different research groups. Recently, Kirilenko et al. identified orthologous genes in hundreds of placental mammals and birds and organized different types of genes into datasets of parewise comparison (e.g., hg38-panTro6) using humans and mice as references (Kirilenko et al. Integrating gene annotation with orthology inference at scale. Science 2023). Based on (a) the many2zero and one2zero gene lists in the “hg38-panTro6” dataset, (b) three human TF lists reported by two studies (Bahram et al. 2015; Lambert et al. 2018) and used in the SCENIC package, we identified HS TFs. The number of HS TFs and HS lncRNAs (5 vs 66) alone lends strong evidence suggesting that HS lncRNAs have contributed more significantly to human evolution than HS TFs (note that 5 is the union of three intersections between <many2zero + one2zero> and the three <human TF list>).

      TF DBS (i.e., TFBS) prediction has also been challenging because they are very short (mostly about 10 bp) and TF-DNA binding involves many cofactors (Bianchi et al. Zincore, an atypical coregulator, binds zinc finger transcription factors to control gene expression. Science 2025). We used two TF DBS prediction programs to predict HS TF DBSs, including the well-established FIMO program (whose results have been incorporated into the JASPAR database) (Rauluseviciute et al. JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles Open Access. NAR 2023) and the recently reported CellOracle program (Kamimoto et al. Dissecting cell identity via network inference and in silico gene perturbation. Nature 2023). Then, we performed downstream analyses and obtained two major results. One is that on average (per base), fewer selection signals are detected in HS TF DBSs (anyway, caution is needed because TF DBSs are very short); the other is that HS TFs and HS lncRNAs contribute to human evolution in quite different ways (Supplementary Figs. 25 and 26).

      (3) On genes with more transcripts may appear as spurious targets of HS lncRNAs.

      Now, the results of HS TF DBSs allow us to address the question of whether genes with more transcripts may appear as spurious targets of HS lncRNAs. We note that (a) we predicted HS lncRNA DBSs and HS TF DBSs in the same promoter regions before the same 179128 Ensembl-annotated transcripts (release 79), (b) we used the same GTEx transcript expression matrices in the analyses of HS TF DBSs and HS lncRNA DBSs (the GTEx database includes gene expression matrices and transcript expression matrices, the latter includes multiple transcripts of a gene). Thus, the analyses of HS TF DBSs provide an effective control for examining the question of whether genes with more transcripts may appear as spurious targets of HS lncRNAs, and consequently, cause the high percentages of HS lncRNA-target transcript pairs that show correlated expression in the brain (Figure 3). We find that the percentages of HS TF-target transcript pairs that show correlated expression are also high in the brain, but the whole profile in GTEx tissues is significantly different from that of HS lncRNA DBSs (Figure 3A; Supplementary Figure 25). On the other hand, on the distribution of significantly changed DBSs in GTEx tissues, the difference between HS lncRNA DBSs and HS TF DBSs is more apparent (Figure 3B; Supplementary Figure 26). Together, these suggest that the brain-enriched distribution of co-expressed HS lncRNA-target transcript pairs must arise from HS lncRNA-mediated transcriptional regulation rather than from the transcript number difference.

      (4) Additional notes on HS TFs and HS TF DBSs.

      First, the “many2zero” and “one2zero” gene lists in the “hg38-panTro6” dataset of Kirilenko et al. provide the most update, but not most complete, data on human-specific genes because “hg38-panTro6” is a pairwise comparison. On the other hand, the Ensembl database also annotates orthologous genes, but lacks such pairwise comparisons as “hg38-panTro6”. Therefore, not all HS genes based on “hg38-panTro6” agree with orthologous genes in the Ensembl database. Second, if HS genes are identified based on both Ensembl and Kirilenko et al., HS TFs will be fewer.

      (5) On speculative or hyperbolic claims.

      First, the title “Human-specific lncRNAs contributed critically to human evolution by distinctly regulating gene expression” is now further supported by HS TF DBSs analyses. Second, we have carefully revised the entire manuscript, trying to make it more readable, accurate, logically reasonable, and biologically acceptable. Third, specifically, in the revision, we avoid speculative or hyperbolic claims in results, interpretations, and discussions as possible as we can. This includes the tone-down of statements and claims, for example, using “reshape” to replace “rewire” and using “suggest” to replace “indicate”. Since the revisions are pervasive, we do not mark all of them, except those that are directly relevant to the reviewer’s comments.

      (1) Line 155: "About 5% of genes have significant sequence differences in humans and chimpanzees," This statement needs a citation, and a definition of what is meant by 'significant', especially as multiple lines below instead mention how it's not clear how many differences matter, or which of them, etc.

      Different studies give different estimates, from 1.24% (Ebersberger et al. Genomewide Comparison of DNA Sequences between Humans and Chimpanzees. Am J Hum Genet. 2002) to 5% (Britten RJ. Divergence between samples of chimpanzee and human DNA sequences is 5%, counting indels. PNAS 2002). The 5% for significant gene sequence differences arises when considering a broader range of genetic variations, particularly insertions and deletions of genetic material (indels). To provide more accurate information, we have replaced this simple statement with a more comprehensive one and cited the above two papers.

      (2) line 187: "Notably, 97.81% of the 105141 strong DBSs have counterparts in chimpanzees, suggesting that these DBSs are similar to HARs in evolution and have undergone human-specific evolution." I do not see any support for the inference here. Identifying HARs and acceleration relies on a far more thorough methodology than what's being presented here. Even generously, pairwise comparison between two taxa only cannot polarise the direction of differences; inferring human-specific change requires outgroups beyond chimpanzee.

      Here, we actually made an analogy but not an inference; therefore, we used such words as “suggesting” and “similar” instead of using more confirmatory words. We have revised the latter half sentence, saying “raising the possibility that these sequences have evolved considerably during human evolution”.

      (3) line 210: "Based on a recent study that identified 5,984 genes differentially expressed between human-only and chimpanzee-only iPSC lines (Song et al., 2021), we estimated that the top 20% (4248) genes in chimpanzees may well characterize the human-chimpanzee differences". I do not agree with the rationale for this claim, and do not agree that it supports the cutoff of 0.034 used below. I also find that my previous concerns with the very disparate numbers of results across the three archaics have not been suitably addressed.

      (1) Indeed, “we estimated that the top 20% (4248) genes in chimpanzees may well characterize the human-chimpanzee differences” is an improper claim; we made this mistake due to the flawed use of English.

      (2) What we need is a gene number, which (a) indicates genes that effectively differentiate humans from chimpanzees, (b) can be used to set a DBS sequence distance cutoff. Since this study is the first to systematically examine DBSs in humans and chimpanzees, we must estimate this gene number based on studies that identify differentially expressed genes in humans and chimpanzees. We choose Song et al. 2021 (Song et al. Genetic studies of human–chimpanzee divergence using stem cell fusions. PNAS 2021), which identified 5984 differentially expressed genes, including 4377 genes whose differential expression is due to trans-acting differences between humans and chimpanzeees. To the best of our knowledge, this is the only published data on trans-acting differences between humans and chimpanzeees, and most HS lncRNAs and their DBSs/targets have trans-acting relationships (see Supplementary Table 2). Based on these numbers, we chose a DBS sequence distance cutoff of 0.034, which corresponds to 4248 genes (the top 20%), slightly fewer than 4377.

      (3) If we chose DBS sequence distance cutoff=0.033 or 0.035, slightly more or fewer genes would be determined, raising the question of whether they would significantly influence the downstream gene set enrichment analysis (Figure 2). We found that 91 genes have a DBS sequence distance of 0.034. Thus, if cutoff=0.035, 4248-91=4157 genes were determined, and the influence on gene set enrichment analysis was very limited.

      (4) On the disparate numbers of results across the three archaics. Figure 1A is based on Figure 2 in Prufer et al. 2017. At first glance, our Figure 1A indicates that Altai Neanderthal is older than Denisovan (upon kya), making our result “identified 1256, 2514, and 134 genes in Altai Neanderthals, Denisovans, and Vindija Neanderthals” unreasonable. However, Prufer et al. (2017) reported that “It has been suggested that Denisovans received gene flow from a hominin lineage that diverged prior to the common ancestor of modern humans, Neandertals, and Denisovans……In agreement with these studies, we find that the Denisovan genome carries fewer derived alleles that are fixed in Africans, and thus tend to be older, than the Altai Neandertal genome”. This note by Prufer et al. provides an explanation for our result, which is that more genes with large DBS sequence distances were identified in Denisovans than in Altai Neanderthals. Of course, the 1256, 2514, and 134 depend on the cutoff of 0.034. If cutoff=0.035, these numbers change slightly, but their relationships remain (i.e., more genes in Denisovans). We examined multiple cutoff values and found that more genes in Denisovans have large DBS sequence distances than in Altai Neanderthals.

      (4) I also think that there is still too much of a tendency to assume that adaptive evolutionary change is the only driving force behind the observed results in the results. As I've stated before, I do not doubt that lncRNAs contribute in some way to evolutionary divergence between these species, as do other gene regulatory mechanisms; the manuscript leans down on it being the sole, or primary force, however, and that requires much stronger supporting evidence. Examples include, but are not limited to:

      (1) Indeed, the observed results are also caused by other genomic elements and mechanisms (but it is hardly feasible to identify and differentiate them in a single study), and we do not assume that adaptive evolutionary change is the only driving force. Careful revisions have been made to avoid leaving readers the impression that we have this tendency or hold the simple assumption.

      (2) Comparing HS lncRNAs to HS TFs is critical, and we have done this.

      (5) line 230: "These results reveal when and how HS lncRNA-mediated epigenetic regulation influences human evolution." This statement is too speculative.

      We have toned down the statement, just saying “These results provide valuable insights into when and how HS lncRNA-mediated epigenetic regulation impacts human evolution”.

      Line 268: "yet the overall results agree well with features of human evolution." What does this mean? This section is too short and unclear.

      (1) First, the sentence “Selection signals in YRI may be underestimated due to fewer samples and smaller sample sizes (than CEU and CHB), yet the overall results agree well with features of human evolution” has been deleted, because CEU, CHB, and YRI samples are comparable (100, 99, and 97, respectively).

      (2) Now the sentence has been changed to “These results agree well with findings reported in previous studies, including that fewer selection signals are detected in YRI (Sabeti et al., 2007; Voight et al., 2006)”.

      (3) On “This section is too short and unclear” - To make the manuscript more readable, we adopt short sections instead of long ones. This section expresses that (a) our finding that more selection signals were detected in CEU and CHB than in YRI agrees with well-established findings (Voight et al. A Map of Recent Positive Selection in the Human Genome. PLoS Biology 2006; Sabeti et al. Genome-wide detection and characterization of positive selection in human populations. Nature 2007), (b) in considerable DBSs, selection signals were detected by multiple tests.

      Line 325: "and form 198876 HS lncRNA-DBS pairs with target transcripts in all tissues." This has not been shown in this paper - sequence based analyses simply identify the “potential” to form pairs.

      This section describes transcriptomic analysis using the GTEx data. Indeed, target transcripts of HS lncRNAs are results of sequence-based analysis, and a predicted target is not necessarily regulated by the HS lncRNA in a tissue. Here, “pair” means a pair of HS lncRNA-target transcript whose expression shows significant Pearson correlation in a GTEx tissue (by the way, we do not mean correlation equals regulation; actually, we identified HS lncRNA-mediated transcriptional regulation upon both DBS-targeting relationship and correlation relationship).

      Line 423: "Our analyses of these lncRNAs, DBSs, and target genes, including their evolution and interaction, indicate that HS lncRNAs have greatly promoted human evolution by distinctly rewiring gene expression." I do not agree that this conclusion is supported by the findings presented - this would require significant additional evidence in the form of orthogonal datasets.

      (1) As mentioned above, we have used “reshape” to replace “rewire” and used “suggest” to replace “indicate”. In addition, we have substantially revised the Discussion, in which this sentence is replaced by “our results suggest that HS lncRNAs have greatly reshaped (or even rewired) gene expression in humans”.

      (2) Multiple citations have been added, including Voight et al. 2006 (Voight et al. A Map of Recent Positive Selection in the Human Genome. PLoS Biology 2006) and Sabeti et al. 2007 (Sabeti et al. Genome-wide detection and characterization of positive selection in human populations. Nature 2007).

      (3) We have analyzed HS TF DBSs, and the obtained results also support the critical contribution of HS lncRNAs.

      I also return briefly to some of my comments before, in particular on the confounding effects of gene length and transcript/isoform number. In their rebuttal the authors argued that there was no need to control for this, but this does in fact matter. A gene with 10 transcripts that differ in the 5' end has 10 times as many chances of having a DBS than a gene with only 1 transcript, or a gene with 10 transcripts but a single annotated TSS. When the analyses are then performed at the gene level, without taking into account the number of transcripts, this could introduce a bias towards genes with more annotated isoforms. Similarly, line 246 focuses on genes with "SNP numbers in CEU, CHB, YRI are 5 times larger than the average." Is this controlled for length of the DBS? All else being equal a longer DBS will have more SNPs than a shorter one. It is therefore not surprising that the same genes that were highlighted above as having 'strong' DBS, where strength is impacted by length, show up here too.

      (1) In gene set enrichment analysis (Figure 2, which is a gene-level analysis), when determining genes differentiating humans from chimpanzees based on DBS sequence distance, if a gene has multiple transcripts/DBSs, we choose the DBS with the largest distance. That is, the input to g:Profiler is a non-redundant gene list.

      (2) In GTEx data analysis (Figure 3, which is a transcriptome-level analysis), the analyses of HS TF DBSs using the GTEx data provide evidence suggesting that different DBS/transcript numbers of genes are unlikely to cause confounding effects. As explained above, we predicted HS TF DBSs in the same promoter regions of 179128 Ensembl-annotated transcripts (release 79), but Supplementary Figures 25 and 26 are distinctly different from Figure 3AB.

      (3) In evolutionary analysis, a gene with 10 DBSs has a higher chance of having selection signals than a gene with 1 DBS. This is biologically plausible, because many conserved genes have novel transcripts whose expression is species-, tissue-, or developmental period-specific, and DBSs before these novel transcripts may differ from DBSs before conserved transcripts.

      (4) “line 246 focuses on genes with "SNP numbers in CEU, CHB, YRI are 5 times larger than the average." Is this controlled for the length of the DBS?” - This is a defect. We have now computed SNP numbers per base and used the new table to replace the old Supplementary Table 8. After examining the new table, we find that the major results of SNP analysis remain.

      (5) On “Is this controlled for length of the DBS? All else being equal a longer DBS will have more SNPs than a shorter one” - We do not think there are reasons to control for the length of DBSs; also, what “All else being equal” means matters. First, DBS sequences have specific features; thus, the feature of a long DBS is stronger than the feature of a short one, making a long DBS less likely to be generated by chance in the genome and less likely to be predicted wrongly than a short one. This means that longer DBSs are less likely to be false ones (note our explanation that the chance of a DBS of 147 bp, the mean length of DBSs, to be wrongly predicted is extremely low, p<8.2e-19 to 1.5e-48). Second, the difference in length suggests a difference in binding affinity, which in turn influences the regulation of the specific transcripts and influences the analysis of GTEx data. Third, it cannot be excluded that some SNPs may be selection signals (detecting selection signal is challenging, and many selection signals cannot be detected by statistical tests, see Grossman et al. A composite of multiple signals distinguishes causal variants in regions of positive selection. Science 2010).

      (6) On “It is therefore not surprising that the same genes that were highlighted above as having 'strong' DBS, where strength is impacted by length” - Indeed, strength is influenced by length, see the above response.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      Finally, figure 1 panels D and F are not legible - the font is tiny! There's also a typo in panel A, where "Homo Sapien" should be "Homo sapiens".

      (1) “Homo sapien” is changed to “Homo sapiens”.

      (2) Even if we double the font size, they are still too small. Inserting a very large panel D into Figure 1 will make Figure 1 ugly, and converting Figure 1D into an independent figure is unnecessary. Actually, panels 1D and F are illustrative figures; the full Fig.1D is Supplementary Figure 6, and the full Fig.1F is Figure 3. We have revised Fig.1’s legend to explain these.

    1. Author response:

      We would like to express our gratitude to all three reviewers for their time and valuable feedback on the manuscript. Below, we provide our point-by-point responses to their comments. Additionally, we summarize here the experiments we plan to conduct in accordance with the reviewers' suggestions:

      Revision plan 1. To further explore the mechanisms of Notch signaling in the decision of regional EE pattern.

      Our observation of EE subtype changes in Notch mutant clones revealed that Notch is required for the specification of Type II EEs, but whether it promotes the generation of Type III EEs is not quite clear. In this revision, we will complete the quantification of Type I and Type III EEs in Notch mutant clones to demonstrate whether Notch signaling participate the determination of these two EE subtypes. Further, we will attempt to combine Notch mutant with different manipulation of WNT and BMP gradients to investigate their interplays.

      Revision plan 2. To supplement the global pattern of WNT and BMP gradient along the whole gut.

      The levels of WNT and BMP gradients are variable in different gut regions both under normal condition and genetic manipulation, leading to different outcomes of EE subtype composition. To further support our model, we will supply the changes of WNT and BMP gradients along the whole gut after genetic manipulation, and perform semi-quantification of their levels to correlate with EE subtype compositions. Additionally, we will also test the gradient levels at different time point during pupal stage to interpret the establishment of regional identity during the development.

      Revision plan 3. To investigate the involvement of apical-basal polarity in the determination of regional EE diversity.

      Although we have demonstrated WNT and BMP gradients orchestrate the regional EE identity, but some observations cannot be fully explained by their roles, such as asymmetric expression of CCHa2 in EE pairs from R4b. A potential mechanism is apical-basal polarity, which has been reported to determine cell fate of ISC progenies at pupal stage. We will specifically knockdown or overexpress key genes related to apical-basal polarity in ISCs or EEs to test whether they are involved preliminarily.

      Please find our detailed point-by-point responses below.

      Public Reviews:

      Reviewer #1 (Public review):

      This valuable study explores the regulatory mechanisms underlying the regional distribution of enteroendocrine cell subtypes in the Drosophila midgut. The regional distribution of EE cell subtypes is carefully documented, and the data convincingly show that each EE cell subtype has a unique spatial pattern. The study aims at determining how the spatial distribution of EE cell subtypes is established and maintained, and explores the roles of three pathways: Notch, WNT, and BMP. The data show evidence that Notch signaling regulates the subtype specificity, being necessary for the specification of Type II, but not Type I and III EE cell subtype specification. The immunofluorescence data in Figure 3 are convincing, but the analysis is incomplete due to a lack of quantification. How Notch signaling activity relates to the emergence of the regional EE cell patterns remains unclear.

      Indeed, the role of Notch signaling in regional EE determination was not fully characterized in this work. As the requirement of Notch activation for the differentiation of enterocytes, introduction of Notch or Delta mutant led to rapid accumulation of ISCs and EEs, making it being a challenge to dive into the details of how EE subtypes were generated. We will try to complete the quantification of Type I and Type III EEs in the Notch mutant clones from different gut regions to figure out whether Notch could influence the specification of these two EE subtypes. Additionally, different from WNT and BMP gradients, Notch signaling can only function locally and is not significantly changed along the whole gut, including Type II EE-enriched R1a and Type I EE-enriched R4b, which implies that function of Notch signaling may can be overridden by the impact of specific combination of WNT and BMP gradients. To test this hypothesis, we will attempt to combine Notch mutant with the activation or inhibition of WNT and BMP signaling since pupal stage, and further examine whether the regional EE identity could be altered, especially in R1a and R4b regions.

      As WNT and BMP are known as morphogens, the study explores their expression patterns and their roles in establishing and maintaining the subtype identities. The observed patterns of WNT and BMP are consistent with earlier studies. Manipulation of WNT and BMP pathway activities in intestinal stem cells is shown to have some region-specific effects on specific EE cell subtypes. The overall conclusion that both WNT and BMP have local effects on EE cell subtypes is based on solid evidence. However, the study falls short in achieving its main objective, i.e., to explain the regional subtype patterns by the action of WNT and BMP gradients. Despite displaying a large volume of phenotypic data in Figures 4-7, the study remains incomplete in providing sufficient evidence to support the models shown in Figures 7 M and N. The main challenge is that the reader is provided with a large volume of individual data fragments of selected regions (e.g., Figures 4 and 5) or images of whole midgut without proper quantification (Figure 7). There is not sufficient effort made to display the data in a way that allows observing changes in the global patterns of EE cell subtypes throughout the midgut and compare these patterns with the observed WNT and BMP gradients.

      As the variation of WNT and BMP gradients along the whole gut, manipulating these two pathways is not able to align their activation levels in different gut regions. This forced us to analyze the change of each region separately, making it to be a challenge to provide a comprehensive global overview. We will supplement the comprehensive profile of WNT and BMP activity under the manipulation of these two signaling pathways to correlated with the change of EE identity, and also try to perform a semi-quantitative interpretation to further support the model in Figure 7M and 7N.

      Reviewer #2 (Public review):

      Summary:

      By labeling the three major enteroendocrine cell markers - AstC, Tk, and CCHa2-the authors systematically investigated the distribution of distinct EE subtypes along the Drosophila midgut, as well as their emergence via symmetric and asymmetric divisions of enteroendocrine progenitor cells. Moreover, they dissected the molecular mechanisms underlying regional patterning by modulating Wnt and BMP signaling pathways, revealing that these compartment boundary signals play key roles in regulating EE subtype diversity.

      Strengths:

      This work establishes a solid methodological and conceptual foundation for future studies on how stem cells acquire positional identity and modulate region-specific behaviors.

      Weaknesses:

      Given that the transcriptional profiles of intestinal stem cells across different regions are highly similar, it is reasonable to hypothesize that the behavior of ISCs and enteroendocrine precursor cells may be regulated non-autonomously, potentially through interactions with enterocytes, which exhibit more distinct region-specific characteristics.

      This is a quite complicated point to discuss. Drosophila adult midgut is established by pISCs (pupal ISCs), which arise from AMPs (adult midgut progenitors) in larval midgut. AMPs are encased by PCs (peripheral cells) to be islands, scattered throughout the entire larval midgut by mid L3 stage (Mathur D. et al. Science. 2010). After pupariation, larval midgut is delaminated to become the yellow body and finally meconium in the pupal midgut. Simultaneously, PCs break down and die, allowing AMPs to give rise to the presumptive adult epithelium (generating enterocyte precursors) and the specification of ISCs in the adult midgut (Jiang H, Edgar BA. Development. 2009; Micchelli CA. et al. Gene Expr Patterns. 2011). During the pupal stage, pISCs only proliferate to generate new ISCs and EE lineages, while adult enterocytes start to appear after eclosion (Takashima S. et al. Dev Biol. 2011). This rules out the possibility that the interaction with enterocytes regulates regional ISC identity during pupal stage.

      However, whether AMPs already acquire the regional identity during larval stage, and whether pISCs interact with enterocyte precursors at pupal stage, are not quite clear. Our study revealed that pISCs can be influenced by WNT and BMP gradients to acquire regional identity, and further establish regional EE diversity. The change of WNT and BMP gradients during the metamorphosis will be supplemented in revision. While WNT and BMP signaling ligands are provided by muscles and adult enterocytes, and even other surrounding tissues, to regulate regional ISC identity, which indicates that non-autonomous mechanisms indeed exist.

      Reviewer #3 (Public review):

      Summary:

      The authors aimed to elucidate the mechanisms underlying the regional patterning of enteroendocrine cell (EE) subtypes along the Drosophila midgut. Through detailed immunohistochemical mapping and genetic perturbation of Notch, WNT, and BMP signaling pathways, they sought to determine how extrinsic morphogen gradients and intrinsic stem cell identity contribute to EE diversity.

      Strengths:

      A major strength of this work is the meticulous regional analysis of EE pairs and the use of multiple genetic tools to manipulate signaling pathways in a spatiotemporally controlled manner. The data robustly demonstrate that WNT and BMP signaling gradients play key roles in specifying EE subtypes and division modes across different gut regions.

      Weaknesses:

      However, the study does not fully explore the mechanistic basis for the region-specific dependence on Notch signaling. Additionally, while the authors propose that symmetric divisions occur in R1a and R4b, the observed heterogeneity in CCHa2 expression within AstC+ pairs in R4b suggests that asymmetric mechanisms may still be at play, possibly involving apical-basal polarity as previously reported.

      As previously mentioned, we acknowledge that the role of Notch signaling in regional EE determination remains further exploration. We will supplement the quantification of Type I and Type III EEs in Figure 3 and Figure S4, and further combine Notch mutant with activation or inhibition of WNT and BMP signaling to test whether they have any interplays, especially in R1a and R4b.

      Apical-basal polarity has been reported to determine the precise segregation of Pros to control ISC number and cell fate at the pupal stage (Wu S. et al. Cell Rep. 2023). During this time, generation of regional EEs are completed and may also be affected except for the influence of Notch, WNT and BMP pathways. Therefore, the apical-basal polarity is quite a potential mechanism to induce asymmetric cell division in R4b, which we will perform experiments to test.

      Appraisal of achievements:

      The authors successfully achieve their aims by providing a compelling model in which intercalated WNT and BMP gradients regulate EE subtype specification and EEP division modes. The genetic data strongly support the conclusion that these pathways are central to establishing regional EE diversity during pupal development.

  3. www.planalto.gov.br www.planalto.gov.br
    1. ou
      • Informativo nº 815
      • 11 de junho de 2024.
      • TERCEIRA TURMA
      • Processo: REsp 2.136.190-RS, Rel. Ministra Nancy Andrighi, Terceira Turma, por unanimidade, julgado em 4/6/2024, DJe 6/6/2024.

      Ramo do Direito DIREITO PROCESSUAL CIVIL

      TemaPaz, Justiça e Instituições Eficazes <br /> Ação de produção antecipada de prova. Local da realização da perícia diverso do local de sede da empresa ré e de eleição. Foro do objeto a ser periciado. Questão de praticidade da instrução. Inexistência de prejuízo.

      Destaque - A produção antecipada de prova pericial pode ser processada no foro onde situado o objeto a ser periciado ao invés do foro de sede da empresa ré, que coincide com o foro eleito em contrato.

      Informações do Inteiro Teor - Ressalta-se de início que a norma de competência (i) do juízo do foro onde a prova deva ser produzida ou (ii) do juízo do foro de domicílio do réu, para fins de apreciar ação de produção antecipada de provas (art. 381, § 2º, do CPC/2015), não possui norma equivalente no CPC/1973.

      • O CPC/1973 tinha como regra geral para fixar a competência do juízo cautelar como sendo a mesma do juízo da ação principal (art. 800 do referido código). Esta Corte, contudo, já permitia a relativização da competência do juízo da ação principal em relação aos procedimentos cautelares, especialmente em se tratando de produção cautelar de provas na forma antecipada.

      • Nesse sentido, o STJ entendia que "poderá haver a mitigação da competência prevista no art. 800 do CPC/1973 quando se tratar de ação cautelar de produção antecipada de provas, podendo ser reconhecida a competência do foro em que se encontra o objeto da lide, por questões práticas e processuais, notadamente para viabilizar a realização de diligências e perícias" (AgInt no AREsp n. 1.321.717/SP, Terceira Turma, DJe de 19/10/2018).

      • A relativização da competência estava igualmente fundamentada na facilitação de inspeção judicial "possibilitando maior celeridade à prestação jurisdicional" em hipótese de ação cautelar de produção antecipada de provas (AgRg no Ag n. 1.137.193/GO, Quarta Turma, DJe de 16/11/2009).

      • Nesse sentido, a facilitação da realização da perícia <u>prevalece</u> sobre a regra geral do ajuizamento no foro do réu por envolver uma questão de ordem prática tendo em vista a necessidade de exame no local onde está situado o objeto a ser periciado.

      • Diferentemente do código anterior, o CPC/2015 expressamente dispõe que o foro de exame prévio de prova não torna ele prevento para a futura eventual ação principal (art. 381, § 3º, do CPC/2015).

      • Dessa forma, inexiste prejuízo presumido neste procedimento prévio, pois - a depender do resultado da perícia - a ação principal sequer poderá ser ajuizada, ou, caso seja ajuizada, o foro de eleição - que coincide com o foro do local de sede da empresa ré - poderá prevalecer.

    2. inclusive
      • Informativo nº 832
      • 5 de novembro de 2024.
      • QUARTA TURMA
      • Processo: REsp 2.152.938-DF, Rel. Ministro Antonio Carlos Ferreira, Quarta Turma, por unanimidade, julgado em 22/10/2024.

      Ramo do Direito DIREITO PROCESSUAL CIVIL

      TemaPaz, Justiça e Instituições Eficazes <br /> Localização do réu. Tentativas infrutíferas. Cadastro de órgãos públicos. Concessionárias de serviços públicos. Ofício. Expedição antes da citação por edital. Obrigatoriedade. Ausência. Avaliação do magistrado. Possibilidade.

      Destaque - A expedição de ofícios a cadastros públicos e concessionárias de serviços públicos para localizar o réu antes da citação por edital não é obrigatória, mas uma <u>possibilidade</u> a ser avaliada pelo magistrado.

      Informações do Inteiro Teor - O tema em discussão consiste em definir se há obrigatoriedade de expedição de ofício a cadastros de órgãos públicos e concessionárias de serviços públicos para localizar o réu antes da citação por edital.

      • Segundo a jurisprudência do STJ, a citação por edital pressupõe o esgotamento dos meios necessários para localização do réu, sob pena de nulidade. Isso porque a citação por edital é uma forma de citação presumida, utilizada em caráter extremamente excepcional. Sua aplicação é restrita às seguintes situações enumeradas no art. 256 do Código de Processo Civil: (i) quando o réu for desconhecido ou sua identidade incerta; (ii) quando seu paradeiro for ignorado, incerto ou inacessível; ou (iii) nas demais hipóteses previstas em lei.

      • No mais, o § 3º do art. 256 do mesmo dispositivo dispõe que o réu será considerado em local ignorado ou incerto se resultarem infrutíferas as tentativas de sua localização, "inclusive mediante requisição pelo juízo de informações sobre seu endereço nos cadastros de órgãos públicos ou de concessionárias de serviços públicos." Note-se que o legislador empregou o termo "inclusive", o que indica que essa providência é uma possibilidade, mas não necessariamente uma imposição.

      • O princípio da celeridade processual, previsto no art. 4º do CPC/2015, determina que o processo deve se desenvolver de maneira eficiente e ágil, evitando formalismos excessivos. Se as tentativas de localização do réu forem suficientes e conduzidas de maneira razoável, a ausência de requisição às concessionárias ou órgãos públicos não implica invalidade do procedimento.

      • A expedição de ofícios a órgãos públicos e concessionárias, embora recomendável na maioria das situações, <u>não é uma exigência automática</u>. O Julgador tem discricionariedade para avaliar, caso a caso, se a requisição de tais informações é necessária, conforme o contexto fático e as tentativas já realizadas. A obrigatoriedade absoluta dessas medidas oneraria o processo com formalidades que, em muitos casos, não trariam resultados práticos.

      • Portanto, a norma processual não obriga à expedição de ofícios a cadastros públicos e concessionárias de serviços públicos antes da citação por edital, mas prevê essa possibilidade como uma ferramenta importante, a ser utilizada conforme o juízo de valor do Magistrado, sempre levando em consideração a razoabilidade e a celeridade do processo.

    3. X
      • Informativo nº 645
      • 26 de abril de 2019.
      • TERCEIRA TURMA
      • Compartilhe:
      • Processo: REsp 1.799.166-GO, Rel. Min. Nancy Andrighi, por unanimidade, julgado em 02/04/2019, DJe 04/04/2019

      Ramo do Direito DIREITO PROCESSUAL CIVIL

      Tema <br /> Tempestividade. Suspensão do prazo recursal. Nascimento do filho do único patrono da causa. Comunicação imediata ao juízo. Desnecessidade.

      Destaque - A suspensão do processo em razão da paternidade do único patrono da causa se opera tão logo ocorra o fato gerador (nascimento ou adoção), <u>independentemente da comunicação imediata ao juízo</u>.

    4. V
      • Informativo nº 771
      • 25 de abril de 2023.
      • SEGUNDA SEÇÃO
      • Compartilhe:
      • Processo: AR 6.463-SP, Rel. Ministra Maria Isabel Gallotti, Segunda Seção, por unanimidade, julgado em 12/4/2023.

      Ramo do Direito DIREITO PROCESSUAL CIVIL

      TemaPaz, Justiça e Instituições Eficazes <br /> Ação rescisória. Decisão rescindenda publicada em nome de advogado que nunca representou o autor nos autos da ação originária. Nulidade. Determinação de nova publicação da decisão rescindenda com reabertura do prazo do recurso.

      Destaque - A ausência de intimação da decisão que implicou o provimento parcial do recurso interposto pela parte contrária é <u>sempre prejudicial</u> ao recorrido, sendo <u>cabível</u> o manejo de ação rescisória.

      Informações do Inteiro Teor - Cinge-se a controvérsia a analisar a rescisão da decisão impugnada por ausência de intimação válida do advogado na ação originária.

      • Em caso versando sobre "a possibilidade do manejo da ação rescisória, no caso de reconhecimento de nulidade absoluta, pela falta de intimação do procurador do recorrente acerca dos atos processuais praticados", esta Corte concluiu que "a exclusividade da querela nullitatis para a declaração de nulidade de decisão proferida sem regular citação das partes, representa solução extremamente marcada pelo formalismo processual. [...] A desconstituição do acórdão rescindendo pode ocorrer tanto nos autos de ação rescisória ajuizada com fundamento no art. 485, V, do CPC/1973 quanto nos autos de ação anulatória, declaratória ou de qualquer outro remédio processual" (STJ, REsp 1.456.632/MG, Rel. Ministra Nancy Andrighi, Terceira Turma, julgado em 7/2/2017, DJe 14/2/2017).

      • Assim sendo, é admissível a presente ação rescisória para declarar a nulidade da intimação do autor após o julgamento unipessoal do recurso especial interposto pelo réu.

      • Na hipótese, após o julgamento unipessoal do AREsp 1.370.930/SP em 29/11/2018, a Secretaria desta Corte, em virtude de equívoco na autuação, efetuou a publicação, em 7/12/2018, em nome de advogado que não tinha e nunca teve representação nos autos e não em nome do único advogado constituído pelo autor na ação originária.

      • O § 2º do art. 272 do CPC 2015 dispõe que: "Sob pena de nulidade, é indispensável que da publicação constem os nomes das partes e de seus advogados, com o respectivo número de inscrição na Ordem dos Advogados do Brasil, ou, se assim requerido, da sociedade de advogados". Assim, a publicação da decisão unipessoal desta Corte em nome de advogado que nunca representou o autor nos autos da ação originária implicou violação manifesta ao disposto no § 2º do art. 272 do CPC 2015.

      • Como decidido por esta Corte, em mais de uma oportunidade, a ausência de intimação da parte em virtude de equívoco na autuação autoriza a rescisão do julgado. "A ausência de intimação do recorrido, por erro na autuação do recurso especial, para a apresentação de contrarrazões e demais atos da parte constitui violação literal ao disposto no § 1º do art. 236 do Código de Processo Civil de 1973, possibilitando-se a rescisão do julgado com fundamento no art. 485, V, do mesmo estatuto".

      • Em suma, a ausência de intimação da decisão que implicou o provimento parcial do recurso interposto pela parte contrária <u>é sempre prejudicial</u> ao recorrido. Nessa direção, esta Corte já observou que "o defeito ou a ausência de intimação - requisito de validade do processo (arts. 236, § 1º, e 247 do CPC/1973) - impedem a constituição da relação processual e constituem temas passíveis de exame em qualquer tempo e grau de jurisdição, independentemente de forma, alegação de prejuízo ou provocação da parte. <u>Trata-se de vícios transrescisórios</u>".

      • Impõe-se concluir pela procedência do primeiro pedido rescisório (CPC 2015, art. 968, inciso I) para reconhecer que a publicação da decisão rescindenda em nome de advogado que nunca representou o autor nos autos da ação originária violou literalmente o disposto no art. 272, § 2º, do CPC 2015.

    5. § 14
      • Informativo 1177
      • AR 2876 QO / DF
      • Órgão julgador: Tribunal Pleno
      • Relator(a): Min. GILMAR MENDES
      • Julgamento: 23/04/2025 (Presencial)
      • Ramo do Direito: Processual Civil
      • Matéria: Ação Rescisória; Questão de Ordem; Decisão Superveniente do STF; Coisa Julgada; Efeitos Temporais

      Ação rescisória: prazo para ajuizamento nos casos de decisão superveniente do STF declarando a inconstitucionalidade de norma

      Tese fixada

      O § 15 do art. 525 e o § 8º do art. 535 do Código de Processo Civil devem ser interpretados conforme à Constituição, com efeitos ex nunc, no seguinte sentido, com a declaração incidental de inconstitucionalidade do § 14 do art. 525 e do § 7º do art. 535:

        1. Em cada caso, o Supremo Tribunal Federal poderá definir os efeitos temporais de seus precedentes vinculantes e sua repercussão sobre a coisa julgada, estabelecendo inclusive a extensão da retroação para fins da ação rescisória ou mesmo o seu não cabimento diante do grave risco de lesão à segurança jurídica ou ao interesse social.
        1. Na ausência de manifestação expressa, os efeitos retroativos de eventual rescisão não excederão cinco anos da data do ajuizamento da ação rescisória, a qual deverá ser proposta no prazo decadencial de dois anos contados do trânsito em julgado da decisão do STF.
        1. O interessado poderá apresentar a arguição de inexigibilidade do título executivo judicial amparado em norma jurídica ou interpretação jurisdicional considerada inconstitucional pelo STF, seja a decisão do STF <u>anterior ou posterior</u> ao trânsito em julgado da decisão exequenda, salvo preclusão (Código de Processo Civil, arts. 525, caput, e 535, caput).”

      Resumo

      Os efeitos temporais das decisões do STF e o prazo para o ajuizamento de ação rescisória podem ser definidos caso a caso pela Corte e, em hipóteses de grave risco de lesão à segurança jurídica ou ao interesse social, é possível estabelecer o não cabimento da ação.

      • Essas prerrogativas objetivam equilibrar a necessidade de corrigir decisões baseadas em fundamentos que o próprio Tribunal declarou inconstitucionais com o princípio da segurança jurídica e a estabilidade das relações jurídicas já consolidadas pela coisa julgada.

      • Ademais, quando esta Corte não definir, de forma expressa, a partir de quando seus precedentes vinculantes devem valer no tempo, a eficácia retroativa para fins de propositura de ação rescisória fica limitada ao período de até cinco anos anteriores à data de seu ajuizamento, observando-se, em todo caso, o prazo decadencial de dois anos a contar do trânsito em julgado da decisão que fundamenta o pedido rescisório.

      • Por fim, ressalvados os casos de preclusão (1), admite-se a arguição da inexigibilidade de título executivo judicial fundado em interpretação judicial ou em norma declaradas inconstitucionais pelo STF, <u>independentemente da anterioridade ou posterioridade</u> dessa decisão em relação ao trânsito em julgado da sentença exequenda.

      • Com base nesses entendimentos, o Plenário resolveu questão de ordem e fixou a tese anteriormente citada, com ressalvas de alguns ministros ao ponto 2. Vale destacar que, nessa sessão de julgamento, decidiu-se apenas a questão de ordem, de modo que a análise do caso concreto deverá ocorrer já se considerando as diretrizes ora fixadas.

    1. e)
      • Informativo 1157
      • ADPF 1178 MC-Ref / DF
      • Órgão julgador: Tribunal Pleno
      • Relator(a): Min. FLÁVIO DINO
      • Julgamento: 05/11/2024 (Virtual)
      • Ramo do Direito: Processual Civil
      • Matéria: Jurisdição e competência; litígios perante jurisdições estrangeiras; ajuizamento por ente subnacional; contratos de risco; pagamento de honorários de êxito

      Litígios internacionais: pagamento de honorários advocatícios contratuais com base em cláusula “ad exitum”

      Resumo Encontram-se presentes os requisitos para a concessão da medida cautelar, pois: (i) há plausibilidade jurídica no que se refere à vedação, em princípio, de pagamento por entes públicos dos chamados honorários de êxito, notadamente quando associados a elevadas taxas de retorno sobre o valor obtido em favor do Poder Público; e (ii) há perigo da demora na prestação jurisdicional, consubstanciado na proximidade de possível julgamento de demandas ajuizadas por municípios pátrios perante tribunais estrangeiros com pedido de indenização de elevada proporção.

      • Conforme entendimento do Tribunal de Contas da União, as estipulações de êxito em contratos com a Administração Pública constituem atos ilegais, ilegítimos e antieconômicos. Nesse contexto, a celebração de contratos de risco, baseados em honorários de êxito (“taxa de sucesso”), com previsão de pagamento de elevado percentual do valor indenizatório eventualmente alcançado aos escritórios de advocacia contratados, representa grave risco de lesão econômica às vítimas e aos cofres públicos, porque permite que os próprios causídicos se tornem os grandes beneficiários de eventual reparação obtida judicialmente.

      • Na espécie, diversos municípios ajuizaram ações de ressarcimento em virtude de desastres socioambientais, especialmente com relação aos acidentes nos municípios mineiros de Mariana e Brumadinho, de modo que é pertinente a aferição das condições dos contratos eventualmente celebrados, com vistas a proteger o patrimônio público nacional e a efetiva e integral reparação de danos perpetrados em solo brasileiro.

      • Com base nesses e em outros entendimentos, o Plenário, por maioria, referendou a decisão que deferiu em parte medida liminar, para determinar aos municípios relacionados como interessados nos autos que (i) juntem cópias dos contratos porventura celebrados com os escritórios de advocacia para atuarem em outros países; e (ii) se abstenham de efetuar qualquer pagamento de honorários, contratados ad exitum, relativos às ações judiciais perante tribunais estrangeiros, sem que previamente haja o exame da legalidade por parte das instâncias soberanas do País, sobretudo o próprio STF.

    2. IV
      • Informativo 1062
      • RE 964659 / RS
      • Órgão julgador: Tribunal Pleno
      • Relator(a): Min. DIAS TOFFOLI
      • Julgamento: 05/08/2022 (Virtual)
      • Ramo do Direito: Administrativo, Constitucional
      • Matéria: Servidor Público; Remuneração/ Direitos e Garantias Fundamentais; Salário Mínimo

      Servidor público: jornada de trabalho reduzida e remuneração inferior ao salário mínimo

      Tese fixada - É defeso o pagamento de remuneração em valor inferior ao salário mínimo ao servidor público, ainda que labore em <u>jornada reduzida</u> de trabalho.

      Resumo - É inconstitucional remunerar servidor público, mesmo que exerça jornada de trabalho reduzida, em patamar inferior a um salário mínimo.

      • O direito fundamental ao salário mínimo é previsto constitucionalmente para garantir a dignidade da pessoa humana por meio da melhoria de suas condições de vida (CF/1988, art. 7º, IV), garantia que foi estendida aos servidores públicos sem qualquer sinalização no sentido da possibilidade de flexibilizá-la no caso de jornada reduzida ou previsão em legislação infraconstitucional (CF/1988, art. 39, § 3º).

      • A leitura conjunta dos dispositivos constitucionais atinentes ao tema, somado ao postulado da vedação do retrocesso de direitos sociais, denota a finalidade de assegurar o mínimo existencial aos integrantes da Administração Pública Direta e Indireta com a fixação do menor patamar remuneratório admissível, especialmente se consideradas as limitações inerentes ao regime jurídico dos servidores públicos, cujas características se distinguem do relativo às contratações temporárias ou originadas de vínculos decorrentes das recentes reformas trabalhistas.

      • Com base nesse entendimento, o Plenário, por maioria, ao apreciar o Tema 900 da repercussão geral, deu provimento ao recurso extraordinário para devolver os autos ao tribunal de origem para continuidade de julgamento, a fim de que sejam decididas as demais questões postas no apelo, observados os parâmetros ora decididos.

      Legislação: CF/1988: arts. 7º, IV; e 39, § 3º.

    1. Author response:

      The following is the authors’ response to the current reviews.

      Reviewer #2 had several remaining suggestions:

      In some instances, the authors face well-known limitations. For example, bath application of drugs. Blockers of Gly and Gaba receptors are likely problematic when studying a network that includes a diverse set of inhibitory interneurons. Likewise, the results derived from application of AMPAR and KAR blockers should impact HC cell fxn, and presumably inner retina interneuron networks. In the Discussion the authors are encouraged to address more of these concerns (e.g., Discussion line 709).

      Rather than concluding that the bath application of drugs is without complications, they can conclude that under the experimental conditions, glutamate release from these On-bipolars continues to exhibit Transient and Sustained release. This is really the key point of their study.

      This is a good suggestion.  We have added a discussion of the complications of the pharmacology starting on line 754.  

      If indeed sustained release is a reflection of higher release rates, ribbon size is what point to but, there are many other possibilities, such as SV recycling, or recruitment of reserve pools of SVs, fusion machinery, Cav channel behavior. The authors could cite more literature in the Discussion.

      We added a sentence to this effect in the discussion, starting on line 866.


      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review): 

      Summary: 

      In the retina, parallel processing of cone photoreceptor output under bright light conditions dissects critical features of our visual environment and is fundamental to visual function. Cone photoreceptor signals are sampled by several types of bipolar cells and passed onto the ganglion cells. At the output of retinal processing, retinal ganglion cells send about 40 different codes of the visual scene to the brain for further processing. In this study, the authors focus on whether subtype-specific differences in the size of synaptic ribbon-associated vesicle pools of bipolar cells contribute to different retinal ganglion cell (RGC) responses. Specifically, inputs to ON alpha RGCs producing transient versus sustained kinetics (ON-S vs. ON-T, respectively) are compared. The authors first demonstrate that ON-S vs. ON-T RGCs are readily identifiable in a whole mount preparation and respond differently to both static and to a spatially uniform, randomly fluctuating (Gaussian noise) light stimulus. Liner-nonlinear (LN) models were used to estimate the transformation between visual input and excitatory synaptic input for each RGCs; these models suggested the presence of transient versus sustained kinetics already in the excitatory inputs to ON-T and ON-S RGCs. Indeed, the authors show that (glutamatergic) excitatory inputs to ON-S vs. ON-T RGCs are of distinct kinetics. The subtypes of bipolar cells providing input to ON-S are known (i.e., type 6 and 7), but the source of excitatory bipolar inputs to ON-T RGCs needed to be determined. In a tedious process, it is elegantly shown here that ON-T RGCs receive most of their excitatory inputs from type 5 and 6 bipolars. Interestingly, the temporal properties of light-evoked responses of type 5, 6, and 7 bipolars recorded from the somas were indistinguishable and rather sustained, suggesting that the origin of transient kinetics of excitatory inputs to ON-T RGCs suggested by the LN model might be found in the processing of visual signals at the bipolar cell axon terminal. Blocking GABA- or glycinergic inhibitory inputs did not alter the light-evoked excitatory input kinetics to ON-T and ON-S RGCs. Twophoton glutamate sensor imaging revealed significantly faster kinetics of light-evoked glutamate signals at ON-T versus ON-S RGCs. Detailed EM analysis of bipolar cell ribbon synapses onto ON-T and ON-S RGCs revealed fewer ribbon-associated vesicles at ON-T synapses, which is consistent with stronger paired-flash depression of lightevoked excitatory currents in ON-T RGCS versus ON-S RGCs. This study suggests that bipolar subtype-specific differences in the size of synaptic ribbon-associated vesicle pools contribute to transient versus sustained kinetics in RGCs. 

      Strengths: 

      The use of multiple, state-of-the-art tools and approaches to address the kinetics of bipolar to ganglion cell synapse in an identified circuit. 

      Weaknesses: 

      For the most part, the data in the paper support the conclusions, and the authors were careful to try to address questions in multiple ways. Two-photon glutamate sensor imaging experiment showing that blocking GABA- and glycinergic inhibition does not change the kinetics of light-evoked glutamate signals at ON-T RGCs would strengthen the conclusion that bipolar subtype-specific differences in the size of synaptic ribbon-associated vesicle pools contribute to transient versus sustained kinetics in RGCs. 

      Thank you for this suggestion. We have revised the text throughout to be careful not to imply that amacrine cells have no role in shaping EPSCs and spike output, but instead that the transience of the On-T responses persists without amacrine cells (see for example lines 91, 450-453, 514-518, 696-714). We have also added additional iGluSnFR experiments to the paper to further test this conclusion (new Figure 7). The new data shows that the transience of glutamate release from the On-T cells is retained when 1) spiking amacrine cell activity is suppressed by blocking voltage-gated Na<sup>+</sup> channels with TTX or 2) all amacrine cell activity is suppressed by blocking AMPA receptors with NBQX. This does provide nice additional evidence that amacrine cells are not necessary for the sustained/transient distinction.

      Reviewer #2 (Public Review): 

      Summary: 

      Goal of the study. The authors tried to pinpoint the origins of transient and sustained responses measured at retinal ganglion cells (rgcs), which is the output layer of the retina. Response characteristics of rgcs are used to group them into different types. The diversity of rgc types represents the ability of the retina to transform visual inputs into distinct output channels. They find that the physical dimensions of bipolar cell's synaptic ribbons (specialized release sites/active zones) vary across the different types of cone on-bpcs, in ways that they argue could facilitate transient or sustained release. This diversity of release output is what they argue underlies the differences in on-rgcs response characteristics, and ultimately represents a mechanism for creating parallel cone-driven channels. 

      Strengths: 

      The major strengths of the study are the anatomical approaches employed and the use of the "glutamate sniffer" to assay synaptic glutamate levels. The outline of the study is elegant and reflects the strengths of the authors. 

      Weaknesses: 

      The major weakness is that the ambitious outline is not matched with a complete set of results, and the set of physiological protocols is disjointed, not sufficient to bridge the systems-level question with the presynaptic release question. 

      Thank you for this comment as it provides an opportunity (here and in the paper) for us to clarify our main goal. We wanted to link the well-established distinction between transient and sustained retinal responses to anatomy. This required locating where this difference arises within the circuitry – which we show to be at least largely the bipolar output synapse – and then examining the structure of this synapse in detail. While we would certainly be interested in connecting our results to a biophysical description of the synapse, that was not the primary focus of our study and was not something we could add without substantial additional work.  

      Major comments on the results and suggestions. 

      The ribbon model of release has been explored for decades and needs to be further adapted to systems-level work. The study under consideration by Kuo et al. takes on this task. Unfortunately, the experimental design does not permit a level of control over presynaptic/bpc behavior that is comparable to earlier studies, nor do they manipulate release in ways that test the ribbon model (i.e., paired recordings or Ribeye-ko). Furthermore, the data needs additional evaluation, and the presentation and interpretations should draw on published biophysical and molecular studies. 

      As described above, our goal was to test several possible explanations for the difference between transient and sustained responses in OnT and OnS ganglion cells: (1) differences in the light responses of the bipolar cells that convey photoreceptor signals to the relevant ganglion cells; (2) shaping of bipolar transmitter release by presynaptic inhibition; (3) shaping of ganglion cell responses by postsynaptic inhibition or spike generation; (4) differences in feedforward bipolar synapses. We were surprised to find that the feedforward bipolar synapses play a central role in this difference, and your comment nicely prompts us to relate this to the large literature on biophysical studies of release from ribbon synapses. We have made substantial revisions in the text to do this. This includes anticipating the importance of feedforward synaptic properties in the abstract and introduction (lines 36-37 and 61-64), pointers in the results (lines 539-548), and several new paragraphs in the discussion (starting on lines 751, 773 and 787). By showing that the transient/sustained differences originates largely at feedforward bipolar synapses, we set the stage for future work that shows how biophysical properties of the synapse shape physiological signals that traverse it.

      To build a ribbon-centric context, consider recent literature that supports the assertion that ribbons play a role in forming AZ release sites and facilitating exocytosis. Reference Ribeye-ko studies. For example, ribbonless bpcs show an 80% reduction in release (Maxeiner et al EMBO J 2016), the ribbonless retina exhibits signaling deficits at the output layer (Okawa et al ...Rieke, ..Wong Nat Comm 2019), and ribbonless rods show an 80% reduction the readily releasable pool (RRP) of SVs (Grabner Moser, elife 2021). In addition, the authors could refer to whole-cell membrane capacitance studies on mammalian rods, cones, and bpcs, because the size of the RRP of SVs scales with the dimensions and numbers of ribbons (total ribbon footprint). For comparison, bipolars see the review by Wan and Heidelberger 2011. For a comparison of mammalian rods and cones, see, rods: Grabner and Moser (2021 eLife), Mueller.. Regus Leidig et al. (2019; J Neurosci) and cones Grabner ...DeVries (Nat Comm 2023). A comparison of cell types shows that the extent of release is (1) proportional to the total size of the ribbon footprint, and (2) less release is witnessed when ribbons are deleted (also see photo ablation studies by Snellman.... And Mehta..Zenisek, Nat Neurosci and Neuron).

      Thank you for these pointers into the literature.  We have included much of this work in the revised Discussion (see three paragraphs starting on line 751). The revised text focuses on the evidence that larger and more numerous ribbons lead to increased release. The direct evidence from previous work for this relationship supports our (indirect) conclusions in the current paper about the role of ribbon size and associated vesicle pools in transient vs sustained responses.  

      Ribbon morphology may change in an activity-dependent manner. The rod ribbon AZ has been reported to lengthen in the dark (Dembla et al 2020), and deletion of the ribbon shortens the length of the AZ (defined by Cav1,4 or RIM2); in addition, the Ribeye-ko AZs fail to change in size with light and dark conditioning. Furthermore, EM studies on rod and cone AZs in light and dark argue that the number of SVs at the base of the ribbon increases in the dark, when PRs are depolarized (see Figure 10, Babai et al 2016 JNeurosci). Lastly, using goldfish Mb1 on-bipolars, Hull et al (2006, J Neurophysio) correlated an increase in release efficiency with an increase in ribbon numbers, which accompanied daylight. >> When release activity is high, ribbon AZ length increases (Dembla, rods), the number of docked SVs increases (Babai, rods cones), and the number of ribbons increases (Hull, diurnal Mb1s). 

      We have extensively revised the discussion section to include more discussion of ribbons, particularly emphasizing evidence supporting the general argument that larger ribbons support higher release rates. We focused on studies that provided direct links between release rates and ribbon size or number of ribbon-associated vesicles.  This includes studies that pair electrophysiology and anatomy and those that measure the consequences of ablating ribbons,

      The results under review, Kuo et al., were attained with SBF-SEM, which has the benefit of addressing large-volume questions as required here, yet it achieves lower spatial resolution than what is attained with TEM tomography and FIB-EM. Ideally, the EM description would include SV size, and the density of ribbon-tethered SVs that are docked at the plasma membrane, because this is where the SVs fuse (additional non-ribbon release sites may also exist? Mehta ... Singer 2014 J Neurosci). Studies by Graydon et al 2011 and 2014 (both in J Neurosci), and Jean ... Moser et al 2018 (eLife) are good examples of quantitative estimates of SVs docking sites at ribbons. SBF-SEM does not allow for an assessment of SVs within 5 nm of the PM, but if the authors can identify the number of SVs that appear within the limit of resolution (10 to 15 nm) from the PM, then this data would be useful. Also, what dimension(s) of the large ribbons make them larger? Typically, ribbons are fixed in height (at least in the outer retina, 200 to 250 nm), but their length varies and the number ribbons per terminal varies. Is the larger ribbon size observed in type 6 bpcs do to longer ribbons, or taller ribbons? A longer ribbon likely has more docked SVs. An additional possibility is that more SVs are about the ribbon-PM footprint, either more densely packed and/or expanding laterally (see definitions in Jean....Moser, elife 2018). 

      We have included an additional analysis of ribbon surface area from our 3D SBFSEM reconstructions. As with the volume measurements included in the original submission, ribbon surface areas are distinct between type 5i and type 6 bipolar cells (Fig. S10A), ON-T RGCs on average receive input from ribbons with smaller surface area than ON-S RGCs (Fig. S10B), and ribbon surface area predicts the number of adjacent vesicles across bipolar cell types (Fig. S10C).  We agree that a higher resolution view of presynaptic structures would be very helpful, but the resolution of our SBF-SEM data is limited (e.g. each pixel is 40 nm on a side).  This resolution does not allow us to distinguish between vesicles at vs near the membrane. 

      In our observations, both length and height of the ribbons showed variability across individual bipolar cells. And ribbons in type 6 bipolar cells tended to be either longer and/or taller compared to those in type 5 cells. We agree that a longer ribbon may accommodate more docked SVs. A more definitive analysis would benefit from higher-resolution, isotropic 3D reconstructions of ribbons, which would allow more precise shape analysis and ,together with a detailed assessment of docked SVs at the ribbons.

      The ribbon literature given above makes the argument that ribbons increase exocytotic output, and morphological studies suggest that release activity enhances 1) ribbon length (Dembla) and 2) the density of SVs near the PM (Babai). These findings could lead one to propose that type 6 bpcs (inputs to On-sustained) are more active than type 5i (feed into On-transient). Here Kuo et al. show that the bpcs have similar Vm (measured from the soma) in response to light stimulation. Does Vm predict release? Not entirely as the authors acknowledge, because: Cav channel properties, SV availability, and negative feedback are all downstream of bpc Vm. The only experiment performed to test downstream factors focused on negative feedback from amacrines. The data presented in Figures 5C-F led me to conclude the opposite of what the authors concluded. My impression is that the T-ON rgc exhibits strong disinhibition when GABA-blockers are applied (the initial phase is greatly increased in amplitude and broadened with the drug), which contrasts with the S-On rgc responses that show a change in the amplitude of the initial phase but not its width (taus would be nice). Here and in many places the authors refer to changes in release kinetics, without implementing a useful description of kinetics. For instance, take the cumulative current (charge) in Figure 5C and fit the control and drug traces to arrive at taus, and their respective amplitudes, and use these values to describe kinetic phases. One final point, the summary in Figure 5D has a p: 0.06, very close to the cutoff for significance, which begs for more than an n = 5. Given that previous studies have shown that bpc output is shaped by immediate msec GABA feedback, in ways that influence kinetic phases of release (..Mb1 bipolars, see Vigh et al 2005 Neuron), more attention to this matter is needed before the authors rule out feedback inhibition in favor of ribbon size. If by chance, type 5i bpcs are under uniquely strong feedback inhibition, then ribbon size may result from less activity, not less output resulting from smaller ribbons.

      The text surrounding Figure 5 led to some confusion, and we have revised that text and the figure for clarity.  First, the data in that figure is entirely from On-T cells (the upper and lower panels show block of GABA and glycine receptors separately).  Second, the observation that we make there is that block of inhibitory receptors increases the transience of the On-T excitatory input, rather than decreasing it as would be expected if the transience is created by presynaptic inhibition. We have added additional data and that increase in transience is now significant. Inhibitory block does substantially increase the amplitude of the postsynaptic response, and a likely origin of this change in response is inhibitory feedback to the bipolar synaptic terminal. We now indicate this in the text on page 13, lines 438-453. 

      The key result of this figure for our purposes here is that the transience of the excitatory input to the OffT cell remains with inhibitory input blocked. We have clarified throughout the text that our results indicate that inhibitory feedback is not necessary for the difference between transient release into On-T and sustained release onto On-S. This does not mean that inhibitory feedback does not shape the responses in other ways or contribute to the transient/sustained difference - just that for the specific stimuli we use that difference is retained without presynaptic inhibition. We have also added citations to past work showing that activity of amacrine cells can modulate bipolar transmitter release. 

      Whether strong feedback inhibition limits activity and therefore limits ribbon size in an activity-dependent way is an intriguing possibility. Indeed, addressing why ribbons are larger in type 6 bipolar cells vs. other bipolar types will be an interesting avenue of further study. However, it would be surprising if ribbon sizes changed during the acute pharmacological block conditions (~10-15 minutes) we employed in our study. Our point here is that there is an interesting correlation between presynaptic ribbon size and the kinetics of glutamate release. We do not think that the two possibilities stated in the last sentence (“…ribbon size may result from less activity, not less output resulting from smaller ribbons”) are mutually exclusive.

      We have not further quantified the response kinetics in the experiments of Figure 5 as the large changes induced by the pharmacology (especially GABA receptor block) make it unclear how to interpret quantitative differences.  In other places we have quantified kinetics through the STA or specified that our focus was more qualitative (i.e. transient vs sustained kinetics). 

      As mentioned above, the behavior of Cav channels is important here. This is difficult to address with voltage clamps from the soma, especially in the Vm range relevant to this study. Given that it has previously been modeled that the rod bpc to AII pathway adapts to prolonged depolarization of rbcs through downregulating Cav channel-mediated Ca<sup>2+</sup> influx (Grimes ....Rieke 2014 Neuron), it seems important for Kou et al to test if there is a difference in Cav regulation between type 6 and 5i bpcs. Ca<sup>2+</sup>  imaging with a GCaMP strategy (Baden....Lagnado Current Biology, 2011) or filling the presynapse with Ca dyes (see inner hair cells: Ozcete and Moser, EMBO J 2020) would allow for the correlation of [Ca]intra with GluSnf signals (both local readouts).

      This is a good suggestion but is outside the scope of our current paper. Our focus was on the circuit origin of the difference in response of the OnT and OnS responses rather than the specific biophysical mechanism.  We are of course interested in the mechanism, but the additional experiments needed to pin that down would need to be a part of future experiments. The work here represents an important step in that direction as it greatly reduces the number of possible locations and mechanisms for the sustained/transient difference and hence serves to focus any future mechanistic investigations.

      Stimulation protocol and presentation of Glutamate Sniffer data in Figure 6. In all of your figures where you state steady st as a % of pk amplitude, please indicate in the figure where you estimate steady state. Alternatively, if you take the cumulative dF/F signal, then you can fit the different kinetic phases. From the appearance of the data, the Sustained Glu signals look like square waves (Figure 6B ROI1-4), without a transient at onset, which is not predicted in your ribbon model that assumes different kinetic phases (1. depletion of docked SVs, and 2. refilling and repriming). The Transient responses (Figure 6B ROI5-8) are transient and more compatible with a depressing ribbon scheme. If you take the cumulative, for all of the On-S and compare it to all of the On-T responses, my guess is the cumulative dF/F will be 10 to 20 larger for the S-On. Would you conclude that bpc inputs to On-S (type 6) release 20fold more SVs per 4 seconds on a per ribbon basis, and does the surface area of the type 6 bpcs account for this difference? From Figures 8B and D, the volume of the ribbon is ~2 fold greater for type 6 vs 5i, but the Surface Area (both faces of ribbon) is more relevant to your model that claims ribbon size is the pivotal factor. If making cumulative traces, and comparisons on an absolute scale is unfounded, then we need to know how to compare different observations. The classic ribbon models always have a conversion factor such as the capacitance of an SV or q size that is used to derive SV numbers from total dCm or Qcontent. See Kim ....et al von Gersdorff, 2023, Cell Reports. Why not use the Gaussian noise stimulus in Fig 6 as in Figure 1 and 2? 

      For iGluSnFR recordings, steady-state responses were measured from the mean fluorescence over the last 1 sec of the light step (2 sec duration) response. We have included this information in the figure caption and in the Methods. 

      There is a good deal of variability in the iGluSnR responses from one ROI to another, and the ROIs shown in the original submission had a less prominent transient component than many other ROIs. We have replaced this figure with another that is more representative of the average behavior across ROIs. The full range of behavior is captured in Figure 6C; it is clear across ROIs that glutamate release near ON-S dendrites shows both sustained and transient components. The new experiments in which we block amacrine cell activity also include a few more example ROIs from ON-S cells, and those also show both transient and sustained components.

      Your suggestion to integrate the iGluSnFR signals to compare to our structural analysis of ribbons is interesting. However, we are hesitant to make a quantitative comparison between the two without further experiments to validate how the iGluSnFR signals we measure relate to release of single vesicles. For example, a quantitative measure of release based on the iGluSnR experiments would require accounting for possible differences in the expression of the indicator - which could differ both in overall level and/or location relative to release sites. 

      This comment and one above highlight the importance of measures of ribbon surface area, which we now provide (Figure S10).

      Figure 7. What is the recovery time for mammalian cones derived from ribbon-based models? There are estimates from membrane capacitance studies. Ground squirrel cones take 0.7 to 1 sec to recover the ultrafast, primed pool of SVs when probed with a paired-pulse protocol (Grabner ...DeVries 2016, Neuron). Their off-bpcs take anywhere from under 0.2 sec to a second to recover, which is a combination of many synaptic factors (Grabner ...DeVries Nat Comm 2023). Rod On bpcs take over a second (Singer Diamond 2006, reviewed Wan and Heidelberger 2011). In Figure 7B, the recovery time is ~150 ms for the responses measured at rgcs. This brief recovery time is incompatible with existing ribbon models of release. Whole-cell membrane capacitance measurements would be helpful here.

      Thanks for drawing our attention to this issue. Indeed, we see a relatively rapid recovery in the paired-flash experiments. We now discuss this recovery time in the context of past measurements of recovery of responses in cones and bipolar cells (paragraph starting on line 773). There are many factors that could contribute to the relatively rapid recovery we observe - including synaptic factors such as those highlighted by Grabner et al., (2016) either at the cone-to-bipolar synapses or the bipolar-to-RGC synapses. We are certainly interested in a more detailed understanding of this issue, but the additional experiments are outside the scope of this paper.  

      Experimental Suggestion: Add GABA blockers and see if type 5i bpc responds with more release (GluSniff) and prolonged [Ca2+] intra (GCaMP). Compare this to type 6 bpc behavior with GABA/gly blockers. This will rule in or out whether feedback inhibition is involved. 

      Figure 7 in the revised manuscript includes two new experiments examining glutamate release (without the simultaneous measurement of bipolar cell intracellular calcium) while blocking (1) all/most amacrine cell-mediated inhibition via inclusion of NBQX in the bath solution, and (2) blocking spiking amacrine cells via inclusion of TTX in the bath solution. The transient vs sustained difference in light-evoked glutamate release around ON-T and ON-S RGC dendrites remained with amacrine activity suppressed. These new results are consistent with the anatomical and pharmacological data that were included in the initial submission of the manuscript (Fig. 5) that indicate presynaptic inhibition does not have a major role in shaping release kinetics at these synapses. 

      Reviewer #3 (Public Review): 

      Summary: 

      Different types of retinal ganglion cell (RGC) have different temporal properties - most prominently a distinction between sustained vs. transient responses to contrast. This has been well established in multiple species, including mice. In general, RGCs with dendrites that stratify close to the ganglion cell layer (GCL) are sustained; whereas those that stratify near the middle of the inner plexiform layer (IPL) are transient. This difference in RGC spiking responses aligns with similar differences in excitatory synaptic currents as well as with differences in glutamate release in the respective layers - shown previously and here, with a glutamate sensor (iGluSnFR) expressed in the RGCs of interest. Differences in glutamate release were not explained by differences in the distinct presynaptic bipolar cells' voltage responses, which were quite similar to one another. Rather, the difference in transient vs. sustained responses seems to emerge at the bipolar cell axon terminals in the form of glutamate release. This difference in the temporal pattern of glutamate release was correlated with differences in the size of synaptic ribbons (larger in the bipolar cells with more sustained responses), which also correlated with a greater number of vesicles in the vicinity of the larger ribbons. 

      The main conclusion of the study relates to a correlation (because it is difficult to manipulate ribbon size or vesicle density experimentally): the bipolar cells with increased ribbon size/vesicle number would have a greater possibility of sustained release, which would be reflected in the postsynaptic RGC synaptic currents and RGC firing rates. This model proposes a mechanism for temporal channels that is independent of synaptic inhibition. Indeed, some experiments in the paper suggest that inhibition cannot explain the transient nature of glutamate release onto one of the RGC types. Still, it is surprising that such a diverse set of inhibitory interneurons in the retina would not play some role in diversifying the temporal properties of RGC responses. 

      Strengths: 

      (1) The study uses a systematic approach to evaluating temporal properties of retinal ganglion cell (RGC) spiking outputs, excitatory synaptic inputs, presynaptic voltage responses, and presynaptic glutamate release. The combination of these experiments demonstrates an important step in the conversion from voltage to glutamate release in shaping response dynamics in RGCs. 

      (2) The study uses a combination of electrophysiology, two-photon imaging, and scanning block-face EM to build a quantitative and coherent story about specific retinal circuits and their functional properties. 

      Weaknesses: 

      (1) There were some interesting aspects of the study that were not completely resolved, and resolving some of these issues may go beyond the current study. For example, it was interesting that different extracellular media (Ames medium vs. ACSF) generated different degrees of transient vs. sustained responses in RGCs, but it was unclear how these media might have impacted ion channels at different levels of the circuit that could explain the effects on temporal tuning.

      We do not have an explanation for the quantitative differences in response kinetics we observed in Ames’ medium vs. ACSF. There are modest differences in calcium and magnesium concentration and a larger difference in potassium (2.5 mM in ACSF vs 3.6 mM in Ames). It would be interesting to test which of these (or other) differences accounts for the difference in response kinetics.

      (2) It was surprising that inhibition played such a small role in generating temporal tuning. At the same time, there were some gaps in the investigation of inhibition (e.g., IPSCs were not measured in either of the RGC types; pharmacology was used to investigate responses only in the transient RGCs).

      We were also surprised at this result. We have included additional data on inhibition in the revised manuscript. Figure S3 shows light-evoked IPSC data from both RGC types (Fig. S3) and Fig. 7 shows additional iGluSnFR measurements around both ON-T and ON-S RGC dendrites with inhibition blocked via bath application of NBQX (Fig. 7) and separately with inhibition from spiking amacrine cells blocked with TTX. These experiments provide additional evidence for the small role of inhibition. We attempted to measure the kinetics of excitatory input to ON-S cells with inhibition blocked, but we found that the excitatory input showed strong spontaneous oscillations under these conditions and the light responses were changed so drastically that we did not feel we could make a clear comparison with control conditions.

      (3) There could be additional discussion and references to the literature describing several topics, including: temporal dynamics of glutamate release at different levels of the IPL; previous evidence that release sites from a single presynaptic neuron can differ in their temporal properties depending on the postsynaptic target; previous investigations of the role of inhibition in temporal tuning within retinal circuitry. 

      Thanks, we have included more discussion and references to the relevant literature as you have suggested in the recommendations to authors.

      Reviewer #1 (Recommendations For The Authors): 

      The presented raw data of the pharmacological experiments show that SR95531 and TPMPA robustly increased both the amplitude and duration of the transient component of the light step-evoked excitatory currents, with slight, if any enhancement of the sustained component in ON-T RGCs Figure 5C. Statistical analysis of the population data (n=5) with Wilcoxon signed rank test yielded no significant difference (ln 363). However, reanalyzing the data extracted from the graph (Figure 5D) revealed that the difference between the paired observations is normally distributed (Shapiro-Wilk normality test, P=0.48) allowing parametric statistics to be used, which provides higher statistical power. Accordingly, reanalyzing the presented data with paired Student's t-test data revealed significant differences (P=0.01) in the steady-state amplitude normalized to that of the peak, recorded in the presence of SR95531 and TPMPA. In other words, based on the (rough) analysis of the presented pharmacology data GABAergic feedback inhibition significantly contributes to shaping the transient portion of the light-evoked excitatory currents in ON-T RGCs, by making it more transient. I believe a similar analysis based on the actual data is necessary, and the results should be communicated either way. However, if warranted, two-photon glutamate sensor imaging experiments showing that blocking GABA- and glycinergic inhibition does not change the kinetics of light-evoked glutamate signals at ON-T RGCs should also be performed, as these would be critical in drawing a conclusion regarding the effect of feedback inhibition on glutamate release from bipolar cells.

      Thanks for this feedback. We have added another cell to the data set in Fig. 5D. With this addition, SR95531/TPMPA application significantly increases the response transience of excitatory currents measured in ON-T RGCs compared to control. This enhanced transience in GABA<sub>A/C</sub> receptor blockers is due to an increase in the amplitude of the initial peak component of the response (control peak amplitude: -833.7±103.3 pA; SR95531+TPMPA peak amplitude: 2023±372.7pA; p=0.03, Wilcoxon signed rank test), with no change to the later sustained component (control plateau amplitude: -200.7±14.71pA; SR95531+TPMPA plateau amplitude: -290.9±43.69pA; p=0.15, Wilcoxon signed rank test).

      We should clarify that this result indicates that GABAergic inhibition makes the excitatory inputs to ON-T RGCs less transient. Block of GABA receptors increased transience, thus intact GABAergic transmission appears to limit the initial peak of the response and therefore make excitatory currents more sustained. We unfortunately were not able to examine whether sustained excitatory currents in ON-S RGCs would become more transient using the same approach. In our hands, bath application of SR95531+TPMPA led to the generation of large-amplitude (>1nA) oscillatory bursts of excitatory input that developed within 5 minutes and persisted for the duration of the incubation (up to ~30 min) in drugs. Further, presentation of light steps tended to induce variable amplitude responses, likely dependent on the presence of spontaneous bursts; when large amplitude responses were evoked, these typically oscillated for several seconds after the step.

      To examine a potential role for presynaptic inhibition in transient vs. sustained bipolar cell output, we therefore chose to eliminate amacrine cell-mediated inhibition by bath application of the AMPA/kainate receptor antagonist NBQX in additional iGluSnFR measurements. This manipulation should leave ON bipolar cell responses intact while eliminating most amacrine cell-mediated responses (and OFF bipolar cell driven responses). In separate experiments, we also eliminated inhibition from spiking amacrine cells by bath application of TTX. As shown in new Fig. 7, sustained and transient responses persisted in distal versus proximal RGC dendrites, respectively. Compared to SR95531/TPMPA, bath application of NBQX was not associated with spontaneous bursts of glutamate release around ON-S dendrites. These results show that amacrine cell-mediated inhibition is not required for either sustained or transient glutamate release from bipolar cells that provide input to ON-S and ON-T RGCs.

      Small points: 

      (1) The legend of Figure 1 (D) refers to shaded areas to show {plus minus} SEM, but no shade is visible (at least in my printout).

      The SEM shading is there in Fig. 1D but is mostly obscured by the mean lines for the respective RGC types. We have added this to the figure caption.

      (2) I found the reported Vrest for the ON bipolar cells somewhat depolarized. Perhaps due to the uncompensated junction potentials? 

      These measurements are indeed not corrected for the liquid junction potential (which is approximately -10.8 mV between K-gluconate internal and Ames’ solution). We did not apply this correction since the appropriate value is not clear in perforated patch recordings as the intracellular chloride concentration is unknown (and can differ from that in the pipette solution). We have clarified this in the results text where we describe the Vrest values (lines 335-338).

      (3) It is Wilcoxon signed rank test, not Wilcoxan. 

      Thanks for catching this. This has been corrected in the revised manuscript.

      Reviewer #2 (Recommendations For The Authors): 

      Some amacrines express vesicular Glut-3 transporter and are reported to release glutamate (Marshak, Vis Neurosci 2016). Are Amacrine vGlut3 signals postsynaptic (within ~0.5 um) to cone bpc ribbons?

      We did not characterize VgluT3-expressing amacrine cells in our SEM datasets. A recent study by Friedrichson et al. (Nat. Comm. 2024; PMID 38580652) using 3D SEM reconstructions found that Vglut3-amacrines are postsynaptic to both type 5i and type 6 bipolar cells, as well as other type 5/xbc bipolar cells (and receive >50% of their input from type 3a OFF bipolar cells).

      How far apart are the postsynaptic targets from the ribbon release sites? The ribbons at type 5i bpc/On-T input appear separated from the dendrites of On-T rgcs (Figure 8C). At least further away than the type 6 bpc ribbons are from On-S rgc dendrites (Figure 8C). Distance may create a thresholding phenomenon, whereby only multivesicular bouts at the onset of depolarization are able to elevate synaptic Glu to levels needed to activate On-T GluRs. See Grabner et al Nat Comm 2023 for such scenarios in the outer retina.

      This is an intriguing possibility, but we should point out that the presynaptic ribbons in Fig. 9C (former Fig. 8C) are similar distances (within the resolution of our reconstructions) from the ON-T and ON-S dendrites. We have increased the brightness of the dendrite segments for both RGC types in the resubmission figure; note that ON-T RGCs have spine-like protrusions that may not have been as apparent in the previously submitted version of our manuscript.

      In Figures 1 and 2, Sustained responses look like the derivative of Transient responses, minus the negative going inflection. In addition, the sustained responses appear to have a lower threshold of activation than the transient On rgcs, because there are more bouts of action potentials (and membrane depol in V-clamp) with earlier onset in sustained than transients traces. It would be great if the GLuSniff data captured these differences. Take cumulative dF/F and see what the onset time is, or an initial tau if possible.

      This is a good suggestion. However, we are reluctant to make detailed quantitative comparisons such as this without further validation of how the kinetics of the iGluSnFR signals relate to kinetics of glutamate release.  A specific concern is that differences in the location and amount of iGluSnFR expression could impact any such comparisons.

      A recent study by Kim et al von Gersdorff (Cell Reports, 2023) presents interesting phases of release in response to light flashes, measured from AIIs, and complementary results from pairs of rbcs-AIIs. The findings highlight the complexity of SV pools under well-controlled experiments. Could their results be explained as variations in rbc ribbon size through development, and possibly between rbcs or within an rbc? 

      This certainly seems possible and would be consistent with the dependence of release on ribbon size that our results support.  It would be interesting to see if there are clear anatomical correlates of that change in release properties.  

      Figure 5 is a pivotal point in the study, but my review has identified numerous weaknesses. The feedback inhibition onto bipolar cell terminals is likely to sculpt glutamate release, and the results do not convincingly rule out this possibility. The suggestions for improvements range from the data needing to be reanalyzed with regard to statistical tests, and/or adding a few more data points (n = 5) before concluding a p: 0.06 is insignificant. 

      We have added an additional recording to this data set. With n= 6 cells, there is now a statistically significant difference between ON-T RGC excitatory currents measured in control conditions versus during GABA<sub>A/C</sub> receptor blockade. Please note that all the recordings shown in Figure 5C-F are from ON-T RGCs (the two panels show separately block of GABergic and glycinergic receptors). We did not make it sufficiently clear that the original trend (now statistically significant) is opposite of that expected if presynaptic GABAergic inhibition contributes to response transience in ON-T RGCs.  What we see is that excitatory synaptic inputs to ON-T RGCs become more transient (rather than mpre sustained) during GABA<sub>A/C</sub> receptor blockade. We have revised the text in that section to make this point more clearly.

      We have also included new data from iGluSnFR measurements showing that bath application of NBQX does not affect light step-evoked glutamate release kinetics at proximal (sustained) or distal (transient) RGC dendrites (control: steady-state amp. as % of peak amp. 13 ± 10; mean ± S.D.; n = 189 ROIs/4 FOVs for ON-T dendrites vs 40 ± 12; mean ± S.D.; n = 287 ROIs/8 FOVs for ON-S dendrites; NBQX: 6 ± 3; mean ± S.D.; n = 112 ROIs/1 FOV for ON-T dendrites vs 23 ± 9; mean ± S.D.; n = 97 ROIs/2 FOVs for ON-S dendrites; *p<0.001). By blocking glutamate receptors on amacrine cells, NBQX (AMPA/KAR antagonist) eliminates all/most amacrine cell-mediated signaling in the retina and should therefore abolish presynaptic inhibitory input to bipolar cell terminals across the IPL. Taken together, our results indicate that presynaptic inhibition does not play a critical role in establishing transient versus sustained kinetics for the stimulus conditions we employed in our study.

      There is a need to cite more recent literature on bipolar cell ribbons (e.g. see Wakeham et al., Front. Cell. Neurosci., 2023), in order to support experimental design and interpretation of the results. The authors should discuss their Ribeye-KO data from Okawa et al 2019 Nat Comm, Figure 7, in the context of their new iGluSnFR results. 

      Thank you for prompting us on this issue. We have expanded the discussion regarding ribbons and included more citations to the ribbon literature. That is largely in the three paragraphs starting on line 727.

      One point deserves emphasis because it is central to the authors' ribbon model but not consistent with their data. The ribbon model as they put it, and as commonly stated, holds that a transient phase of release at the onset of depolarization indicates the depletion of the primed SVs, and the subsequent slower rate of release (steady state release in the authors' terms) reflects recruiting, priming, and release of new SVs. The On-transient dendrite GluSnf responses agree with this multiphasic process, but the sustained responses show only an elevation in glutamate without a pronounced initial peak, creating a square-wave-shaped response (Figure 6B). This does not agree with the simple ribbon-based release model. I would expect the signals from the T- and S-on dendrites to have a comparable initial phase, while the sustained phase should be greater in amplitude for the S-on dendrites. More discussion may clarify possible mechanisms.

      Thanks for pointing this out. The example iGluSnFR traces we originally included in the manuscript were not entirely representative in that they did not show much initial transient phase. Note there is a distribution of steady-state amplitudes for proximal dendrites in Fig. 6C; the examples are from ROIs from the upper end of the distribution. In the new Figure 7, we have included some additional examples that show both a clear transient and sustained component. The summary data in Figure 6C shows the distribution of sustained/transient ratios across ROIs.  

      Reviewer #3 (Recommendations For The Authors): 

      (1) It would be interesting to understand the differences in IPSCs in the two RGC types. Perhaps they are small in both types, which would explain their apparent lack of impact on temporal tuning. The authors may already have these data.

      We did make measurements of noise-evoked IPSCs (as well as EPSCs) in a subset of ON-T and ON-S recordings. We have now included this data as Figure S3. There are slight differences in the kinetics of inhibition between RGC types (Fig. S3C) and there is a trend towards stronger inhibition (relative to excitation) in ON-T RGCs compared to ON-S RGCs (Fig. S3E), although there is not a statistically significant difference. In both cases excitatory synaptic currents are as large or larger than inhibitory currents, and this does not include the difference in driving force near spike threshold which will favor excitatory input by a factor of 2-3.  Hence our data suggests that postsynaptic inhibition does not play a major role in generating the differential temporal spiking responses of ON-T and ON-S RGCs. However, additional experiments examining the relative contribution of excitation and inhibition to spiking output in these RGCs would be needed to reach a firm conclusion.

      The pharmacological experiments in which we blocked inhibition (Fig. 5C-F, new Fig. 7) were designed to test the effect of presynaptic inhibition on bipolar cell output (voltage-clamp isolation of excitatory currents in Fig. 5; iGluSnFR measurements of glutamate release in Fig. 7). We do not mean to suggest that postsynaptic inhibition does not have any role in shaping the spiking behavior of these RGC types, but that transient vs. sustained kinetics are already present in the bipolar cell output and that presynaptic inhibition of bipolar cell terminals does not appear to account for this difference.  We have revised the text throughout to be clearer on this point.

      (2) It could be convincing to show transient/sustained differences between RGC types in dim light, where the response would depend on the rod bipolar/AII circuit. In this case, any difference in temporal properties would presumably be explained by differences that localize to the cone bipolar cell axon terminals. Indeed, is that the result in Figure 1B? This seems to be a dim stimulus presented on darkness, which may be driven through the rod bipolar pathway. The authors could then discuss the interpretation of this data in terms of the rod bipolar circuit. 

      Yes, Figure 1B is a dim light step (~30R*/rod/s) presented from darkness and the distinction between cells is clear down at still lower light levels that more effectively isolate signaling through the rod bipolar pathway. Thanks for making this point that observation of distinct temporal responses under scotopic conditions where signals suggests these differences must arise at and/or downstream of cone bipolar cell output. We have included additional text (lines 361-365) in the results describing bipolar cell responses that raise this point.

      (3) Glutamate release was already measured across the full IPL depth by Borghuis et al. (2013) and Franke et al. (2017). It would be appropriate to better motivate the current study based on these existing measurements.

      We have clarified that these important studies provided important motivation for measuring excitatory synaptic input to ON-T vs. ON-S RGCs (lines 165-169).   

      (4) Line 212/213. It would be appropriate to add to the list of papers showing the different stratification of transient vs. sustained responses: Borghuis et al. (2013) and Beaudoin et al. (2019).

      Thank you - these references have been added.  

      (5) Line 635-638. It would be useful to discuss papers by Pottackal et al. (2020, 2021), which suggested that a single presynaptic cell (starburst) can signal with different temporal properties depending on the postsynaptic target (other starburst vs. DSGCs). The mechanism was not completely resolved (i.e., it was not explained by differences in presynaptic Ca channels at the two synapse types), but it at least shows that neurotransmitter release can show different filtering depending on the postsynaptic target from the same presynaptic neuron. (This could also be at play for the type 6 bipolar cell inputs to ON-S vs. ON-T RGCs in the present study.)

      We have added a reference to Pottackal et al 2021 in this section.

      (6) Line 714. Should describe the procedure for embedding the tissue in agarose. 

      We have added more detail regarding agarose embedding for preparation of retinal slices in the methods.

      (7) Line 775. Need a better description of the virus (not the construct), what serotype? Provide the Addgene number if available. 

      This has been added to the methods.

      (8) Line 808. Was the SD for the gaussian really 50%? That would cut off a lot of the distribution, i.e., it would get clipped at 0. 

      Yes, the SD for Gaussian noise was 50%. This high contrast stimulus was used in part to achieve measurable signals from bipolar cells. You are correct that some of the distribution was clipped at 0 (it was also clipped at twice the mean to make sure that the distribution remained symmetrical). The clipping was accounted for during our LN analyses.

      (9) The paper should discuss Swygart et al. (2024) results showing different spatial surround properties of neighboring synapses from a type 6 bipolar cell. Based on this result, it would seem very likely that amacrine cells could play a role in shaping the temporal processing of bipolar cell glutamate release as well. Indeed, spatial and temporal processing will not be completely independent in a typical experiment. For example, with the spot stimulus used in the present study, bipolar cells within the center versus the edge of the spot will have different balances of center/surround activation, which could potentially influence their temporal processing.

      We have included discussion of results from Swygart et al 2024 in the section of the Discussion in which we point out differences in surround inhibition between ON-S and ON-T RGCs (lines 710-714). We agree that spatial and temporal processing are not completely independent. Our results with SR95531/TPMPA indicate ON-T RGCs receive stronger GABAergic surround inhibition than ON-S RGCs (Fig. S8). However, our results in Fig. 5C-D show GABAergic surround inhibition makes ON-T excitation more sustained rather than more transient. So even though bipolar cells presynaptic to ON-T RGCs receive stronger surround inhibition (Fig. S8), this inhibition does not establish the transient kinetics of glutamate release from these bipolar cells (in fact, it works to make release more sustained). Additional iGluSnFR experiments where we used NBQX to block all/most amacrine cell-mediated responses also suggest presynaptic inhibition does not have an important role in establishing differential glutamate release kinetics onto ON-S vs. ON-T RGC dendrites (Fig. 7).

      (10) Cui et al. 2016 described ON-S Alpha as having a divisive suppression mechanism that explained the temporal properties of white-noise response better than a standard LN model. Do the authors think the divisive suppression reflects a property of the excitatory synapses independent of inhibition?

      This is an interesting question, but one for which we don’t have a good answer for now. As mentioned in some of the above responses and as we have tried to clarify in the manuscript, we do not mean to imply that there is no role for presynaptic inhibition in modulating bipolar cell output, including for the divisive suppression described by Cui et al. Rather, our point is that the distinction between transient and sustained excitatory input to ON-T and ON-S RGCs does not require presynaptic inhibition and is more likely an intrinsic property of the bipolar cell synapses. 

      (11) Do the authors mean to imply that the pool size at bipolar cell ribbon synapses could depend on the use of Ames vs. ACSF? 

      For now, we do not have a good answer as to why there are quantitative differences in response kinetics between Ames and ACSF. We have not done any experiments to investigate whether ribbon sizes or ribbon pools are different in the different solutions.

      (12) More generally, different mean luminance levels could drive different levels of baseline glutamate release, which could alter the available pool of vesicles at bipolar cell ribbon synapses. Can we explain varying degrees of transient/sustained in the same cell at different levels of mean luminance based on this mechanism (e.g., Grimes et al., 2014)?

      Yes, the emergence of a transient component of excitatory input to ON-S RGCs at ~100 R*/rod/s versus at scotopic levels (0.5 R*/rod/s) in Grimes et al. (2014) could be due to differences in the number of releasable vesicles (due to different type 6 bipolar cell axon terminal membrane potentials and hence differences in spontaneous release rates) at the different light levels.

      We should note that although ON-T and ON-S RGCs exhibit some changes in transient/sustained kinetics across different light levels, the relative differences between these RGC types are preserved across light levels. We have included a statement about this in the text (lines 361-367).

      (13) Figure 1. Have the authors considered performing the LN analysis of the firing responses, to compare the degree of rectification between the two RGC types?

      This is a good suggestions. From an LN analysis of spiking responses, we do not observe a clear difference between the static nonlinearity component of the model for ON-T and ON-S RGCs. Both RGC types are strongly rectified under our experimental conditions.  

      (14) Figure 5. Do the authors have the pharmacology data for the ON-S cells? There are examples of sustained EPSCs in amacrine cells that become more transient after blocking inhibition, which at least suggests that inhibition can play some role in the transient/sustained nature of glutamate release (Park et al., 2015, Figure 3). Perhaps ON-S cells likewise become more transient with inhibition blocked. 

      (The colored symbols in A were not visible in a printout. It would be useful to indicate the cell type (ON-T) in C and E). 

      As described above in the response to reviewer 1’s recommendation for authors, we were not able to use SR95531/TPMPA for recordings from ON-S RGCs. Bath application of these drugs led to oscillatory bursts of excitatory input to ON-S RGCs. However, the lack of effect of bath-applied NBQX on the kinetics of glutamate release around either ON-T or ON-S RGC dendrites (new Fig. 7) suggests that presynaptic inhibition does not contribute to generating sustained excitation to ON-S RGCs (or transient excitation to ON-T RGCs).  

      We have corrected Fig. 5A to include the referenced colored symbols and have also edited Fig 5C and E to clarify that measurements in Fig. 5C-F are from ON-T RGCs.

      (15) Figure 6 legend. Should be Kcng4-Cre, not KCNG-Cre. Also, it should make clear that this is cre-dependent expression of iGluSnFR. For C, were the statistics based on the number of FOVs? 

      Thanks for catching this, we have corrected Figure 6 legend. The methods section includes a description of how we achieved iGluSnFR expression on alpha RGC dendrites via a cre-dependent viral strategy in Kcng4-Cre mice.  We have also clarified that the statistics are based on ROIs in Figure 6C.

      (16) Figure 7, Flashes were apparently 400% contrast on a dim background. What was the background? Is there a rod component to the response in this case? 

      In Figure 7 (now Figure 8), the same background (~3300 R*/rod/s; 2000 P*/Scone/s) was used as in the Gaussian noise and step response experiments. At this light level, the response should be primarily be mediated by cones.

      (17) Figure S1. The colors here differ from those in previous figures (Here, ON-T, magenta; ON-S, cyan). Is something mislabeled? 

      Thanks for catching this. We mistakenly swapped the labels in the legend for Fig. S1. The figure colors were correct, but we have corrected the legend in the revised manuscript.

      (18) Figure S2. For the LN model for RGC synaptic currents, the ON-S are more rectified than some previous recordings (Cui et al., 2016). Is this perhaps explained by different light levels?

      We aren’t sure why ON-S excitatory currents are more strongly rectified in our recordings compared to Cui et al., 2016. Cui et al. used an ~20-fold higher background light intensity (~40,000 P*/cone/s vs. ~2000 P*/cone/s in our study), so different light levels may be a factor (although we should point out that rectification increases in these RGCs between scotopic to low photopic light levels (see Grimes et al., 2014 and Kuo et al., 2016).

      (19) The study is apparently comparing PV1 and PV2 described in Farrow et al. (2013; see Supplementary information for stratification analysis), which should be cited.

      Thanks, we have corrected this oversight in the revised manuscript. We now cite Farrow et al and mention the connection to PV1 and PV2 in the first paragraph of Results (lines 104-108).

    1. e)
      • ARE 1018459 ED
      • Órgão julgador: Tribunal Pleno
      • Relator(a): Min. GILMAR MENDES
      • Julgamento: 12/09/2023
      • Publicação: 30/10/2023

      • Embargos de declaração em processo paradigma da sistemática da repercussão geral.

      • Direito do Trabalho. Tema 935.

      • Alegação de omissão, contradição ou obscuridade.

      • Efeitos infringentes. Admissão da cobrança da contribuição assistencial prevista no art. 513 da Consolidação das Leis do Trabalho, inclusive aos não filiados ao sistema sindical, assegurado ao trabalhador o direito de oposição.

      • A constitucionalidade das contribuições assistenciais, respeitado o direito de oposição, faculta a trabalhadores e sindicatos instrumento capaz de, ao mesmo tempo, recompor a autonomia financeira do sistema sindical e concretizar o direito à representação sindical sem ferir a liberdade de associação dos trabalhadores.

      • Embargos de declaração conhecidos e providos em parte para retificar a tese da repercussão geral, que passa a ter a seguinte redação: “É constitucional a instituição, por acordo ou convenção coletivos, de contribuições assistenciais a serem impostas a todos os empregados da categoria, <u>ainda que não sindicalizados</u>, desde que <u>assegurado o direito de oposição</u>.

    2. I

      Súmula nº 353/TST - EMBARGOS. AGRAVO. CABIMENTO

      Não cabem embargos para a Seção de Dissídios Individuais de decisão de Turma proferida em agravo, salvo: - a) da decisão que não conhece de agravo de instrumento ou de agravo pela ausência de pressupostos extrínsecos; - b) da decisão que nega provimento a agravo contra decisão monocrática do Relator, em que se proclamou a ausência de pressupostos extrínsecos de agravo de instrumento; - c) para revisão dos pressupostos extrínsecos de admissibilidade do recurso de revista, cuja ausência haja sido declarada originariamente pela Turma no julgamento do agravo; - d) para impugnar o conhecimento de agravo de instrumento; - e) para impugnar a imposição de multas previstas nos arts. 1.021, § 4º, do CPC de 2015 ou 1.026, § 2º, do CPC de 2015 (art. 538, parágrafo único, do CPC de 1973, ou art. 557, § 2º, do CPC de 1973). - f) contra decisão de Turma proferida em agravo em recurso de revista, nos termos do art. 894, II, da CLT.

    3. participação nos lucros

      Súmula nº 451/TST - PARTICIPAÇÃO NOS LUCROS E RESULTADOS. RESCI-SÃO CONTRATUAL ANTERIOR À DATA DA DISTRIBUI-ÇÃO DOS LUCROS. PAGAMENTO PROPORCIONAL AOS MESES TRABALHADOS. PRINCÍPIO DA ISONOMIA. - Fere o princípio da isonomia instituir vantagem mediante acordo coletivo ou norma regulamentar que condiciona a percepção da parcela participação nos lucros e resultados ao fato de estar o contrato de trabalho em vigor na data prevista para a distribuição dos lucros. Assim, inclusive na rescisão contratual <u>antecipada</u>, é devido o pagamento da parcela de forma proporcional aos meses trabalhados, pois o ex-empregado concorreu para os resultados positivos da empresa.

    1. Dlatego jedyna sensowna rada to… regularnie wykonuj kopię bezpieczeństwa konta Google za pomocą Google Takeout. Bo na phishing albo instalację malware zawsze możesz się złapać, tak jak zrobił to Mateusz, człowiek od lat działający zawodowo w branży IT.
      • The article describes the case of a user (Mateusz) whose Google account was hijacked after he ran malware (a stealer) sent from a compromised friend's Discord account.
      • The malware stole the user's active session cookie, not their password. This allowed the attacker to bypass all login protections, including 2-Step Verification (like a YubiKey), because they were able to take over an already-authenticated session without needing to log in.
      • Using this hijacked session, the attacker convinced Mateusz to join a "Family Group" (Google Family Link) and simultaneously changed his account's birth date to an age under 13.
      • This action immediately flagged the account as a "child's account," with the attacker as the "parent/guardian," which locked Mateusz out and triggered a 14-day permanent deletion process.
      • Mateusz is now in a "digital Catch-22": standard account recovery forms do not work for "child accounts," and Google's support (including YouTube and Google Play) has been unhelpful, closing his tickets despite him having proof of ownership.
      • The article criticizes Google for an "astounding oversight" in its business logic that allows an adult account's age to be so easily changed to a child's, creating a major vulnerability.
      • As a result, Mateusz lost 13 years of data (Gmail, Drive, Contacts) and access to all his purchases on Google Play.
      • The article concludes that since 2FA can't stop session hijacking, the only effective way to protect against the data loss from this specific attack is to regularly back up your Google account data using Google Takeout.
    1. formulation

      utilizing

      Clue/Trail/Plex Mark Atomic Terms used for naming

      info-morphic units of information that are high-resolution addressable high fildeilty meaning/intentfully deeply intertwingled named info-morphic-colab-orative interpersonal nterplanetary structures amenable to muassive multiplayer interplays that plays nicely with other structures

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Major comments:

      (comment #1)- It is interesting that TRF2 loss not only fails to increase γH2AX/53BP1 levels but may even slightly reduce them (e.g., Fig. S2c and the IF images). While the main hypothesis is that TRF2 loss does not trigger telomere dysfunction in NSCs, this observation raises the possibility that TRF2 itself contributes to DDR signaling (ATM-P, γH2AX, 53BP1) in these cells and that in its absence, cells are not able to form those foci. To exclude the possibility that telomere-specific DDR is being missed due to an overall dampened DDR response in the absence of TRF2, it would be informative to induce exogenous DSBs in TRF2-depleted cells and test DDR competence (e.g., IF for γH2AX/53BP1). In other words, are those NSC lacking TRF2 even able to form H2AX/53BP1 foci when damaged? In addition, it would be interesting to perform telomere fusion analysis in TRF2 silenced cells (and TRF1 silenced cells as a positive control).

      We acknowledge a slight reduction; however, this difference is not statistically significant (Fig S2c,e). We will quantify the levels of DDR markers upon TRF2 loss and exogenous DSBs and include it in the subsequent revision.

      (comment #2)-A TRF2 ChIP-seq should be performed in NSC as this list of genes (named TAN genes in the text) was determined using a ChIP performed in another cell line (HT1080). For the ChIP-qPCR in the various conditions, primers for negative control regions should be included to show the specific binding of TRF2 to the promoter of the genes associated with neuronal differentiation. For example, an intergenic region and/or promoters of genes that are not associated with neuronal differentiation (or don't contain a potential G4). The same comment goes true for the gene expression analysis: a few genes that are not bound by TRF2 should be included as negative controls to exclude a potential global effect of TRF2 loss on gene expression (ideally a RNA-seq would be performed instead). We have performed NSC-specific TRF2 ChIP-seq for an upcoming manuscript, which confirms TRF2 occupancy at multiple promoters of differentiation-associated genes. These data are provided solely for confidential evaluation by the designated reviewers.

      Regarding the ChIP-qPCR control experiments: We thank reviewer for pointing this out, indeed we included controls in our PCR assays as positive (telomeric) and TRF2-nonbinding loci (GAPDH, RPS18, and ACTB, based on HT1080 TRF2 ChIP-seq data) as negative controls. These results were not included earlier for clarity given that we were presenting several ChIP-PCR figures - in response to the comment we have included this now in the revised version (Fig. S3d,e). Gene expression analyses show selective upregulation of the TAN genes upon TRF2 loss (data normalised to GAPDH); whereas negative control genes lacking TRF2 binding (RPS18, ACTB) remain unchanged, ruling out non-specific effects. (Fig S3f,g,j,k).

      -(comment #3) A co-IP should be performed between the TRF2 PTM mutant K176R or WT TRF2 and REST and PRC2 components to directly show a defect of interaction between them when TRF2 is mutated (a co-IP with DNase/RNase treatment to exclude nucleic-acid bridging). The TRF2 PTM mutant T188N also seems to lead to an increased differentiation (Fig. S5a). Could the author repeat the measure of gene expression and co-IP with REST upon the overexpression of this mutant too?

      We confirm that DNase/RNase is routinely included in our pull-down experiments to exclude nucleic-acid bridging, with detailed methodology now elaborated in the Methods section. Not including this in the manuscript Methods was an oversight from our side. Our data demonstrate that only REST directly interacts with TRF2, while TRF2 engages PRC2 indirectly via REST, as also previously shown by us and others (page 6; ref. [62]; Sharma et al., ref. [15]).

      We thank the reviewer for noting the apparent differentiation in Fig. S5a. However, this observation represents rare spontaneous differentiation event and is not statistically significant (as shown in Fig S5b). Consistently, gene expression analysis of the TRF2-T188N mutant shows no significant change in TRF2-associated neuronal differentiation (TAN) genes. Therefore, Co-IP for TRF2-T188N with REST was not done.

      (comment #4) - The authors show that the G4 ligands SMH14.6 and Bis-indole carboxamide upregulate TAN genes and promote neuronal differentiation, but the underlying mechanism remains unclear. Bis-indole carboxamide is generally considered a G4 stabilizer, while SMH14.6 is less characterized and should be better introduced. The authors should clarify how G4 stabilization would interfere with TRF2 binding, it seems that it would likely be by blocking access. A more detailed discussion, and ideally TRF2 ChIP after ligand treatment and/or G4 helicase treatment, would strengthen the model.

      We clarify that Bis-indole carboxamide acts as a G4 stabilizer, while SMH14.6 is also a noted G4-binding ligand that stabilizes G4s (ref. [15]). The exclusion of TRF2 from G4 motifs in gene promoters by G4-binding ligands has also been documented previously (ref. [18]). In line with these findings, ChIP experiments performed following ligand treatment revealed a decreased occupancy of TRF2 at TAN gene promoters, supporting the proposed mechanism (added Fig. 6h).

      Minor comments:

      • Supp Figures related to the scRNA-seq are difficult to read (blurry).

      Corrected

      • Fig S1h: The red box mentioned in the legend is not visible

      Corrected

      • In the text, the Figures 1 f-g are misannotated as Fig 1m and l

      Corrected

      • The symbol γ of γH2AX is missing in the text

      Corrected

      • Fig.3d, please indicate in the legend that it is done in SH-SY5Y.

      Added SH-SY5Y in the legend of Fig. 3d.

      • Fig. S3b: Please consider replotting this panel with an increased y-axis scale. As currently presented, the TRF2 ChIP-seq peaks at several promoters appear truncated by the scaling.

      Corrected

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      1. For most of the data graphs in the manuscript, there is no indication of the number of independent biological replicates carried out (which should ideally be plotted as individual dots overlaying the column graphs), or what the error bars represent, or what statistical test was used. All the figure legends and methods have now been updated with the corresponding biological replicates per experiment, with error bars as SD/SEM and the corresponding statistical test along with p values.

      Figure S1.1a: needs a marker to show that the tissue is dentate gyrus.

      We acknowledge the reviewers' concern that high-magnification images alone make it difficult to verify whether the fields are taken from the correct anatomical location. The dentate gyrus (DG) of the hippocampus is a well-defined structure. In the revised figure (Fig S1.1a), we now include a low-magnification image showing the entire hippocampus, including the CA fields, along with two high-magnification fields specifically from the DG region. Consistent with our claim, the co-immunostaining demonstrates that Sox2-positive neural stem cells in the DG are also positive for TRF2.

      Figure 1c (and all other flow cytometry panels throughout the manuscript): it is not clear if the expression of any of these proteins, except maybe MAP2, are significantly different in the presence or absence of TRF2. These differences need to be presented more quantitatively, with the results compiled from multiple biological replicates and analysed statistically. I am not sure that flow cytometry is the best way to determine differences in protein expression levels for non-surface proteins, because many of the reported differences are not at all convincing.

      To detect intracellular/nuclear proteins by flow cytometry, cells were permeabilized using pre-chilled 0.2% Triton X-100 for 10 minutes, as described in the Methods section.

      We have revised the figures (Fig 1c,e) and now included statistical analysis from three independent biological replicates for these experiments.(Fig S1.4h-j, S2e, S6d)

      Fig 1d: has TRF2 been effectively silenced in this experiment? There appears to be just as many TRF2+ nuclei in the "TRF2 silenced" panel vs the control, including in the cells with neurite outgrowths.

      Quantification of nuclear levels of TRF2 showing decrease in nuclear TRF2 has been included in supplementary Fig S1g.

      Fig 2a-c: these experiments need a positive control, showing increased expression of these proteins in mNSC and SH-SY5Y cells in response to a DNA damaging agent. Again, flow cytometry may not be the best method for this; immunofluorescence combined with telomere FISH would be more convincing.

      We confirm that doxorubicin induces 53BP1 foci (IF-FISH Sup Fig. S2b) and TRF1 silencing elevates γH2AX (Sup Fig. S2c) validating DDR sensitivity. Unlike TRF2 loss (Fig. 2a-c), no TIFs appear with IF and telomere probes (Fig. 2d, Sup Fig. 2a), and without TIFs, there is no telomeric fusion. Flow cytometry was performed with Triton X- 100 to target nuclear protein. These findings adequately address the concern; therefore, further IF-FISH experiments were not included in the present study.

      To conclude that telomere damage is not occurring, an independent marker of such damage, such as telomere fusions, should also be measured.

      In response to uncapped telomeres, ATM kinase activates the DNA damage response (DDR), recruiting γH2AX and 53BP1 to telomeres, which precedes the end-to-end fusions (Takai et al., 2003; Maciejowski & de Lange, 2015; Takai et al., 2003; d'Adda di Fagagna et al., 2003; Cesare & Reddel, 2010; Hayashi et al., 2012; Sarek et al., 2015). We observe no DDR activation or foci (Fig. 2; Sup. Fig. 2). This absence of a DDR response and TIFs indicates no telomere uncapping, negating the need for direct telomere fusion analysis.

      Figure S2b is lacking a no-doxorubicin control.

      Untreated control has been included Fig. S2b.

      Figures 3a and 3b need a positive control (e.g. TRF2 binding to telomeric DNA) and a negative control (e.g. a promoter that did not show any TRF2 binding in the HT1080 ChiP-seq experiment in Fig S3).

      We have included positive (telomere) and negative (GAPDH) controls (based on HT1080 TRF2 ChIP-seq data) for the TRF2 ChIP assay in Supplementary Fig. S3d,e. Additionally, positive and negative controls for all ChIP experiments conducted in this study are presented in Supplementary Figs. S3d, S3e, S3h, S3i, S4c-h, and S5c-e

      The data in Figure 3 would be more compelling if all experiments were also performed in fibroblasts to confirm the cell-type specificity of the effect.

      Our HT1080 fibrosarcoma ChIP-seq data (ref. [18]; Sup. Fig. 3a,b) show TRF2 binding to TAN gene promoters in a fibroblast-derived model, with enrichment in neurogenesis-related genes (refs. [19,20]). In fibroblasts TRF2 depletion, as expected, induce telomere dysfunction and DDR (Fig. 2d; Sup. Fig. 2a), and eventually cell-cycle arrest and cell death as also reported earlier (van Steensel et al., 1998; Smogorzewska & de Lange, 2002). Therefore, the suggested experiments which would require sustained TRF2-depletion are not possible to perform in fibroblasts. TRF2 occupancy on the promoter of the genes in question in cells other than NSC was noted in HT1080 cells (ref. [18]; Sup. Fig. 3a,b).

      No references are provided for the TRF2 posttranslational modifications on R17, K176, K190 and T188. What is the evidence for these modifications, and is it known if they participate in the telomeric role of TRF2?

      These lines with references have been included in the manuscript (highlighted in blue).

      R17 methylation enhances telomere stability (66). K176/K190 acetylation stabilizes telomeres and is deacetylated by SIRT6 (67). T188 phosphorylation facilitates telomere repair after DSBs(68). These PTMs primarily support telomeric roles.

      The experiments in Fig 5 should also be performed with WT TRF2, to confirm that effects are not due to the overexpression of TRF2.

      WT TRF2 shows no differentiation phenotype and change in TAN gene expression (Fig. 1f,g; 3h, Sup Fig. 5a). Confirming effects are not due to TRF2 overexpression.

      Fig 5c has not been described in the text, and there are multiple technical problems with the TRF2 WT experiment: i) There appears to be significant background binding of REST to the IgG beads, though this blot has such high background it is hard to tell (the REST blot in Fig S4b is also of poor quality), ii) TRF2 is migrating at two different positions in the Input and IP lanes, and the TRF2 band in the K176R blot is at a different position to either, and iii) the relative loading of the Input and IP lanes is not indicated, so it's not clear why K176R appears to be so enriched in the IP.

      We acknowledge the oversight in not citing Fig 5c in the manuscript. This has been corrected, and, highlighted in blue in the revised manuscript.

      i) Multiple optimization attempts were made for the Co-IP experiments, and the presented figure reflects the best achievable result despite REST blot smearing, a pattern also reported previously (Ref. 65). The TRF2-REST interaction is well established, and a similar background was also observed in the cited study

      ii)Variable migration patterns of TRF2 were also noted in the cited study (Ref. 65), consistent with our observations. Our primary emphasis, however, is on the TRF2 K176R mutant, which clearly disrupts its interaction with REST.

      iii)The input loading corresponds to 10% of the total lysate. As the experiments were conducted independently, variations in transfection and pull-down efficiencies may account for observed differences.

      To rule out indirect effects of the G4 ligands on the results in Fig 6g, the binding of BG4 and TRF2 at the promoters of these genes should be measured by ChIP.

      To confirm that G4 ligand effects on TAN gene promoters are direct, TRF2 occupancy was assessed using ChIP. Significantly decreased occupancy of TRF2 was noted at TAN gene promoters, (added Fig. 6h). This implies that ligand-induced changes in TRF2 binding are directly linked to promoter-level G4 stabilization.

      Minor comments:

      1. The size of all the size markers in western blots should be added to the figures. Size has been included in all the western blots

      2. There are several figure panels that are incorrectly referenced in the text, e.g. Fig S1.1 (e-f) should be Fig S1.1 (e-h); Fig. 1m should be Fig. 1f; Figs 5e and 5f have been swapped.

      Corrected.

      1. Fig S1.4 is not referred to in the text. It is not clear what the purpose of Fig S1.4a is.

      The following line has been included in the manuscript highlighted in blue.

      Neurospheres were characterized using PAX6, a NSC marker (Fig S1.4a).

      Are the experiments in Figs 3e, 4a, 4c and 4e using 4-OHT treatment, or siRNA? If the latter, I don't think a control for the effectiveness of the knockdown in this cell type has been included anywhere in the manuscript.

      It is using siRNA, a western blot showing the effectiveness of knockdown is presented in supplementary figure S4c (now S4a).

      The lanes of the western blots in Fig S4c are not labelled.

      Corrected.

      1. Given that the experiments in Fig 5 were carried out on a background of endogenous WT TRF2 expression, presumably the K176R mutant is having a dominant negative effect. To understand the mechanism of this effect (e.g, is it simply due to replacement of endogenous WT TRF2 at its genomic binding sites by a large excess of exogenous K176R, or is dimerisation with WT TRF2 needed?) it would be helpful to know the relative expression levels of endogenous and K176R TRF2.

      To address the query, qRT-PCR with 3′ UTR-specific primers showed no change in endogenous TRF2 mRNA upon K176R expression in SH-SY5Y cells, while primers detecting total TRF2 revealed ~10-fold higher expression of K176R compared to control (Figure below). This indicates the absence of suppression of endogenous TRF2 mRNA. Given that the mutant's DNA binding is intact (Fig. 5f), the dominant-negative effect of K176R likely arises from overexpression of the exogenous mutant.

      For the sentence "...and critical for transcription factor binding including epigenetic functions that are G4 dependent" (bottom of page 3 of the PDF), the authors cite only their own prior papers, but there are examples from others that could be cited.

      We have incorporated citations from other research groups, now included as references 23-26.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript examines the effects of depletion of the telomeric protein TRF2 in mouse neural stem cells, using mice carrying a floxed allele of TRF2 and inducible Cre recombinase under the control of the stem cell-specific Nestin promoter. The results are also backed up in a human neuroblastoma cell line that has progenitor-like properties. There is no apparent induction of telomere damage in either of these cell types, but there is an increase in expression of neurogenesis genes. This is accompanied by an increase in binding of TRF2 to the relevant promoters, and evidence is provided that this binding involves G-quadruplexes in the promoters.

      On the whole, these core findings of this study are interesting, and reasonably robust. However, the study as a whole is marred by a large number of technical issues and missing controls which should be addressed prior to publication:

      1. For most of the data graphs in the manuscript, there is no indication of the number of independent biological replicates carried out (which should ideally be plotted as individual dots overlaying the column graphs), or what the error bars represent, or what statistical test was used.
      2. Figure S1.1a: needs a marker to show that the tissue is dentate gyrus.
      3. Figure 1c (and all other flow cytometry panels throughout the manuscript): it is not clear if the expression of any of these proteins, except maybe MAP2, are significantly different in the presence or absence of TRF2. These differences need to be presented more quantitatively, with the results compiled from multiple biological replicates and analysed statistically. I am not sure that flow cytometry is the best way to determine differences in protein expression levels for non-surface proteins, because many of the reported differences are not at all convincing.
      4. Fig 1d: has TRF2 been effectively silenced in this experiment? There appears to be just as many TRF2+ nuclei in the "TRF2 silenced" panel vs the control, including in the cells with neurite outgrowths.
      5. Fig 2a-c: these experiments need a positive control, showing increased expression of these proteins in mNSC and SH-SY5Y cells in response to a DNA damaging agent. Again, flow cytometry may not be the best method for this; immunofluorescence combined with telomere FISH would be more convincing.
      6. To conclude that telomere damage is not occurring, an independent marker of such damage, such as telomere fusions, should also be measured.
      7. Figure S2b is lacking a no-doxorubicin control.
      8. Figures 3a and 3b need a positive control (e.g. TRF2 binding to telomeric DNA) and a negative control (e.g. a promoter that did not show any TRF2 binding in the HT1080 ChiP-seq experiment in Fig S3).
      9. The data in Figure 3 would be more compelling if all experiments were also performed in fibroblasts to confirm the cell-type specificity of the effect.
      10. No references are provided for the TRF2 postranslational modifications on R17, K176, K190 and T188. What is the evidence for these modifications, and is it known if they participate in the telomeric role of TRF2?
      11. The experiments in Fig 5 should also be performed with WT TRF2, to confirm that effects are not due to the overexpression of TRF2.
      12. Fig 5c has not been described in the text, and there are multiple technical problems with the TRF2 WT experiment: i) There appears to be significant background binding of REST to the IgG beads, though this blot has such high background it is hard to tell (the REST blot in Fig S4b is also of poor quality), ii) TRF2 is migrating at two different positions in the Input and IP lanes, and the TRF2 band in the K176R blot is at a different position to either, and iii) the relative loading of the Input and IP lanes is not indicated, so it's not clear why K176R appears to be so enriched in the IP.
      13. To rule out indirect effects of the G4 ligands on the results in Fig 6g, the binding of BG4 and TRF2 at the promoters of these genes should be measured by ChIP.

      Minor comments:

      1. The size of all the size markers in western blots should be added to the figures.
      2. There are several figure panels that are incorrectly referenced in the text, e.g. Fig S1.1 (e-f) should be Fig S1.1 (e-h); Fig. 1m should be Fig. 1f; Figs 5e and 5f have been swapped.
      3. Fig S1.4 is not referred to in the text. It is not clear what the purpose of Fig S1.4a is.
      4. Are the experiments in Figs 3e, 4a, 4c and 4e using 4-OHT treatment, or siRNA? If the latter, I don't think a control for the effectiveness of the knockdown in this cell type has been included anywhere in the manuscript.
      5. The lanes of the western blots in Fig S4c are not labelled.
      6. Given that the experiments in Fig 5 were carried out on a background of endogenous WT TRF2 expression, presumably the K176R mutant is having a dominant negative effect. To understand the mechanism of this effect (e.g is it simply due to replacement of endogenous WT TRF2 at its genomic binding sites by a large excess of exogenous K176R, or is dimerisation with WT TRF2 needed?) it would be helpful to know the relative expression levels of endogenous and K176R TRF2.
      7. For the sentence "...and critical for transcription factor binding including epigenetic functions that are G4 dependent" (bottom of page 3 of the PDF), the authors cite only their own prior papers, but there are examples from others that could be cited.

      Significance

      The protein TRF2 was first identified as one of the core proteins that bind to the double-stranded region of telomeric DNA, and its many-faceted role in telomere protection has been well studied over the last 3 decades. More recent data from several labs indicate that TRF2 has additional roles outside the telomere, including in regulating gene expression, but these roles are so far much less characterised. Also, it has recently been shown that mouse ES cells, unexpectedly, do not require TRF2 for telomere protection (references 3 and 4 in this paper).

      The findings of the current findings expand the type of stem cells in which TRF2 is likely to be playing more of a role elsewhere in the genome, and not at telomeres, and hence is likely to be of high interest to both researchers of telomere biology, and those interested in the regulation of stem cell biology and neurogenesis.

      The strengths of the study are its novelty, its use of an inducible system to knock out TRF2 in the mouse neural stem cells of interest, and a thorough analysis of changes in gene expression and promoter occupancy across a range of genes of relevance to neurogenesis. The major weakness of the study, as descibed above, is the large number of technical problems, missing controls and missing indications of biological reproducibility.

    1. Reviewer #3 (Public Review):

      Summary:

      There are two major flaws that fundamentally undermine the value of the study. First, nearly all the central conclusions drawn here rely on the unfounded assumption that the effects observed are direct. No rigorous cause-and-effect relationships are established to support the claims. Second, the quality of the experimental data is substandard. Collectively, these concerns significantly limit any advances that might be gained in our understanding of the UBP1 pathway or Mediator function.

      Weaknesses:

      (1) The decrease in 1,6-hexanediol-treated cells of MED16 is modest, variable, not quantified, and internally inconsistent. For example, in Figure 1A, 1,6-hexanediol treatment should not have an impact on the level of the protein being directly IP. For MED12 (and CDK8 and MED1 to a lesser extent), 1,6-hexanediol treatment alters the level of the target protein in the IP. Along these lines, Figure 1A shows a no 1,6H-D dependent decrease in MED1 or MED12 levels in the CDK8 IP, whereas Figure 1B does show a decrease. Figure 1A shows no 1,6H-D dependent decrease in CDK8 levels in the MED1 IP, whereas Figure 1B shows a dramatic decrease. MED24 levels in the MED12 IP increase upon 1,6H-D in Figure 1A, but decrease in Figure 1B. Internal inconsistencies of this nature persist in the other Figures.

      (2) Undermining the value of Figure 1E/F, UBP1 and TFCP2 may also associate with the small amount of MED16 in the 2MDa fractions. This is not tested, and therefore, the conclusion that they just associate with the dissociable form of MED16 is not supported.

      (3) Domain mapping studies in Figure 2 are overinterpreted. Since the interactions could be indirect, it is not accurate to conclude "Therefore, the N-terminal WDR domain of MED16 is crucial for its integration into the Mediator complex, while the C-terminal αβ-domain is essential for interacting with UBP1-TFCP2. "

      (4) A close examination of Figure 2C undermines confidence in the association studies. The bait protein in lanes 5-8 should be equal. Also, there is significant binding of GST to UBP1 and TFCP2, in roughly the same patterns as they bind to GST-MED16 αβ. The absence of input samples makes the results even more difficult to interpret.

      (5) The domain deletion mutants are utilized throughout the manuscript as evidence of the importance of the UBP1-MED16 interaction. However, in Figure 2F lanes 7 and 8, the delta-S mutant binds MED16 as well as full-length UBP1. This undermines much of the subsequent data and conclusions about specificity.

      (6) Even if the delta-S mutant were defective for MED16 binding, the result in Figure 3B does not "confirm that MED16 is required for the transcriptional activity of UBP1,". Removal of that domain may have other effects.

      (7) As Mediator is critical for the activation of many genes, it is not accurate to assume that the impact of its deletion in Figure 3E/F demonstrates a direct requirement in UBP1-driven transcription. This could easily be an indirect effect.

      (8) Without documenting the relative protein expression levels in Figure 3G/H, conclusions cannot be drawn about the titration experiments, nor the co-expression experiments. These findings are likely the result of squelching or some form of competition that is not directly related to the UBP1-mediated transcription. A great deal of validation would be required in order to support the model that these effects are a result of MED16 overexpression sequestering UBP1 away from holo-Mediator.

      (9) The lack of any documentation of expression levels for the various ectopic proteins in the majority of Figures, renders mechanistic claims meaningless (Figures 3, 4, 5, 6, 7, S2, S3). This is particularly relevant since the model presented for many of the results invokes concentration-dependent competition.

    1. El vibe coding funciona porque hay gente que sabe programar. Un programador que sabe lo que hace puede pedirle a una IA que le haga un código y luego puede revisar y corregir sus inevitables* errores. O puede corregir los errores de las personas que no saben programar pero usaron un chatbot para escribir código. De hecho hay toda una industria de programadores dedicados a hacer estos arreglos. Muchas empresas de software ahora no están contratando a programadores junior, con la idea de que alguien puede producir código à la vibe coding y luego un programador más experto lo puede corregir. ¿Pero qué van a hacer cuando esos programadores expertos se retiren y las empresas pierdan esas habilidades? Por ahora, muchas confían en las promesas de mejoría de la industria de la inteligencia artificial*.

      Lo que me pareció más interesante de este fragmento es cómo muestra que el vibe coding solo funciona porque aún existen personas con verdadero conocimiento en programación. Me sorprende pensar que, si las empresas dejan de formar nuevos programadores y dependen solo de la inteligencia artificial, llegará un momento en que nadie sabrá cómo corregir los errores que la misma IA cometa. Es curioso cómo una herramienta creada para facilitar el trabajo puede terminar debilitando las habilidades humanas que la sostienen. David Ramos

    2. Ya que escribo como trabajo, muchas veces me han preguntado si no creo que seré reemplazado por una inteligencia artificial. Yo creo que no. Aunque seguramente muchas personas usarán estas herramientas para escribir cosas, consideren lo que pasaría si todo el texto del mundo fuera creado por IA: los modelos de lenguaje en los que están basados estas herramientas simplemente regurgitarían infinitamente otros textos, si bien coherentes, de baja calidad y de dudosa verosimilitud ya regurgitados por otra inteligencia artificial. Eventualmente habría un mercado para algún humano que entrara, cuando menos, a revisar, a editar, a hacer algo con el texto. A escribir.

      La IA puede escribir, sí, pero no puede tener algo que decir y mientras exista alguien que piense, cuestione, viva y sienta va a ser necesario que un humano esté ahí, al menos para revisar, reinterpretar y darle alma a lo que se escribe. Sara Sarria

    3. Por su parte, las redes sociales (en un sentido amplio que incluye foros y blogs) atrofiaron nuestro sentido de habitar una realidad común. Pero a cambio nos dieron la posibilidad de cambiar las dinámicas del poder de la información. Ahora “cualquiera” (en el sentido de Ratatouille) puede hacer escuchar su voz, no sólo los guardianes de la información a los que hemos estado acostumbrados. Esto tiene sus cosas buenas y malas, pero sin duda ha cambiado cómo vivimos e interactuamos.

      Lo que plantea sobre las redes es muy cierto, antes unos pocos hablaban y el resto escuchaba, ahora cualquiera puede opinar, pero eso también hizo que cada uno viva en su propia “burbuja”. Ganamos voz pero perdimos un poco el sentido de realidad compartida "No todos estamos viendo el mismo mundo". Sara Sarria

    4. La inteligencia artificial es muy compleja y aún no nos ha demostrado que se justifique para ser inevitable y que sus críticos quedemos como Sócrates. *La industria de la inteligencia artificial argumenta que su producto mejorará tanto que los errores sí llegarán a ser evitables.

      Cierra el texto retomando el paralelo con Sócrates y deja una pregunta abierta sobre el futuro de la IA. Es un buen punto para un momento reflexivo, me gusta cómo el autor termina el texto dandonos como una cierta duda lo cual, nos invita a pensar si la IA realmente será tan necesaria como la escritura, sugiere que tal vez los críticos de la IA no están equivocados del todo, sino que están advirtiendo sobre un cambio que aún no comprendemos del todo. Esta reflexión nos hace pensar en la responsabilidad colectiva que tenemos frente al uso y los límites de la IA.

    5. A Sócrates no le convencía eso de escribir. Su argumento principal era que, al tener las ideas siempre a la mano en un dispositivo externo a la mente humana, esto atrofiaría nuestra memoria: ya no haríamos un esfuerzo por recordar largos poemas épicos, o largas listas de hechos científicos. Pero tampoco haríamos un esfuerzo por recordar nuestros propios argumentos sobre disquisiciones varias. Todo estaría por ahí, en papel o en piedra, listo para consultarse cuando se nos diera la gana.

      Fue interesantes porque Sócrates, el filósofo, no confiaba en la escritura porque pensaba que al externalizar el conocimiento en textos, nuestra memoria se volvería floja y solo tendríamos una simulación del saber, no el conocimiento de verdad. La ironía es que sabemos esto porque Platón, su estudiante, al final lo escribió el, la escritura se impuso a pesar de las críticas, igual que hoy pasa con la inteligencia artificial: hay un montón de gente que le tiene miedo o le hace críticas, pero el texto sugiere que, como con la escritura, la IA probablemente acabará triunfando y cambiándolo todo, aunque ahora nos cueste verlo.

      Maria Gabriela Quiroga B

    6. Pero, a cambio, la escritura nos abrió la posibilidad de conocer mucho más allá de lo que puede guardar una memoria humana individual.

      A veces pienso que la escritura es como una memoria que no se cansa. Las personas olvidan, cambian o se van, pero lo que queda escrito sigue ahí esperando a ser leído, eso me hace pensar que de cierta forma escribir es una manera de vencer al olvido. Sara Sarria.

    7. Un discípulo de Platón, Aristóteles, a veces es descrito como una de las últimas personas que sabían todo lo que había por saber. No porque estuviera al tanto de todo el conocimiento en general, sino porque en su época la escritura aún no era tan popular y la cantidad de conocimiento a la que podía potencialmente tener acceso un individuo seguía siendo muy limitada. Quizás conociera todo lo que había que conocer en su mundo, pero ese mundo era bastante pequeño. Probablemente ignoraba conocimientos de China, o América, pero no podía saber que los ignoraba.

      Esta parte del texto muestra que Aristóteles era visto como alguien que sabía todo lo que se podía saber en su época, pero ese “todo” era muy limitado. En su tiempo, el conocimiento estaba restringido porque la escritura y el contacto con otros pueblos eran escasos, por eso, aunque fuera muy sabio, solo conocía lo que existía dentro de su pequeño mundo, la idea final resalta que el conocimiento siempre depende del contexto y que incluso los más sabios ignoran cosas que ni siquiera saben que existen. Allison Marentes Reyes

    8. Un discípulo de Platón, Aristóteles, a veces es descrito como una de las últimas personas que sabían todo lo que había por saber. No porque estuviera al tanto de todo el conocimiento en general, sino porque en su época la escritura aún no era tan popular y la cantidad de conocimiento a la que podía potencialmente tener acceso un individuo seguía siendo muy limitada. Quizás conociera todo lo que había que conocer en su mundo, pero ese mundo era bastante pequeño. Probablemente ignoraba conocimientos de China, o América, pero no podía saber que los ignoraba. Eso es imposible de sostener ahora. Ninguna persona por sí sola puede tener en su cabeza todo el conocimiento humano. Pero sí tiene acceso, potencialmente, a todo este conocimiento, en internet, en libros, incluso en ChatGPT. Cada formato con sus errores y sesgos.

      Me pareció un texto muy interesante porque reflexiona sobre cómo ha cambiado nuestra relación con el conocimiento a lo largo del tiempo. En la época de Aristóteles, era posible que una persona conociera casi todo lo que se sabía en su entorno, ya que el mundo que se conocía era limitado y la información circulaba de manera más sencilla. Hoy en día, vivimos rodeados de muchísima información a la que podemos acceder fácilmente gracias al internet y la tecnología. Aun así, tener tanta información también tiene su parte negativa, porque no siempre sabemos qué es cierto o qué vale la pena creer. Por eso, lo difícil ahora no es saberlo todo, sino aprender a entender bien lo que encontramos y usarlo de la mejor manera. Stephania Parra

    9. Por su parte, las redes sociales (en un sentido amplio que incluye foros y blogs) atrofiaron nuestro sentido de habitar una realidad común. Pero a cambio nos dieron la posibilidad de cambiar las dinámicas del poder de la información. Ahora “cualquiera” (en el sentido de Ratatouille) puede hacer escuchar su voz, no sólo los guardianes de la información a los que hemos estado acostumbrados. Esto tiene sus cosas buenas y malas, pero sin duda ha cambiado cómo vivimos e interactuamos.

      En este párrafo se reconoce que las redes sociales transformaron nuestra manera de relacionarnos con la información, permitiendo que más voces puedan ser escuchadas, aunque a costa de perder una sensación compartida de realidad. Personalmente, considero que este cambio no puede juzgarse de forma completamente negativa ni positiva. Si bien es cierto que las redes han fragmentado la percepción colectiva y fomentado la desinformación, también democratizaron el acceso a la expresión y al debate público. En conclusión , su valor depende del uso que las personas hagan de ellas: son un reflejo de nuestras dinámicas sociales más que una causa directa de su deterioro.

    10. Una de las críticas que se le suele hacer a la inteligencia artificial generativa (que como conté en otro post, es una sección muy específica de la IA) y que yo mismo hago, es que va a atrofiar nuestra capacidad de hacer y pensar cosas críticamente. Si decides programar usando sólo un chatbot (una práctica llamada “vibe coding” en inglés), vas a delegar constantemente no sólo el trabajo, sino la capacidad de aprender cómo hacerlo. Nunca vas a aprender a programar bien. Ni siquiera vas a saber cómo corregir los errores que salgan de ese vibe coding, porque no vas a saber identificarlos. Lo mismo puede pasar con cualquier actividad humana que se le delegue a una inteligencia artificial: escribir, componer o tocar música, pensar en argumentos, lo que sea.

      La crítica me parece muy cierta porque la inteligencia artificial, aunque ayuda bastante, también puede hacer que dejemos de pensar por nosotros mismos. Si usamos un chatbot para todo, terminamos repitiendo lo que nos da sin entenderlo, y eso nos quita la capacidad de aprender o de resolver problemas por nuestra cuenta. El problema no es la herramienta, sino cuando dejamos que piense por nosotros. El texto invita a reflexionar sobre cómo la tecnología puede afectar nuestra forma de aprender y crear. No se trata de rechazar la inteligencia artificial, sino de usar su potencial sin perder el pensamiento crítico ni la creatividad humana, porque al final lo que nos diferencia es justamente nuestra manera de cuestionar y comprender lo que hacemos.

      Laura Ximena Bolivar Roman

    11. El vibe coding funciona porque hay gente que sabe programar. Un programador que sabe lo que hace puede pedirle a una IA que le haga un código y luego puede revisar y corregir sus inevitables* errores. O puede corregir los errores de las personas que no saben programar pero usaron un chatbot para escribir código. De hecho hay toda una industria de programadores dedicados a hacer estos arreglos. Muchas empresas de software ahora no están contratando a programadores junior, con la idea de que alguien puede producir código à la vibe coding y luego un programador más experto lo puede corregir. ¿Pero qué van a hacer cuando esos programadores expertos se retiren y las empresas pierdan esas habilidades? Por ahora, muchas confían en las promesas de mejoría de la industria de la inteligencia artificial*.

      El texto plantea una preocupación real sobre cómo la inteligencia artificial está cambiando la forma de programar. Hoy en día muchas personas usan chatbots para generar código sin entender realmente cómo funciona, y eso puede ser peligroso a largo plazo. Los programadores con experiencia todavía corrigen esos errores, pero llegará un momento en que esas personas ya no estén. Entonces, ¿quién quedará con el conocimiento? Más que una crítica a la tecnología, el texto nos hace pensar en la importancia de no perder las habilidades humanas detrás del progreso digital. Daniel Camargo

    12. A diferencia de la escritura, no es claro cuál es el beneficio concreto que pueda traernos la inteligencia artificial para que se justifique su eventual omnipresencia (y el atrofiamiento que ella implica). Si absolutamente todos adoptáramos su uso en todas las áreas de la vida, pronto nadie tendría habilidades.

      Estoy de acuerdo con esta parte del texto, es inevitable pensar que la inteligencia artificial va a cambiar muchos aspectos en nuestras vidas, y uno de ellos es nuestra habilidad de pensar criticamente, debido a que esta misma herramienta hará todo por nosotros sin tener que esforzarnos en pensar.

    13. No podemos negar que la inteligencia artificial esté aquí para quedarse. El asunto es cómo va a quedarse. A diferencia de la escritura, no es claro cuál es el beneficio concreto que pueda traernos la inteligencia artificial para que se justifique su eventual omnipresencia (y el atrofiamiento que ella implica). Si absolutamente todos adoptáramos su uso en todas las áreas de la vida, pronto nadie tendría habilidades.

      Estoy de acuerdo con esta afirmación porque aunque la IA ofrece muchas ventajas su uso excesivo puede generar una fuerte dependencia que afecte nuestras capacidades. Si dejamos que la IA piense, escriba, decida y solucione todo por nosotros poco a poco perderemos la habilidad de razonar, analizar y crear por nuestra propia cuenta "la historia demuestra que las herramientas deben complementar al ser humano, no remplazarlo". Ana Sofia Chacón Casasbuenas

    14. la escritura triunfó como tecnología: casi todas las sociedades del planeta la han adoptado y buena parte de nuestro conocimiento, nuestras comunicaciones y nuestra vida en general está basada en esta invención.

      A pesar de que Sócrates no confiaba en la escritura, esta termino siendo parte esencial de la vida humana. Con el tiempo, casi todas las sociedades la adoptaron y gracias a ella hoy podemos guardar y compartir conocimiento, comunicarnos rápidamente y hacernos la vida mas fácil.

    15. tener habilidades humanas es mucho más valioso.

      Me parece muy importante esta frase, es mas valioso tener la fortuna de pensar por nosotros mismos. Siempre he pensado que la IA nunca podrá reemplazarnos, porque es incapaz de crear algo, sirve como motor de búsqueda, crea algo a partir de miles de cosas que ya existen, si, es una gran ventaja y por eso le es tan fácil "crear", pero nada como un cerebro humano investigando, haciendo lluvias de ideas, sacando inspiración de vivencias propias, sintiendo algo con tanta fuerza que pueda sacar una maravillosa pieza artística, grafica o escrita, la IA nunca podrá reemplazar esto. Si bien es posible que algún día aprendan a crear de ceros, nunca será un cerebro humano lleno de recuerdos, sentimientos, motivaciones y razones para pensar. Además recordemos que la IA siempre va a necesitar de un humano que le diga que hacer.

    16. Una de las críticas que se le suele hacer a la inteligencia artificial generativa (que como conté en otro post, es una sección muy específica de la IA) y que yo mismo hago, es que va a atrofiar nuestra capacidad de hacer y pensar cosas críticamente.

      Esta parte del texto atrajo mi atención, durante todo el texto se mencionó la palabra "atrofiar" para referirnos al hecho de perder la capacidad para recordar algo, en sí perder parte de nuestra memoria, desde que tengo memoria el internet ha sido parte fundamental de mi vida, en la mayoría de cosas o actividades; hace unos años para poder investigar algún tema o tener conocimiento de este ingresabas en miles de páginas para poder recolectar información que sea útil, actualmente ya solo utilizas una herramienta para eso, que es la IA, esta palabra atrofiar da mucho de que pensar, que realmente estamos perdiendo la capacidad de poder hacer tareas tan básicas como el resumen de un libro, etc. Con esto quiero decir, de que no tenemos conciencia ni estamos colocando límites, como máquinas que somos de aprender, mejorar, superar nuestras capacidades de aprendizaje, estamos volviendo flojos en todo sentido, para cada situación recurrimos a que las inteligencias artificiales nos hagan las vida aún más fácil en esta época, estamos "atrofiando" todo lo que somos, la cúspide de la inteligencia en la tierra.

      Juan Sebastian Quiroz Arroyo

    17. vas a delegar constantemente no sólo el trabajo, sino la capacidad de aprender cómo hacerlo. Nunca vas a aprender a programar bien. Ni siquiera vas a saber cómo corregir los errores que salgan de ese vibe coding, porque no vas a saber identificarlos. Lo mismo puede pasar con cualquier actividad humana que se le delegue a una inteligencia artificial: escribir, componer o tocar música, pensar en argumentos, lo que sea.

      Ésta parte me parece muy interesante ya que en un par de lineas logra explicar muy detalladamente uno de los principales y más peligrosos problemas del uso de inteligencias artificiales, ya que al resultarnos efectivas el 99% de las veces llegamos a confiar ciegamente en ellas, siendo esto lo más peligroso, perder la habilidad de ver qué es lo que se está diciendo o si es correcto eso que se está diciendo porque "si antes estuvo bien y no me falló, porqué sería diferente ahora?".

      Juan Castellanos

    18. La inteligencia artificial es muy compleja y aún no nos ha demostrado que se justifique para ser inevitable y que sus críticos quedemos como Sócrates.

      No sé si decir que por el momento sus críticos no están como Sócrates en su momento o ahora. Sócrates criticó un aspecto de la escritura (que eso sí, ya extremista lo llevó al punto del rechazo absoluto), la dependencia de un medio para la memorización, pero en ningún momento habla de cómo aun así tiene otro tipo de beneficios, como es la facilidad de compartir información en masa (aunque eso ya iría mucho más en el futuro de parte de Gutenberg y aprovechado por Lutero). Sócrates criticó un aspecto de la escritura, que incluso hoy día me parece razonable, al igual que la crítica hacia las IA, al igual que Sócrates, se juzgó un aspecto en concreto de ese todo en general, ambas lógicas y funcionales hoy día a mi parecer, por lo cual no me parece este símil que se crea en la conclusión final de que aún no se demuestra que el crítico, en este caso el autor, quede como Sócrates.

    19. Aunque seguramente muchas personas usarán estas herramientas para escribir cosas, consideren lo que pasaría si todo el texto del mundo fuera creado por IA: los modelos de lenguaje en los que están basados estas herramientas

      Esto en especifico me recuerda a varias cosas que lei sobre como funcionaba la ia con la recopilacion de información y como la ia muchas veces funciona de manera que pueda complacerte o hacerte interesarte en un resultado. La inteligencia artificial activamente se va comiendo informacion de todas partes, las traduce y almacena en todo su ''conocimiento'' virtual, pero, ¿Que pasa cuando este conocimiento se retroalimenta de si mismo?

      Uno de estos temas tocaba el como algunas ia tomaban informacion de articulos escritos por otras ia, ¿Es siempre información tan confiable?

      En mi opinion, si de aqui en un futuro sucediera algo así, estariamos condenados a un mundo todavia más lleno de desinformación y seria más dificil investigar que es real y que no, haciendo que muchos datos que son falsos se tomen como ciertos en la mente colectiva

    20. las personas nos daremos cuenta de que obtener habilidades es mucho más valioso de delegárselas a una máquina.

      Esta parte me hizo tener un momento de verdadera reflexión. El autor afirma una idea que en mi cabeza se ha venido debatiendo de un buen tiempo para acá... Que tan realmente "superior" es la habilidad digital y de la IA, sobre la capacidad humana y sus ya conocidos limites?

      Pienso que es algo que desarrolla bien a lo largo del texto, y que como individuo, es algo que al momento no he logrado aclarar.

    21. Por su parte, las redes sociales (en un sentido amplio que incluye foros y blogs) atrofiaron nuestro sentido de habitar una realidad común. Pero a cambio nos dieron la posibilidad de cambiar las dinámicas del poder de la información. Ahora “cualquiera” (en el sentido de Ratatouille) puede hacer escuchar su voz, no sólo los guardianes de la información a los que hemos estado acostumbrados. Esto tiene sus cosas buenas y malas, pero sin duda ha cambiado cómo vivimos e interactuamos.

      Estoy de acuerdo con la idea de que las redes sociales, incluyendo foros y blogs, han tenido un impacto doble. Por un lado, como se menciona, siento que han debilitado nuestra noción de vivir en una realidad compartida.Sin embargo el cambio que trajeron es radical. Antes, la información y la opinión pública estaban casi totalmente controladas por unos pocos, grandes medios de comunicación, instituciones y expertos.a referencia a Ratatouille es perfecta: "cualquiera puede cocinar", o en este caso, cualquiera puede tener una voz. Un testimonio en un blog, un hilo en Twitter o un video en TikTok pueden visibilizar una injusticia o una idea que los medios tradicionales ignoraban. La democratización de la voz viene acompañada de desinformación y ruido. Pero a pesar de eso, ha cambiado esencialmente cómo nos informamos, nos relacionamos y hasta cómo exigimos responsabilidades a los poder. 1025062039

    22. Aunque seguramente muchas personas usarán estas herramientas para escribir cosas, consideren lo que pasaría si todo el texto del mundo fuera creado por IA

      Este punto me parece muy interesante. Si todo lo escribiera una máquina, perderíamos la creatividad y la originalidad humanas. Las ideas frescas, las emociones reales, los errores que también son parte de escribir... todo eso es difícil que lo reproduzca una IA. Me gusta pensar que todavía vale la pena escribir nosotros mismos.

    23. Esto, habría dicho Sócrates, nos daría una “simulación” del conocimiento, en vez de permitirnos acceder a un “verdadero” conocimiento de las cosas.

      Esta idea me resulta muy familiar en mi vida. Hoy en día muchas veces sentimos que sabemos algo solo porque lo leímos o lo encontramos rápido en internet, pero en realidad no lo entendemos profundamente. Es como aprender de memoria sin comprender. Me pasa con algunos temas en clase: creo que los entiendo porque los vi en un video o en una IA, pero después, cuando tengo que explicarlos, me doy cuenta de que no lo procesé bien.

    24. El vibe coding funciona porque hay gente que sabe programar. Un programador que sabe lo que hace puede pedirle a una IA que le haga un código y luego puede revisar y corregir sus inevitables* errores. O puede corregir los errores de las personas que no saben programar pero usaron un chatbot para escribir código. De hecho hay toda una industria de programadores dedicados a hacer estos arreglos. Muchas empresas de software ahora no están contratando a programadores junior, con la idea de que alguien puede producir código à la vibe coding y luego un programador más experto lo puede corregir. ¿Pero qué van a hacer cuando esos programadores expertos se retiren y las empresas pierdan esas habilidades? Por ahora, muchas confían en las promesas de mejoría de la industria de la inteligencia artificial*.

      "El vibe coding suena bien en el corto plazo, pero se apoya en que todavía hay gente que realmente sabe programar. Si dejamos de formar nuevos programadores porque 'la IA lo hace sola', ¿quién va a entender el código en unos años cuando los expertos se retiren? Es como construir un edificio con piezas prefabricadas sin que nadie sepa cómo funciona la estructura: puede aguantar un tiempo, pero el día que falle algo, ¿quién lo arregla?

    25. A diferencia de la escritura, no es claro cuál es el beneficio concreto que pueda traernos la inteligencia artificial para que se justifique su eventual omnipresencia (y el atrofiamiento que ella implica).

      La inteligencia artificial, más que expandir nuestras capacidades, amenaza con adormecerlas. Nos acostumbra a delegar el pensamiento, el aprendizaje y la creación, hasta el punto de confundir comodidad con progreso. Nos ofrece respuestas inmediatas, pero no comprensión; eficiencia, pero no conocimiento. En lugar de impulsarnos hacia una inteligencia más profunda, corre el riesgo de volvernos dependientes de una ilusión de saber, una versión brillante pero vacía de lo que significa realmente pensar.

    1. 2008 zhoršuje

      Nevím, jestli koukám dobře na graf, ale: když se dejme tomu dostupnost mezi 2008 a 2013 zlešovala, nebylo by lepší říct, že se dostupnost bydlení zhorušuje od 2014?

    1. Between Not Everythingand Not Nothing: Cuts TowardInfrastructural Critique

      chrome-extension://bjfhmglciegochdpefhhlphglcehbmek/pdfjs/web/viewer.html?file=file%3A%2F%2F%2FUsers%2Fprestontaylor%2FDownloads%2FC3.VISHMIDT_Between%2520Not%2520Everything%2520and%2520Not%2520Nothing_FORMER%2520WEST%25202016.pdf

    Annotators

    1. Le Prince d’Aquitaine à la tour abolie

      Eliot’s line “Le prince d’Aquitaine à la tour abolie” which translates to “the prince of Aquitaine, his tower in ruins” is a direct reference to the identical line in Gerard de Nerval’s poem El Desdichado. The “tour abolie” or “tower in ruins” references back to the “falling towers” from earlier in the section, and, thus the “unreal city” referenced in “The Burial of the Dead” and in “A Game of Chess.” In these references, the city, which may seem at first glance to be bustling and full of life, is inverted upon further investigation. With “brown fog” and “tower in ruins,” the images that Eliot portrays of the urban environment is anything but inspiring. The “towers” that make up the city, falling or in ruins, are replicated in the structure of the poem, with the poem itself acting as an autonomous landscape, with the characters, Madame Sosostris, Tiresias, etc, going through the motions of life while surrounded by a world, or words, falling apart. Thus, at the end of The Waste Land, bringing this notion of “la tour abolie” to the forefront, the readers can end the poem, seeing the microcosm of references and urbanity crumbling beneath itself, with no hope of resurrection.

      In his annotation on the same line, Richard Lu compares the “tour abolie” and “seule étoile” (in the following line of El Desdichado) to the tarot deck. He states that “in tarot decks, The Star directly follows The Tower card. The Tower is often called the most dreadfall card as it often implies a sudden disaster which instates a change in your world. This change is extreme. However, after the tower falls, the Star appears, the tarot card of hope. However, Eliot ends with this reference. The Star is dead, there may not be a hope after the destruction of The Tower.” Concluding “The Waste Land,” this line, among others, works to support the desolation laid out since the very first line. Though approached with a dead and barren landscape, throughout the poem one can find a glimmer of hope, an ounce of inspiration between the clever minds and many characters that prolong the narrative. This ending defies all of that. Eliot, through all five parts of the poem, sets up an ending where divinity and faith have no place. There is no hope for the The Star tarot card coming next, it is dead. There is no God, nothing matters. In the wake of destruction, among the “falling towers” in ruins, one finally becomes as desolate and depressing as their surroundings.

    2. DA

      This section depicts an image of drought in India, drawing on the geography of the Ganges River (Ganga) and the Himalayan mountains (Himavant). The "limp leaves" and "sunken" river depict a world parched and waiting for a life-giving force, while the rain clouds gather "far distant," suggesting that salvation or meaning is possible but not yet present. The "jungle crouched, humped in silence" (line 399) adds a sense of primal, coiled tension, as if the entire natural world is holding its breath. This silence is then broken by the thunder, which says "DA." This sound as a three-part command structures the rest of this section; each command is met with a complex response, revealing a deep spiritual inadequacy. The first is "Datta," or "give". The speaker creates a self-interrogation by saying "what have we given?" (line 402) They later claim that "By this, and this only, we have existed." This "truth" is unrecordable though, and won't be found in "memories draped by the beneficent spider / Or under seals broken by the lean solicitor" (line 408). It exists only in the haunting echo of "our empty rooms." The thunder speaks again, saying "Dayadhvam," or "be compassionate." This triggers a scene from Dante's Inferno: "I have heard the key / Turn in the door once and turn once only" (line 412-413). This is the sound of Count Ugolino being locked in the tower to starve, a symbol of irrevocable imprisonment. Eliot then layers this with a philosophical idea from F.H. Bradley, who saw every individual consciousness as a isolated prison. Individuals are all trapped in their subjective selves, "each in his prison" (line 414) and their awareness of the isolation. Lastly the thunder commands Damyata. Unlike the previous two sections, which dwell on failure and isolation, this one offers a glimpse of harmony. It paints a picture of perfect control: a boat responding "gaily" to an "expert" hand. The speaker then poignantly extends this metaphor to a human relationship, saying "your heart would have responded / Gaily, when invited." The conditional tense "would have" is key—it reveals this harmonious control not as a reality, but as a lost opportunity or a poignant "what if." It’s a vision of a relationship that could have been obedient to "controlling hands," a symbiosis that was never achieved.

    3. DA

      I have been tracking agency across the poem in many of my annotations. Something that I think somehow fits with this—also at play across the poem—is perception. Someone touched on this the other day (I think it might have been William, but I can’t quite remember), and it made me think of this connection.

      “Da” means “be self-controlled,” “give,” or “be compassionate.” But how can these three be distinguished between when just “Da” is used (and not Damyata or Datta or Dayadhvam, and if the relationship—to gods, to human beings, or to demons—is not defined)? How does the second voice in the exchange in the poem discern which? Zooming out, readers, in a pause before reading what follows after each “Da,” can interpret it as all three at once, or choose one. Is this desired? How does the effect change?

      The Bradley source clearly speaks very much to perception. How can the exchange between the two figures (one being thunder) here at the end with the three repetitions, and meanings, of “Da” be informed or elucidated by this? At the very least, it seems to exemplify the potential disconnect (and differing interpretation etc.) Bradley lays out. Is this then to spread to other characters and their exchanges in the poem? All of this makes me want to loop back to Madame Sosotris, and the tarot deck…

      On a slightly different note, I think this ending (or at least the beginning of the end; I don’t want to speak too soon) is hopeful. The order of “Da”s goes from human beings to demons to gods, ending with self-control—and a boat responding “Gaily, to the hand expert with sail and oar / the sea was calm…” This seems to lay out the journey, ending on top, and with established human agency. But of course I am now seeing some other ways this could be read…

    4. DA

      “Look at this stuff, isn’t it neat?” she asks, surrounded by objects that don’t speak her language. Ariel’s voice is stolen so she can walk on land, and that is the question of Psalm 137: “How shall we sing the Lord’s song in a strange land?” Eliot asks the same thing, except there’s no sea witch to blame, only silence. The Ganga is sunken, the clouds are distant, and the thunder can barely form a word: DA. The syllable stammers toward meaning. Datta. Dayadhvam. Damyata. Give. Sympathize. Control. Commands that echo the psalm’s plea for song but return only fragments. The captives in Babylon hung their harps on the willows; Eliot’s speakers hang their words on static. The thunder speaks, but its language is splintered, a sacred tongue reduced to consonants. Both rivers are holy, both are broken. The Ganges without rain is as desolate as Babylon without Zion. In both, sound becomes the only remaining form of faith, the echo of what once was music. The psalmist threatens vengeance, but Eliot offers obedience; both are desperate to reclaim voice through rhythm. When the thunder says DA, it is the ghost of a hymn, a cracked psalm vibrating through dry air. The rain hesitates at the edge of speech. What’s left is a choir of lost voices, each trying to sing in a language it no longer believes in. Ariel traded her song for legs; Eliot traded his for survival. Neither ever gets it back.

    5. What you get married for if you don’t want children?

      Between his textual narrative of Lil and his reference to Ophelia, Eliot examines the contrast and connection between love, virginity, purity, and exploitation. Here, the speakers discuss how Lil is not taking care of her appearance and will not be appealing to her husband, then one says “What you get married for if you don’t want children?” This implies that the sole reasons for marriage are sexuality and having children, while the concept of love is not mentioned. The final line of “The Game of Chess” is “Good night, ladies, good night, sweet ladies, good night, / good night,” which is a reference to Ophelia’s farewell in Hamlet prior to her suicide. By ending the passage with Ophelia’s words of distress, it is implied that Ophelia’s situation is very significant to Eliot’s message. Ophelia says, in the excerpt from Hamlet, that “To-morrow is Saint Valentine's day, All in the morning betime, And I a maid at your window, To be your Valentine. Then up he rose, and donn'd his clothes, And dupp'd the chamber-door; Let in the maid, that out a maid Never departed more.” She mentions arriving at Hamlet’s window as a virgin (a maid – older description for a young unmarried virgin), looking to be his Valentine, or to find love. Ophelia leaves this meeting no longer a maid, or a virgin. Afterwards, Ophelia adds, “Before you tumbled me, You promised me to wed. So would I ha' done, by yonder sun, An thou hadst not come to my bed.” Hamlet promised to marry her, which she was ready for, but instead just slept with her and then shunned her. Ophelia seems to feel used and exploited for her body. Lil, having already lived a life of five children, chooses to resist the need to appease her husband with superficial changes to her appearance. It is almost as if Lil lived Ophelia’s life, but continued living with a different mindset, though she is still subject to the same expectations and judgement. The difference between the two women is that Lil experiences this while married, and Ophelia is a young unmarried woman. Considering the time period of the piece, having lost her virginity to a man who decides not to marry her after all, Ophelia is left “ruined” and “dishonored” in society and in future romantic relationships. Essentially, by taking her virginity without marrying her, Hamlet has sentenced Ophelia to a life without the authentic love she originally desired. Left without clear choices and grieving the loss of her father, Ophelia becomes mentally unstable and feels that she has no other option than suicide by drowning. This is significant, because water is most often viewed as spiritually pure, especially as the medium for baptism. At the start of one’s life, they are baptized, and at the end of Ophelia’s life, she drowns. So, at line 170, when the women in the bar say “goonight” to each other, they are just going home for the night. In the final line, however, when the farewells shift to Ophelia’s voice, she is saying goodbye to the “Game of Chess” – the “game” of a woman experiencing sexual exploitation and a loss of pure connection – and transitioning the reader to the next section where water (the River Thames) becomes polluted and “impure,” as well.

    6. DA

      In the final lines of the part (and the poem), we finally hear from the divine. I have established, throughout this part of the poem, an equality and symbiosis between the divine and the natural, whether that is the wind and the chapel or the thunder or the “third walking beside you.” With this in mind, the words from the Thunder, “DA,” stand out as a calling from the divine, perhaps a respite or a sign of holiness, in a waste land, where the lack or disappearance of a god has been repeatedly referenced. These words, from the Thunder to the readers and inhabitants of The Waste Land, perhaps signify an end to the perilous conditions that they have been living under, a savoir, at last, from the human-induced nightmare we have all been living in.

      Ira’s reading of the line “DA” stuck out to me during tonight’s reading. In the narrative of the poem, we are presented with the lines “Then spoke the thunder/DA,” suggesting that this is, finally, “What the Thunder Said.” Interestingly, Ira’s reading of this line splits the term “DA” into the prefix for the three proceeding words, “datta,” “dayahadvam,” and “damyata.” All of which were meant to represent the interpretations of the thunder’s words to the different groups – gods, humans, and demons. What interested me the most about her annotation, however, was the question that she indirectly asks in the second paragraph, do we care more about the interpretations of the thunder’s words than the true words themselves? She further asks “what if “da” were intended to encompass all three meanings?”

      With this in mind, it makes me wonder: To what degree do the Thunder’s words matter at all? Clearly, they acted as a stem, a savoir for the datta,” “dayahadvam,” and “damyatal;” however, the meanings and suffixes of these words were all human creations. Perhaps what Eliot is suggesting is that we don’t need to look for guidance or a savoir from our action-induced reality, perhaps we have the answers within ourselves, though they may seem to be too divine or grand for human creation and understanding. Do we need to look outward to find the solution to The Waste Land?

    7. Damyata: The boat responded

      Datta means "give," dayadhvam means "be compassionate," and damyata means "be self-controlling." Eliot's footnote cites these three Sanskrit words from the Brihadaranyaka Upanishad, where the thunder speaks "DA DA DA" to gods, humans, and demon. The reference to a boat in the damyata section (lines 418-422) deliberately contrasts with earlier water imagery. As Scholar Marisin observes, “Earlier references to 'death by water' and 'drowned sailors' immediately fills the mind with dread at the mere implication of the liquid, yet here the ‘sea was calm” and the hand is an “expert with sail and oar.” This shift from threatening to tranquil water happens because a prophetic voice can finally appear in the Wasteland. When the thunder speaks, it represents authentic divine revelation rather than Madame Sosostris's fraudulent fortune-telling. The spiritual chaos that has defined the poem momentarily stills, just like the sea. The fortune of the sailor takes on new meaning here. Madame Sosostris warned "Fear death by water" (line 55), and we saw Phlebas the Phoenician drowned in Part IV. But the damyata section reframes water entirely. The boat "responded / Gaily" to "controlling hands" (lines 418, 422). This isn't about technological mastery over nature but rather the heart "beating obedient" (line 421) to spiritual discipline. Damyata as self-control leads to harmony rather than drowning.

    1. Note d'information : Militantisme et Esprit Critique

      Synthèse

      Ce document de synthèse analyse les tensions entre l'engagement militant et la rigueur de la pensée critique, en se basant sur les analyses de Laurent Puech, assistant social.

      Il démontre que si le militantisme est essentiel pour le progrès social, une approche axée exclusivement sur la "cause" peut conduire à des dérives méthodologiques, à la manipulation de données et à des résultats contre-productifs.

      À travers deux études de cas approfondies — les violences conjugales et les enfants tués par leurs parents —

      Laurent Puech met en lumière comment certains discours militants, souvent amplifiés par les médias et les institutions, propagent des statistiques alarmistes et factuellement fausses.

      Par exemple, l'idée d'une augmentation des "féminicides" ou le chiffre de "deux enfants tués par jour" sont directement contredits par les données officielles, qui montrent au contraire une baisse significative de ces phénomènes.

      Ce décalage entre la perception et la réalité révèle l'utilisation des chiffres non pas comme des outils de mesure, mais comme des arguments moraux visant à susciter l'émotion et à valider une idéologie préexistante.

      Cette démarche, bien que souvent sincère, entrave une compréhension juste des problèmes, génère une peur infondée et risque de paralyser les victimes que l'on prétend aider.

      En conclusion, Laurent Puech plaide pour un militantisme fondé sur la méthode, la vérification des faits et l'honnêteté intellectuelle, même face aux sujets les plus sensibles.

      1. Profil de l'intervenant : Laurent Puech

      Laurent Puech est un assistant social de formation qui a développé une expertise sur l'application de la pensée critique dans le domaine de l'aide sociale et du militantisme.

      1.1. Parcours professionnel

      Formation et débuts : Après une réorientation professionnelle vers la trentaine, il s'est formé au métier d'assistant de service social.

      Expériences diverses : Son parcours l'a conduit à travailler dans des contextes variés, incluant le milieu scolaire (collèges, lycées), la "polyvalence de secteur" (service social de quartier), et une mise à disposition auprès de la gendarmerie.

      Spécialisation : Ces expériences l'ont rapproché des questions de protection de l'enfance et des personnes, notamment les femmes victimes de violences conjugales.

      Son rôle auprès de la gendarmerie consistait à assister le public en contact avec les forces de l'ordre, sur la base du volontariat.

      1.2. Parcours militant et évolution

      Laurent Puech se définit comme un militant, son parcours étant jalonné d'engagements syndicaux, politiques et associatifs (notamment à l'Association Nationale des Assistants de Service Social - ANAS).

      Il décrit une évolution significative dans sa manière de militer :

      Du militantisme de l'idée... : Dans sa jeunesse (années 80), son engagement était principalement motivé par des "idées" et des grands principes.

      Il cite son adhésion au MRAP (Mouvement contre le Racisme et pour l'Amitié entre les Peuples) comme un exemple de militantisme centré sur la défense de valeurs (égalité, dignité) sans un questionnement approfondi de la méthode.

      ...Au militantisme de la méthode : Aujourd'hui, son militantisme est axé sur la défense d'une méthode basée sur la pensée critique, l'analyse de l'information et la déconstruction des logiques argumentatives.

      Il ne défend plus une "casquette" mais une approche rigoureuse.

      L'influence de sa jeunesse punk a joué un rôle formateur, lui inculquant une défiance envers les autorités non justifiées et un regard critique comme préalable à la reconnaissance de toute autorité.

      2. Le Rôle Central de la Pensée Critique

      L'intérêt de Laurent Puech pour la pensée critique a émergé en dehors du travail social, lors d'une expérience dans le secteur de la diététique en Belgique.

      2.1. L'origine de l'intérêt

      Confronté à des personnes en souffrance utilisant des thérapies dites "alternatives" (par exemple, des cures de vitamines basées sur les conseils d'un astrologue), il a commencé à s'interroger sur l'impact des croyances.

      Il a compris que la sincérité ou la bienveillance d'un praticien (astrologue, gourou) ne suffisait pas à garantir la qualité de sa démarche.

      La lecture d'ouvrages comme "Le paranormal" d'Henri Broch a été un tournant, lui fournissant les outils méthodologiques pour analyser la construction d'une argumentation et la validité des preuves.

      2.2. La Zététique comme méthode

      Il a adopté la démarche de la zététique, définie comme un scepticisme utilisant la méthode scientifique pour mettre à l'épreuve des énoncés par l'investigation, la remontée aux sources et l'expérimentation.

      Il a appliqué cette méthode en déconstruisant les prévisions de l'astrologue Élizabeth Teissier, démontrant qu'elles étaient soit factuellement fausses (dans les dates), soit si vagues qu'elles étaient sujettes à toutes les interprétations.

      Cette analyse a également mis en lumière la complaisance des médias, qui relayaient ses affirmations sans aucun regard critique.

      3. Application au Travail Social : Le Paradoxe de la Protection

      Laurent Puech transpose cette analyse critique à son propre domaine, le travail social, via ses sites SecretPro.fr (sur le secret professionnel) et protections-critiques.org.

      3.1. L'aide sociale comme "effraction"

      Il décrit certaines facettes de l'aide sociale, notamment en protection de l'enfance, comme une "effraction". Lorsqu'une "information préoccupante" est émise, une enquête sociale est déclenchée.

      Une famille ne peut refuser ce contact sans risquer une saisine de l'autorité judiciaire. L'intervention, même si elle n'est pas physiquement forcée, l'est symboliquement.

      Il note une augmentation de ces procédures, ce qui pose la question de l'équilibre entre aide et contrôle, avec une part du contrôle devenant de plus en plus "brutale et violente".

      3.2. L'angle mort du système protecteur

      Le principal risque est que "le protecteur peut devenir maltraitant". Selon lui, les systèmes de protection souffrent d'un angle mort majeur :

      ils sont conçus pour voir la violence chez les autres (les familles) mais peinent à penser leur propre violence potentielle.

      Ce phénomène est renforcé par un vocabulaire qui se veut exclusivement positif :

      Protection : Un concept "horizon", une promesse impossible à atteindre pleinement.

      Déontologie, Respect, Bienveillance : Des termes qui bardent le professionnel de certitudes morales et l'empêchent de questionner les effets potentiellement destructeurs de ses actions.

      Pour Puech, la bienveillance ne se décrète pas au présent ; elle se mesure aux effets produits, donc toujours au passé.

      L'injonction paradoxale faite aux professionnels ("Soyez aidant en contrôlant les gens") achève de brouiller les repères et complique la pratique quotidienne.

      4. Étude de Cas 1 : Les Violences Conjugales

      Laurent Puech applique sa méthode critique à la question très médiatisée des violences conjugales, en analysant les discours militants et les données disponibles.

      4.1. Définitions et données

      Distinction clé : Il rappelle la distinction du rapport Henrion entre le conflit conjugal (où les acteurs sont sur un pied d'égalité, même avec des actes violents) et la violence conjugale, qui se caractérise par une domination de l'un sur l'autre.

      Types de violence : Si la violence physique grave est majoritairement le fait d'hommes sur des femmes, les études (notamment québécoises) montrent une quasi-parité dans les violences psychologiques.

      Manque de données en France : L'enquête Enveff (début des années 2000) ne portait que sur les femmes.

      Le rapport complet de la nouvelle enquête Virage (hommes et femmes), prévu pour 2017, n'est toujours pas publié en 2019.

      4.2. Analyse du discours militant

      Le discours militant actuel se concentre sur les violences physiques des hommes envers les femmes, en utilisant le terme "féminicide". Cette approche présente plusieurs biais :

      Invisibilisation d'une partie du réel : Elle occulte les violences psychologiques, les violences exercées par des femmes, et les hommes victimes.

      Ces derniers sont d'ailleurs confrontés à une incrédulité qui rend leur parole encore plus difficile ("Oh, monsieur ! C'est qui, l'homme à la maison ?").

      Simplification idéologique : En ne retenant que la violence patriarcale (homme sur femme), ce discours met sur le même plan des situations de nature très différente (ex: un homicide violent et une euthanasie de conjoint atteint d'Alzheimer) au seul motif que la victime est une femme.

      Focalisation sur l'agresseur : En se concentrant sur les "féminicides", le discours militant s'intéresse moins aux femmes victimes qu'à prouver que "l'homme est un salaud".

      La preuve en est que les homicides de femmes par des femmes dans un couple ne sont pas comptabilisés par certains collectifs.

      4.3. La réalité des chiffres

      Le discours militant diffuse l'idée d'une augmentation dramatique des "féminicides", en s'appuyant sur des pics statistiques de courte durée (ex: janvier-février 2019) et en ignorant les périodes de baisse.

      Tendance de fond : Les données officielles de la Délégation aux victimes du Ministère de l'Intérieur, collectées depuis 2006, montrent une baisse de 25 % des homicides au sein du couple (hommes et femmes) entre 2006 et 2017.

      Contexte général : Cette baisse s'inscrit dans une tendance plus large de diminution des homicides en France (passés de 1500 à 800 par an en 15 ans).

      Explications : Cette amélioration est le fruit de multiples facteurs : meilleure connaissance du phénomène (grâce, paradoxalement, aux alertes militantes initiales), renforcement de la loi pénale, et création de dispositifs d'aide et d'hébergement.

      Effets du discours alarmiste : En affirmant que "rien n'est fait", le discours militant actuel est jugé "dépressif" et peut "tétaniser les femmes qui vivent de la violence" en leur envoyant le message que la société les abandonne.

      5. Étude de Cas 2 : Les Enfants Tués par leurs Parents

      Un autre sujet où l'émotion anesthésie l'esprit critique est celui des enfants tués par leurs parents.

      5.1. Le mythe des "deux enfants tués par jour"

      Le chiffre de "deux enfants tués par jour" (environ 700 par an) est largement diffusé par des associations, des médias et même des institutions (rapports parlementaires, ministres, etc.).

      Laurent Puech en retrace l'origine, qu'il compare à celle de l'iridologie (une pseudoscience fondée sur une seule anecdote non vérifiée).

      Origine (années 80) : Le chiffre provient d'une extrapolation "insensée" réalisée à partir de données éparses d'un seul service hospitalier.

      Légitimation (années 2000) : Une étude de l'Inserm, portant uniquement sur les enfants de 0 à 1 an, a popularisé une méthode d'extrapolation consistant à multiplier les cas connus par un facteur allant jusqu'à 15 pour estimer les cas cachés (ex: syndrome du bébé secoué).

      Généralisation absurde : Cette méthode, déjà très critiquable pour les nourrissons, a ensuite été appliquée à tous les mineurs, comme s'il était aussi facile de dissimuler le meurtre d'un adolescent de 14 ans que celui d'un bébé.

      5.2. La réalité des chiffres

      Contradiction flagrante : Le chiffre de 700 enfants tués par an était supérieur au nombre total d'homicides enregistrés en France toutes catégories d'âge confondues. Cette absurdité n'a pourtant pas empêché sa diffusion.

      Données réelles : Un travail de recensement rigoureux mené sur la période 2012-2016 a établi le nombre moyen de cas à environ 70 par an, soit dix fois moins que le chiffre militant.

      5.3. Le chiffre comme argument moral

      L'analyse de Laurent Puech montre que, sur ces sujets hautement émotionnels, le chiffre n'est pas utilisé pour décrire le réel, mais pour soutenir une position morale.

      Il sert à dire "j'ai raison" et à disqualifier toute parole dissonante comme étant "immorale".

      Ceux qui contestent le chiffre sont accusés de minimiser la gravité du problème et de se placer "dans le camp du mal", alors même que la critique ne porte pas sur la sincérité des acteurs, mais sur la rigueur de leur méthode et la fiabilité de l'information qu'ils diffusent.

  4. accessmedicina.mhmedical.com accessmedicina.mhmedical.com
    1. McQuaid KR. McQuaid K.R. McQuaid, Kenneth R.Apendicitis. In: Papadakis MA, Rabow MW, McQuaid KR, Gandhi M. Papadakis M.A., & Rabow M.W., & McQuaid K.R., & Gandhi M(Eds.),Eds. Maxine A. Papadakis, et al.eds. Diagnóstico clínico y tratamiento 2025. McGraw Hill Education; 2025. Accessed octubre 18, 2025. https://accessmedicina.mhmedical.com/content.aspx?bookid=3530&sectionid=294839301

      hj

    1. fixada
      • Tarifa = Fixada pela proposta vencedora da licitação;

      • <u>Revisão</u> da tarifa = regras previstas em lei, contrato e edital

  5. www.planalto.gov.br www.planalto.gov.br
    1. reajustamento de preços

      Observação:

      Art. 135. Os preços dos contratos para serviços contínuos com regime de dedicação exclusiva de mão de obra ou com predominância de mão de obra serão repactuados para manutenção do equilíbrio econômico-financeiro, mediante demonstração analítica da variação dos custos contratuais, com data vinculada:

      • I - à da apresentação da proposta, para custos decorrentes do mercado;

      • II - ao acordo, à convenção coletiva ou ao dissídio coletivo ao qual a proposta esteja vinculada, para os custos de mão de obra.

      § 1º A Administração não se vinculará às disposições contidas em acordos, convenções ou dissídios coletivos de trabalho que tratem de matéria não trabalhista, de pagamento de participação dos trabalhadores nos lucros ou resultados do contratado, ou que estabeleçam direitos não previstos em lei, como valores ou índices obrigatórios de encargos sociais ou previdenciários, bem como de preços para os insumos relacionados ao exercício da atividade.

      § 2º É vedado a órgão ou entidade contratante vincular-se às disposições previstas nos acordos, convenções ou dissídios coletivos de trabalho que tratem de obrigações e direitos que somente se aplicam aos contratos com a Administração Pública.

      § 3º A repactuação deverá observar o interregno mínimo de 1 (um) ano, contado da data da apresentação da proposta ou da data da última repactuação.

      § 4º A repactuação poderá ser dividida em tantas parcelas quantas forem necessárias, observado o princípio da anualidade do reajuste de preços da contratação, podendo ser realizada em momentos distintos para discutir a variação de custos que tenham sua anualidade resultante em datas diferenciadas, como os decorrentes de mão de obra e os decorrentes dos insumos necessários à execução dos serviços.

      § 5º Quando a contratação envolver mais de uma categoria profissional, a repactuação a que se refere o inciso II do caput deste artigo poderá ser dividida em tantos quantos forem os acordos, convenções ou dissídios coletivos de trabalho das categorias envolvidas na contratação.

      § 6º A repactuação será precedida de solicitação do contratado, acompanhada de demonstração analítica da variação dos custos, por meio de apresentação da planilha de custos e formação de preços, ou do novo acordo, convenção ou sentença normativa que fundamenta a repactuação.

    2. 50% (cinquenta por cento)
      • Considerando que parcela significativa é aquela que representa no mínimo 4% do valor global da contratação, será admitido a exigência da Administração de comprovação, por meio de certidões e atestados, que há capacidade de execução de, <u>no máximo</u>, 50% da referida significativa parcela.
    3. 4% (quatro por cento) do valor total

      Para fins de habilitação técnica, parcela de maior relevância ou de valor significativo será aquela que representar, ao menos, 4% do valor total da contratação.

  6. learn-us-east-1-prod-fleet02-xythos.content.blackboardcdn.com learn-us-east-1-prod-fleet02-xythos.content.blackboardcdn.com
    1. “pseayysttpdu‘srouaisy]sjodusoo yetyooodseayew0}‘yoaedsinojoUoNDeNppueoimeuot}a8uryo0}ingyosedsojuTaouaTisWOa8IaUIS0}UsEqJOU sey2188nnssno‘usWIOMYORIJoyATUTED-JOQ‘preayoqUdS2d10AMOYL“WarsUseqJOUSavyUSO‘Gsonrunur4UWOdSTUTPe asIOAIP PUL)SoMTUNUTODydeIqUTInq‘sarisPoUNoyUrspunoi3yoeqdSVAWoyUsUIOMJospjoyasnoyaufUTaoxydusyeE)seyyeyJoSuyaquiaurasaiesnooeueaq ABUT 20UDTISS,UZUTOMUOsiseydurostyy“AyoyINE[eyorenned0}vOIssTUNss,UEWIOAJOuSIsay}—,POOyULUIOMJoypoads1481,IsTXasay]seUDdSUdYOSf2dUETIS‘SafISFUTWASUTA“983dpjnom ,pooyuRWIOMJoyooeds1y8y,ouyeuospassaiddnsaq01sem yooads1wyL “yoaadsuMOAur333919pjnomLyempAraqissodjessorddns0)pepusiurasoaseqBuny]eI,JOJPoeataoos]swuaurysrund ayy ‘yoseds poepsemospezrumsa,ou‘spdSuryyedJoyBuTeo,,OUSEM aray, ‘Yoword0}payedaqAepouwiosiyStu [1mBuLAsyaqyeods0]ourpoSeinoouesavyISAAauy ‘hog&u9eq|peH “PHYPIsarpApetnonedasourpue—pyryo ay—oursoua|is0}popusiuyaJo“AAOUL“ssa[pue pawisas yoaadsjo sioeasapJoysjuawysrund oy‘Kes01SS2TP2ON-soyoseds Suryew ‘suonsonb Sunyse A[ssaTPua‘aurIePaidaJIP 10USoMyey

      this basically highlights how girls, especially black girls in this case, and are often punished for speaking out. It points out that the struggle isn't really to speak but to be heard and share opinions.

    1. Relationships charac-terized as Andhra riste were not as binding as those of the Lashkarwala orSheharwala riste. They did not entail rigid responsibilities and obligationsas the guru-cela bond did, nor were they restricted to members of one’s ownlineage or hijra house. The most common of such relationships were thosebetween “sisters” (behen), and that between a “mother” and her “daughter”(ma-beti relationships).

      Reddy distinguishes two different relationships/bindings in the community: the guru-cela relationship (focus on lineage and hierarchy and entails obligations), and the sister and mother-daughter relationship (motivated by mutual affection, focus on love and caring). I note that she didn't discuss how the sister relationship is organized for hijras, instead comparing the guru-cela and mother-daughter as parallel. As she argues that this latter form were not restricted by houses or lineages, will it unsettle the Hijra's rigid kinship structure by houses? How does the kinship incorporate this form of relationality, or even use it to recuperate the need for care, which is sometimes absent in the obligations-bound guru-cela relationship?

    2. Mothers often appeared to have greater affection for daughters thanfor their celas, even though there was no denying the greater significanceand legitimacy of the guru-cela bond over the ma-beti one

      Just a random pondering: how would the celas and daughters of one hijra interact with each other? And does this scale of affection also engender another form of hierarchy and evaluation?

    Annotators

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02830

      Corresponding author(s): Julien, Sage

      1. General Statements

      We thank the Reviewers for a fair review of our work and helpful suggestions. We have significantly revised the manuscript in response to these suggestions. We provide a point-by-point response to the Reviewers below but wanted to highlight in our response a recurring concern related to the strong cell cycle arrest observed upon the acute FAM53C knock-down being different than the limited phenotypes in other contexts, including the knockout mice and DepMap data.

      First, we now show that we can recapitulate the strong G1 arrest resulting from the FAM53C knock-down using two independent siRNAs in RPE-1 cells, supporting the specificity of the effects.

      Second, the G1 arrest that results from the FAM53C knock-down is also observed in cells with inactive p53, suggesting it is not due to a non-specific stress response due to “toxic” siRNAs. In addition, the arrest is dependent on RB, which fits with the genetic and biochemical data placing FAM53C upstream of RB, further supporting a specific phenotype.

      Third, we have performed experiments in other human cells, including cancer cell lines. As would be expected for cancer cells, the G1 arrest is less pronounced but is still significant, indicating that the G1 arrest is not unique to RPE-1 cells.

      Fourth, it is not unexpected that compensatory mechanisms would be activated upon loss of FAM53C during development or in cancer – which may explain the lack of phenotypes in vivo or upon long-term knockout. This has been true for many cell cycle regulators, either because of compensation by other family members that have overlapping functions, or by a larger scale rewiring of signaling pathways.

      2. Point-by-point description of the revisions

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      Summary:

      Taylar Hammond and colleagues identified new regulators of the G1/S transition of the cell cycle. They did so by screening public available data from the Cancer Dependency Map, and identified FAM53C as a positive regulator of the G1/S transition. Using biochemical assays they then show that FAM53 interacts with the DYRK1A kinase to inhibit its function. DYRK1A in its is known to induce degradation of cyclin D, leading the authors to propose a model in which DYRK1A-dependent cyclin D degradation is inhibited by FAM53C to permit S-phase entry. Finally the authors assess the effect of FAM53C deletion in a cortical organoid model, and in Fam53c knockout mice. Whereas proliferation of the organoids is indeed inhibited, mice show virtually no phenotype.

      Major comments:

      The authors show convincing evidence that FAM53C loss can reduce S-phase entry in cell cultures, and that it can bind to DYRK1A. However, FAM53 has multiple other binding partners and I am not entirely convinced that negative regulation of DYRK1A is the predominant mechanism to explain its effects on S-phase entry. Some of the claims that are made based on the biochemical assays, and on the physiological effects of FAM53C are overstated. In addition, some choices made methodology and data representation need further attention.

      1. The authors do note that P21 levels increase upon FAM53C. They show convincing evidence that this is not a P53-dependent response. But the claim that " p21 upregulation alone cannot explain the G1 arrest in FAM53C-deficient cells (line 138-139) is misleading. A p53-independent p21 response could still be highly relevant. The authors could test if FAM53C knockdown inhibits proliferation after p21 knockdown or p21 deletion in RPE1 cells. The Reviewer raises a great point. Our initial statement needed to be clarified and also need more experimental support. We have performed experiments where we knocked down FAM53C and p21 individually, as well as in combination, in RPE-1 cells. These experiment show that p21 knock-down is not sufficient to negate the cell cycle arrest resulting from the FAM53C knock-down in RPE-1 cells (Figure 4B,C and Figure S4C,D).

      We now extended these experiments to conditions where we inhibited DYRK1A, and we also compared these data to experiments in p53-null RPE-1 cells. Altogether, these experiments point to activation of p53 downstream of DYRK1A activation upon FAM53C knock-down, and indicate that p21 is not the only critical p53 target in the cell cycle arrest observed in FAM53C knock-down cells (Figure 4 and Figure S4).

      The authors do not convincingly show that FAM53C acts as a DYRK1A inhibitor in cells. Figures 4B+C and S4B+C show extremely faint P-CycD1 bands, and tiny differences in ratios. The P values are hovering around the 0.05, so n=3 is clearly underpowered here. Total CycD1 levels also correlate with FAM53C levels, which seems to affect the ratios more than the tiny pCycD1 bands. Why is there still a pCycD1 band visible in 4B in the GFP + BTZ + DYRK1Ai condition? And if I look at the data points I honestly don't understand how the authors can conclude from S4C that knockdown of siFAM53C increases (DYRK1A dependent) increases in pCycD1 (relative to total CycD1). In figure 5C, no blot scans are even shown, and again the differences look tiny. So the authors should either find a way to make these assays more robust, or alter their claims appropriately.

      We appreciate these comments from the Reviewer and have significantly revised the manuscript to address them.

      The analysis of Cyclin D phosphorylation and stability are complicated by the upregulation of p21 upon FAM53C knock-down, in particular because p21 can be part of Cyclin D complexes, which may affect its protein levels in cells (as was nicely showed in a previous study from the lab of Tobias Meyer – Chen et al., Mol Cell, 2013). Instead of focusing on Cyclin D levels and stability, we refocused the manuscript on RB and p53 downstream of FAM53C loss.

      We removed previous panel 4B from the revised manuscript. For panels 4E and S4B (now panels S3J and S3K)), we used a true “immunoassay” (as indicated in the legend – not an immunoblot), which is much more quantitative and avoids error-prone steps in standard immunoblots (“Western blots”). Briefly, this system was developed by ProteinSimple. It uses capillary transfer of proteins and ELISA-like quantification with up to 6 logs of dynamic range (see their web site https://www.proteinsimple.com/wes.html). The “bands” we show are just a representation of the luminescence signals in capillaries. We made sure to further clarify the figure legends in the revised manuscript.

      The representative Western blot images for 5C-D (now 5F-G) in the original submission are shown in Figure 5E, we apologize if this was not clear. The differences are small, which we acknowledge in the revised manuscript. Note that several factors can affect Cyclin D levels in cells, including the growth rate and the stage of the cell cycle. Our FACS analysis shows that normal organoids have ~63% of cells in G1 and ~13% in S phase; the overall lower proportion of S-phase cells in organoids may make the immunoblot difference appear smaller, with fewer cycling cells resulting in decreased Cyclin D phosphorylation.

      Nevertheless, the Reviewer brings up a good point and comments from this Reviewer and the others made us re-think how to best interpret our results. As discussed above, we re-read carefully the Meyer paper and think that FAM53C’s role and DYRK1A activity in cells may be understood when considering levels of both CycD and p21 at the same time in a continuum. While our genetic and biochemical data support a role for FAM53C in DYRK1A inhibition, it is likely that the regulation of cell cycle progression by FAM53C is not exclusively due to this inhibition. As discussed above and below, we noted an upregulation of p21 upon FAM53C knock-down, and activation of p53 and its targets likely contributes significantly to the phenotypes observed. We added new experiments to support this more complex model (Figure 4 and Figure S4, with new model in S4L).

      The experiments to test if DYRK1A inhibition could rescue the G1 arrest observed upon FAM53C knockdown are not entirely convincing either. It would be much more convincing if they also perform cell counting experiments as they have done in Figures 1F and 1G, to complement the flow cytometry assays. I suggest that the authors do these cell counting experiments in RPE1 +/- P53 cells as well as HCT116 cells. In addition, did the authors test if P21 is induced by DYRK1Ai in HCT116 cells?

      We repeated the experiments with the DYRK1A inhibitor and counted the cells. In p53-null RPE-1 cells, we found that cell numbers do not increase in these conditions where we had observed a cell cycle re-entry (Fig. 4E), which was accompanied by apoptotic cell death (Fig. S4I). Thus, cells re-enter the cell cycle but die as they progress through S-phase and G2/M. We note that inhibition of DYRK1A has been shown to decrease expression of G2/M regulators (PMID: 38839871), which may contribute to the inability of cells treated to DYRK1Ai to divide. Because our data in RPE-1 cells showed that p21 knock-down was not sufficient to allow the FAM53C knock-down cells to re-enter the cell cycle, we did not further analyze p21 in HCT-116 cells.

      The data in Figure 5C and 5D are identical, although they are supposed to represent either pCycD1 ratios or p21 levels. This is a problem because at least one of the two cannot be true. Please provide the proper data and show (representative) images of both data types.

      We apologize for these duplicated panels in the original submission. We now replaced the wrong panel with the correct data (Fig. 5F,G).

      Line 246: "Fam53c knockout mice display developmental and behavioral defects." I don't agree with this claim. The mutant mice are born at almost the expected Mendelian ratios, the body weight development is not consistently altered. But more importantly, no differences in adult survival or microscopic pathology were seen. The authors put strong emphasis on the IMPC behavioral analysis, but they should be more cautious. The IMPC mouse cohorts are tested for many other phenotypes related to behavior and neurological symptoms and apparently none of these other traits were changed in the IMPC Famc53c-/- cohort. Thus, the decreased exploration in a new environment could very well be a chance finding. The authors need to take away claims about developmental and behavioral defects from the abstract, results and discussion sections; the data are just too weak to justify this.

      We agree with the Reviewer that, although we observed significant p-values, this original statement may not be appropriate in the biological sense. We made sure in the revised manuscript to carefully present these data.

      Minor comments:

      Can the authors provide a rationale for each of the proteins they chose to generate the list of the 38 proteins in the DepMap analysis? I looked at the list and it seems to me that they do not all have described functions in the G1/S transition. The analysis may thus be biased.

      To address this point, we updated Table S1 (2nd tab) to provide a better rationale for the 38 factors chosen. Our focus was on the canonical RB pathway and we included RB binding proteins whose function had suggested they may also be playing a role in the G1/S transition. We do agree that there is some bias in this selection (e.g., there are more RB binding factors described) but we hope the Reviewer will agree with us that this list and the subsequent analysis identified expected factors, including FAM53C. Future studies using this approach and others will certainly identify new regulators of cell cycle progression.

      Figure 1B is confusing to me. Are these just some (arbitrarily) chosen examples? Consider leaving this heatmap out altogether, of explain in more detail.

      We agree with the Reviewer that this panel was not necessarily useful and possibly in the wrong place, and we removed it from the manuscript. We replaced it with a cartoon of top hits in the screen.

      The y-axes in Figures 2C, 2D, 2E, and 4D are misleading because they do not start at 0. Please let the axis start at 0, or make axis breaks.

      We re-graphed these panels.

      Line 229: " Consequences ... brain development." This subheader is misleading, because the in vitro cortical organoid system is a rather simplistic model for brain development, and far away from physiological brain development. Please alter the header.

      We changed the header to “Consequences of FAM53C inactivation in human cortical organoids in culture”.

      Figure S5F: the gating strategy is not clear to me. In particular, how do the authors know the difference between subG1 and G1 DAPI signals? Do they interpret the subG1 as apoptotic cells? If yes, why are there so many? Are the culturing or harvesting conditions of these organoids suboptimal? Perhaps the authors could consider doing IF stainings on EdU or BrdU on paraffin sections of organoids to obtain cleaner data?

      Thank you for your feedback. The subG1 population in the original Figure S5F represents cells that died during the dissociation step of the organoids for FACS analysis. To address this point, we performed live & dead staining to exclude dead cells and provide clearer data. We refined gating strategy for better clarity in the new S5F panel.

      Figure S6A; the labeling seems incorrect. I would think that red is heterozygous here, and grey mutant.

      We fixed this mistake, thank you.

      __Reviewer #1 (Significance (Required)): __

      The finding that the poorly studied gene FAM53C controls the G1/S transition in cell lines is novel and interesting for the cell cycle field. However, the lack of phenotypes in Famc53-/- mice makes this finding less interesting for a broader audience. Furthermore, the mechanisms are incompletely dissected. The importance of a p53-indepent induction of p21 is not ruled out. And while the direct inhibitory interaction between FAM53C and DYRK1A is convincing (and also reported by others; PMID: 37802655), the authors do not (yet) convincingly show that DYRK1A inhibition can rescue a cell proliferation defect in FAM53C-deficient cells.

      Altogether, this study can be of interest to basic researchers in the cell cycle field.

      I am a cell biologist studying cell cycle fate decisions, and adaptation of cancer cells & stem cells to (drug-induced) stress. My technical expertise aligns well with the work presented throughout this paper, although I am not familiar with biolayer interferometry.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      Summary

      In this study Hammond et al. investigated the role of Dual-specificity Tyrosine Phosphorylation regulated Kinase 1A (DYRK1) in G1/S transition. By exploiting Dependency Map portal, they identified a previously unexplored protein FAM53C as potential regulator of G1/S transition. Using RNAi, they confirmed that depletion of FAM53C suppressed proliferation of human RPE1 cells and that this phenotype was dependent on the presence protein RB. In addition, they noted increased level of CDKN1A transcript and p21 protein that could explain G1 arrest of FAM53C-depleted cells but surprisingly, they did not observe activation of other p53 target genes. Proteomic analysis identified DYRK1 as one of the main interactors of FAM53C and the interaction was confirmed in vitro. Further, they showed that purified FAM53C blocked the ability of DYRK1 to phosphorylate cyclin D in vitro although the activity of DYRK1 was likely not inhibited (judging from the modification of FAM53C itself). Instead, it seems more likely that FAM53C competes with cyclin D in this assay. Authors claim that the G1 arrest caused by depletion of FAM53C was rescued by inhibition of DYRK1 but this was true only in cells lacking functional p53. This is quite confusing as DYRK1 inhibition reduced the fraction of G1 cells in p53 wild type cells as well as in p53 knock-outs, suggesting that FAM53C may not be required for regulation of DYRK1 function. Instead of focusing on the impact of FAM53C on cell cycle progression, authors moved towards investigating its potential (and perhaps more complex) roles in differentiation of IPSCs into cortical organoids and in mice. They observed a lower level of proliferating cells in the organoids but if that reflects an increased activity of DYRK1 or if it is just an off target effect of the genetic manipulation remains unclear. Even less clear is the phenotype in FAM53C knock-out mice. Authors did not observe any significant changes in survival nor in organ development but they noted some behavioral differences. Weather and how these are connected to the rate of cellular proliferation was not explored. In the summary, the study identified previously unknown role of FAM53C in proliferation but failed to explain the mechanism and its physiological relevance at the level of tissues and organism. Although some of the data might be of interest, in current form the data is too preliminary to justify publication.

      Major points

      1. Whole study is based on one siRNA to Fam53C and its specificity was not validated. Level of the knock down was shown only in the first figure and not in the other experiments. The observed phenotypes in the cell cycle progression may be affected by variable knock-down efficiency and/or potential off target effects. We thank the Reviewer for raising this important point. First, we need to clarify that our experiments were performed with a pool of siRNAs (not one siRNA). Second, commercial antibodies against FAM53C are not of the best quality and it has been challenging to detect FAM53C using these antibodies in our hands – the results are often variable. In addition, to better address the Reviewer’s point and control for the phenotypes we have observed, we performed two additional series of experiments: first, we have confirmed G1 arrest in RPE-1 cells with individual siRNAs, providing more confidence for the specificity of this arrest (Fig. S1B); second, we have new data indicating that other cell lines arrest in G1 upon FAM53C knock-down (Fig. S1E,F and Fig. 4F).

      Experiments focusing on the cell cycle progression were done in a single cell line RPE1 that showed a strong sensitivity to FAM53C depletion. In contrast, phenotypes in IPSCs and in mice were only mild suggesting that there might be large differences across various cell types in the expression and function of FAM53C. Therefore, it is important to reproduce the observations in other cell types.

      As mentioned above, we have new data indicating that other cell lines arrest in G1 upon FAM53C knock-down (three cancer cell lines) (Fig. S1E,F and Fig. 4F).

      Authors state that FAM53C is a direct inhibitor of DYRK1A kinase activity (Line 203), however this model is not supported by the data in Fig 4A. FAM53C seems to be a good substrate of DYRK1 even at high concentrations when phosphorylations of cyclin D is reduced. It rather suggests that DYRK1 is not inhibited by FAM53C but perhaps FAM53C competes with cyclin D. Further, authors should address if the phosphorylation of cyclin D is responsible for the observed cell cycle phenotype. Is this Cyclin D-Thr286 phosphorylation, or are there other sites involved?

      We revised the text of the manuscript to include the possibility that FAM53C could act as a competitive substrate and/or an inhibitor.

      We removed most of the Cyclin D phosphorylation/stability data from the revised manuscript. As the Reviewers pointed out, some of these data were statistically significant but the biological effects were small. As discussed above in our response to Reviewer #1, the analysis of Cyclin D phosphorylation and stability are complicated by the upregulation of p21 upon FAM53C knock-down, in particular because p21 can be part of Cyclin D complexes, which may affect its protein levels in cells (as was nicely showed in a previous study from the lab of Tobias Meyer – Chen et al., Mol Cell, 2013). Instead of focusing on Cyclin D levels and stability, we refocused the manuscript on RB and p53 downstream of FAM53C loss.

      We note, however, that we used specific Thr286 phospho-antibodies, which have been used extensively in the field. Our data in Figure 1 with palbociclib place FAM53C upstream of Cyclin D/CDK4,6. We performed Cyclin D overexpression experiments but RPE-1 cells did not tolerate high expression of Cyclin D1 (T286A mutant) and we have not been able to conduct more ‘genetic’ studies.

      At many places, information on statistical tests is missing and SDs are not shown in the plots. For instance, what statistics was used in Fig 4C? Impact of FAM53C on cyclin D phosphorylation does not seem to be significant. In the same experiment, does DYRK1 inhibitor prevent modification of cyclin D?

      As discussed above, we removed some of these data and re-focused the manuscript on p53-p21 as a second pathway activated by loss of FAM53C.

      Validation of SM13797 compound in terms of specificity to DYRK1 was not performed.

      This is an important point. We had cited an abstract from the company (Biosplice) but we agree that providing data is critical. We have now revised the manuscript with a new analysis of the compound’s specificity using kinase assays. These data are shown in Fig. S3F-H.

      A fraction of cells in G1 is a very easy readout but it does not measure progression through the G1 phase. Extension of the S phase or G2 delay would indirectly also result in reduction of the G1 fraction. Instead, authors could measure the dynamics of entry to S phase in cells released from a G1 block or from mitotic shake off.

      The Reviewer made a good point. As discussed in our response to Reviewer #1, with p53-null RPE-1 cells, we found that cell numbers do not increase in these conditions where we had observed a cell cycle re-entry (Fig. 4E), which was accompanied by apoptotic cell death (Fig. S4I). Thus, cells re-enter the cell cycle but die as they progress through S-phase and G2/M. We note that inhibition of DYRK1A has been shown to decrease expression of G2/M regulators (PMID: 38839871), which may contribute to the inability of cells treated to DYRK1Ai to divide. Because our data in RPE-1 cells showed that p21 knock-down was not sufficient to allow the FAM53C knock-down cells to re-enter the cell cycle, we did not further analyze p21 in HCT-116 cells. These data indicate that G1 entry by flow cytometry will not always translate into proliferation.

      Other points:

      Fig. 2C, 2D, 2E graphs should begin with 0

      We remade these graphs.

      Fig. 5D shows that the difference in p21 levels is not significant in FAM53C-KO cells but difference is mentioned in the text.

      We replaced the panel by the correct panel; we apologize for this error.

      Fig. 6D comparison of datasets of extremely different sizes does not seem to be appropriate

      We agree and revised the text. We hope that the Reviewer will agree with us that it is worth showing these data, which are clearly preliminary but provide evidence of a possible role for FAM53C in the brain.

      Could there be alternative splicing in mice generating a partially functional protein without exon 4? Did authors confirm that the animal model does not express FAM53C?

      We performed RNA sequencing of mouse embryonic fibroblasts derived from control and mutant mice. We clearly identified fewer reads in exon 4 in the knockout cells, and no other obvious change in the transcript (data not shown). However, immunoblot with mouse cells for FAM53C never worked well in our hands. We made sure to add this caveat to the revised manuscript.

      __Reviewer #2 (Significance (Required)): __

      Main problem of this study is that the advanced experimental models in IPSCs and mice did not confirm the observations in the cell lines and thus the whole manuscript does not hold together. Although I acknowledge the effort the authors invested in these experiments, the data do not contribute to the main conclusion of the paper that FAM53C/DYRK1 regulates G1/S transition.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This paper identifies FAM53C as a novel regulator of cell cycle progression, particularly at the G1/S transition, by inhibiting DYRK1A. Using data from the Cancer Dependency Map, the authors suggest that FAM53C acts upstream of the Cyclin D-CDK4/6-RB axis by inhibiting DYRK1A.

      Specifically, their experiments suggest that FAM53C Knockdown induces G1 arrest in cells, reducing proliferation without triggering apoptosis. DYRK1A Inhibition rescues G1 arrest in P53KO cells, suggesting FAM53C normally suppresses DYRK1A activity. Mass Spectrometry and biochemical assays confirm that FAM53C directly interacts with and inhibits DYRK1A. FAM53C Knockout in Human Cortical Organoids and Mice leads to cell cycle defects, growth impairments, and behavioral changes, reinforcing its biological importance.

      Strength of the paper:

      The study introduces a novel cell cycle control signalling module upstream of CDK4/6 in G1/S regulation which could have significant impact. The identification of FAM53C using a depmap correlation analysis is a nice example of the power of this dataset. The experiments are carried out mostly in a convincing manner and support the conclusions of the manuscript.

      Critique:

      1) The experiments rely heavily on siRNA transfections without the appropriate controls. There are so many cases of off-target effects of siRNA in the literature, and specifically for a strong phenotype on S-phase as described here, I would expect to see solid results by additional experiments. This is especially important since the ko mice do not show any significant developmental cell cycle phenotypes. Moreover, FAM53C does not show a strong fitness effect in the depmap dataset, suggesting that it is largely non-essential in most cancer cell lines. For this paper to reach publication in a high-standard journal, I would expect that the authors show a rescue of the S-phase phenotype using an siRNA-resistant cDNA, and show similar S-phase defects using an acute knock out approach with lentiviral gRNA/Cas9 delivery.

      We thank the Reviewer for this comment. Please refer to the initial response to the three Reviewers, where we discuss our use of single siRNAs and our results in multiple cell lines. Briefly, we can recapitulate the G1 arrest upon FAM53C knock-down using two independent siRNAs in RPE-1 cells. We also observe the same G1 arrest in p53 knockout cells, suggesting it is not due to a non-specific stress response. In addition, the arrest is dependent on RB, which fits with the genetic and biochemical data placing FAM53C upstream of RB, further supporting a specific phenotype. Human cancer cell lines also arrest in G1 upon FAM53C knock-down, not just RPE-1 cells. Finally, we hope the Reviewer will agree with us that compensatory mechanisms are very common in the cell cycle – which may explain the lack of phenotypes in vivo or upon long-term knockout of FAM53C.

      2) The S-phase phenotype following FAM53C should be demonstrated in a larger variety of TP53WT and mutant cell lines. Given that this paper introduces a new G1/S control element, I think this is important for credibility. Ideally, this should be done with acute gRNA/Cas9 gene deletion using a lentiviral delivery system; but if the siRNA rescue experiments work and validate an on-target effect, siRNA would be an appropriate alternative.

      We now show data with three cancer cell lines (U2OS, A549, and HCT-116 – Fig. S1E,F and Fig. 4F), in addition to our results in RPE-1 cells and in human cortical organoids. We note that the knock-down experiments are complemented by overexpression data (Fig. 1G-I), by genetic data (our original DepMap screen), and our biochemical data (showing direct binding of FAM53C to DYRK1A).

      3) The western blot images shown in the MS appear heavily over-processed and saturated (See for example S4B, 4A, B, and E). Perhaps the authors should provide the original un-processed data of the entire gels?

      For several of our panels (e.g., 4E and S4B, now panels S3J and S3K)), we used a true “immunoassay” (as indicated in the legend – not an immunoblot), which is much more quantitative and avoids error-prone steps in standard immunoblots (“Western blots”). Briefly, this system was developed by ProteinSimple. It uses capillary transfer of proteins and ELISA-like quantification with up to 6 logs of dynamic range (see their web site https://www.proteinsimple.com/wes.html). The “bands” we show are just a representation of the luminescence signals in capillaries. We made sure to further clarify the figure legends in the revised manuscript.

      Data in 4A are also not a western blot but a radiograph.

      For immunoblots, we will provide all the source data with uncropped blots with the final submission.

      4) A critical experiment for the proposed mechanism is the rescue of the FAM53C S-phase reduction using DYRK1A inhibition shown in Figure 4. The legend here states that the data were extracted from BrdU incorporation assays, but in Figure S4D only the PI histograms are shown, and the S-phase population is not quantified. The authors should show the BrdU scatterplot and quantify the phenotype using the S-phase population in these plots. G1 measurements from PI histograms are not precise enough to allow for conclusions. Also, why are the intensities of the PI peaks so variable in these plots? Compare, for example, the HCT116 upper and lower panels where the siRNA appears to have caused an increase in ploidy.

      We apologize for the confusion and we fixed these errors, for most of the analyses, we used PI to measure G1 and S-phase entry. We added relevant flow cytometry plots to supplemental figures (Fig. S1G, H, I, as well as Fig. S4E and S4K, and Fig. S5F).

      5) There's an apparent contradiction in how RB deletion rescues the G1 arrest (Figure 2) while p21 seems to maintain the arrest even when DYRK1A is inhibited. Is p21 not induced when FAM53C is depleted in RB ko cells? This should be measured and discussed.

      This comment and comments from the two other Reviewers made us reconsider our model. We re-read carefully the Meyer paper and think that DYRK1A activity may be understood when considering levels of both CycD and p21 at the same time in a continuum (as was nicely showed in a previous study from the lab of Tobias Meyer – Chen et al., Mol Cell, 2013). While our genetic and biochemical data support a role for FAM53C in DYRK1A inhibition, it is obvious that the regulation of cell cycle progression by FAM53C is not exclusively due to this inhibition. As discussed above and below, we noted an upregulation of p21 upon FAM53C knock-down, and activation of p53 and its targets likely contributes significantly to the phenotypes observed. We added new experiments to support this more complex model (Figure 4 and Figure S4, with new model in S4L).

      __Reviewer #3 (Significance (Required)): __

      In conclusion, I believe that this MS could potentially be important for the cell cycle field and also provide a new target pathway that could be relevant for cancer therapy. However, the paper has quite a few gaps and inconsistencies that need to be addressed with further experiments. My main worry is that the acute depletion phenotypes appear so strong, while the gene is non-essential in mice and shows only a minor fitness effect in the depmap screens. More convincing controls are necessary to rule out experimental artefacts that misguide the interpretation of the results.

      We appreciate this comment and hope that the Reviewer will agree it is still important to share our data with the field, even if the phenotypes in mice are modest.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper identifies FAM53C as a novel regulator of cell cycle progression, particularly at the G1/S transition, by inhibiting DYRK1A. Using data from the Cancer Dependency Map, the authors suggest that FAM53C acts upstream of the Cyclin D-CDK4/6-RB axis by inhibiting DYRK1A.

      Specifically, their experiments suggest that FAM53C Knockdown induces G1 arrest in cells, reducing proliferation without triggering apoptosis. DYRK1A Inhibition rescues G1 arrest in P53KO cells, suggesting FAM53C normally suppresses DYRK1A activity. Mass Spectrometry and biochemical assays confirm that FAM53C directly interacts with and inhibits DYRK1A. FAM53C Knockout in Human Cortical Organoids and Mice leads to cell cycle defects, growth impairments, and behavioral changes, reinforcing its biological importance.

      Strength of the paper:

      The study introduces a novel cell cycle control signalling module upstream of CDK4/6 in G1/S regulation which could have significant impact. The identification of FAM53C using a depmap correlation analysis is a nice example of the power of this dataset. The experiments are carried out mostly in a convincing manner and support the conclusions of the manuscript.

      Critique:

      1. The experiments rely heavily on siRNA transfections without the appropriate controls. There are so many cases of off-target effects of siRNA in the literature, and specifically for a strong phenotype on S-phase as described here, I would expect to see solid results by additional experiments. This is especially important since the ko mice do not show any significant developmental cell cycle phenotypes. Moreover, FAM53C does not show a strong fitness effect in the depmap dataset, suggesting that it is largely non-essential in most cancer cell lines. For this paper to reach publication in a high-standard journal, I would expect that the authors show a rescue of the S-phase phenotype using an siRNA-resistant cDNA, and show similar S-phase defects using an acute knock out approach with lentiviral gRNA/Cas9 delivery.
      2. The S-phase phenotype following FAM53C should be demonstrated in a larger variety of TP53WT and mutant cell lines. Given that this paper introduces a new G1/S control element, I think this is important for credibility. Ideally, this should be done with acute gRNA/Cas9 gene deletion using a lentiviral delivery system; but if the siRNA rescue experiments work and validate an on-target effect, siRNA would be an appropriate alternative.
      3. The western blot images shown in the MS appear heavily over-processed and saturated (See for example S4B, 4A, B, and E). Perhaps the authors should provide the original un-processed data of the entire gels?
      4. A critical experiment for the proposed mechanism is the rescue of the FAM53C S-phase reduction using DYRK1A inhibition shown in Figure 4. The legend here states that the data were extracted from Brad incorporation assays, but in Figure S4D only the PI histograms are shown, and the S-phase population is not quantified. The authors should show the Brad scatterplot and quantify the phenotype using the S-phase population in these plots. G1 measurements from PI histograms are not precise enough to allow for conclusions. Also, why are the intensities of the PI peaks so variable in these plots? Compare, for example, the HCT116 upper and lower panels where the siRNA appears to have caused an increase in ploidy.
      5. There's an apparent contradiction in how RB deletion rescues the G1 arrest (Figure 2) while p21 seems to maintain the arrest even when DYRK1A is inhibited. Is p21 not induced when FAM53C is depleted in RB ko cells? This should be measured and discussed.

      Significance

      In conclusion, I believe that this MS could potentially be important for the cell cycle field and also provide a new target pathway that could be relevant for cancer therapy. However, the paper has quite a few gaps and inconsistencies that need to be addressed with further experiments. My main worry is that the acute depletion phenotypes appear so strong, while the gene is non-essential in mice and shows only a minor fitness effect in the depmap screens. More convincing controls are necessary to rukle out experimental artefacts that misguide the interpretation of the results.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In this study Hammond et al. investigated the role of Dual-specificity Tyrosine Phosphorylation regulated Kinase 1A (DYRK1) in G1/S transition. By exploiting Dependency Map portal, they identified a previously unexplored protein FAM53C as potential regulator of G1/S transition. Using RNAi, they confirmed that depletion of FAM53C suppressed proliferation of human RPE1 cells and that this phenotype was dependent on the presence protein RB. In addition, they noted increased level of CDKN1A transcript and p21 protein that could explain G1 arrest of FAM53C-depleted cells but surprisingly, they did not observe activation of other p53 target genes. Proteomic analysis identified DYRK1 as one of the main interactors of FAM53C and the interaction was confirmed in vitro. Further, they showed that purified FAM53C blocked the ability of DYRK1 to phosphorylate cyclin D in vitro although the activity of DYRK1 was likely not inhibited (judging from the modification of FAM53C itself). Instead, it seems more likely that FAM53C competes with cyclin D in this assay. Authors claim that the G1 arrest caused by depletion of FAM53C was rescued by inhibition of DYRK1 but this was true only in cells lacking functional p53. This is quite confusing as DYRK1 inhibition reduced the fraction of G1 cells in p53 wild type cells as well as in p53 knock-outs, suggesting that FAM53C may not be required for regulation of DYRK1 function. Instead of focusing on the impact of FAM53C on cell cycle progression, authors moved towards investigating its potential (and perhaps more complex) roles in differentiation of IPSCs into cortical organoids and in mice. They observed a lower level of proliferating cells in the organoids but if that reflects an increased activity of DYRK1 or if it is just an off target effect of the genetic manipulation remains unclear. Even less clear is the phenotype in FAM53C knock-out mice. Authors did not observe any significant changes in survival nor in organ development but they noted some behavioral differences. Weather and how these are connected to the rate of cellular proliferation was not explored. In the summary, the study identified previously unknown role of FAM53C in proliferation but failed to explain the mechanism and its physiological relevance at the level of tissues and organism. Although some of the data might be of interest, in current form the data is too preliminary to justify publication.

      Major points

      1. Whole study is based on one siRNA to Fam53C and its specificity was not validated. Level of the knock down was shown only in the first figure and not in the other experiments. The observed phenotypes in the cell cycle progression may be affected by variable knock-down efficiency and/or potential off target effects.
      2. Experiments focusing on the cell cycle progression were done in a single cell line RPE1 that showed a strong sensitivity to FAM53C depletion. In contrast, phenotypes in IPSCs and in mice were only mild suggesting that there might be large differences across various cell types in the expression and function of FAM53C. Therefore, it is important to reproduce the observations in other cell types.
      3. Authors state that FAM53C is a direct inhibitor of DYRK1A kinase activity (Line 203), however this model is not supported by the data in Fig 4A. FAM53C seems to be a good substrate of DYRK1 even at high concentrations when phosphorylations of cyclin D is reduced. It rather suggests that DYRK1 is not inhibited by FAM53C but perhaps FAM53C competes with cyclin D. Further, authors should address if the phosphorylation of cyclin D is responsible for the observed cell cycle phenotype. Is this Cyclin D-Thr286 phosphorylation, or are there other sites involved?
      4. At many places, information on statistical tests is missing and SDs are not shown in the plots. For instance, what statistics was used in Fig 4C? Impact of FAM53C on cyclin D phosphorylation does not seem to be significant. IN the same experiment, does DYRK1 inhibitor prevent modification of cyclin D?
      5. Validation of SM13797 compound in terms of specificity to DYRK1 was not performed.
      6. A fraction of cells in G1 is a very easy readout but it does not measure progression through the G1 phase. Extension of the S phase or G2 delay would indirectly also result in reduction of the G1 fraction. Instead, authors could measure the dynamics of entry to S phase in cells released from a G1 block or from mitotic shake off.

      Other points

      1. Fig. 2C, 2D, 2E graphs should begin with 0
      2. Fig. 5D shows that the difference in p21 levels is not significant in FAM53C-KO cells but difference is mentioned in the text.
      3. Fig. 6D comparison of datasets of extremely different sizes does not seem to be appropriate
      4. Could there be alternative splicing in mice generating a partially functional protein without exon 4? Did authors confirm that the animal model does not express FAM53C?

      Significance

      Main problem of this study is that the advanced experimental models in IPSCs and mice did not confirm the observations in the cell lines and thus the whole manuscript does not hold together. Although I acknowledge the effort the authors invested in these experiments, the data do not contribute to the main conclusion of the paper that FAM53C/DYRK1 regulates G1/S transition.

    1. Inicial

      Mudou a natureza do recurso em sede de recurso no processo de PC (de autofinanciamento para doação à campanha doadora é a companheira do candidato.

      Não obstante, chama atenção, ainda, o fato de que no mesmo dia (07.10.2024- um dia após a realização das eleições) em que foi feita a suposta doação por sua companheira do valor de R$ 18.345,00 (dezoito mil e trezentos e quarenta e cinco reais), momentos antes, o candidato efetuou um pagamento do mesmo valor (R$ 18.345,00) à empresa NOVA COLOR GRÁFICA E EDITORA LTDA, conforme consta no extrato da prestação de contas do candidato (fl. 190 do PPC em anexo).

      Juntou-se aos autos da prestação e contas, ainda, as notas fiscais da empresa NOVA COLOR GRÁFICA E EDITORA LTDA emitidas em 20 de agosto de 2024 referentes a serviços gráficos prestados ao candidato (confecção de adesivos, bandeiras, santinhos, praguinha etc) no exato valor de R$ 18.345,00 (dezoito mil e trezentos e quarenta e cinco reais), as quais, conforme extratos bancários, somente foram pagas no dia 07.10.24 (dia seguinte à eleição) por meio de recursos transferidos da conta de Amanda Cristina de Azevedo Mozer pelo candidato

      Defesa

      Já de partida, afirmamos que o descumprimento do limite de autofinanciamento por ele estabelecido não configura abuso de poder econômico, devendo ser apreciado unicamente pela ótica do art. 30-A da Lei das Eleições.

      Basta uma mera consulta nos registros oficiais dessa Especializada para verificar que, enquanto, o Investigado teve despesas que montaram à R$30.408,00 (trinta mil, quatrocentos e oito reais)1 , o candidato mais votado no sufrágio proporcional de 2024 - Sr. Raphael Braga - despendeu recursos na monta de R$ 35.000,00 (trinta e cinco mil reais)2 . Como pode se afirmar que o excesso de gastos impactou nas eleições municipais se o candidato, ora investigado, sequer ficou a frente dos demais candidatos, tendo que, efetivamente, se digladiar com os demais para que seu intento fosse bem sucedido?

      E assim se passa, pois, com a criação e fixação do teto global de gastos, há reconhecimento prévio, pelo sistema normativo, que o emprego de recursos além o limite máximo permitido não enseja quebra de normalidade e de legitimidade das eleições, não possuindo, pois, gravidade apta a macular o pleito, caso se leve em consideração a distribuição de votos entre os candidatos eleitos nas proporcionais. Com efeito, para justificar a suposta ocorrência de abuso de poder econômico, pelo descumprimento do limite de autofinanciamento durante o período eleitoral, o parquet assevera unicamente o quantitativo gasto, sem se ater à realidade fática das eleições proporcionais do município de Armação dos Búzios Justamente por isso, a jurisprudência do TSE firmou o entendimento de que não basta o descumprimento de normas de arrecadação para as graves sanções do art. 30-A da Lei nº 9.504/97, sendo necessária afetação da normalidade das eleições

      Alegações finais MP

      No entanto, merece destaque que o Município de Armação dos Búzios teve o total de 27.506 (vinte e sete mil quinhentos e seis) eleitores que compareceram às urnas nas eleições municipais de 2024 e que o candidato se elegeu com 1.206 (mil duzentos e seis votos) pelo partido MDB pelo quociente partidário, sendo que o segundo candidato de seu partido MDB (Victor Santos – eleito por média) obteve 1.078 votos e o terceiro candidato mais votado do seu partido ( Gugu de Nair) obteve 957 votos, ou seja, uma diferença de apenas 128 votos para o segundo candidato e 249 votos para o terceiro candidato. Por análise primária e lógica, obviamente se o Representado tivesse extrapolado minimamente o valor do autofinanciamento, apesar de ter cometido irregularidade eleitoral, poderia não ter impactado no resultado das eleições. No entanto, ele extrapolou em mais de 300% (trezentos por cento) do limite do autofinanciamento, valor que claramente possui o condão para comprometer a integridade do pleito, especialmente se tratando de Comarca com número reduzido de eleitores e com diferença pequena de votos entre os candidatos, conforme demonstrado acima.

      A jurisprudência massiva do TSE, inclusive as colacionadas pelo Representado na sua defesa vem entendendo que a extrapolação do limite de autofinanciamento, por si só, não é capaz de ensejar o reconhecimento e aplicação do abuso de poder econômico, sendo necessária a aplicação dos princípios da proporcionalidade e razoabilidade exigindo-se a presença cumulativa de três requisitos para afastar o abuso do poder econômico: (i) percentual de irregularidade inferior a 10% do total movimentado; (ii) ausência de má-fé; e (iii) irrelevância do valor absoluto.

      Dessa forma, evidente que a prática do Representado configura abuso do poder econômico, visto que, sob o parâmetro da própria jurisprudência do TSE, quando analisado o caso concreto sob a ótica dos princípios da proporcionalidade e razoabilidade, ao extrapolar o limite de autofinanciamento (essencial para evitar vantagens indevidas e proteger o princípio da isonomia no pleito eleitoral - art. 27, § 4º, da Resolução TSE nº 23.607/2019) o Representado praticou ato de abuso de poder econômico que causou desequilíbrio no pleito.

      PAREI ALEGAÇÕES FINAIS réu ID 32649352

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Major Comments:

      It is interesting case study but the main problem with the study is the use of an unsuitable tardigrade model species. It was shown in the past that Hypsibius exemplaris is not a good model species to test tardigrade survival under extreme stress. Of course, results of Hypsibius exemplaris can be published but from the entire manuscript all general comments that tardigrades react in this or in different way need to be removed. This is characteristic only to Hypsibius exemplaris species which is a poor model for studies focused on environmental stressTo present general conclusions use few different tardigrade species or at least a correct tardigrade species with confirmed high resilience for different kind of stress like Milnesium, Ramazzottius, Paramacrobiotus or similar must be tested. Based on present study I can only propose to publish this manuscript as a case study for one poorly stress resistant eutardigrade species, without any general conclusions about other tardigrades. See: Poprawa, I., Bartylak, T., Kulpla, A., Erdmann, W., Roszkowska, M., Chajec, Ł., Kaczmarek, Ł., Karachitos, A. & Kmita, H. (2022) Verification of Hypsibius exemplaris Gąsiorek et al., 2018 (Eutardigrada; Hypsibiidae) application in anhydrobiosis research. PLoS ONE 17(3): e0261485.

      Minor comments:

      1. General comment to entire manuscript. Please do not start sentences with abbreviations, i.e. The DNA instead of DNA, Caenorhabditis instead of C. etc. In bibliography many doin numbers for publications are lacking, you have a different styles of citations, do not use capital letters for words inside the article title e.g. "Tardigrades as a Potential Model Organism in Space Research.", change it to "Tardigrades as a potential model organism in space research." Or use capital letters in all citations. Use italics for Latin names of the species and genera. On figures please try to put all of them like this that specimens ill be situated horizontally and in the middle of figure.
      2. Introduction, Lines 80-96: I do not understand why this section is in Introduction. This is description of the results of the studies could be minimal and details could be moved to proper chapters.
      3. Results: In this section are mixed results with methods. Please put all parts to the correct chapters.
      4. Line 227 and 235: Based on what you interpreted: "fully-grown adults" and "juveniles" that they were adult and fully grown? Please explain in the text.
      5. Line 315: You wrote "These findings demonstrate that even a transient exposure to zeocin causes irreversible DNA damage, leading to delayed mortality." but not to all specimens as you marked above.
      6. Line 461-462: You wrote: "In this study, we probed why tardigrades-despite their impressive DNA repair capacity and extremotolerance-still succumb to genotoxic stress." But only one tardigrade species with poor resilience to stress conditions has been tested in this study. What if more repair mechanisms are activated in tardigrades when tardigrades leaving the state of anhydrobiosis? Authors tested only active animals and in such mechanisms maybe not activated or are activated on lower level. What is even more problematic, and what I marked this in one of the first comments, the species used in study is incorrect because is not very resilient to extreme conditions. This species is also a poor anhydrobiotic species with almost zero ability to anhydrobiosis (during which repair mechanisms are activated).
      7. Line 609: "..actively searching for food.." - How you know that they were looking for food? What was a difference between normal crawling around and looking for food?
      8. Line 635: "In sum, tardigrades illustrate that..." - Only in case of Hypsibius. This is not characteristic for tardigrades. See my previous comments. This conclusion is too strong without adequate proof.
      9. Lines 666-667: "Adults measured {greater than or equal to}240 μm in length, while juveniles ranged between 120-180 μm." - Why such measurements? It was connected with something or is it arbitrary? Please explain.
      10. Lines: 673-677: "For each timepoint, fertility was calculated by dividing the total number of eggs laid by the number of live animals at that time (using the last recorded number of live animals when all animals had died). In Fig. 5A-B, fertility is presented as the mean cumulative number of eggs laid per animal over time; in Fig. S9, it is shown as the mean number of eggs laid per animal at each timepoint." - This method of calculating fertility may be valid only if you know that all the females laid the same number of eggs. It is obvious that some females produced less and some others more eggs. Hence, fertility can not be accurately calculated in this way.

      Significance

      Studies described in the manuscript are very interesting for many potential readers, however manuscript need to be modified as case study for one tardigrades species without generalization of the results for all tardigrades. It is very important to not suggest that all tardigrades react in the same way especially that species used is not a good candidate for this type of studies (see my major comments).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank all the reviewers for their valuable comments and criticisms. We have thoroughly revised the manuscript and the resource to address all the points raised by the reviewers. Below, we provide a point-by-point response for the sake of clarity.

      Reviewer #1

      __Evidence, reproducibility and clarity __

      Summary: This manuscript, "MAVISp: A Modular Structure-Based Framework for Protein Variant Effects," presents a significant new resource for the scientific community, particularly in the interpretation and characterization of genomic variants. The authors have developed a comprehensive and modular computational framework that integrates various structural and biophysical analyses, alongside existing pathogenicity predictors, to provide crucial mechanistic insights into how variants affect protein structure and function. Importantly, MAVISp is open-source and designed to be extensible, facilitating reuse and adaptation by the broader community.

      Major comments: - While the manuscript is formally well-structured (with clear Introduction, Results, Conclusions, and Methods sections), I found it challenging to follow in some parts. In particular, the Introduction is relatively short and lacks a deeper discussion of the state-of-the-art in protein variant effect prediction. Several methods are cited but not sufficiently described, as if prior knowledge were assumed. OPTIONAL: Extend the Introduction to better contextualize existing approaches (e.g., AlphaMissense, EVE, ESM-based predictors) and clarify what MAVISp adds compared to each.

      We have expanded the introduction on the state-of-the-art of protein variant effects predictors, explaining how MAVISp departs from them.

      - The workflow is summarized in Figure 1(b), which is visually informative. However, the narrative description of the pipeline is somewhat fragmented. It would be helpful to describe in more detail the available modules in MAVISp, and which of them are used in the examples provided. Since different use cases highlight different aspects of the pipeline, it would be useful to emphasize what is done step-by-step in each.

      We have added a concise, narrative description of the data flow for MAVISp, as well as improved the description of modules in the main text. We will integrate the results section with a more comprehensive description of the available modules, and then clarify in the case studies which modules were applied to achieve specific results.

      OPTIONAL: Consider adding a table or a supplementary figure mapping each use case to the corresponding pipeline steps and modules used.

      We have added a supplementary table (Table S2) to guide the reader on the modules and workflows applied for each case study

      We also added Table S1 to map the toolkit used by MAVISp to collect the data that are imported and aggregated in the webserver for further guidance.

      - The text contains numerous acronyms, some of which are not defined upon first use or are only mentioned in passing. This affects readability. OPTIONAL: Define acronyms upon first appearance, and consider moving less critical technical details (e.g., database names or data formats) to the Methods or Supplementary Information. This would greatly enhance readability.

      We revised the usage of acronyms following the reviewer’s directions of defying them at first appearance.

      • The code and trained models are publicly available, which is excellent. The modular design and use of widely adopted frameworks (PyTorch and PyTorch Geometric) are also strong points. However, the Methods section could benefit from additional detail regarding feature extraction and preprocessing steps, especially the structural features derived from AlphaFold2 models. OPTIONAL: Include a schematic or a table summarizing all feature types, their dimensionality, and how they are computed.

      We thank the reviewer for noticing and praising the availability of the tools of MAVISp. Our MAVISp framework utilizes methods and scores that incorporate machine learning features (such as EVE or RaSP), but does not employ machine learning itself. Specifically, we do not use PyTorch and do not utilize features in a machine learning sense. We do extract some information from the AlphaFold2 models that we use (such as the pLDDT score and their secondary structure content, as calculated by DSSP), and those are available in the MAVISp aggregated csv files for each protein entry and detailed in the Documentation section of the MAVISp website.

      • The section on transcription factors is relatively underdeveloped compared to other use cases and lacks sufficient depth or demonstration of its practical utility. OPTIONAL: Consider either expanding this section with additional validation or removing/postponing it to a future manuscript, as it currently seems preliminary.

      We have removed this section and included a mention in the conclusions as part of the future directions.

      Minor comments: - Most relevant recent works are cited, including EVE, ESM-1v, and AlphaFold-based predictors. However, recent methods like AlphaMissense (Cheng et al., 2023) could be discussed more thoroughly in the comparison.

      We have revised the introduction to accommodate the proper space for this comparison.

      • Figures are generally clear, though some (e.g., performance barplots) are quite dense. Consider enlarging font sizes and annotating key results directly on the plots.

      We have revised Figure 2 and presented only one case study to simplify its readability. We have also changed Figure 3, whereas retained the other previous figures since they seemed less problematic.

      • Minor typographic errors are present. A careful proofreading is highly recommended. Below are some of the issues I identified: Page 3, line 46: "MAVISp perform" -> "MAVISp performs" Page 3, line 56: "automatically as embedded" -> "automatically embedded" Page 3, line 57: "along with to enhance" -> unclear; please revise Page 4, line 96: "web app interfaces with the database and present" -> "presents" Page 6, line 210: "to investigate wheatear" -> "whether" Page 6, lines 215-216: "We have in queue for processing with MAVISp proteins from datasets relevant to the benchmark of the PTM module." -> unclear sentence; please clarify Page 15, line 446: "Both the approaches" -> "Both approaches" Page 20, line 704: "advantage of multi-core system" -> "multi-core systems"

      We have done a proofreading of the entire article, including the points above

      Significance

      General assessment: the strongest aspects of the study are the modularity, open-source implementation, and the integration of structural information through graph neural networks. MAVISp appears to be one of the few publicly available frameworks that can easily incorporate AlphaFold2-based features in a flexible way, lowering the barrier for developing custom predictors. Its reproducibility and transparency make it a valuable resource. However, while the technical foundation is solid and the effort substantial, the scientific narrative and presentation could be significantly improved. The manuscript is dense and hard to follow in places, with a heavy use of acronyms and insufficient explanation of key design choices. Improving the descriptive clarity, especially in the early sections, would greatly enhance the impact of this work.

      Advance

      to the best of my knowledge, this is one of the first modular platforms for protein variant effect prediction that integrates structural data from AlphaFold2 with bioinformatic annotations and even clinical data in an extensible fashion. While similar efforts exist (e.g., ESMfold, AlphaMissense), MAVISp distinguishes itself through openness and design for reusability. The novelty is primarily technical and practical rather than conceptual.

      Audience

      this study will be of strong interest to researchers in computational biology, structural bioinformatics, and genomics, particularly those developing variant effect predictors or analyzing the impact of mutations in clinical or functional genomics contexts. The audience is primarily specialized, but the open-source nature of the tool may diffuse its use among more applied or translational users, including those working in precision medicine or protein engineering.

      Reviewer expertise: my expertise is in computational structural biology, molecular modeling, and (rather weak) machine learning applications in bioinformatics. I am familiar with graph-based representations of proteins, AlphaFold2, and variant effects based on Molecular Dynamics simulations. I do not have any direct expertise in clinical variant annotation pipelines.

      Reviewer #2

      __Evidence, reproducibility and clarity __

      Summary: The authors present a pipeline and platform, MAVISp, for aggregating, displaying and analysis of variant effects with a focus on reclassification of variants of uncertain clinical significance and uncovering the molecular mechanisms underlying the mutations.

      Major comments: - On testing the platform, I was unable to look-up a specific variant in ADCK1 (rs200211943, R115Q). I found that despite stating that the mapped refseq ID was NP_001136017 in the HGVSp column, it was actually mapped to the canonical UniProt sequence (Q86TW2-1). NP_001136017 actually maps to Q86TW2-3, which is missing residues 74-148 compared to the -1 isoform. The Uniprot canonical sequence has no exact RefSeq mapping, so the HGVSp column is incorrect in this instance. This mapping issue may also affect other proteins and result in incorrect HGVSp identifiers for variants.

      We would like to thank the reviewer for pointing out these inconsistencies. We have revised all the entries and corrected them. If needed, the history of the cases that have been corrected can be found in the closed issues of the GitHub repository that we use for communication between biocurators and data managers (https://github.com/ELELAB/mavisp_data_collection). We have also revised the protocol we follow in this regard and the MAVISp toolkit to include better support for isoform matching in our pipelines for future entries, as well as for the revision/monitoring of existing ones, as detailed in the Method Section. In particular, we introduced a tool, uniprot2refseq, which aids the biocurator in identifying the correct match in terms of sequence length and sequence identity between RefSeq and UniProt. More details are included in the Method Section of the paper. The two relevant scripts for this step are available at: https://github.com/ELELAB/mavisp_accessory_tools/

      - The paper lacks a section on how to properly interpret the results of the MAVISp platform (the case-studies are helpful, but don't lay down any global rules for interpreting the results). For example: How should a variant with conflicts between the variant impact predictors be interpreted? Are specific indicators considered more 'reliable' than others?

      We have added a section in Results to clarify how to interpret results from MAVISp in the most common use cases.

      • In the Methods section, GEMME is stated as being rank-normalised with 0.5 as a threshold for damaging variants. On checking the data downloaded from the site, GEMME was not rank-normalised but rather min-max normalised. Furthermore, Supplementary text S4 conflicts with the methods section over how GEMME scores are classified, S4 states that a raw-value threshold of -3 is used.

      We thank the reviewer for spotting this inconsistency. This part in the main text was left over from a previous and preliminary version of the pre-print, we have revised the main text. Supplementary Text S4 includes the correct reference for the value in light of the benchmarking therewithin.

      • Note. This is a major comment as one of the claims is that the associated web-tool is user-friendly. While functional, the web app is very awkward to use for analysis on any more than a few variants at once. The fixed window size of the protein table necessitates excessive scrolling to reach your protein-of-interest. This will also get worse as more proteins are added. Suggestion: add a search/filter bar. The same applies to the dataset window.

      We have changed the structure of the webserver in such a way that now the whole website opens as its own separate window, instead of being confined within the size permitted by the website at DTU. This solves the fixed window size issue. Hopefully, this will improve the user experience.

      We have refactored the web app by adding filtering functionality, both for the main protein table (that can now be filtered by UniProt AC, gene name or RefSeq ID) and the mutations table. Doing this required a general overhaul of the table infrastructure (we changed the underlying engine that renders the tables).

      • You are unable to copy anything out of the tables.
      • Hyperlinks in the tables only seem to work if you open them in a new tab or window.

      The table overhauls fixed both of these issues

      • All entries in the reference column point to the MAVISp preprint even when data from other sources is displayed (e.g. MAVE studies).

      We clarified the meaning of the reference column in the Documentation on the MAVISp website, as we realized it had confused the reviewer. The reference column is meant to cite the papers where the computationally-generated MAVISp data are used, not external sources. Since we also have the experimental data module in the most recent release, we have also refactored the MAVISp website by adding a “Datasets and metadata” page, which details metadata for key modules. These include references to data from external sources that we include in MAVISp on a case-by-case basis (for example the results of a MAVE experiment). Additionally, we have verified that the papers using MAVISp data are updated in https://elelab.gitbook.io/mavisp/overview/publications-that-used-mavisp-data and in the csv file of the interested proteins.

      Here below the current references that have been included in terms of publications using MAVISp data:

      SMPD1

      ASM variants in the spotlight: A structure-based atlas for unraveling pathogenic mechanisms in lysosomal acid sphingomyelinase

      Biochim Biophys Acta Mol Basis Dis

      38782304

      https://doi.org/10.1016/j.bbadis.2024.167260

      TRAP1

      Point mutations of the mitochondrial chaperone TRAP1 affect its functions and pro-neoplastic activity

      Cell Death & Disease

      40074754

      https://doi.org/10.1038/s41419-025-07467-6

      BRCA2

      Saturation genome editing-based clinical classification of BRCA2 variants

      Nature

      39779848

      0.1038/s41586-024-08349-1

      TP53, GRIN2A, CBFB, CALR, EGFR

      TRAP1 S-nitrosylation as a model of population-shift mechanism to study the effects of nitric oxide on redox-sensitive oncoproteins

      Cell Death & Disease

      37085483

      10.1038/s41419-023-05780-6

      KIF5A, CFAP410, PILRA, CYP2R1

      Computational analysis of five neurodegenerative diseases reveals shared and specific genetic loci

      Computational and Structural Biotechnology Journal

      38022694

      https://doi.org/10.1016/j.csbj.2023.10.031

      KRAS

      Combining evolution and protein language models for an interpretable cancer driver mutation prediction with D2Deep

      Brief Bioinform

      39708841

      https://doi.org/10.1093/bib/bbae664

      OPTN

      Decoding phospho-regulation and flanking regions in autophagy-associated short linear motifs

      Communications Biology

      40835742

      10.1038/s42003-025-08399-9

      DLG4,GRB2,SMPD1

      Deciphering long-range effects of mutations: an integrated approach using elastic network models and protein structure networks

      JMB

      40738203

      doi: 10.1016/j.jmb.2025.169359

      Entering multiple mutants in the "mutations to be displayed" window is time-consuming for more than a handful of mutants. Suggestion: Add a box where multiple mutants can be pasted in at once from an external document.

      During the table overhaul, we have revised the user interface to add a text box that allows free copy-pasting of mutation lists. While we understand having a single input box would have been ideal, the former selection interface (which is also still available) doesn’t allow copy-paste. This is a known limitation in Streamlit.

      Minor comments

      • Grammar. I appreciate that this manuscript may have been compiled by a non-native English speaker, but I would be remiss not to point out that there are numerous grammar errors throughout, usually sentence order issues or non-pluralisation. The meaning of the authors is mostly clear, but I recommend very thoroughly proof-reading the final version.

      We have done proofreading on the final version of the manuscript

      • There are numerous proteins that I know have high-quality MAVE datasets that are absent in the database e.g. BRCA1, HRAS and PPARG.

      Yes, we are aware of this. It is far from trivial to properly import the datasets from multiplex assays. They often need to be treated on a case-by-case basis. We are in the process of carefully compiling locally all the MAVE data before releasing it within the public version of the database, so this is why they are missing. We are giving priorities to the ones that can be correlated with our predictions on changes in structural stability and then we will also cover the rest of the datasets handling them in batches. Having said this, we have checked the dataset for BRCA1, HRAS, and PPARG. We have imported the ones for PPARG and BRCA1 from ProtGym, referring to the studies published in 10.1038/ng.3700 and 10.1038/s41586-018-0461-z, respectively. Whereas for HRAS, checking in details both the available data and literature, while we did identify a suitable dataset (10.7554/eLife.27810), we struggled to understand what a sensible cut-off for discriminating between pathogenic and non-pathogenic variants would be, and so ended up not including it in the MAVISp dataset for now. We will contact the authors to clarify which thresholds to apply before importing the data.

      • Checking one of the existing MAVE datasets (KRAS), I found that the variants were annotated as damaging, neutral or given a positive score (these appear to stand-in for gain-of-function variants). For better correspondence with the other columns, those with positive scores could be labelled as 'ambiguous' or 'uncertain'.

      In the KRAS case study presented in MAVISP, we utilized the protein abundance dataset reported in (http://dx.doi.org/10.1038/s41586-023-06954-0) and made available in the ProteinGym repository (specifically referenced at https://github.com/OATML-Markslab/ProteinGym/blob/main/reference_files/DMS_substitutions.csv#L153). We adopted the precalculated thresholds as provided by the ProteinGym authors. In this regard, we are not really sure the reviewer is referring to this dataset or another one on KRAS.

      • Numerous thresholds are defined for stabilizing / destabilizing / neutral variants in both the STABILITY and the LOCAL_INTERACTION modules. How were these thresholds determined? I note that (PMC9795540) uses a ΔΔG threshold of 1/-1 for defining stabilizing and destabilizing variants, which is relatively standard (though they also say that 2-3 would likely be better for pinpointing pathogenic variants).

      We improved the description of our classification strategies for both modules in the Documentation page of our website. Also, we explained more clearly the possible sources of ‘uncertain’ annotations for the two modules in both the web app (Documentation page) and main text. Briefly, in the STABILITY module, we consider FoldX and either Rosetta or RaSP to achieve a final classification. We first classify one and the other independently, according to the following strategy:

      If DDG ≥ 3, the mutation is Destabilizing If DDG ≤ −3, the mutation is Stabilizing If −2 We then compare the classifications obtained by the two methods: if they agree, then that is the final classification, if they disagree, then the final classification is Uncertain. The thresholds were selected based on a previous study, in which variants with changes in stability below 3 kcal/mol were not featuring a markedly different abundance at cellular level [10.1371/journal.pgen.1006739, 10.7554/eLife.49138]

      Regarding the LOCAL_INTERACTION module, it works similarly as for the Stability module, in that Rosetta and FoldX are considered independently, and an implicit classification is performed for each, according to the rules (values in kcal/mol)

      If DDG > 1, the mutation is Destabilizing. If DDG Each mutation is therefore classified for both methods. If the methods agree (i.e., if they classify the mutation in the same way), their consensus is the final classification for the mutation; if they do not agree, the final classification will be Uncertain.

      If a mutation does not have an associated free energy value, the relative solvent accessible area is used to classify it: if SAS > 20%, the mutation is classified as Uncertain, otherwise it is not classified.

      Thresholds here were selected according to best practices followed by the tool authors and more in general in the literature, as the reviewer also noticed.

      • "Overall, with the examples in this section, we illustrate different applications of the MAVISp results, spanning from benchmarking purposes, using the experimental data to link predicted functional effects with structural mechanisms or using experimental data to validate the predictions from the MAVISp modules."

      The last of these points is not an application of MAVISp, but rather a way in which external data can help validate MAVISp results. Furthermore, none of the examples given demonstrate an application in benchmarking (what is being benchmarked?).

      We have revised the statements to avoid this confusion in the reader.

      • Transcription factors section. This section describes an intended future expansion to MAVISp, not a current feature, and presents no results. As such, it should be moved to the conclusions/future directions section.

      We have removed this section and included a mention in the conclusions as part of the future directions.

      • Figures. The dot-plots generated by the web app, and in Figures 4, 5 and 6 have 2 legends. After looking at a few, it is clear that the lower legend refers to the colour of the variant on the X-axis - most likely referencing the ClinVar effect category. This is not, however, made clear either on the figures or in the app.

      The reviewer’s interpretation on the second legend is correct - it does refer to the ClinVar classification. Nonetheless, we understand the positioning of the legend makes understanding what the legend refers to not obvious. We also revised the captions of the figures in the main text. On the web app, we have changed the location of the figure legend for the ClinVar effect category and added a label to make it clear what the classification refers to.

      • "We identified ten variants reported in ClinVar as VUS (E102K, H86D, T29I, V91I, P2R, L44P, L44F, D56G, R11L, and E25Q, Fig.5a)" E25Q is benign in ClinVar and has had that status since first submitted.

      We have corrected this in the text and the statements related to it.

      Significance

      Platforms that aggregate predictors of variant effect are not a new concept, for example dbNSFP is a database of SNV predictions from variant effect predictors and conservation predictors over the whole human proteome. Predictors such as CADD and PolyPhen-2 will often provide a summary of other predictions (their features) when using their platforms. MAVISp's unique angle on the problem is in the inclusion of diverse predictors from each of its different moules, giving a much wider perspective on variants and potentially allowing the user to identify the mechanistic cause of pathogenicity. The visualisation aspect of the web app is also a useful addition, although the user interface is somewhat awkward. Potentially the most valuable aspect of this study is the associated gitbook resource containing reports from biocurators for proteins that link relevant literature and analyse ClinVar variants. Unfortunately, these are only currently available for a small minority of the total proteins in the database with such reports. For improvement, I think that the paper should focus more on the precise utility of the web app / gitbook reports and how to interpret the results rather than going into detail about the underlying pipeline.

      We appreciate the interest in the gitbook resource that we also see as very valuable and one of the strengths of our work. We have now implemented a new strategy based on a Python script introduced in the mavisp toolkit to generate a template Markdown file of the report that can be further customized and imported into GitBook directly (​​https://github.com/ELELAB/mavisp_accessory_tools/). This should allow us to streamline the production of more reports. We are currently assigning proteins in batches for reporting to biocurator through the mavisp_data_collection GitHub to expand their coverage. Also, we revised the text and added a section on the interpretation of results from MAVISp. with a focus on the utility of the web-app and reports.

      In terms of audience, the fast look-up and visualisation aspects of the web-platform are likely to be of interest to clinicians in the interpretation of variants of unknown clinical significance. The ability to download the fully processed dataset on a per-protein database would be of more interest to researchers focusing on specific proteins or those taking a broader view over multiple proteins (although a facility to download the whole database would be more useful for this final group).

      While our website only displays the dataset per protein, the whole dataset, including all the MAVISp entries, is available at our OSF repository (https://osf.io/ufpzm/), which is cited in the paper and linked on the MAVISp website. We have further modified the MAVISp database to add a link to the repository in the modes page, so that it is more visible.

      My expertise. - I am a protein bioinformatician with a background in variant effect prediction and large-scale data analysis.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Evidence, reproducibility and clarity:

      Summary:

      The authors present MAVISp, a tool for viewing protein variants heavily based on protein structure information. The authors have done a very impressive amount of curation on various protein targets, and should be commended for their efforts. The tool includes a diverse array of experimental, clinical, and computational data sources that provides value to potential users interested in a given target.

      Major comments:

      Unfortunately I was not able to get the website to work correctly. When selecting a protein target in simple mode, I was greeted with a completely blank page in the app window. In ensemble mode, there was no transition away from the list of targets at all. I'm using Firefox 140.0.2 (64-bit) on Ubuntu 22.04. I would like to explore the data myself and provide feedback on the user experience and utility.

      We have tried reproducing the issue mentioned by the reviewer, using the exact same Ubuntu and Firefox versions, but unfortunately failed to produce it. The website worked fine for us under such an environment. The issue experienced by the reviewer may have been due to either a temporary issue with the web server or a problem with the specific browser environment they were working in, which we are unable to reproduce. It would be useful to know the date that this happened to verify if it was a downtime on the DTU IT services side that made the webserver inaccessible.

      I have some serious concerns about the sustainability of the project and think that additional clarifications in the text could help. Currently is there a way to easily update a dataset to add, remove, or update a component (for example, if a new predictor is published, an error is found in a predictor dataset, or a predictor is updated)? If it requires a new round of manual curation for each protein to do this, I am worried that this will not scale and will leave the project with many out of date entries. The diversity of software tools (e.g., three different pipeline frameworks) also seems quite challenging to maintain.

      We appreciate the reviewer’s concerns about long-term sustainability. It is a fair point that we consider within our steering group, who oversee and plans the activities and meet monthly. Adding entries to MAVISp is moving more and more towards automation as we grow. We aim to minimize the manual work where applicable. Still, an expert-based intervention is really needed in some of the steps, and we do not want to renounce it. We intend to keep working on MAVISp to make the process of adding and updating entries as automated as possible, and to streamline the process when manual intervention is necessary. From the point of view of the biocurators, they have three core workflows to use for the default modules, which also automatically cover the source of annotations. We are currently working to streamline the procedures behind LOCAL_INTERACTION, which is the most challenging one. On the data manager and maintainers' side, we have workflows and protocols that help us in terms of automation, quality control, etc, and we keep working to improve them. Among these, we have workflows to use for the old entries updates. As an example, the update of erroneously attributed RefSeq data (pointed out by reviewer 2) took us only one week overall (from assigning revisions and importing to the database) because we have a reduced version of Snakemake for automation that can act on only the affected modules. Also, another point is that we have streamlined the generation of the templates for the gitbook reports (see also answer to reviewer 2).

      The update of old entries is planned and made regularly. We also deposit the old datasets on OSF for transparency, in case someone needs to navigate and explore the changes. We have activities planned between May and August every year to update the old entries in relation to changes of protocols in the modules, updates in the core databases that we interact with (COSMIC, Clinvar etc). In case of major changes, the activities for updates continue in the Fall. Other revisions can happen outside these time windows if an entry is needed or a specific research project and needs updates too.

      Furthermore, the community of people contributing to MAVISp as biocurators or developers is growing and we have scientists contributing from other groups in relation to their research interest. We envision that for this resource to scale up, our team cannot be the only one producing data and depositing it to the database. To facilitate this we launched a pilot for a training event online (see Event page on the website) and we will repeat it once per year. We also organize regular meetings with all the active curators and developers to plan the activities in a sustainable manner and address the challenges we encounter.

      As stated in the manuscript, currently with the team of people involved, automatization and resources that we have gathered around this initiative we can provide updates to the public database every third month and we have been regularly satisfied with them. Additionally, we are capable of processing from 20 to 40 proteins every month depending also on the needs of revision or expansion of analyses on existing proteins. We also depend on these data for our own research projects and we are fully committed to it.

      Additionally, we are planning future activities in these directions to improve scale up and sustainability:

      • Streamlining manual steps so that they are as convenient as fast as possible for our curators, e.g. by providing custom pages on the MAVISp website
      • Streamline and automatize the generation of useful output, for instance the reports, by using a combination of simple automation and large language models
      • Implement ways to share our software and scripts with third parties, for instance by providing ready made (or close to) containers or virtual machines
      • For a future version 2 if the database grows in a direction that is not compatible with Streamlit, the web data science framework we are currently using, we will rewrite the website using a framework that would allow better flexibility and performance, for instance using Django and a proper database backend. On the same theme, according to the GitHub repository, the program relies on Python 3.9, which reaches end of life in October 2025. It has been tested against Ubuntu 18.04, which left standard support in May 2023. The authors should update the software to more modern versions of Python to promote the long-term health and maintainability of the project.

      We thank the reviewer for this comment - we are aware of the upcoming EOL of Python 3.9. We tested MAVISp, both software package and web server, using Python 3.10 (which is the minimum supported version going forward) and Python 3.13 (which is the latest stable release at the time of writing) and updated the instructions in the README file on the MAVISp GitHub repository accordingly.

      We plan on keeping track of Python and library versions during our testing and updating them when necessary. In the future, we also plan to deploy Continuous Integration with automated testing for our repository, making this process easier and more standardized.

      I appreciate that the authors have made their code and data available. These artifacts should also be versioned and archived in a service like Zenodo, so that researchers who rely on or want to refer to specific versions can do so in their own future publications.

      Since 2024, we have been reporting all previous versions of the dataset on OSF, the repository linked to the MAVISp website, at https://osf.io/ufpzm/files/osfstorage (folder: previous_releases). We prefer to keep everything under OSF, as we also use it to deposit, for example, the MD trajectory data.

      Additionally, in this GitHub page that we use as a space to interact between biocurators, developers, and data managers within the MAVISp community, we also report all the changes in the NEWS space: https://github.com/ELELAB/mavisp_data_collection

      Finally, the individual tools are all available in our GitHub repository, where version control is in place (see Table S1, where we now mapped all the resources used in the framework)

      In the introduction of the paper, the authors conflate the clinical challenges of variant classification with evidence generation and it's quite muddled together. They should strongly consider splitting the first paragraph into two paragraphs - one about challenges in variant classification/clinical genetics/precision oncology and another about variant effect prediction and experimental methods. The authors should also note that they are many predictors other than AlphaMissense, and may want to cite the ClinGen recommendations (PMID: 36413997) in the intro instead.

      We revised the introduction in light of these suggestions. We have split the paragraph as recommended and added a longer second paragraph about VEPs and using structural data in the context of VEPs. We have also added the citation that the reviewer kindly recommended.

      Also in the introduction on lines 21-22 the authors assert that "a mechanistic understanding of variant effects is essential knowledge" for a variety of clinical outcomes. While this is nice, it is clearly not the case as we can classify variants according to the ACMG/AMP guidelines without any notion of specific mechanism (for example, by combining population frequency data, in silico predictor data, and functional assay data). The authors should revise the statement so that it's clear that mechanistic understanding is a worthy aspiration rather than a prerequisite.

      We revised the statement in light of this comment from the reviewer

      In the structural analysis section (page 5, lines 154-155 and elsewhere), the authors define cutoffs with convenient round numbers. Is there a citation for these values or were these arbitrarily chosen by the authors? I would have liked to see some justification that these assignments are reasonable. Also there seems to be an error in the text where values between -2 and -3 kcal/mol are not assigned to a bin (I assume they should also be uncertain). There are other similar seemingly-arbitrary cutoffs later in the section that should also be explained.

      We have revised the text making the two intervals explicit, for better clarity.

      On page 9, lines 294-298 the authors talk about using the PTEN data from ProteinGym, rather than the actual cutoffs from the paper. They get to the latter later on, but I'm not sure why this isn't first? The ProteinGym cutoffs are somewhat arbitrarily based on the median rather than expert evaluation of the dataset, and I'm not sure why it's even worth mentioning them when proper classifications are available. Regarding PTEN, it would be quite interesting to see a comparison of the VAMP-seq PTEN data and the Mighell phosphatase assay, which is cited on page 9 line 288 but is not actually a VAMP-seq dataset. I think this section could be interesting but it requires some additional attention.

      We have included the data from Mighell’s phosphatase assay as provided by MAVEdb in the MAVISp database, within the experimental_data module for PTEN, and we have revised the case study, including them and explaining better the decision of supporting both the ProteinGym and MAVEdb classification in MAVISp (when available). See revised Figure3, Table 1 and corresponding text.

      The authors mention "pathogenicity predictors" and otherwise use pathogenicity incorrectly throughout the manuscript. Pathogenicity is a classification for a variant after it has been curated according to a framework like the ACMG/AMP guidelines (Richards 2015 and amendments). A single tool cannot predict or assign pathogenicity - the AlphaMissense paper was wrong to use this nomenclature and these authors should not compound this mistake. These predictors should be referred to as "variant effect predictors" or similar, and they are able to produce evidence towards pathogenicity or benignity but not make pathogenicity calls themselves. For example, in Figure 4e, the terms "pathogenic" and "benign" should only be used here if these are the classifications the authors have derived from ClinVar or a similar source of clinically classified variants.

      The reviewer is correct, we have revised the terminology we used in the manuscript and refers to VEPs (Variant Effect Predictors)

      Minor comments:

      The target selection table on the website needs some kind of text filtering option. It's very tedious to have to find a protein by scrolling through the table rather than typing in the symbol. This will only get worse as more datasets are added.

      We have revised the website, adding a filtering option. In detail, we have refactored the web app by adding filtering functionality, both for the main protein table (that can now be filtered by UniProt AC, gene name, or RefSeq ID) and the mutations table. Doing this required a general overhaul of the table infrastructure (we changed the underlying engine that renders the tables).

      The data sources listed on the data usage section of the website are not concordant with what is in the paper. For example, MaveDB is not listed.

      We have revised and updated the data sources on the website, adding a metadata section with relevant information, including MaveDB references where applicable.

      Figure 2 is somewhat confusing, as it partially interleaves results from two different proteins. This would be nicer as two separate figures, one on each protein, or just of a single protein.

      As suggested by the reviewer, we have now revised the figure and corresponding legends and text, focusing only on one of the two proteins.

      Figure 3 panel b is distractingly large and I wonder if the authors could do a little bit more with this visualization.

      We have revised Figure 3 to solve these issues and integrating new data from the comparison with the phosphatase assay

      Capitalization is inconsistent throughout the manuscript. For example, page 9 line 288 refers to VampSEQ instead of VAMP-seq (although this is correct elsewhere). MaveDB is referred to as MAVEdb or MAVEDB in various places. AlphaMissense is referred to as Alphamissense in the Figure 5 legend. The authors should make a careful pass through the manuscript to address this kind of issues.

      We have carefully proofread the paper for these inconsistencies

      MaveDB has a more recent paper (PMID: 39838450) that should be cited instead of/in addition to Esposito et al.

      We have added the reference that the reviewer recommended

      On page 11, lines 338-339 the authors mention some interesting proteins including BLC2, which has base editor data available (PMID: 35288574). Are there plans to incorporate this type of functional assay data into MAVISp?

      The assay mentioned in the paper refers to an experimental setup designed to investigate mutations that may confer resistance to the drug venetoclax. We started the first steps to implement a MAVISp module aimed at evaluating the impact of mutations on drug binding using alchemical free energy perturbations (ensemble mode) but we are far from having it complete. We expect to import these data when the module will be finalized since they can be used to benchmark it and BCL2 is one of the proteins that we are using to develop and test the new module.

      Reviewer #3 (Significance (Required)):

      Significance:

      General assessment:

      This is a nice resource and the authors have clearly put a lot of effort in. They should be celebrated for their achievments in curating the diverse datasets, and the GitBooks are a nice approach. However, I wasn't able to get the website to work and I have raised several issues with the paper itself that I think should be addressed.

      Advance:

      New ways to explore and integrate complex data like protein structures and variant effects are always interesting and welcome. I appreciate the effort towards manual curation of datasets. This work is very similar in theme to existing tools like Genomics 2 Proteins portal (PMID: 38260256) and ProtVar (PMID: 38769064). Unfortunately as I wasn't able to use the site I can't comment further on MAVISp's position in the landscape.

      We have expanded the conclusions section to add a comparison and cite previously published work, and linked to a review we published last year that frames MAVISp in the context of computational frameworks for the prediction of variant effects. In brief, the Genomics 2 Proteins portal (G2P) includes data from several sources, including some overlapping with MAVISp such as Phosphosite or MAVEdb, as well as features calculated on the protein structure. ProtVar also aggregates mutations from different sources and includes both variant effect predictors and predictions of changes in stability upon mutation, as well as predictions of complex structures. These approaches are only partially overlapping with MAVISp. G2P is primarily focused on structural and other annotations of the effect of a mutation; it doesn’t include features about changes of stability, binding, or long-range effects, and doesn’t attempt to classify the impact of a mutation according to its measurements. It also doesn’t include information on protein dynamics. Similarly, ProtVar does include information on binding free energies, long effects, or dynamical information.

      Audience:

      MAVISp could appeal to a diverse group of researchers who are interested in the biology or biochemistry of proteins that are included, or are interested in protein variants in general either from a computational/machine learning perspective or from a genetics/genomics perspective.

      My expertise:

      I am an expert in high-throughput functional genomics experiments and am an experienced computational biologist with software engineering experience.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors present a pipeline and platform, MAVISp, for aggregating, displaying and analysis of variant effects with a focus on reclassification of variants of uncertain clinical significance and uncovering the molecular mechanisms underlying the mutations.

      Major comments:

      • On testing the platform, I was unable to look-up a specific variant in ADCK1 (rs200211943, R115Q). I found that despite stating that the mapped refseq ID was NP_001136017 in the HGVSp column, it was actually mapped to the canonical UniProt sequence (Q86TW2-1). NP_001136017 actually maps to Q86TW2-3, which is missing residues 74-148 compared to the -1 isoform. The Uniprot canonical sequence has no exact RefSeq mapping, so the HGVSp column is incorrect in this instance. This mapping issue may also affect other proteins and result in incorrect HGVSp identifiers for variants.
      • The paper lacks a section on how to properly interpret the results of the MAVISp platform (the case-studies are useful, but don't lay down any global rules for interpreting the results). For example: How should a variant with conflicts between the variant impact predictors be interpreted? Are certain indicators considered more 'reliable' than others?
      • In the Methods section, GEMME is stated as being rank-normalised with 0.5 as a threshold for damaging variants. On checking the data downloaded from the site, GEMME was not rank-normalised but rather min-max normalised. Furthermore, Supplementary text S4 conflicts with the methods section over how GEMME scores are classified, S4 states that a raw-value threshold of -3 is used.
      • Note. This is a major comment as one of the claims is that the associated web-tool is user-friendly. While functional, the web app is very awkward to use for analysis on any more than a few variants at once.
        • The fixed window size of the protein table necessitates excessive scrolling to reach your protein-of-interest. This will also get worse as more proteins are added. Suggestion: add a search/filter bar.
        • The same applies to the dataset window.
        • You are unable to copy anything out of the tables.
        • Hyperlinks in the tables only seem to work if you open them in a new tab or window.
        • All entries in the reference column point to the MAVISp preprint even when data from other sources is displayed (e.g. MAVE studies).
        • Entering multiple mutants in the "mutations to be displayed" window is time-consuming for more than a handful of mutants. Suggestion: Add a box where multiple mutants can be pasted in at once from an external document.

      Minor comments

      • Grammar. I appreciate that this manuscript may have been compiled by a non-native English speaker, but I would be remiss not to point out that there are numerous grammar errors throughout, usually sentence order issues or non-pluralisation. The meaning of the authors is mostly clear, but I recommend very thoroughly proof-reading the final version.
      • There are numerous proteins that I know have high-quality MAVE datasets that are absent in the database e.g. BRCA1, HRAS and PPARG.
      • Checking one of the existing MAVE datasets (KRAS), I found that the variants were annotated as damaging, neutral or given a positive score (these appear to stand-in for gain-of-function variants). For better correspondence with the other columns, those with positive scores could be labelled as 'ambiguous' or 'uncertain'.
      • Numerous thresholds are defined for stabilizing / destabilizing / neutral variants in both the STABILITY and the LOCAL_INTERACTION modules. How were these thresholds determined? I note that (PMC9795540) uses a ΔΔG threshold of 1/-1 for defining stabilizing and destabilizing variants, which is relatively standard (though they also say that 2-3 would likely be better for pinpointing pathogenic variants).
      • "Overall, with the examples in this section, we illustrate different applications of the MAVISp results, spanning from benchmarking purposes, using the experimental data to link predicted functional effects with structural mechanisms or using experimental data to validate the predictions from the MAVISp modules."

      The last of these points is not an application of MAVISp, but rather a way in which external data can help validate MAVISp results. Furthermore, none of the examples given demonstrate an application in benchmarking (what is being benchmarked?). - Transcription factors section. This section describes an intended future expansion to MAVISp, not a current feature, and presents no results. As such, it should probably be moved to the conclusions/future directions section. - Figures. The dot-plots generated by the web app, and in Figures 4, 5 and 6 have 2 legends. After looking at a few, it is clear that the lower legend refers to the colour of the variant on the X-axis - most likely referencing the ClinVar effect category. This is not, however, made clear either on the figures or in the app. - "We identified ten variants reported in ClinVar as VUS (E102K, H86D, T29I, V91I, P2R, L44P, L44F, D56G, R11L, and E25Q, Fig.5a)"

      E25Q is benign in ClinVar and has had that status since first submitted.

      Significance

      Platforms that aggregate predictors of variant effect are not a new concept, for example dbNSFP is a database of SNV predictions from variant effect predictors and conservation predictors over the whole human proteome. Predictors such as CADD and PolyPhen-2 will often provide a summary of other predictions (their features) when using their platforms. MAVISp's unique angle on the problem is in the inclusion of diverse predictors from each of its different moules, giving a much wider perspective on variants and potentially allowing the user to identify the mechanistic cause of pathogenicity. The visualisation aspect of the web app is also a useful addition, although the user interface is somewhat awkward. Potentially the most valuable aspect of this study is the associated gitbook resource containing reports from biocurators for proteins that link relevant literature and analyse ClinVar variants. Unfortunately, these are only currently available for a small minority of the total proteins in the database with such reports.

      For improvement, I think that the paper should focus more on the precise utility of the web app / gitbook reports and how to interpret the results rather than going into detail about the underlying pipeline.

      In terms of audience, the fast look-up and visualisation aspects of the web-platform are likely to be of interest to clinicians in the interpretation of variants of unknown clinical significance. The ability to download the fully processed dataset on a per-protein database would be of more interest to researchers focusing on specific proteins or those taking a broader view over multiple proteins (although a facility to download the whole database would be more useful for this final group).

      My expertise.

      • I am a protein bioinformatician with a background in variant effect prediction and large-scale data analysis.
    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer 1 (Public review):

      This research investigates how the cellular protein quality control machinery influences the effectiveness of cystic fibrosis (CF) treatments across different genetic variants. CF is caused by mutations in the CFTR gene, with over 1,700 known disease-causing variants that primarily work through protein misfolding mechanisms. While corrector drugs like those in Trikafta therapy can stabilize some misfolded CFTR proteins, the reasons why certain variants respond to treatment while others don't remain unclear. The authors hypothesized that the cellular proteostasis network-the machinery that manages protein folding and quality control-plays a crucial role in determining drug responsiveness across different CFTR variants. The researchers focused on calnexin (CANX), a key chaperone protein that recognizes misfolded glycosylated proteins. Using CRISPR-Cas9 gene editing combined with deep mutational scanning, they systematically analyzed how CANX affects the expression and corrector drug response of 234 clinically relevant CF variants in HEK293 cells. 

      In terms of findings, this study revealed that CANX is generally required for robust plasma membrane expression of CFTR proteins, and CANX disproportionately affects variants with mutations in the C-terminal domains of CFTR and modulates later stages of protein assembly. Without CANX, many variants that would normally respond to corrector drugs lose their therapeutic responsiveness. Furthermore, loss of CANX caused broad changes in how CF variants interact with other cellular proteins, though these effects were largely separate from changes in CFTR channel activity. 

      This study has some limitations: the research was conducted in HEK293 cells rather than lung epithelial cells, which may not fully reflect the physiological context of CF. Additionally, the study only examined known diseasecausing variants and used methodological approaches that could potentially introduce bias in the data analysis. 

      We agree that the approaches employed here are not fully physiological, though we would remind the reviewer that we previously benchmarked the results generated by this experimental platform against a variety of other published datasets (PMID: 37253358). Regarding the issue of bias, we outline several pieces of evidence suggesting we retain robust and near-uniform sampling of these variants across these experimental conditions. We hope our comments below address all of these concerns. Overall, we believe deep mutational scanning is actually remarkably unbiased relative to other approaches due to the fact that all measurements are taken from a single dish of cells that is processed in parallel. Moreover, we show the trends are highly reproducible across replicates and users (see Figure S1). 

      How cellular quality control mechanisms influence the therapeutic landscape of genetic diseases is an emerging field. Overall, this work provides important cellular context for understanding CF mutation severity and suggests that the proteostasis network significantly shapes how different CFTR variants respond to corrector therapies. The findings could pave the way for more personalized CF treatments tailored to patients' specific genetic variants and cellular contexts. 

      Strengths: 

      (1) This work makes an important contribution to the field of variant effect prediction by advancing our understanding of how genetic variants impact protein function. 

      (2) The study provides valuable cellular context for CFTR mutation severity, which may pave the way for improved CFTR therapies that are customized to patient-specific cellular contexts. 

      (3) The research provides further insight into the biological mechanisms underlying approved CFTR therapies, enhancing our understanding of how these treatments work. 

      (4) The authors conducted a comprehensive and quantitative analysis, and they made their raw and processed data as well as analysis scripts publicly available, enabling closer examination and validation by the broader scientific community. 

      We are grateful for this broad perspective on the general relevance of this work.

      Weaknesses: 

      (1) The study only considers known disease-causing variants, which limits the scope of findings and may miss important insights from variants of uncertain significance. 

      We agree with this caveat. A more comprehensive library of CFTR variants will undoubtedly be useful for assigning variants of uncertain significance, though we note that such a large library would involve trade-offs in depth/ coverage that will compromise the sensitivity/ precision of the measurements. This will, in turn, make it challenging to compare the effects of CFTR modulators across the spectrum of clinical variants. For this reason, we believe the current library will remain a useful tool for CF variant theratyping.

      (2) The cellular context of HEK293 cells is quite removed from lung epithelia, the primary tissue affected in cystic fibrosis, potentially limiting the clinical relevance of the findings. 

      We concede this limitation, but note that we did carry out functional measurements in FRT monolayers, which are a prevailing model that closely mimics pharmacological outcomes in the clinic (see Fig. 6). 

      (3) Methodological choices, such as the expansion of sorted cell populations before genetic analysis, may introduce possible skew or bias in the data that could affect interpretation. 

      We respectfully disagree with this point. The recombination system we employ in these studies generates millions of recombinant cells per transfection, which corresponds to tens of thousands of clones per variant. Moreover, our sequencing data contain exhaustive coverage of every variant characterized herein within each of the final data sets. Generally, we do not see any evidence to suggest certain variants are lost from the population. We note that, while HEK293T cells are not the most physiological relevant system, they are robust to uniformly express these variants in a manner that provides a precise comparison of their effects and/ or response to CFTR modulators. To address this concern, we added Document S1 to the revised draft, which shows the total number of reads for each variant within each fraction and each experiment.

      (4) While the impact on surface trafficking is convincingly demonstrated, how cellular proteostasis affects CFTR function requires further study, likely within a lung-specific cellular context to be more clinically relevant.

      We agree with this caveat.

      Reviewer 1 (Recommendations for the authors):

      Major Issues

      Cell Growth Bias? After sorting cell populations into quartiles, cells were expanded before genetic analysis - if CFTR variants affect cell doubling time (e.g., severely misfolded variants causing cellular stress), this could skew variant abundance within sorted quartiles and bias results.

      Based on several observations, we do not believe this to be a significant issue. First, we note that we previously benchmarked the quantitative outputs of these experiments against a variety of other investigations and found very good agreement with previous variant classifications and expression levels (PMID: 37253358). If there were significant bias, we believe this would have come up in our efforts to benchmark the assay. Second, we note that we typically create recombinant cell lines that express WT or ΔF508 CFTR only alongside each recombinant cellular library. Importantly, we have never observed any difference in the growth rate of cultures expressing different CFTR variants. Third, even if cells expressing certain variants grow slower, it seems likely this slow growth would consistently occur in the context of each sorted subpopulation. Given that scores are derived from the relative amount of identifications across each subpopulation, we do not suspect this should impact the scoring. Overall, we believe the robustness of this cell line is a key feature that allows us to avoid any such issues related to proteostatic toxicity.

      (1) Please add methodological detail. The data analysis pipeline lacks adequate description beyond referencing prior studies - essential details about what the Plasma Membrane Expression (PME) values represent (fold enrichment vs input library) and calculation methods must be provided.

      We thank the reviewer for this helpful comment. We have added the text below to the revised manuscript in order to provide more detail to the reader:

      “Briefly, low quality reads that likely contain more than one error were first removed from the demultiplexed sequencing data. Unique molecular identifier sequences within the remaining reads were then counted within each sample to track the relative abundance of each variant. To compare read counts across fractions, the collection of reads within each population were then randomly down-sampled to ensure a consistent total read count across each sub-population. The surface immunostaining of each variant was then estimated by calculating the the weighted-average immunostaining intensity for each variant using the following equation:

      where ⟨I⟩<sub>variant</sub> is the weighted-average fluorescence intensity of a given variant, ⟨F⟩<sub>i</sub> is the mean fluorescence intensity associated with cells from the ith FACS quartile, and Ni is the number of variant reads in the i<sup>th</sup> FACS quartile. Variant intensities from each replicate were normalized relative to one another using the mean surface immunostaining intensity of the entire recombinant cell population for each experiment to account for small variations in laser power and/ or detector voltage. Finally, to filter out any noisy scores arising from insufficient sampling, we repeated the down-sampling and scoring process then rejected any variant measurements that exhibit more than X% variation in their intensity scores across the two replicate analyses. The reported intensity values represent the average normalized intensity values from two independent down-sampling iterations across three biologicals replicates.”

      (3) Add detail on library composition. The distribution of CFTR variants within the parental HEK293T library after landing pad insertion needs documentation, including any variant dropout or overrepresentation issues.

      As noted in our previous work (PMID: 37253358), our CF variant library is quite uniform, with each mutant contributing on average, 0.43% of the library with a standard deviation of +/- 0.16%. This corresponds to an average read depth of over 40K reads per variant, per experimental condition in the final analyses. Indeed, the most abundant variant in the pool was ΔF508 (1.67% of total reads). In contrast, the least sampled variant was S549R (1647T>G) was still sampled an average of 3,688 times per replicate, which corresponds to 0.09% of the total reads. See Doc S1.

      (4) Documentation of CFTR variant overlap between parental and CANX KO HEK293T libraries is needed, including whether every variant was present at equivalent input abundance in both libraries.

      We thank the reviewer for this suggestion. Though there are small deviations in the composition of recombinant parental and knockout cell lines, the relative abundances of individual variants within the recombinant populations only differs by an average of 18.5% between the parental and knockout lines. There are no cases in which we observe a single variant increasing by more than 50% in the knockout line relative to the parent. However, there is a single variant, Y563N, that exhibits a 96% decrease in its abundance in the context of the knockout cell line. Nevertheless, even this variant was sampled over 1,000 times, and it’s final score passed all quality control metrics. In the revised draft, we have provided a complete table containing the total number of reads and percent of total reads for each variant for each cell line and condition (see Doc. S1).

      (5) The section reporting CANX impact on functional rescue of CF variants requires clearer logic flow - the conclusion about higher specific activity of CFTR assembled without CANX appears misleading, given later discussion about CANX allowing suboptimally folded CFTR to traffic to the surface.

      We apologize for any confusion. We invoked the term “specific activity” in the enzymological sense, which is to say the proportion of active enzyme (i.e. channel) at the plasma membrane differs in the knockout line. The logic is quite simple- if protein levels are lower while ion conductance remains the same in the knockout cells, then a higher proportion of the mature channels must be inactive in the parental cell line. Thus, we suspect fewer of the channels at the plasma membrane are active in the context of the parental cell line containing CANX. We considered modifications to the text in the discussion, but ultimately feel the current text strikes a reasonable balance between nuance and simplicity.

      (6) In your discussion, consider that HEK293T cellular context differs significantly from lung epithelia, and the hYFP quenching assay may have insufficient dynamic range or high noise for detecting relevant functional differences.

      We modified the following sentence in the discussion to introduce this possibility:

      “While these discrepancies could stem from differences in the dynamic range of the functional assays, they may also suggest the stringency of QC is more finely tuned to ion channel biosynthesis in epithelial monolayers.”

      Minor Issues

      (1) Include immunostaining quartiles as a supplementary figure overlaid on Figure 1A, and clarify whether quartiles were consistent across experiments or adjusted for each sort.

      We added a new figure to demonstrate the gating approach in the revised manuscript (see Fig. S10). We have also added the following text to the Methods section:

      “Sorting gates for surface immunostaining were independently set for each biological replicate and in each condition to ensure that the population was evenly divided into four equal subpopulations.”

      (2) Figure 2C improvements. Flip the figure 180 degrees to position MSD1 and NBD1 on the left, replace the blue-to-red color scale with yellow-to-blue or monochromatic scaling for better intermediate value differentiation.

      Respectfully, we prefer not to do this so that our figures can be easily compared across our previous and forthcoming publications. We chose this rendering because this view depicts certain trends in variant response more clearly. 

      (3) Indicate the location of ECL4 on the protein structure shown in Figure 2C for better reference.

      We appreciate the suggestion. However, most of ECL4 is missing from the experimental cryo-EM models of CFTR due to a lack of density. For this reason, we did not modify the figure. 

      Reviewer 2 (Public review):

      In this work, the authors use deep mutational scanning (DMS) to examine the effect of the endogenous chaperone calnexin (CANX) on the plasma membrane expression (PME) and potential pharmacological stabilization cystic fibrosis disease variants. This is important because there are over 1,700 loss-of-function mutations that can lead to the disease Cystic Fibrosis (CF), and some of these variants can be pharmacologically rescued by small-molecule "correctors," which stabilize the CFTR protein and prevent its degradation. This study expands on previous work to specifically identify which mutations affect sensitivity to CFTR modulators, and further develops the work by examining the effect of a known CFTR interactor-CANX-on PME and corrector response. 

      Overall, this approach provides a useful atlas of CF variants and their downstream effects, both at a basal level as well as in the context of a perturbed proteostasis. Knockout of CANX leads to an overall reduced plasma membrane expression of CFTR with CF variants located at the C-terminal domains of CFTR, which seem to be more affected than the others. This study then repeats their DMS approach, using PME as a readout, to probe the effect of either VX-445 or VX-455 + VX-661-which are two clinically relevant CFTR pharmacological modulators. I found this section particularly interesting for the community because the exact molecular features that confer drug resistance/sensitivity are not clear. When CANX is knocked out, cells that normally respond to VX-445 are no longer able to be rescued, and the DMS data show that these non-responders are CF variants that lie in the VX-445 binding site. Based on computational data, the authors speculate that NBD2 assembly is compromised, but that remains to be experimentally examined. Cells lacking CANX were also resistant to combinatorial treatment of VX-445 + VX-661, showing that these two correctors were unable to compensate for the lack of this critical chaperone. 

      One major strength of this manuscript is the mass spectrometry data, in which 4 CF variants were profiled in parental and CANX KO cells. This analysis provides some explanatory power to the observation that the delF508 variant is resistant to correctors in CANX KO cells, which is because correctors were found not to affect protein degradation interactions in this context. Findings such as this provide potential insights into intriguing new hypothesis, such as whether addition of an additional proteostasis regulators, such as a proteosome inhibitor, would facilitate a successful rescue. Taken together, the data provided can be generative to researchers in the field and may be useful in rationalizing some of the observed phenotypes conferred by the various CF variants, as well as the impact of CANX on those effects. 

      To complete their analysis of CF variants in CANX KO cells, the research also attempted to relate their data, primarily based on PME, to functional relevance. They observed that, although CANX KO results in a large reduction in PME (~30% reduction), changes in the actual activation of CFTR (and resultant quenching of their hYFP sensor) were "quite modest." This is an important experiment and caveat to the PME data presented above since changes in CFTR activity does not strictly require changes in PME. In addition, small molecule correctors also do not drastically alter CFTR function in the context of CANX KO. The authors reason that this difference is due to a sort of compensatory mechanism in which the functionally active CFTR molecules that are successfully assembled in an unbalanced proteostasis system (CANX KO) are more active than those that are assembled with the assistance of CANX. While I generally agree with this statement, it is not directly tested and would be challenging to actually test. 

      The selected model for all the above experiments was HEK293T cells. The authors then demonstrate some of their major findings in Fischer rat thyroid cell monolayers. Specifically, cells lacking CANX are less sensitive to rescue by CFTR modulators than the WT. This highlights the importance of CANX in supporting the maturation of CFTR and the dependence of chemical correctors on the chaperone. Although this is demonstrated specifically for CANX in this manuscript, I imagine a more general claim can be made that chemical correctors depend on a functional/balanced proteostasis system, which is supported by the manuscript data. I am surprised by the discordance between HEK293T PME levels compared to the CTFR activity. The authors offer a reasonable explanation about the increase in specific activity of the mature CFTR protein following CANX loss. 

      For the conclusions and claims relevant to CANX and CF variant surveying of PME/function, I find the manuscript to provide solid evidence to achieve this aim. The manuscript generates a rich portrait of the influence of CF mutations both in WT and CANX KO cells. While the focus of this study is a specific chaperone, CANX, this manuscript has the potential to impact many researchers in the broad field of proteostasis.

      We thank the reviewer for their thoughtful and comprehensive perspectives on the scope and relevance of this work.

      Reviewer 2 (Recommendations for the authors):

      While I did not identify any major weaknesses in this manuscript, I offer some suggestions below, as well as some conclusions to consider:

      (1) Missing period at the end of line 51.

      We thank the reviewer for catching this grammatical error and have added proper punctuation.

      (2)Figure S1 "repre-sent"??

      We have corrected this punctuation error.

      (3) Figure S2 missing parentheses A)

      We have corrected the punctuation error.

      (4) Figure S5, "B) The total ΔRMSD of the active conformation of NBD2 is shown for variants bound to VX-445. Red bars show increasing deviations from the native NBD2 conformation in the mutant models, and blue bars show how much VX-445 suppresses these conformational defects in NBD2."

      VX-445 should not bind/stabilize the G85E from the calculations in Figure S5A. As a confirmation, it would be nice to see the calculated hypothetical effect of VX-445 in the G85E variant as performed for L1077P and N1303K. I also want to point out that G58E is referred to as being non-responsive in S5A, but then in S5D, N103K is referred to as non-responsive, but this variant falls pretty far below the stabilized region calculated in S5A, right?

      We agree that it would be insightful to examine the RMSD changes in a non-responsive variant such as G85E. We added the G85E NBD2 ∆RMSD to Supplemental Figure S5B and a G85E ∆RMSD structure map as an additional subpanel at Supplemental Figure S5C. As the reviewer expected, VX-445 fails to confer any stability to G85E as shown by a lack of significant change in NBD2 ∆RMSD or any visible ∆RMSD throughout the structure.  Finally, we acknowledge that N1303K falls below the stabilized region as calculated in S5A. However, we note that the binding energy only suggests it is likely to interact with the protein- this does not to necessarily mean that binding will allosterically suppress conformational defects in NBD2. Moreover, this is simply an in silico calculation, that does not necessarily capture all of the nuanced interactions in the cell (or lack thereof). We have corrected this in the Figure S5 caption, which reads as follows:

      “Maps of the change in RMSD between N1303K modeled with and without VX-445 shows that few structural regions are stabilized by VX-445 for N1303K, which responds poorly to VX-445 in vitro.”

      (5) "stan-dard" standard?

      We have corrected this punctuation error.

      (6) Line 270, "these variants" is written twice

      We have corrected this typographical error.

      (7) Figure 6 B. What is being compared? The text writes "there are prominent differences in the activity of these variants [those with CANX] (two-way ANOVA, p = 3.8 x 10-27." Does this mean WT vs. delF508, P5L, V232D, T1036N, and I1366N combined? I have not seen a set of 5 variables compared to a single variable. Usually, it would be WT vs. DelF508, WT vs. P5L, WT vs. V232D...right? Maybe this is normal in this specific field. The same goes for the CANX knockout comparison "(two-way ANOVA, p = 0.06).".

      In this instance, the two-way ANOVA test is evaluating whether there are differences in the half-lives of individual variants and/ or systematic differences across the variant measurements in the knockout line relative to the parental cells. The test gives independent p-values for these two variables (variant and cell line). We chose this test because it makes it clear that, when you consider the trends together, one variable has a significant effect while the other does not.

      (8) Why don't the CFTR modulators rescue CFTR activity in the WT FRT monolayers?

      We thank the reviewer for this inquiry. Please note that compared to DMSO, VX-661 does significantly enhance the forskolin-mediated response of WT-CFTR (red asterisk). Treatments with VX-445 alone, VX-661+VX-445, or VX-661+VX-445+VX-770 showed no significant forskolin stimulation of WT-CFTR. These observations could be attributable to the brief period in which WT-CFTR cDNA is transiently transfected. However, it is not necessarily anticipated that modulators would enhance WT-CFTR function. Correctors and potentiators are designed to rescue processing and gating abnormalities, respectively. WT-CFTR channels do not exhibit such defects.

      In both constitutive overexpression systems and primary human airway epithelia, published literature demonstrates that prolonged exposure to CFTR modulators has resulted in variable consequences on WT-CFTR activity. For example, forskolin-mediated responsiveness of WT-CFTR is not altered by chronic application of VX-445 (PMID: 34615919) nor VX-770 (PMID: 28575328, 27402691, 37014818). In contrast, short-circuit current measurements show that forskolin stimulation of WT-CFTR is augmented by chronic treatment with VX-809 (PMID: 28575328), an analog of VX-661. Thus, our findings are congruent with observations reported by other groups.

      (9) General comment: As someone not familiar with the field, it would be nice to see the structures of VX-445 and VX-661 somewhere in the figures or at least in the SI.

      We appreciate this suggestion, but do not feel that we include enough structural analyses to justify a stand-alone figure for these purposes. The structures of these compounds are easily referenced on a variety of internetbased resources.

      (10) Weakness: As an ensemble, the data points CANX as required for plasma membrane expression, particularly those that lie in the C-terminal domain, but when considering individual CF variants, there is no clear trend. Similarly, when looking at the effect of the pharmacological correctors on PME, no variant strays from the linear trend.

      We generally agree that the predominant trend is a uniform decrease in CFTR PME across all variants and that individual variant effects are hard to generalize. Indeed, this latter point has been widely appreciated in the CF community for several decades. Our approach exposes this variability in detail, but we concede that we cannot yet fully interpret the full complexity of the trends.

      (11) Something to consider: Knockout of calnexin, a central ER chaperone, is going to set off the UPR, which in turn will activate the ISR and attenuate translation. From what I can tell, in general, all CF variant PME is decreased. Is this simply because less CF protein is being synthesized?

      The reviewer raises an excellent point. However, to investigate this possibility further, we compared whole-cell proteomic data for the parental and knockout cell lines. Our analysis suggests there is no significant upregulation of proteins associated with UPR activation, as is shown in the graphic to the right. In fact, only proteins associated with the PERK branch of the UPR exhibit any statistically significant changes between these two cell lines across three biological replicates. Based on this consideration, we suspect any wider changes in ER proteostasis must be relatively subtle. 

      Author response image 1.

    1. «Cum ipsi (majores homines) appellabant rem aliquam, et cum secundum earn vocem corpusad aliquid movebant, videbam, et tenebam hoc ab eis vocari rem il-lam, quod sonabant, cum earn vellent ostendere. Hoc autem eos velleex motu corporis aperiebatur: tamquam verbis naturalibus omniumgentium, quae fiunt vultu et nutu oculorum, ceterorumque membro-rum actu, et sonitu vocis indicante affectionem animi in petendis, ha-bendis, rejiciendis, fugiendisqve rebus. Ita verba in variis sententiislocis suis posita, et crebro audita, quarum rerum signa essent,paulatim colhgebam, measque jam voluntates, edomito in eis signisore, per haec enutitiabam»

      “Наблюдая, как взрослые, называя какой-нибудь предмет, поворачивались в его сторону, я постигал, что предмет обозначается произносимыми им звуками, поскольку они указывали на него. А вывод этот я делал из их жестов, этого естественного языка всех народов, языка, который мимикой, движениями глаз, членов тела, звучанием голоса выражает состояние души — когда чего-то просят, получают, отвергают, чуждаются. Так постепенно я стал понимать, какие веши обозначаются теми словами, которые я слышал вновь и вновь произносимыми в определенных местах различных предложений. И когда мои уста привыкли к этим знакам, я научился выражать ими свои желания” (лат.)

    Annotators

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      (1) The authors should provide a detailed description of the pathogenesis of Haemorrhagic Fever with Renal Syndrome (HFRS) and elaborate on the crucial role of IgG proteins in the disease's progression (line 65).

      As suggested, we have now provided a detailed description of the pathogenesis of HFRS and elaborated on the crucial role of IgG proteins in the disease's progression:

      "Hantaviruses are tri-segmented, single-stranded, negative-sense RNA viruses, whose genomes consist of three regions: large (L), medium (M), and small (S). The glycoproteins Gn and Gc, encoded by the M segment, can infect target cells - primarily vascular endothelial cells - via β3 integrin receptors (Pizarro et al., 2019). Simultaneously, they could also infect other cell types, such as mononuclear macrophages and dendritic cells, leading to systemic viral infection. Although hantavirus replication is thought to occur primarily in the vascular endothelium without direct cytopathic effects, a plethora of innate immune cells mediate host antiviral defenses. These include natural killer cells, neutrophils, monocytes, and macrophages, together with pattern recognition receptors (PRRs), interferons (IFNs), antiviral proteins, and complement activation, e.g., via the pentraxin 3 (PTX3) pathway, which can exacerbate HFRS disease progression leading to immunopathological damage through cytokine/chemokine production, cytoskeletal rearrangements in endothelial cells, ultimately amplifying vascular dysfunction (Tariq & Kim, 2022). Rapid and effective humoral immune responses, however, such as neutralizing antibody responses targeting the glycoproteins Gn/Gc, contribute to rapid recovery from HFRS and are critical for protection from severe disease (Engdahl & Crowe, 2020; Li et al., 2020)." Please see the Introduction (Page 4, lines 65-81).

      (2) An additional discussion on the significance of glycosylation, particularly IgG N-glycosylation, in viral infections should be included in the Introduction section.

      Thank you for the suggestion and we have added an additional discussion on the significance of glycosylation in viral infections in the revised Introduction section.

      "Immunoglobulin G (IgG) N-linked glycosylation mediates critical functions modulating antiviral immunity during viral infection. Changes in the conserved N-linked glycan Asn297 in the Fc region of IgG typically by fucosylation, galactosylation, or sialylation can alter antibody effector function. A reduction in core fucosylation decreases IgG binding to NK cell FcγRIIIa promotes antibody-dependent cellular cytotoxicity (ADCC) necessary for clearance of viruses, including SARS-CoV-2, dengue and HIV-1 whereas sialylation can attenuate immune responses resulting in immune evasion (Ash et al., 2022; Haslund-Gourley et al., 2024; Hou et al., 2021; Wang et al., 2017). Changes in IgG and other protein N-linked glycosylation profiles therefore shape virus-host interactions and disease progression." (Page 4, lines 82-91).

      (3) In the abstract section, the authors state that HTNV-specific IgG antibody titers were detected and IgG N-glycosylation was analyzed. However, the analysis of plasma IgG N-glycans is described in the Methods section. Therefore, the authors should clarify the glycome analysis process. Was the specific IgG glycome profile similar to the total IgG N-glycome? Given the biological relevance of specific IgG in immunological diseases, characterizing the specific IgG N-glycome profile would be more significant than analyzing the total plasma IgG.

      We are grateful to the reviewer for the comments. Previous studies on viral infections have revealed that the pattern of virus-specific IgG N-glycans may be similar to that of total IgG N-glycome, and we therefore analyzed the total plasma IgG glycosylation profiling in the HFRS patients. However, we have discussed this in the Discussion section.

      "Despite establishing a well-characterized patient cohort and performing systematic IgG glycosylation profiling based on HTNV NP antibody status, this study has several noteworthy limitations. Most notably, while preliminary comparisons suggested similar patterns between virus-specific and total IgG N-glycome, our total plasma IgG analysis may have introduced confounding factors in the observed associations. This methodological constraint could potentially affect the interpretation of certain disease-specific glycosylation signatures." Please see the Discussion (Page 12, lines 274-280). 

      References

      (1) Mads Delbo Larsen, Erik L de Graaf, Myrthe E Sonneveld, et al. Afucosylated IgG characterizes enveloped viral responses and correlates with COVID-19 severity. Science . 2021 Feb 26;371(6532):eabc8378.

      (2) Chakraborty S, Gonzalez J, Edwards K, et al. Proinflammatory IgG Fc structures in patients with severe COVID-19. Nat Immunol. 2021 Jan;22(1):67-73.

      (3) Tea Petrović, Amrita Vijay, Frano Vučković, et al. IgG N-glycome changes during the course of severe COVID-19: An observational study. EBioMedicine. 2022 Jul ;81: 104101. 

      (4) Hou H, Yang H, Liu P, et al. Profile of Immunoglobulin G N-Glycome in COVID-19 Patients: A Case-Control Study. Front Immunol. 2021 Sep 23;12:748566.

      (4) Further details regarding the N-glycome analysis should be provided, including the quantity of IgG protein used and the methodology employed for analyzing IgG N-glycans (lines 286-287).

      As suggested, we have provided further details regarding the N-glycome analysis in the Method section.

      "Briefly, the diluted plasma samples were transferred onto a 96-well protein G monolithic plate (BIA Separations, Slovenia) for the isolation of IgG. The isolated IgG was eluted with 1 mL of 0.1 M formic acid and was immediately neutralized with 170 µL of 1M ammonium bicarbonate.

      The released N-glycans were labelled with 2-aminobenzamide (2-AB) and were then purified from a mixture of 100% acetonitrile and ultrapure water in a 1:1 ratio (v/v). This was then analyzed by hydrophilic interaction liquid chromatography using ultra-performance liquid chromatography (HILIC-UPLC; Walters Corporation, Milford, MA) (Hou et al., 2019). As previously reported, the chromatograms were separated into 24 IgG glycan peaks (GPs) (Menni et al., 2018)." Please see the Method section (Page 15, lines 346-355).

      (5) Additional statistical analyses should be performed, including multiple comparisons with p-value adjustment, false discovery rate (FDR) control, and Pearson correlation (line 291).

      As suggested, we have performed additional statistical analyses and mentioned the results in the revised manuscript.

      "Positive correlations were observed between the ASM subsets and both galactosylation (p=0.017, r<sub>s</sub>=0.418) and sialylation (p=0.008, r<sub>s</sub>=0.458) in the antibody Fc region, as well as between the PB subsets and sialylation (p=0.036, r<sub>s</sub>=0.372) (Figure 4A-C). (Page 8, lines 180-183)"

      "The Benjamini - Hochberg (BH) method was used to adjust the raw p-values from DEG analysis, controlling the false discovery rate (FDR)." Please see the Materials and Methods (Page 16, lines 369-371).

      (6) Quality control should be conducted prior to the IgG N-glycome analysis. Additionally, both biological and technical replicates are essential to assess the reproducibility and robustness of the methods.

      Thank you for the suggestion. We have added descriptions on the biological and technical replicates in the Method section.

      "Our study incorporated both biological and technical replicates to ensure a robust glycomic profiling analysis. Specifically, we analyzed paired acute/convalescent-phase samples from 65 confirmed HFRS patients to assess inter-individual biological variability, while technical reproducibility was validated through comparison with standard chromatographic peak plots (Vučković et al., 2016). This dual-replicate strategy enabled a comprehensive evaluation of both biological heterogeneity and assay precision." (Page 15, lines 356-362).

      (7) Multiple regression analysis should be conducted to evaluate the influence of genetic and environmental factors on the IgG N-glycome.

      As suggested, we have conducted multiple regression analysis to evaluate the influence of genetic and environmental factors on the IgG N-glycome. These results have been provided in the revised Result section.

      "Multivariate linear regression was employed to mitigate potential confounding by genetic and environmental factors in the glycomics analysis. While no significant associations were observed for most glycan models (fucosylation, p=0.526; bisecting GlcNAc, p=0.069; and sialylation, p=0.058), we discovered sex showed a potentially influential effect on galactosylation (p=0.001) (Supplementary files 5-8). These results suggest that while most glycan features appear unaffected by the examined covariates, galactosylation may be subject to sex-specific biological regulation." (Page 7, lines 153-160).

      (8) Line 196. Additional discussions should be included, focusing on the underlying correlation between the differential expression of B-cell glycogenes and the dysregulated IgG N-glycome profile, as well as the potential molecular mechanisms of IgG N-glycosylation in the development of HFRS.

      Thank you for your suggestions. We have added these contents in the Discussion section.

      "Antibody-related glycogenes are significantly activated following Hantaan virus infection. We noted that ribophorin I and II (RPN1 and RPN2) were significantly upregulated in the ASM/IM/PB/RM subsets after Hantaan virus infection, which linked the high mannose oligosaccharides with asparagine residues found in the Asn-X-Ser/Thr consensus motif (Hwang et al., 2025). We speculate that they continuously attach the synthesized glycan chains to the constant region of antibodies during antibody synthesis. Similarly, fucosyltransferase 8 (FUT8) in the ASM subset, catalyzing the alpha1-2, alpha1-3, and alpha1-4 fucose addition (Wang & Ravetch, 2019; Yang et al., 2015), was downregulated in the mRNA translation, and the levels of fucosylated antibodies were naturally lower in the acute HFRS patients. Meanwhile, the beta-1,4-galactosyltransferase (beta4GalT) gene expression was significantly elevated in the ASM subpopulation during the acute phase, which also correlated with increased levels of galactosylated antibodies in serum (Wang & Ravetch, 2019). However, we did not observe significant upward changes in sialyltransferase mRNA expression in the acute HFRS patients, similar with the finding from severe COVID-19 cohorts (Haslund-Gourley et al., 2024). The neuraminidase 1 (NEU1) gene is strikingly upregulated and may potentially explain the decreased sialylation on the secreted HTNV-specific IgG antibodies during convalescence. Overall, the glycosylation of immunoglobulin G is regulated by a large network of B-cell glycogenes during HTNV infection." Please see the Discussion (Page 11, lines 254-273).

      Reviewer #2 (Public review):

      (1) While it is great to reference prior publications in the Materials and Methods section, the current level of detail is insufficient to clearly understand the study design and experimental procedures performed. Readers should not be expected to consult multiple previous papers to grasp the core methodological aspects of the present paper. For instance, the categorization of HFRS patients into different clinical subtypes/ courses, and the methods for measuring Fc glycosylation should be explicitly described in the Materials and Methods section of this manuscript. 

      Many thanks for your comments. We have added more details regarding the study design and experimental procedures in the Materials and Methods section. "Clinical specimens were collected from HFRS patients who were hospitalized in Baoji Central Hospital between October 2019 and January 2022. Patients were categorized into four clinical subtypes (mild, moderate, severe, and critical) based on the diagnostic criteria for HFRS issued by the Ministry of Health (Ma et al., 2015). This study was approved by the ethics committee of the Shandong First Medical University & Shandong Academy of Medical Sciences (R201937). Written informed consent was obtained from each participant or their guardians.

      The clinical course of HFRS is grouped into acute (febrile, hypotensive, and oliguric stages) and convalescent (diuretic and convalescent stages) phases. The acute phase was defined as within 12 days of illness onset, and the convalescent phase was defined as a period of illness lasting 13 days or longer (Tang et al., 2019; Zhang et al., 2022). The earliest sample was selected if there were multiple blood samples available in the acute phase and the last available sample before discharge was selected if there were multiple blood samples in the convalescent phase.

      Briefly, the diluted plasma samples were transferred onto a 96-well protein G monolithic plate (BIA Separations, Slovenia) for the isolation of IgG. The isolated IgG was eluted with 1 mL of 0.1 M formic acid and was immediately neutralized with 170 µL of 1M ammonium bicarbonate.

      The released N-glycans were labelled with 2-aminobenzamide (2-AB) and were then purified from a mixture of 100% acetonitrile and ultrapure water in a 1:1 ratio (v/v). This was then analyzed by hydrophilic interaction liquid chromatography using ultra-performance liquid chromatography (HILIC-UPLC; Walters Corporation, Milford, MA) (Hou et al., 2019). As previously reported, the chromatograms were separated into 24 IgG glycan peaks (GPs) (Menni et al., 2018)." Please see the Materials and Methods (Page 13, lines 290-303, and Page 15, lines 346-355).

      (2) The authors should explain the nature of their cohort in a bit more detail. While it appears that HFRS cases were identified based on IgM ELISA and/or PCR, these are indicators of the Haantan virus infection. My understanding is that not all Haantan virus infections progress to HFRS. Thus, it is unclear whether all patients in the HFRS group actually had hemorrhagic fever. This distinction is critical for interpreting how the results observed relate to disease severity.

      We are sincerely grateful for this valuable suggestion. We have carefully revised Figure 1 and the texts (Page 5, lines 104-107) in the revised manuscript.

      "To characterize the humoral immune profiles in HFRS patients, we enrolled 166 suspected HTNV-infected patients who were admitted to Baoji Central Hospital in Shaanxi Province, China, between October 2019 and January 2022. Among them, 65 met the inclusion criteria and were included in the study (Figure 1)."

      (3) The authors state that: "A 4-fold or greater increase in HTNV-NP-specific antibody titers usually indicates a protective humoral immune response during the acute phase", but they do not cite any references or provide any context that supports this claim. Given that in their own words, one of the most significant findings in the study is changes in glycosylation coinciding with this 4-fold increase, it is important to ground this claim in evidence. Without this, the use of a 4-fold threshold appears arbitrary and weakens the rationale for using this immune state as a proxy for protective immunity.

      Thank you for the suggestion and we have provided relevant references in the Results section (Page 8, lines 171-173).

      According to the Expert Consensus on Prevention and Treatment of Hemorrhagic  Fever with Renal Syndrome (HFRS) (https://ts-cms.jundaodsj.com/file/163823638693909.pdf), a confirmed diagnosis requires, based on a suspected or clinical diagnosis, one of the following: positive serum-specific IgM antibodies, detection of Hantavirus RNA in patient specimens, a four-fold or greater rise in titer of serum-specific IgG antibodies in the convalescent phase compared to the acute phase, or isolation of Hantavirus from patient specimens. A four-fold or greater rise in titer of convalescent serum-specific IgG antibodies compared to the acute phase not only suggests a recent Hantaan virus infection, but also the production of antibodies helping to combat the viral infection. In addition, the antibody glycosylation modifications may thus play a significant role in the antiviral immune response.

      (4) The authors also claim that changes in Fc glycosylation influence recovery from HFRS - a point even emphasized in the manuscript title. However, this conclusion is not well supported by the data for two main reasons. First, the authors appear to measure bulk IgG Fc glycans, not Fc glycans of Hantaan virus-specific antibodies. While reasonable, this is something that should be communicated in the manuscript. Hantaan virus-specific antibodies are likely a very small fraction of total circulating IgG antibodies (perhaps ~1%), even during acute infection. As a result, changes in bulk Fc glycosylation may (or may not) accurately reflect the glycosylation state of Hantaan virus-specific antibodies. Second, even if the bulk Fc glycan shifts do mirror those of Hantaan virus-specific antibodies, it remains unclear whether these changes causally drive recovery or are merely a consequence of the infection being resolved. Thus, while the differences in Fc glycosylation observed are interesting - and it is tempting to speculate on their functional significance - the manuscript treats the observed correlations as causal mechanistic insight without sufficient data or justification.

      Thank you for your valuable comments. This study measured bulk IgG Fc glycans, not Fc glycans of Hantaan virus-specific antibodies. We have described this limitation in the Discussion section (Page 12, lines 274-280). As reported in previous studies (references provided below), the changed pattern of virus-specific IgG N-glycans may reflect the total IgG N-glycome. Nevertheless, more studies are clearly needed to directly measure virus-specific IgGs and to clarify the causal mechanistic insights.

      References

      (1) Mads Delbo Larsen, Erik L de Graaf, Myrthe E Sonneveld, et al. Afucosylated IgG characterizes enveloped viral responses and correlates with COVID-19 severity. Science. 2021 Feb 26;371(6532): eabc8378.

      (2) Chakraborty S, Gonzalez J, Edwards K, et al. Proinflammatory IgG Fc structures in patients with severe COVID-19. Nat Immunol. 2021 Jan;22(1):67-73.

      (3) Tea Petrović, Amrita Vijay, Frano Vučković, et al. IgG N-glycome changes during the course of severe COVID-19: An observational study. EBioMedicine. 2022 Jul ;81: 104101. 

      (4) Hou H, Yang H, Liu P, et al. Profile of Immunoglobulin G N-Glycome in COVID-19 Patients: A Case-Control Study. Front Immunol. 2021 Sep 23;12: 748566.

      (5) Fc glycosylation is known to be influenced by covariates such as age and sex. While it is helpful that the authors stratified the patients by age group and looked for significant differences in glycosylation across them, a more robust approach would be to directly control for these covariates in the statistical analysis - such as by using a linear mixed effects model, in which disease state (e.g., acute vs. convalescent), age, and sex are treated as fixed effects, and subject ID is included as a random effect to account for repeated measures. This would allow the authors to assess whether observed differences in Fc glycosylation remain significant after accounting for potential confounders. This could be important given that some of the reported differences are quite small, for example, 94.29% vs. 94.89% fucosylation.

      Thank you for your valuable suggestion. As suggested, we have conducted multiple regression analysis to evaluate the influence of genetic and environmental factors on the IgG N-glycome, and have provided these results in the revised Result section.

      "Multivariate linear regression was employed to mitigate potential confounding by genetic and environmental factors in the glycomics analysis. While no significant associations were observed for most glycan models (fucosylation, p=0.526; bisecting GlcNAc, p=0.069; and sialylation, p=0.058), we discovered sex showed a potentially influential effect on galactosylation (p=0.001) (Supplementary files 5-8). These results suggest that while most glycan features appear unaffected by the examined covariates, galactosylation may be subject to sex-specific biological regulation." (Page 7, lines 153-160).

      (6) The manuscript states that there are limited studies on antibody glycosylation in the context of HFRS, but does not cite any relevant literature. If prior work exists, it should be cited to contextualize the current study. If no prior studies have been conducted/reported, to the author's knowledge, that should be stated explicitly to show the novelty of the work.

      Thank you for your suggestion. To our knowledge, there has been no prior reports regarding the regulation of IgG glycosylation in HFRS, particularly in relation to seroconversion. We have reworded this sentence in the revised manuscript. "Importantly, there have not been prior studies specifically examining plasma IgG N-glycome profiles derived from chromatographic peak data in HFRS patients, particularly in relation to seroconversion status. This gap in our knowledge motivated our systematic investigation of both total and virus-specific IgG glycosylation dynamics during acute infection." Please see the Introduction (Page 5, lines 92-96).

      Reviewer #2 (Recommendations for the authors):

      Minor points:

      (1) Line 47, 78: The use of the word 'However' appears to be an incorrect expression.

      We have made this correction.

      (2) Line 127: The term 'glycome' should be replaced with 'N-glycome,' and all relevant expressions should be corrected accordingly, such as 'N-glycosylation.

      We have made this correction.

      (3) Line 84-87: The sentence 'A total of 166 HFRS patients...' contains a grammatical error.

      We have made tis correction (Page 5, lines 99-101).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Summary:

      Miyamoto et al. report that importin α1 is highly enriched in a subfraction of micronuclei (about 40%), which exhibit defective nuclear envelopes and compromised accessibility of factors essential for the damage response associated with homologous recombination DNA repair. The authors suggest that the unequal localization and abnormal distribution of importin α1 within these micronuclei contribute to the genomic instability observed in cancer.


      Major comments:

      1.) It is crucial to quantitatively assess the localization of importin α1 in micronuclei (MN) across non-transformed MCM10A cells compared to transformed cell lines (MC7, HeLa, and MDA-MB-231). This analysis would help determine whether the localization of importin α1 in MN correlates with genomic stability in human cancer cells

      We appreciate the reviewer's thoughtful suggestion to compare non-transformed and transformed cell lines to evaluate importin α1 localization in MN. Given that HeLa cells are derived from cervical cancer rather than the mammary epithelium, we considered it inappropriate to directly compare them with non-transformed mammary epithelial MCF10A cells. Therefore, HeLa cells were analyzed separately to assess the effects of reversine treatment on importin α1 localization. The results indicated no significant difference between the treated and untreated HeLa cells. (Supplemental Fig. S2F in the revised manuscript). Regarding the comparison between MCF10A and the two cancer cell lines, MCF7 and MDA-MB-231, the proportion of importin α1-positive MN did not significantly differ across the cell lines, regardless of reversine treatment (Supplemental Fig. S3B, Untreated: p = 0.9850 and 0.5533; Reversine: p = 0.2218 and 0.9392). These results suggest that there is no clear difference in the localization of importin α1 in MN between the transformed and non-transformed cell lines tested. However, we acknowledge that this does not exclude the possibility that importin α1 localization to MN is linked to genomic instability under specific conditions.

      2.) While the authors provide some evidence indicating partial disruption of nuclear envelopes in MN (Figures 3 and S4), it is noteworthy that this phenomenon also occurs in importin α1-negative MN. Furthermore, according to the figure legends, the data presented in both figures stem from a single experiment. Current literature suggests that compromised nuclear envelope integrity is one of the major contributors to genomic instability, mediated through mechanisms such as chromothripsis and cGAS-STING-mediated inflammation arising from MN. Therefore, a more comprehensive quantification of nuclear envelope integrity-ideally comparing non-transformed MCM10A cells with transformed cell lines (MC7, HeLa, and MDA-MB-231)-is necessary to substantiate the connection between aberrant importin α1 behavior in MN and chromothripsis processes, as well as regulation of the cGAS-STING pathway linked to genomic instability in cancer cells.

      We thank the reviewer for the constructive suggestion to quantify nuclear envelope integrity more comprehensively. In response, we compared laminB1 localization at the MN membrane between importin α1-positive and -negative MN in MCF10A, MCF7, MDA-MB-231, and HeLa cells, and included these results in the revised manuscript (Fig. 4C). For each cell, the laminB1 intensity in the MN was normalized to that of the primary nucleus (PN). This analysis showed that laminB1 intensity was significantly lower in importin α1-positive MN across all cell lines, including non-transformed MCF10A cells. These findings support a close association between aberrant importin α1 accumulation and compromised nuclear envelope integrity, a key factor potentially linking MN to chromothripsis and cGAS-STING-mediated genomic instability.

      3.) The schematic illustration presented in Figure 8 does not adequately summarize all findings from this study nor does it clarify how the localization of importin α1 within MN might hypothetically influence genome stability. Although it is reasonable to propose that "importin α can serve as a molecular marker for characterizing the dynamics of MN" (Line 344), the authors assert (Line 325) that their findings, along with others, have "potential implications for the induction of chromothripsis processes and regulation of the cGAS-STING pathway in cancer cells." However, they fail to provide a clear or even hypothetical explanation regarding how their findings contribute to these molecular events. To address this gap, it would be essential for them to contextualize their results within existing literature that explores and links structural integrity deficits or aberrant DNA replication/damage responses in MN with chromothripsis and inflammation (e.g., PMID: 32601372; PMID: 32494070; PMID: 27918550; PMID: 28738408; PMID: 28759889).

      We agree that the previous schematic illustration (former Fig. 8) did not adequately summarize our findings and may have overstated our conclusions. Accordingly, we have removed this figure from the revised manuscript.

      To address the reviewer's concern, we performed additional analyses and included the results in the new Figure 8. These data show that, in addition to RAD51, both RPA2 and cGAS display mutually exclusive localization with importin α1 in MN. RPA2, a single-stranded DNA-binding protein, stabilizes damaged DNA and enables RAD51 filament assembly during homologous recombination repair. Previous studies have demonstrated that RPA2 accumulates in ruptured MN in a CHMP4B-dependent manner (PMID: 32601372). Likewise, cGAS is a cytosolic DNA sensor that localizes to ruptured MN and activates innate immune signaling through the cGAS-STING pathway, as widely reported (PMID: 28738408; 28759889; see also PMID: 32494070; 27918550).

      Our findings suggest an alternative scenario: even when nuclear envelope rupture occurs, importin α1-positive MN may remain inaccessible to DNA repair and sensing factors such as RPA2 and cGAS. This supports the view that importin α1 defines a distinct MN subset, separate from those characterized by the canonical DNA damage response or innate immune signaling factors. Furthermore, our overexpression experiments with EGFP-importin α1 (Fig. 7G, 7H) raises the possibility that importin α1 enrichment may impede the recruitment of DNA-binding proteins.

      Taken together, these results support the conclusion that importin α1 marks a unique MN state and provides a molecular framework for distinguishing between different MN environments. At the reviewer's suggestion, we have cited all the recommended references (PMID: 32601372, 32494070, 27918550, 28738408, and 28759889) in the revised manuscript to better contextualize our findings. We are grateful for the reviewer's thoughtful suggestions and literature recommendations, which helped us clarify the implications of our findings within the broader context of chromothripsis and cGAS-STING-mediated genomic instability.

      4.) Fig. 4D does not support the idea that importin α1 is euchromatin enriched: H3K9me3, H3K4me3 and H3K37me3 seem to be all deeply blue.

      We sincerely thank the reviewer for pointing out the important limitations of the original version of Fig. 4D, as also raised in minor comment #5. As the reviewer correctly noted, this figure was intended to demonstrate that importin-α1 preferentially localizes to euchromatin regions (H3K4me3 and H3K36me3) rather than heterochromatin (H3K9me3 and H3K27me3). However, we acknowledge that in the original figure, the predominantly blue tone of the heatmap made this interpretation unclear and that the Spearman's correlation coefficient for H3K36me3 was missing. In response, we have substantially revised the figure (now shown as Fig. 5E in the revised manuscript). Specifically, we improved the color scale for better visual distinction, added the missing Spearman's coefficients for H3K36me3, and strengthened the analysis by incorporating ChIP-seq data obtained with two independent antibodies against importin α1 (Ab1 and Ab2). We believe that these revisions provide a clear and more accurate representation of euchromatin enrichment of importin-α1, as originally intended.

      Indeed, the data presented by the authors do not adequately support a direct link between the presence of importin α1 in MN and genomic instability in human cancer cells. While the experimental correlations provided may not substantiate this connection definitively, they do lay a foundation for a grounded hypothesis and suggest the need for further research to explore this topic in greater depth. Additionally, it is worth noting that the evidence contributes to the growing list of nuclear proteins exhibiting abnormal behavior in micronuclei (MN). This highlights the significance of studying such proteins to understand their roles in genomic stability and cancer progression.

      Following the reviewer's suggestion, we carefully revised the manuscript to ensure that our statements are consistent with the scope of the data and do not overstate our conclusions. As part of this effort, we removed the schematic illustration (former Fig. 8), which might have overstated our findings, and refined the relevant text to prevent overinterpretation.

      To our knowledge, this study is the first to report the specific accumulation of importin α in MN. Our results suggest a previously unrecognized function of importin α beyond its canonical transport role and add to the growing list of nuclear proteins that exhibit abnormal behavior in MN. We hope that these findings will provide a conceptual and experimental basis for future studies aimed at clarifying the biological significance of MN heterogeneity and quality control in cancer biology.


      Additional experiments are necessary to quantitatively assess the localization of importin α1 in micronuclei (MN) across non-transformed MCM10A cells and transformed cell lines (MC7, HeLa, MDA-MB-231). This analysis would help determine whether the localization of importin α1 in MN correlates with genomic stability in human cancer cells.

      As part of our response to Major Comment 1, we conducted additional experiments to quantitatively compare importin α1 localization in MN between non-transformed MCF10A cells, breast cancer cell lines (MCF7 and MDA-MB-231), and HeLa cells. These results have been included in the revised manuscript (Supplemental Fig. S2F and Fig. S3B). The analyses showed no significant differences in the proportion of importin α1-positive MN among these cell lines, consistent with the reviewer's request for a more comprehensive evaluation.

      The authors claim that importin α1 preferentially localizes to euchromatic areas rather than heterochromatic regions within MN. While this assertion is supported by the immunofluorescence (IF) images presented in Figures 4A/B and S5A/B, it remains less clear for Figure S5C/B. To strengthen this claim, providing averages of IF distributions from multiple cells across independent experiments would be beneficial to draw more robust conclusions.

      We have quantified the co-localization of importin α1 with the euchromatin marker H3K4me3 and the heterochromatin marker H3K9me3 in micronuclei (MN) across four human cell lines (MCF10A, MCF7, MDA-MB-231, and HeLa). The results of this statistical analysis are included in the revised manuscript in Fig. 5C. These data provide quantitative evidence from independent experiments showing that importin α1 preferentially localizes to euchromatic regions within the MN, thereby supporting our initial observation.

      Furthermore, ChIP-seq data are presented to support the idea that importin α1 preferentially distributes over euchromatin areas in MN. However, as described, the epigenetic chromatin status indicated by these ChIP-seq experiments reflects that of the principal nucleus (PN), not specifically the status within MN in MCF7 cells. Given that MN represent only a small fraction of the cell population under normal culture conditions-likely less than 5% for HeLa cells as shown in Figure S2D-the relevance of this data is limited. Additionally, according to data presented in Figure 1B, importin α1 does not localize or distribute within the PN as it does in MN in MCF7 cells. Therefore, further experiments should be conducted to substantiate that importin α1 preferentially targets euchromatin areas within MN and to compare this distribution with that observed in the principal nucleus. Such studies could reveal potential abnormalities regarding the correlation between epigenetic chromatin status and importin α distribution in MN.

      As noted, these experiments were performed on whole-cell populations of MCF7 cells and therefore reflect the overall chromatin landscape, not specifically that of the MN. We fully acknowledge that MN constitute only a small fraction of the cell population under standard culture conditions (Supplemental Fig. S2D), and thus, the relevance of ChIP-seq data to MN must be interpreted with caution.

      Nevertheless, our intention in presenting these data was to illustrate that importin α1 preferentially associates with euchromatin regions marked by H3K4me3. To examine this more directly, we analyzed importin α1 localization in MN using immunofluorescence with histone modification markers across multiple cell lines. These analyses, together with the quantitative results now included in the revised manuscript (Fig. 5C), confirming that importin α1 preferentially localizes to euchromatic regions within MN.

      Taken together, although the ChIP-seq data were derived from whole-cell populations, the combined results from IF imaging and quantitative analysis support our interpretation that importin α1 retains its euchromatin-associating property within MN. We hope that these additional data will address the reviewer's concerns.

      To support the hypothesis that importin α1 inhibits RAD51 accessibility within MN, Figures 7D and E should be supplemented with thorough quantification and statistical analysis based on at least three independent experiments. This additional data would enhance confidence in their findings regarding RAD51 accessibility inhibition by importin α1.

      Following the reviewer's suggestion, we have added a new graph (Fig. 7F) in the revised manuscript. This figure presents the quantified frequency of RAD51-positive MN among importin α1-negative and importin α1-positive MN, analyzed across six microscopy fields (n = 6) from three independent experiments.

      To improve clarity and consistency, we reorganized the panels: representative RAD51 images are now shown in Fig. 7B, and the Cell #1 (low RAD51) vs. Cell #2 (high RAD51) classification with etoposide responsiveness is summarized in Fig. 7C. As illustrated in Figs. 7D and 7E, importin α1 and RAD51 exhibit mutually exclusive localization in MN. Fig. 7F provides a unified statistical summary at the population level.

      The results showed that the proportion of RAD51-positive MN was significantly lower among importin α1-positive MN than among importin α1-negative MN, providing robust quantitative support for the proposed mutual exclusivity between importin α1 localization and RAD51 accessibility in MN.

      We are grateful to the reviewer for this constructive suggestion, which helped us clarify and better support the central message of our study.


      The additional experiments proposed are controls and direct comparisons using the same techniques and experimental designs used by the authors, so it is reasonable that the authors can carry them out within a realistic timeframe.

      We appreciate the reviewer's thoughtful consideration of the feasibility of the additional experiments.

      Given the importance of reproducibility and the need to evaluate results based on imaging and quantitation, I strongly recommend that the authors include a detailed description of the optical microscopy procedures utilized in their study. This should encompass imaging conditions, acquisition settings, and the specific equipment used. Providing this information will enhance transparency and facilitate reproducibility. For reference, some valuable guidance on essential parameters for reproducibility can be found in Heddleston et al. (2021) (doi:10.1242/jcs.254144). Incorporating these details will not only strengthen the manuscript but also support other researchers in reproducing the findings accurately.

      Following the reviewer's suggestion, we have substantially revised the Materials and Methods sections in the main and supplemental manuscripts to provide detailed descriptions of the optical microscopy procedures, including the specifications of the imaging equipment, acquisition settings, and image processing parameters. These revisions follow the best practices recommended by Heddleston et al. (2021, J. Cell Sci., doi:10.1242/jcs.254144).

      We have also expanded the description of our quantitative image analysis using ImageJ, providing details on the parameters for MN identification and the measurement of colocalization rates between importin α and histone modifications. These additions ensured reproducibility and clarity.

      We believe that these modifications will enhance the reproducibility of our results and increase the value of our study for the research community. We sincerely appreciate the reviewer's helpful suggestions.


      Many of the plots and values in the manuscript lack appropriate statistical analysis, including p-values, which are not detailed in the figures or their legends. Furthermore, the Statistical Analysis section does not provide adequate information regarding the specific statistical tests employed or the criteria used to determine which analyses were applied in each case. To enhance the rigor and clarity of the study, it is essential that these issues be addressed prior to publication. A comprehensive presentation of statistical analysis will improve the reliability of the findings and allow readers to better understand the significance of the results. I recommend that the authors revise this section to include detailed explanations of all statistical methods used, along with corresponding p-values for all relevant comparisons.

      We sincerely appreciate the reviewer's constructive comments highlighting the importance of transparent and rigorous statistical analyses. In response, we have carefully revised all figure panels, figure legends, and the Materials and Methods (Statistical Analysis) section in both the main and the supplementary manuscripts.

      In the revised figure legends, we now provide the number of independent experiments and sample sizes (n), statistical tests applied (e.g., unpaired or paired two-tailed t-test, one-way ANOVA with Tukey's post-hoc test, two-way ANOVA with Sidak's multiple comparisons), data presentation format (mean {plus minus} SD), and corresponding p-values or significance indicators (*, **, ***). The Statistical Analysis section was also expanded to explain the rationale for selecting each statistical test, the criteria for significance, and the reporting of the replicates. These revisions ensure clarity, reproducibility, and transparency throughout the manuscript, directly addressing the reviewers' concerns. We are grateful for this valuable suggestion, which has significantly improved the rigor of our study.

      Minor comments:

      The authors claim that importin α1 exhibits remarkably low mobility in the micronuclei (MN) compared to its mobility in the principal nucleus (PN), as illustrated in Figure 1. However, based on the experimental design, this conclusion may not be appropriate. In the current setup, the FRAP experiment conducted in the PN measures the mobility of importin α1 molecules within the cell nucleus, where the influence of nuclear transport is likely negligible. Conversely, in the MN experiments shown, all molecules of importin α1 are bleached within a given MN. Consequently, what is being measured here primarily reflects the effects of nuclear transport rather than intrinsic molecular mobility. To accurately compare kinetics of nuclear transport, it would be essential to completely bleach the entire PN. If measuring molecular mobility between MN and PN is desired, only a small fraction of either MN or PN area/volume should be bleached during FRAP analysis. Additionally, it would be beneficial to include measurements of mobility for other canonical nuclear transport factors (e.g., RAN, CAS, RCC1) for comparative purposes. This broader context would allow for a more comprehensive understanding of importin α1 behavior relative to other factors involved in nuclear transport. Finally, utilizing cells that exhibit importin α1 signals in both PN and MN could further strengthen comparisons and provide more robust conclusions regarding its mobility dynamics.

      We thank the reviewer for their constructive suggestions regarding our FRAP analysis. To address the concern that the original comparison between PN and the micronuclei (MN) might have been biased by differences in bleaching areas, we performed new experiments in which both PN and MN were fully bleached within the same cells (Fig. 3A, and 3C). This approach allowed for a more direct comparison of importin α1 dynamics under equivalent conditions.

      These experiments revealed a markedly slower fluorescence recovery in MN than in PN, indicating reduced nuclear import and/or recycling efficiency of importin α1 in MN. In addition, we retained our original analysis to further characterize the heterogeneous mobility patterns of importin α1 in MN, identifying three distinct mobility classes: high, intermediate, and low (Fig. 3B, and 3D). Together, these results support our observation that importin α1 mobility is restricted in MN, likely due to altered nuclear transport dynamics.

      As suggested by the reviewer, we attempted partial bleaching of MN to assess intranuclear mobility. However, owing to the small size of MN, partial bleaching is technically challenging and inconsistent, with some MN recovering even during the bleaching process. Therefore, reliable quantification was not possible. For transparency, these data are provided as a Reviewer-only Figure but were not included in the revised manuscript.

      Finally, while we agree that examining other nuclear transport factors (e.g., RAN, CAS, RCC1) would be informative, our study focused on importin α1 dynamics. We consider these additional factors to be important directions for future investigations.


      Prior studies are referenced appropriately in general, but the authors missed some references (PMID: 32601372; PMID: 32494070; PMID: 27918550; PMID: 28738408; PMID: 28759889) that I consider key to put the present findings in frame with previous works which link the lack of structural integrity and/or aberrant DNA replication/damage responses in MN with Cchromothripsis and inflammation.

      We thank the reviewer for carefully pointing out the key references that are highly relevant to framing our findings in the context of previous studies on micronuclear instability, chromothripsis and inflammation. We fully agree with this suggestion.

      In the revised manuscript, we have cited these studies in both the Introduction and Discussion sections. Specifically, we incorporated these studies when discussing the structural fragility of MN, aberrant DNA replication, and the exposure of micronuclear DNA to cytoplasmic sensors, which mechanistically link MN rupture to chromothripsis and cGAS-STING-mediated immune activation. For example, we now refer to the study demonstrating RPA2 recruitment to ruptured MN in a CHMP4B-dependent manner (PMID: 32601372), reports showing defective replication and DNA damage responses in MN (PMID: 32494070; 27918550), and seminal studies establishing cGAS localization to ruptured MN and activation of innate immune signaling (PMID: 28738408; 28759889).

      By incorporating these references, we more clearly position our findings that importin α1 defines a distinct subset of MN lacking access to DNA repair and sensing factors such as RAD51, RPA2, and cGAS. This contextualization emphasizes that our data add to and extend the established view that compromised MN integrity underlies chromothripsis and inflammation by identifying importin α1 as a novel marker of an alternative MN microenvironment. We are grateful for this constructive recommendation, which has allowed us to strengthen the framing of our study in the existing literature.


      The figures presented in the manuscript are clear; however, where plots are included, they require appropriate statistical analysis. It is essential to display p-values on the plots or within their legends to provide readers with information regarding the significance of the results. Including this statistical information will enhance the interpretability of the data and strengthen the overall findings of the study. I recommend that the authors revise these sections accordingly before publication.

      In response, we have revised the relevant figure panels and their legends to clearly display the statistical significance, including p-values, where appropriate. Specifically, we added statistical annotations (p-values or significance markers such as asterisks) directly on the plots or in the corresponding legends, and clarified the number of replicates, statistical tests used, and definitions of error bars (mean {plus minus} SD). We believe that these revisions improve the interpretability and transparency of our results and strengthen the overall presentation of the data.

      __ 1.) In lines 134-135, it is stated that "up to 40% of the MN showed importin α1 accumulation under both standard culture conditions and the reversine treatment (Fig. S2F)." However, Figure S2F only displays percentages for reversine-treated cells, and there is no mention in the text or figures regarding the percentage of importin α1-positive MN determined by immunofluorescence (IF) under standard culture conditions. This discrepancy should be addressed.__

      Following the reviewer's comments, we revised Supplemental Fig. S2F shows a direct comparison of the proportion of importin α1-positive MN between untreated and reversine-treated HeLa cells based on indirect IF analysis. The Results section was updated accordingly (page 8, Lines 148-150): "We then examined whether reversine treatment affected the proportion of importin α1-positive MN. The results revealed that the MN formation rate for either untreated or treated cells was 36.2% {plus minus} 7.8 or 38.3% {plus minus} 8.8, respectively, with no significant difference (Fig. S2F). "

      We believe that this revision addresses the reviewer's concern by providing relevant quantitative data for the untreated condition.

      2.) In line 170, the authors state that "Cells in which overexpressed EGFP-importin α1 localized only in PN were excluded from the analysis (see Fig. 1E, top panels)." It is unclear why this exclusion was made. The authors should clarify whether they are referring to all constructs or only to the wild-type (WT) construct when mentioning EGFP-importin α1 localization solely in PN. This clarification is important as it may affect the results highlighted in line 173.

      In this section, we aimed to clarify that the quantitative analysis focused exclusively on cells harboring MN, as the purpose of the analysis was to compare the localization of EGFP-importin α1 between MN and PN. We excluded cells that contained no MN and showed EGFP-importin α1 localization only in the PN. This criterion was consistently applied to both wild-type and mutant constructs. To avoid confusion, we have removed the sentence "Cells in which overexpressed EGFP-importin α1 localized only in PN were excluded from the analysis (see Fig. 1E, top panels)." from the revised manuscript.

      3.) The statement in line 191 ("However, this antibody could not be further used in this context due to cross-reactivity with highly concentrated importin α1 in MN (Fig. S4)") is somewhat misleading. While it hints at a technical issue, it does not provide additional relevant information for understanding its implications for the rationale of the research. Moreover, Figure S4 is referenced but appears to refer specifically to panels S4D and E, which are not mentioned in the text. I recommend clarifying this point or removing it altogether.

      We agree with the reviewer that the statement "However, this antibody could not be further used in this context due to cross-reactivity with highly concentrated importin α1 in MN (Fig. S4)" was not essential for understanding the rationale of our study and could be misleading. In response, we have removed this sentence from the revised manuscript, along with the corresponding Supplementary Fig. S4.

      4.) Lines 197-199 contain a sentence that could be misleading and would benefit from clearer explanation. Although Figure 3D provides some clarity on this matter, no statistical analysis is included-only a bar plot is presented. A proper statistical analysis should be provided here to enhance understanding.

      In the revised manuscript, we performed one-way ANOVA followed by Holm-Sidak's multiple comparisons test to evaluate the MN localization ratio of EGFP-NES between Imp-α1-negative and Imp-α1-positive MN. This analysis revealed a statistically significant difference (**p

      5.) In lines 218-221, it states that importin α1 associates with euchromatin regions characterized by H3K4me3 and H3K36me3; however, Figure 4D lacks the Spearman's correlation coefficient value for H3K36me3 within the matrix. This omission needs correction.

      We thank the reviewer for this insightful comment. As addressed in response to Major comment #4, we have substantially revised Fig. 5 and added the missing Spearman's correlation coefficient value for H3K36me3 (now shown in Fig. 5E). These revisions, together with the overall improvements to the figure, more clearly illustrate the euchromatin enrichment of importin-α1.

      6.) For consistency in the experimental design aimed at identifying potential importin α1-interacting proteins, it would be more appropriate for Figures 5C/D to show IF data from MCF7 cells rather than HeLa cells.

      We sincerely apologize for the misstatements in the legends of the original Fig. 5C. The correct description is that this experiment was performed using MCF7 cells, and we have revised the legend accordingly in the revised manuscript (now Fig. 6C). In addition, because the original data in Fig. 5D were obtained from HeLa cells, we repeated this experiment using MCF7 cells and replaced the panel with new data (now Fig. 6D).

      7.) To substantiate claims that importin α1 inhibits RAD51 accessibility within MN, Figures 7D and E should include thorough quantitation and statistical analysis based on at least three independent experiments.

      As described above, we addressed this point by adding a new quantification and statistical analysis in Fig. 7F, based on six microscopy fields across three independent experiments. This analysis directly supports our claim that importin α1 inhibits RAD51 accessibility in the MN.

      We would also like to clarify that although the reviewer referred to Figs 7D and 7E, these two panels were designed to illustrate the same phenomenon-the mutually exclusive localization of importin α1 and RAD51 to distinct MN-shown in different contexts. Specifically, Fig. 7D presents examples from separate cells, each with MN containing either importin α1 or RAD51, while Fig. 7E shows a single cell containing two distinct MN, one enriched with importin α1 and the other with RAD51. Because both panels serve as illustrative examples of the same phenomenon, it would not be meaningful to quantify them independently as parallel datasets. Instead, we integrated the statistical analysis into a unified graph (Fig. 7F), which summarizes the frequency of RAD51-positive MN in relation to importin α1 status across the cell population, thereby supporting our interpretation that importin α1-positive MN represent a distinct subset that is less accessible to RAD51.

      8.) The meaning of lines 336-338-"Therefore, the enrichment of importin α1 in MN, along with its interaction with chromatin, may regulate the accessibility of RAD51 to DNA/chromatin fibers in MN and protect its activity"-is unclear. I suggest rephrasing this sentence for improved clarity and comprehension.

      We appreciate the reviewer's comment regarding the clarity of our statement in the Discussion (former lines 336-338). We agree that the original phrasing is ambiguous. To improve clarity and align with our results, we revised this section to emphasize that importin α1-positive MN represent a restricted environment from which DNA repair and sensing factors are excluded. Specifically, RAD51, RPA2, and cGAS showed mutually exclusive localization with importin α1, indicating that these MN are largely inaccessible to DNA-binding proteins (pages 20-21). This rephrasing removes the unclear phrase "protect its activity" and directly reflects our experimental findings, presenting a clearer interpretation that is consistent with the Results.

      9.) Fig. 1D: Numbers on the y-axis are missing, x-axis labeling is too small

      We appreciate the reviewer's careful examination of the figure. In the revised manuscript, we added numerical tick labels to both the x- and y-axes and increased the label font size to ensure clear readability, as shown in Fig. 1D. We also applied the same improvements to other fluorescence intensity plots, including Figs. 4A, 4B, 5A, 5B, 7H, and Supplemental Fig. S4C and S5A-S5F to ensure consistency in readability across the manuscript. We thank the reviewer for helping us improve the clarity and accuracy of our figure presentations.

      10.) Fig. 1F: As the PN/MN values of the three experiments are seemingly identical (third column) the distribution of the three individual data of the PN (first column) should mirror the distribution of the three individual data of the MN (second column). The authors might want to check why this is not the case.

      Upon re-examination of the source data, we identified and corrected a minor calculation error in one subset and regenerated the panel. After correction, the three independent PN/MN ratios were 3.1%, 2.9%, and 2.6%, rather than being identical. These corrected values were proportional to the corresponding PN and MN measurements and preserved the expected relationship between their distributions. Although the numerical differences were small, they demonstrated high reproducibility across independent experiments. These corrections do not alter the interpretation of Fig. 1F, and the distribution of PN/MN values is now consistent with the paired PN and MN data presented in the revised manuscript.

      Significance Micronuclei (MN) primarily arise from defects in mitotic progression and chromatin segregation, often associated with chromatin bridges and/or lagging chromosomes. MN frequently exhibit DNA replication defects and possess a rupture-prone nuclear envelope, which has been linked to genomic instability. The nuclear envelope of MN is notably deficient in crucial factors such as lamin B and nuclear pore complexes (NPCs). This deficiency may be attributed to the influence of microtubules and the gradient of Aurora B activity at the mitotic midzone, which inhibits the recruitment of proper nuclear envelope components. Additionally, several other factors may contribute to this process: for instance, PLK1 controls the assembly of NPC components onto lagging chromosomes; chromosome size and gene density positively correlate with the membrane stability of MN; and abnormal accumulation of the ESCRT complex on MN exacerbates DNA damage within these structures, triggering pro-inflammatory pathways.

      The work presented by Dr. Miyamoto and colleagues reveals the abnormal behavior of importin α1 in MN during interphase. According to their findings, it is reasonable to consider importin α1 as a molecular marker for characterizing MN dynamics. Furthermore, it could serve as a potential clinical marker if the authors provide additional experiments demonstrating significantly different localization patterns of importin α1 in transformed cells (e.g., MC7, HeLa, MDA-MB-231) compared to non-transformed cells (e.g., MCM10A).

      While the authors present some evidence indicating partial disruption of nuclear envelopes in MN (Figures 3 and S4), it is noteworthy that this phenomenon also occurs in importin α1-negative MN. Moreover, according to the figure legends, data for both figures originate from a single experiment. As such, convincing evidence linking the aberrant behavior of importin α1 in MN with chromothripsis processes or regulation of the cGAS-STING pathway-and its implications for genomic instability in cancer cells-remains lacking.

      Overall, it is not entirely clear what significance this advance holds for the field; while there are conceptual contributions made by this work, they do not appear sufficiently robust at this time. Further research is needed to clarify these connections and strengthen their conclusions regarding importin α1's role in MN dynamics and genomic instability.

      We sincerely appreciate the reviewer's thoughtful and constructive evaluation of the significance of our study. We agree that in the original submission, the conceptual contribution was not fully supported by sufficient evidence. In the revised manuscript, we have substantially strengthened our findings by incorporating new data on RPA2 and cGAS, in addition to RAD51. These results consistently show that importin α1-positive MN are largely inaccessible to multiple DNA-recognizing proteins-including DNA repair factors (RAD51 and RPA2) and the innate immune sensor cGAS-whereas importin α1-negative MN readily recruit these proteins. This broader dataset reinforces the concept that importin α defines a distinct and restricted MN subset, extending beyond our initial observation of RAD51 exclusion.

      By framing importin α as a molecular marker that discriminates between functionally distinct MN environments, our study conceptually advances the understanding of MN heterogeneity. This adds to the prior literature showing that defective nuclear envelope integrity underlies chromothripsis and cGAS-STING activation and positions importin α as a new marker for identifying MN that are refractory to these DNA repair and sensing pathways. While we agree that further work is necessary to directly link importin α enrichment to downstream genomic instability or inflammation in cancer, we believe that our revised data now provide a robust foundation for future investigations.

      Taken together, the revised manuscript presents a clearer and more comprehensive conceptual advance: importin α-positive MN represents a previously unrecognized molecular environment distinct from MN characterized by canonical DNA repair or sensing factors. We are grateful to the reviewer, whose constructive comments greatly improved the clarity, robustness, and overall impact of our study. We believe that these findings will be of particular interest to researchers studying the mechanisms of genomic instability, chromothripsis, and cancer biology.


      Reviewer #2

      Summary:

      The authors have shown that Importin α1, a nuclear transport factor, is enriched in subsets of micronuclei (MN) of cancer cells (MCF7 and HeLa) and, using FRAP, has an altered dynamics in MN. Moreover, the authors have shown that these levels of Importin α1 in the MN are likely not due to its traditional role for signal-dependent protein transport, as suggested by immunofluorescence of other factors important for this function. Additionally, cargo dynamics carrying NLS or NES signals were disrupted in Importin α1-positive micronuclei. Importin α1-positive micronuclei also appear to have a disrupted nuclear envelope, potentially explaining some of these cargo disruptions. The authors also demonstrated that Importin α colocalizes with proteins important for DNA replication, and p53 signaling using RIME, followed by immunofluorescence. Lastly, the authors show that Importin α and RAD51 have mutual exclusivity in the micronuclei.

      Major comments:

      1) A key issue is there are very few statistical tests used in this study. It is crucial to the interpretation of the data. We strongly urge the authors to re-analyze the data using appropriate statistical analyses. Along those lines, in many figures 1 or 2 images are shown without stating how many biological or technical replicates this is representative of or showing quantification of the anlyses. In general, the authors' statements would be strengthened by showing more examples and/or stating "N" in the figure legends or supplement.

      We sincerely thank the reviewer for emphasizing the importance of including sufficient statistical analyses and replication information. As noted in our response to Reviewer #1, we have carefully revised the manuscript to enhance statistical rigor and transparency throughout. Specifically, we expanded the Statistical Analysis section in the Materials and Methods section to provide a clear description of the statistical approaches used. In addition, all figure legends have been revised to explicitly state the number of biological replicates, sample sizes, statistical tests applied, and corresponding p-values or significance indicators. Representative images are consistently accompanied by quantitative analyses derived from multiple independent experiments.

      We believe that these comprehensive revisions directly address the reviewer's concerns and substantially improve the rigor, clarity, and interpretability of our manuscript.

      2) Using RIME and immunofluorescence, the authors identify factors that co-localize with Importin α1 in subsets of micronuclei (Figure 5), which is interesting, but there is no functional data associated with this result. Are the authors stating that these differences account for altered DNA damage or replication? It is unclear what the conclusion is beyond "some MN are different than others." Could the authors knockdown/knockout these factors to determine if they recruit Importin α1 into MN or the reciprocal? For many of these factors, they appear to be broadly present throughout the entire primary nucleus as well, indicating there is nothing unique about their MN localization.

      We agree that our original RIME and indirect IF analyses were primarily descriptive and lacked functional validation. To strengthen this aspect, we added new IF and quantification data (now presented in Fig. 8) showing that importin α1-positive MN are largely mutually exclusive with DNA repair and sensing factors such as RAD51, RPA2, and cGAS, whereas importin α1 frequently co-localizes with chromatin regulators identified by RIME, such as PARP1 and SUPT16H/FACT. These findings indicate that importin α1-positive MN define a distinct molecular environment enriched in replication- and chromatin-associated regulators but inaccessible to canonical DNA repair and sensing proteins.

      This combination of mutual exclusivity with DNA repair/sensing factors and frequent co-localization with chromatin regulators underscores the biological significance of importin α1 localization in MN, as it may contribute to localized chromatin stabilization through association with chromatin regulators while simultaneously restricting access to DNA repair and sensing factors. Thus, importin α1-positive MN represent a restricted subset with potential implications for genome stability and immune signaling, going beyond the descriptive notion that "some MN are different than others."

      Moreover, many chromatin regulators identified by RIME contain classical nuclear localization signals (NLSs), raising the possibility that importin α1 interacts with these proteins via their NLS sequences. We fully agree with the reviewer that knockdown or knockout experiments would be highly valuable to clarify whether such interactions actively recruit importin α1 into MN or occur reciprocally, and we regard this as an important direction for future investigations.

      3) In line 274, the authors state that MN highly enriched for Importin α1 inhibits RAD51 accessibility but this is an overstatement of the data. Instead, the authors show that RAD51 and importin α1 do not colocalize in micronuclei, albeit without quantification which weakens their argument. Also, the consequence of this "mutual exclusivity" is unclear. Can the authors inhibit or knockdown Importin α1 and show that RAD51 goes to all micronuclei? And how is this different than the data shown for factors in Figure 5? Some of those show colocalization with Importin α1-positive micronuclei and others do not. Could you perform live imaging of labeled Importin a1 and RAD51 and show that as Importin α1 accumulates in MN that RAD51 or other DNA repair factors are exported? An alternative experiment would be to show that the C-mutant, which is defective in nuclear export, now colocalizes with RAD51 in MN. Please reconcile this or show experiments to prove the statement above.

      We agree that our original wording "inhibits RAD51 accessibility" was not sufficiently supported by direct evidence, as it was based solely on the immunofluorescence data. Therefore, we have removed this statement from the Results section of the revised manuscript. To strengthen this point, we added a quantitative analysis (Fig. 7F) showing that RAD51 signals were significantly reduced in importin α1-enriched MN.

      Regarding the suggestion to perform knockdown experiments, we note that the depletion of KPNA2 (gene name of importin α1) has been reported to cause severe cell-cycle arrest (Martinez-Olivera et al, 2018; Wang et al, 2012). Consistent with these reports, we also found that siRNA-mediated knockdown of KPNA2 in our system strongly reduced MN induction upon reversine treatment, making it technically unfeasible to analyze RAD51 localization under these conditions. We also sincerely thank the reviewer for suggesting the live imaging experiments. We fully agree that such experiments would provide valuable mechanistic insights, and we regard this as an important direction for future research.

      In addition, to address the reviewer's concern about other DNA repair factors, we added new data (Fig. 8) showing that importin α1-positive MN are mutually exclusive with RPA2 and cGAS. RPA2 is a canonical single-strand DNA (ssDNA)-binding protein that stabilizes exposed ssDNA and facilitates RAD51 recruitment. It has been reported to accumulate in ruptured MN in a CHMP4B-dependent manner (Vietri et al, 2020). cGAS is a cytosolic DNA sensor that detects ruptured MN and activates innate immune signaling via the cGAS-STING pathway. Together with our RAD51 results, these data show that importin α1-positive MN are consistently segregated from multiple DNA-recognizing factors, including RAD51. Simultaneously, importin α1 co-localizes with chromatin regulators identified by RIME, such as PARP1 and SUPT16H/FACT. These findings support the view that importin α1-positive MN define a distinct molecular environment enriched in chromatin regulators but largely inaccessible to DNA repair and sensing factors. While the precise mechanism remains unclear, one possibility is that importin α1-associated chromatin interactions limit the access of DNA repair and sensing proteins. However, this interpretation is speculative and requires further investigation.

      4) In the Discussion, line 343-344 states that "importin α1 is uniquely distributed and alters the nuclear/chromatin status when enriched in MN," however this is not currently supported by the present data. The data presented shows correlation (albeit weak) between euchromatic modifications and Importin α1, and it does not definitively show that importin α1 is sufficient to alter the nuclear-chromatin status when enriched in the MN. More substantial experiments would be required to show whether Importin α1 plays an active role in these modifications.

      Following the reviewer's suggestion, in the revised manuscript, we removed this overstatement and rephrased the relevant sections of the Discussion. Rather than implying a causal role, we now describe the mutually exclusive localization of importin α1 with DNA repair and sensing factors (RAD51, RPA2, and cGAS), emphasize its preferential association with euchromatin regions marked by H3K4me3, and note its frequent co-localization with chromatin regulators identified by RIME, such as PARP1 and SUPT16H/FACT. These findings suggest that importin α1-positive MN define a distinct subset characterized by limited accessibility to DNA repair and sensing proteins, whereas cGAS-positive ruptured MN exemplify a state in which these proteins can accumulate.

      We also added a concluding statement that frames importin α1 as defining a previously unrecognized MN subset that is distinct from conventional ruptured MN. This revision provides a more accurate and appropriately cautious interpretation of our data while underscoring the conceptual advance of our study by clarifying how importin α1 localization reveals MN heterogeneity.

      Minor Comments

      1) Summary statement (page 3 Line 40): The use of "their" is confusing. Whose microenvironment are you referring to?

      We have rephrased the sentence as follows: The accumulation of importin α in micronuclei, followed by modulation of the microenvironment of the micronuclei, suggests the non-canonical function of importin α in genomic instability and cancer development. Thank you for this useful suggestion.

      2) In Abstract and introduction (page 4, Line 44 and page 5, line 59) it states that MN are membrane enclosed structures, but this is not always the case (see https://doi.org/10.1038/nature23449 as one example).

      While MN are typically surrounded by a nuclear envelope at the time of their formation during mitosis, we agree that this envelope can later rupture or fail to assemble completely, thereby exposing micronuclear DNA to the cytoplasm. To clarify this point, we revised the Introduction to explicitly acknowledge that MN may lose nuclear envelope integrity, which can have important consequences for genomic instability and immune activation inflammation. Specifically, we have added the following sentence to the Introduction (page 4, lines 77-80): "The nuclear envelope of MN can be partially or completely disrupted, allowing cytoplasmic DNA sensors, such as cyclic GMP-AMP synthase (cGAS), to access micronuclear DNA and trigger innate immune responses via the cGAS-STING pathway (Harding et al, 2017; Li & Chen, 2018; Mackenzie et al, 2017). "

      We hope this addition appropriately addresses the concerns raised by Reviewer #2 while incorporating the valuable suggestions from Reviewer #1 without altering the overall structure and flow of the manuscript.

      3) Given the fact that the RIME result identified proteins involved in DNA replication to be enriched with Importin α1, are these MN enriched in factors described in Fig. 5 simply localizing to MN that are in S phase, as described previously (doi: 10.1038/nature10802)?

      We sincerely thank the reviewer for raising this constructive perspective regarding the potential relationship between importin α1 enrichment in micronuclei (MN) and the S phase. Our RIME analysis identified chromatin-associated proteins, such as PARP1 and SUPT16H/FACT, which are often activated during replication stress and frequently function in the S phase. However, importin α1-positive MN were not exclusively associated with S-phase-specific molecules, and our data do not indicate that these MN are restricted to the S phase.

      Previous studies [e.g., (Crasta et al, 2012)] have established that MN are prone to replication defects and represent hotspots of genomic instability. The recovery of replication stress-responsive molecules, such as PARP1 and FACT, by RIME is therefore consistent with the biology of MN. Based on this valuable suggestion, we have revised the Discussion (page 19) to explicitly mention the potential involvement of replication-related proteins in importin α1-positive MN, as well as the possibility that importin α1 accumulation may contribute to replication defects in these structures. We are grateful to the reviewer for raising this important perspective, which has enabled us to place our findings in a broader mechanistic context.

      We are grateful to the reviewer for this important comment, which has allowed us to place our findings in a broader mechanistic context and outline directions for future research, including testing the relationship between importin α1-positive MN and established S-phase markers such as PCNA.

      4) The FRAP data is not very compelling. While it is clear there are differences between the PN and MN dynamics, what is driving these differences? Are these differences meaningful to the biology of the MN or PN? It is unclear what this data is contributing to the conclusions of the paper. Also, if the mobility of the MN is plotted on the same graph as the PN, the differences in MN mobility might not look as compelling.

      We respectfully emphasize that FRAP analysis is a key component of our study, as it provides important insights into the distinct dynamics of importin α1 in MN compared to PN.

      In the revised manuscript, we included new experiments (now shown in Fig. 3A and 3C) that directly compare the recovery kinetics of importin α1 in PN and MN in the same cells. By plotting the PN and MN recovery curves side by side, we aimed to improve clarity and provide a direct visualization of the pronounced differences in importin α1 dynamics between these compartments.

      Our FRAP results showed that importin α1 accumulated in both PN and MN but exhibited markedly reduced mobility in MN. These findings suggest that, unlike in the PN, canonical nucleocytoplasmic recycling of importin α1 is impaired in MN. Furthermore, the reduced mobility indicates that importin α1 is stably associated with chromatin or chromatin-associated factors in MN, consistent with our additional biochemical and imaging data showing preferential association with euchromatin (e.g., H3K4me3) and chromatin regulators.

      Taken together, the FRAP data provide functional evidence that complements our structural and molecular analyses, supporting our central conclusion that importin α1 accumulation in MN defines a restricted chromatin environment that influences the accessibility of DNA repair and sensing factors.

      5) In Results (line 117), you state that "the cytoplasm of those cell lines emitted quite strong signals" for Importin α1, but that phrasing is a little confusing. Yes, Importin α1 is in present the cytoplasm in most cells, but it appears you are referring to the enrichment in MN. I would recommend re-phrasing this statement to make your intent clearer.

      As the reviewer rightly noted, the original phrasing, "the cytoplasm of those cell lines emitted quite strong signals," was misleading, as it could suggest a broad cytoplasmic distribution of importin α1. Our observations showed that importin α1 accumulated specifically in MN located within the cytoplasm, but not in the cytoplasmic regions. To clarify this, we revised the Results section (page 7, lines 125-127) to read: " Next, we performed indirect immunofluorescence (IF) analysis on human cancer cell lines, including MCF7 and HeLa cells. Notably, we found that importin α1 accumulated prominently in MN located within the cytoplasm (MCF7 cells, Fig. 1B; HeLa cells, Fig. 1C; yellow arrowhead). " .

      We believe that this revised wording more accurately reflects our findings and addresses the reviewer's concerns.

      6) In Results (line 135, Figure S2E,F), the ratio of high, low or no Importin α1 intensity is confusing. Is this percentage relative to the total number of MN? It Is unclear what is meant by "whole number" of MN. Is Importin α1 intensity quantified or is it subjective?

      We apologize for the confusing terminology used in the original manuscript for Supplemental Fig. S2 and thank the reviewer for pointing it out. Although the reviewer did not specifically comment on the classification of importin α1 signal intensity as "high" or "low," we recognized that this approach relied on subjective visual assessment and lacked clearly defined thresholds. To improve clarity and objectivity, we have removed this classification and now analyze importin α1 localization in MN as simply positive or negative (revised Supplemental Fig. S2E). The previous graph (original Fig. S2F) was deleted. In addition, the frequency of Importin α1-positive MN has been reported in the Results section of the main text (page 8). We believe that these revisions have improved the clarity and reproducibility of our data presentation.

      7) Figure 2C is confusing. Are you counting MN with co-localization of Importin α1 and these factors? Please clarify.

      Figure 2C shows the percentage of importin α1-positive MN that displayed localization of importin β1, CAS, or Ran based on IF analysis. In other words, it represents the co-localization rates of these transport factors specifically within the subset of MN positive for importin α1. To improve clarity, we revised the y-axis label in Fig. 2C to "Localization in Impα1-positive MN (%)" and modified the figure legend accordingly. We have clarified this point in the Results section (page 9). We believe that these revisions resolve the confusion and clarify the scope of the analysis.

      8) Figure S3D quantification is very confusing and unclear. Also, how is this normalized? Are you controlling for total signal in each cell? And can the results of this experiment give you any mechanistic insight as to what is regulating MN localization beyond the interpretation of "MN localization is distinct from PN localization"? The "C-mutant" appears quite a bit different than the others. What might that indicate about the role of CAS/CSE1L in MN enrichment?

      We apologize for the confusion caused by the quantification in the Supplemental Fig. S3D (now revised as Fig. S4D). This figure shows the relative enrichment of EGFP-importin α1 in MN compared with that in PN for wild-type and mutant constructs. To control for nuclear size, fluorescence intensity was measured using a fixed circular ROI (1.5-2.0 µm in diameter) placed in both the MN and PN of the same cell, and MN/PN intensity ratios were directly plotted for individual cells (n = 8 per condition). This procedure is described in detail in the Results section (page 10).

      Regarding the C-mutant, the reduced MN/PN ratio primarily reflects increased importin α1 accumulation in the PN rather than a reduced retention in the MN. As discussed in the revised manuscript (page 18), this suggests that CAS/CSE1L-mediated nuclear export is active in the PN but may be impaired or uncoupled in the MN, possibly due to differences in nuclear envelope integrity or chromatin context. We believe that this clarification addresses the reviewer's concerns and highlights the mechanistic implications of the C-mutant phenotype.

      9) For Figures 3A,B and S4, are these images of single z-slices or projections? It would be helpful to clarify for your interpretations as to whether they are truly partial or diffuse or the membrane is in another z-plane. Also, how does the localization of Importin α1 different or similar to other factors that localize to MN with a compromised nuclear envelope, such as cGAS? If it is based on epigenetic marks, it should be different than cGAS, which primarily binds non-chromatinized DNA.

      We thank the reviewer for this valuable suggestion. All images shown in Figs 3A, 3B, and S4 in the original manuscript (now revised as Fig. 4A and 4B, with the original Fig. S4 omitted) were derived from single optical sections rather than projections. We would like to emphasize that similar discontinuities in signals for lamin proteins (including laminB1 and laminA/C) were consistently observed across multiple cells and independent experiments, indicating that these observations are not due to an artifact of image acquisition or a missing z-plane, but rather reflect a genuine partial loss of the MN membrane.

      In contrast to cGAS, which predominantly binds non-chromatinized DNA in ruptured MN, our data indicate that importin α1 preferentially localizes to MN regions enriched in euchromatin-associated histone modifications, such as H3K4me3. The new data presented in Fig. 8 further strengthen this point by directly comparing importin α1 with DNA-recognizing proteins such as cGAS and RPA2, which preferentially localize to MN lacking importin α1. Together, these results highlight that importin α1-positive MN constitute a distinct subset characterized by chromatin-associated localization and reduced accessibility to DNA repair and sensing proteins.

      10) In Results, it is unclear how Fig. 7B was calculated. Are the authors qualitatively assessing if RAD51 is there or looking for MN enrichment relative to PN? Additionally, in Fig. 7C, RAD51 localization is diffuse. It should be enriched in foci. I would recommend the authors repeat this experiment using pre-extraction then quantify RAD51 foci number and/or intensity.

      For the quantification shown in Fig. 7B of the original manuscript, we acquired images containing approximately 15-50 cells per condition and counted all the micronuclei (MN) in those fields. The percentage of RAD51-positive MN relative to the total MN was calculated. In the revised manuscript, we further refined this analysis by classifying RAD51-positive MN into two categories based on signal intensity: weak (Cell #1 type) and strong (Cell #2 type). For each condition, nine independent fields were analyzed (302 MN in untreated cells and 213 MN in etoposide-treated cells). This quantification revealed that etoposide treatment preferentially increased the proportion of MN with strong RAD51 accumulation (Fig. 7C, right panels), indicating enhanced DNA damage in MN. Thus, our analysis was quantitative rather than qualitative, based on systematic counting across multiple fields.

      Regarding the reviewer's suggestion of pre-extraction, we believe that this approach is technically difficult because MN are structurally fragile. Importantly, in the subset of MN with strong RAD51 accumulation, RAD51 was clearly present in foci rather than diffuse signals, as shown in the high-magnification images (Fig. 7E).

      Finally, in response to Reviewer #1, we performed a new quantitative analysis (Fig. 7F) focusing on the frequency of strongly RAD51-positive MN in relation to importin α1 status. This analysis confirmed the mutually exclusive relationship between RAD51 and importin α1 in MN and further strengthened our conclusions.

      11) In line 264, "notably" is misspelled.

      Thank you for pointing this out. We have corrected the spelling.

      12) In line 303, "scenarios" should be changed to the singular form.

      Thank you for this confirmation. We have corrected this to "scenario".

      13) In Figure legend, line 571-582, H3K27me3 is shown in Figure 4D, but the written legend does not mention this mark.

      We have added the marks in the legend for Fig. 5E.


      Significance: Overall, this paper shows compelling evidence for micronuclear localization of regulators of nuclear export, notably Importin α1. Of note, this occurs in subsets of MN that lack an intact nuclear envelope. And while it has been appreciated that compromised micronuclear envelopes lead to genomic instability, this is one of the first that demonstrate alteration in the nuclear envelope may disrupt import or export of nuclear proteins into micronuclei.

      A limitation of the study is that much of the work is based on immunofluorescence and lacks mechanism. While there is much correlative data showing that Importin α1 localizes to micronuclei with compromised envelopes, it is unclear whether Importin α1 drives micronuclear collapse or it is downstream of this process. Additionally, Importin α1 micronuclear localization anti-correlates with RAD51 but does colocalize with other DNA replication factors, yet it is unclear whether their localization is dependent on Importin α1 or its role in nuclear export. Currently, the audience for this manuscript would be focused to those interested in micronuclei. If these concerns about an active role for Importin α1 in micronuclear export are resolved, it would greatly increase the impact of this manuscript to those interested more broadly in genomic instability, DNA repair, and cancer.

      We thank the reviewer for positively evaluating our study and highlighting the importance of defining the biological significance of our findings. In the revised manuscript, we incorporated new data (Fig. 8) demonstrating that importin α1-positive MN are mutually exclusive not only with RAD51 but also with RPA2 and cGAS. These results clearly establish importin α1-positive MN as a distinct subset, defined by the enrichment of chromatin-associated proteins, while being largely inaccessible to canonical DNA repair and DNA-sensing factors.

      Consistent with this, our FRAP experiments and analysis of the CAS/CSE1L-binding mutant (C-mut) further indicated that the recycling dynamics of importin α1 were altered in MN compared to PN. In addition, importin α1 was enriched in lamin-deficient areas of MN, where electron microscopy revealed a fragile nuclear envelope morphology. Together with prior evidence, as discussed in the revised manuscript that recombinant importin α can inhibit nuclear envelope assembly in Xenopus egg extracts (Hachet et al, 2004), these findings raise the possibility that high local concentrations of importin α1 may actively contribute to impaired nuclear envelope formation or stability in MN.

      Such a distinct MN state may have important biological consequences. By limiting the access of DNA repair and DNA-sensing proteins, importin α1 accumulation may influence chromothripsis and immune activation, which, in turn, could play a role in tumor progression and genome instability. We believe that the identification of importin α1 as a marker defining such a restricted MN environment represents a conceptual advance that extends the relevance of our study beyond the MN field to the broader areas of genome instability, DNA repair, and cancer biology. We are grateful to the reviewer for encouraging us to strengthen the framing of our work, which has helped us clarify the novelty and impact of our findings.

      Reviewer #3

      Summary:

      This study reports that importin alpha isoforms enrich strongly in a subset of micronuclei in cancer cells and uses mutagenesis and immunostaining to define how this localization relates to importin alpha's nuclear transport function. This enrichment occurs even though importin-alpha-positive micronuclei also contain Ran and the importin alpha export factor CSE1L, indicating that importin a enrichment is not simply a consequence of the absence of components of the nuclear transport machinery that control its localization. Mutagenesis of importin a indicates that Mn enrichment persists even when the importin beta binding and NLS binding capacities of imp a are impaired. Potential importin alpha interacting proteins are identified by proteomics, although the relationship of these potential binding partners to micronucleus localization is unclear.


      1. In Figure S3, the authors show that mutagenesis of importin alpha's CSE1L binding domain decreases the ratiometric enrichment in Mn vs. Pn. However, is this effect occurring because th CSE1L binding mutant decreases Mn enrichment, or increases Pn enrichment? It seems that the latter is possible based on the images shown. If the Pn specifically becomes brighter on average in cells expressing the C-mut, while Mn remain similar in fluorescence intensity, that might suggest that CSE1L has less of an effect on importin alpha export in Mn compared to Pn.

      We appreciate the reviewer's insightful observations. In the revised analysis (now presented in Supplemental Fig. S4D), we quantified EGFP-importin α1 intensities in both PN and MN using fixed circular regions of interest. This revealed that the reduced MN/PN ratio observed in the CSE1L-binding mutant (C-mut) was mainly due to an increase in the PN signal rather than a decrease in the MN signal. These results are consistent with the reviewer's suggestion and indicate that CSE1L-mediated nuclear export is functional in PN but has a limited impact on MN.

      Importantly, this interpretation is supported by our FRAP experiments (Fig. 3), which show that importin α1 recycles normally in the PN but exhibits markedly reduced mobility in the MN. Together with our proteomic and colocalization analyses (Fig. 6), which identified importin α1 association with chromatin regulators such as PARP1 and SUPT16H/FACT, these findings suggest that importin α1 accumulates in MN not only because the recycling machinery is uncoupled but also because it forms stable interactions with chromatin-associated proteins. As discussed in the revised manuscript, this dual mechanism provides a plausible explanation for the persistent retention of importin α1 in MN and its role in defining a distinct MN environment.

      It is unclear from the text or the methods whether RIME identification of importin-alpha binding partners is performed in reversine-treated cells, which would increase the proportion of importin alpha in Mn, or in untreated cells. In either case, it seems likely that the majority of interactors identified would be cargoes that rely on importin alpha for import into the Pn. The rationale for linking these potential interactions to the Mn is unclear. While some of these factors are indeed shown enriched in Mn in Figure 5, the significance of this is also unclear. These points should be clarified.

      We thank the reviewer for raising this important point. The RIME assay was performed using whole-cell extracts from untreated wild-type MCF7 cells, which primarily identified importin α1-associated nuclear cargo proteins. To assess their potential relevance to MN, we screened the RIME candidates using immunofluorescence data provided by the Human Protein Atlas database and experimentally validated those showing clear MN localization by colocalization with importin α1. This two-step approach enabled us to highlight importin α1 interactors that are functionally relevant to MN biology rather than general nuclear cargoes.

      In response to the reviewer's concerns, we revised the Results section to clarify this rationale. Specifically, we added the explanation that "As importin α1 interactors are typically nuclear proteins, it is plausible that they reside not only in the primary nucleus but also in the MN. To test this possibility, we screened the identified candidates for MN localization using immunofluorescence images provided by the Human Protein Atlas (HPA) database (Pontén et al, 2008; Thul et al, 2017)." (page 14, lines 294-297).

      This is consistent with the idea that a wide range of nuclear proteins carrying NLS motifs can recruit importin α1 into the micronuclei, where they reside. This protein-driven enrichment of importin α1 may create a restricted microenvironment in which canonical DNA repair and sensing proteins, including RAD51, RPA2, and cGAS, are excluded, thereby defining a distinct subset of micronuclei with limited genome surveillance capacity.

      In Figure 6, the authors perform FRAP of importin alpha in Mn and show that it recovers much more slowly in Mn than in Pn. However, it appears from the images shown that the entire Mn was photobleached in each FRAP experiment. It thus is unclear whether the slow FRAP recovery is limited by slow diffusion of importin alpha within Mn/on Mn chromatin or impaired trafficking of importin alpha into and out of Mn. These distinct outcomes have distinct implications: either importin alpha is immobilized on Mn (eu)chromatin, or alternatively importin alpha is poorly transported into / out of Mn. This ambiguity could be resolved by bleaching a portion of a Mn and testing whether importin alpha diffuses within a single Mn.

      We thank the reviewer for this insightful comment regarding the interpretation of FRAP data. As the reviewer rightly pointed out, the original FRAP design-where the entire MN was photobleached-does not allow for a clear discrimination between the intranuclear immobilization of importin α1 and impaired trafficking into or out of the MN.

      In line with a similar suggestion from Reviewer #1, we attempted partial photobleaching of MN to evaluate whether importin α1 can diffuse within MN independently of nucleocytoplasmic transport. However, due to the small size of MN, precise targeting is technically challenging and recovery is often unreliable, with some MN even exhibiting partial recovery during the bleaching process itself. These data were not included in the revised figures; however, we provide representative examples as reviewer-only figures to illustrate these technical limitations.

      To further clarify the nuclear transport dynamics of importin α1, we redesigned our FRAP experiments to fully photobleach both the PN and MN within the same cells under identical conditions. These results, presented in revised Fig. 3A and 3C, demonstrate a markedly slower recovery of importin α1 in MN compared to PN, strongly suggesting that nucleocytoplasmic recycling of importin α1 is impaired in MN. Moreover, the reduced mobility of importin α1 in the MN is consistent with stable chromatin binding, limiting its ability to diffuse freely within the nuclear space.

      We believe that this additional analysis, prompted by the reviewer's comment, significantly strengthens the mechanistic interpretation of our FRAP data.

      References

      Crasta K, Ganem NJ, Dagher R, Lantermann AB, Ivanova EV, Pan Y, Nezi L, Protopopov A, Chowdhury D, Pellman D (2012) DNA breaks and chromosome pulverization from errors in mitosis. Nature 482: 53-58

      Hachet V, Kocher T, Wilm M, Mattaj IW (2004) Importin α associates with membranes and participates in nuclear envelope assembly in vitro. EMBO J 23: 1526-1535

      Martinez-Olivera R, Datsi A, Stallkamp M, Köller M, Kohtz I, Pintea B, Gousias K (2018) Silencing of the nucleocytoplasmic shuttling protein karyopherin a2 promotes cell-cycle arrest and apoptosis in glioblastoma multiforme. Oncotarget 9: 33471-33481

      Vietri M, Schultz SW, Bellanger A, Jones CM, Petersen LI, Raiborg C, Skarpen E, Pedurupillay CRJ, Kjos I, Kip E, Timmer R, Jain A, Collas P, Knorr RL, Grellscheid SN, Kusumaatmaja H, Brech A, Micci F, Stenmark H, Campsteijn C (2020) Unrestrained ESCRT-III drives micronuclear catastrophe and chromosome fragmentation. Nat Cell Biol 22: 856-867

      Wang CI, Chien KY, Wang CL, Liu HP, Cheng CC, Chang YS, Yu JS, Yu CJ (2012) Quantitative proteomics reveals regulation of karyopherin subunit alpha-2 (KPNA2) and its potential novel cargo proteins in nonsmall cell lung cancer. Mol Cell Proteomics 11: 1105-1122

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors have shown that Importin α1, a nuclear transport factor, is enriched in subsets of micronuclei (MN) of cancer cells (MCF7 and HeLa) and, using FRAP, has an altered dynamics in MN. Moreover, the authors have shown that these levels of Importin α1 in the MN are likely not due to its traditional role for signal-dependent protein transport, as suggested by immunofluorescence of other factors important for this function. Additionally, cargo dynamics carrying NLS or NES signals were disrupted in Importin α1-positive micronuclei. Importin α1-positive micronuclei also appear to have a disrupted nuclear envelope, potentially explaining some of these cargo disruptions. The authors also demonstrated that Importin α colocalizes with proteins important for DNA replication, and p53 signaling using RIME, followed by immunofluorescence. Lastly, the authors show that Importin α and RAD51 have mutual exclusivity in the micronuclei.

      Major comments:

      1. A key issue is there are very few statistical tests used in this study. It is crucial to the interpretation of the data. We strongly urge the authors to re-analyze the data using appropriate statistical analyses. Along those lines, in many figures 1 or 2 images are shown without stating how many biological or technical replicates this is representative of or showing quantification of the anlyses. In general, the authors' statements would be strengthened by showing more examples and/or stating "N" in the figure legends or supplement.
      2. Using RIME and immunofluorescence, the authors identify factors that co-localize with Importin α1 in subsets of micronuclei (Figure 5), which is interesting, but there is no functional data associated with this result. Are the authors stating that these differences account for altered DNA damage or replication? It is unclear what the conclusion is beyond "some MN are different than others." Could the authors knockdown/knockout these factors to determine if they recruit Importin α1 into MN or the reciprocal? For many of these factors, they appear to be broadly present throughout the entire primary nucleus as well, indicating there is nothing unique about their MN localization.
      3. In line 274, the authors state that MN highly enriched for Importin α1 inhibits RAD51 accessibility but this is an overstatement of the data. Instead, the authors show that RAD51 and importin α1 do not colocalize in micronuclei, albeit without quantification which weakens their argument. Also, the consequence of this "mutual exclusivity" is unclear. Can the authors inhibit or knockdown Importin α1 and show that RAD51 goes to all micronuclei? And how is this different than the data shown for factors in Figure 5? Some of those show colocalization with Importin α1-positive micronuclei and others do not. Could you perform live imaging of labeled Importin a1 and RAD51 and show that as Importin α1 accumulates in MN that RAD51 or other DNA repair factors are exported? An alternative experiment would be to show that the C-mutant, which is defective in nuclear export, now colocalizes with RAD51 in MN. Please reconcile this or show experiments to prove the statement above.
      4. In the Discussion, line 343-344 states that "importin α1 is uniquely distributed and alters the nuclear/chromatin status when enriched in MN," however this is not currently supported by the present data. The data presented shows correlation (albeit weak) between euchromatic modifications and Importin α1, and it does not definitively show that importin α1 is sufficient to alter the nuclear-chromatin status when enriched in the MN. More substantial experiments would be required to show whether Importin α1 plays an active role in these modifications.

      Minor Comments

      1. Summary statement (page 3 Line 40): The use of "their" is confusing. Whose microenvironment are you referring to?
      2. In Abstract and introduction (page 4, Line 44 and page 5, line 59) it states that MN are membrane enclosed structures, but this is not always the case (see https://doi.org/10.1038/nature23449 as one example).
      3. Given the fact that the RIME result identified proteins involved in DNA replication to be enriched with Importin α1, are these MN enriched in factors described in Fig. 5 simply localizing to MN that are in S phase, as described previously (doi: 10.1038/nature10802)?
      4. The FRAP data is not very compelling. While it is clear there are differences between the PN and MN dynamics, what is driving these differences? Are these differences meaningful to the biology of the MN or PN? It is unclear what this data is contributing to the conclusions of the paper. Also, if the mobility of the MN is plotted on the same graph as the PN, the differences in MN mobility might not look as compelling.
      5. In Results (line 117), you state that "the cytoplasm of those cell lines emitted quite strong signals" for Importin α1, but that phrasing is a little confusing. Yes, Importin α1 is in present the cytoplasm in most cells, but it appears you are referring to the enrichment in MN. I would recommend re-phrasing this statement to make your intent clearer.
      6. In Results (line 135, Figure S2E,F), the ratio of high, low or no Importin α1 intensity is confusing. Is this percentage relative to the total number of MN? It Is unclear what is meant by "whole number" of MN. Is Importin α1 intensity quantified or is it subjective?
      7. Figure 2C is confusing. Are you counting MN with co-localization of Importin α1 and these factors? Please clarify.
      8. Figure S3D quantification is very confusing and unclear. Also, how is this normalized? Are you controlling for total signal in each cell? And can the results of this experiment give you any mechanistic insight as to what is regulating MN localization beyond the interpretation of "MN localization is distinct from PN localization"? The "C-mutant" appears quite a bit different than the others. What might that indicate about the role of CAS/CSE1L in MN enrichment?
      9. For Figures 3A,B and S4, are these images of single z-slices or projections? It would be helpful to clarify for your interpretations as to whether they are truly partial or diffuse or the membrane is in another z-plane. Also, how does the localization of Importin α1 different or similar to other factors that localize to MN with a compromised nuclear envelope, such as cGAS? If it is based on epigenetic marks, it should be different than cGAS, which primarily binds non-chromatinized DNA.
      10. In Results, it is unclear how Fig. 7B was calculated. Are the authors qualitatively assessing if RAD51 is there or looking for MN enrichment relative to PN? Additionally, in Fig. 7C, RAD51 localization is diffuse. It should be enriched in foci. I would recommend the authors repeat this experiment using pre-extraction then quantify RAD51 foci number and/or intensity.
      11. In line 264, "notably" is misspelled.
      12. In line 303, "scenarios" should be changed to the singular form.
      13. In Figure legend, line 571-582, H3K27me3 is shown in Figure 4D, but the written legend does not mention this mark.

      Significance

      Overall, this paper shows compelling evidence for micronuclear localization of regulators of nuclear export, notably Importin α1. Of note, this occurs in subsets of MN that lack an intact nuclear envelope. And while it has been appreciated that compromised micronuclear envelopes lead to genomic instability, this is one of the first that demonstrate alteration in the nuclear envelope may disrupt import or export of nuclear proteins into micronuclei.

      A limitation of the study is that much of the work is based on immunofluorescence and lacks mechanism. While there is much correlative data showing that Importin α1 localizes to micronuclei with compromised envelopes, it is unclear whether Importin α1 drives micronuclear collapse or it is downstream of this process. Additionally, Importin α1 micronuclear localization anti-correlates with RAD51 but does colocalize with other DNA replication factors, yet it is unclear whether their localization is dependent on Importin α1 or its role in nuclear export. Currently, the audience for this manuscript would be focused to those interested in micronuclei. If these concerns about an active role for Importin α1 in micronuclear export are resolved, it would greatly increase the impact of this manuscript to those interested more broadly in genomic instability, DNA repair, and cancer.

      Reviewer's areas of expertise: Genomic instability, cancer epigenetics, and mitosis

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). Miyamoto et al. report that importin α1 is highly enriched in a subfraction of micronuclei (about 40%), which exhibit defective nuclear envelopes and compromised accessibility of factors essential for the damage response associated with homologous recombination DNA repair. The authors suggest that the unequal localization and abnormal distribution of importin α1 within these micronuclei contribute to the genomic instability observed in cancer.

      Major comments:

      Are the key conclusions convincing?

      The conclusions drawn by the authors would benefit from additional supportive experiments and a more detailed explanation. 1. It is crucial to quantitatively assess the localization of importin α1 in micronuclei (MN) across non-transformed MCM10A cells compared to transformed cell lines (MC7, HeLa, and MDA-MB-231). This analysis would help determine whether the localization of importin α1 in MN correlates with genomic stability in human cancer cells 2. While the authors provide some evidence indicating partial disruption of nuclear envelopes in MN (Figures 3 and S4), it is noteworthy that this phenomenon also occurs in importin α1-negative MN. Furthermore, according to the figure legends, the data presented in both figures stem from a single experiment. Current literature suggests that compromised nuclear envelope integrity is one of the major contributors to genomic instability, mediated through mechanisms such as chromothripsis and cGAS-STING-mediated inflammation arising from MN. Therefore, a more comprehensive quantification of nuclear envelope integrity-ideally comparing non-transformed MCM10A cells with transformed cell lines (MC7, HeLa, and MDA-MB-231)-is necessary to substantiate the connection between aberrant importin α1 behavior in MN and chromothripsis processes, as well as regulation of the cGAS-STING pathway linked to genomic instability in cancer cells. 3. The schematic illustration presented in Figure 8 does not adequately summarize all findings from this study nor does it clarify how the localization of importin α1 within MN might hypothetically influence genome stability. Although it is reasonable to propose that "importin α can serve as a molecular marker for characterizing the dynamics of MN" (Line 344), the authors assert (Line 325) that their findings, along with others, have "potential implications for the induction of chromothripsis processes and regulation of the cGAS-STING pathway in cancer cells." However, they fail to provide a clear or even hypothetical explanation regarding how their findings contribute to these molecular events. To address this gap, it would be essential for them to contextualize their results within existing literature that explores and links structural integrity deficits or aberrant DNA replication/damage responses in MN with chromothripsis and inflammation (e.g., PMID: 32601372; PMID: 32494070; PMID: 27918550; PMID: 28738408; PMID: 28759889). 4. Fig. 4D does not support the idea that importin α1 is euchromatin enriched: H3K9me3, H3K4me3 and H3K37me3 seem to be all deeply blue.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Indeed, the data presented by the authors do not adequately support a direct link between the presence of importin α1 in MN and genomic instability in human cancer cells. While the experimental correlations provided may not substantiate this connection definitively, they do lay a foundation for a grounded hypothesis and suggest the need for further research to explore this topic in greater depth. Additionally, it is worth noting that the evidence contributes to the growing list of nuclear proteins exhibiting abnormal behavior in micronuclei (MN). This highlights the significance of studying such proteins to understand their roles in genomic stability and cancer progression.

      Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Additional experiments are necessary to quantitatively assess the localization of importin α1 in micronuclei (MN) across non-transformed MCM10A cells and transformed cell lines (MC7, HeLa, MDA-MB-231). This analysis would help determine whether the localization of importin α1 in MN correlates with genomic stability in human cancer cells. The authors claim that importin α1 preferentially localizes to euchromatic areas rather than heterochromatic regions within MN. While this assertion is supported by the immunofluorescence (IF) images presented in Figures 4A/B and S5A/B, it remains less clear for Figure S5C/B. To strengthen this claim, providing averages of IF distributions from multiple cells across independent experiments would be beneficial to draw more robust conclusions.

      Furthermore, ChIP-seq data are presented to support the idea that importin α1 preferentially distributes over euchromatin areas in MN. However, as described, the epigenetic chromatin status indicated by these ChIP-seq experiments reflects that of the principal nucleus (PN), not specifically the status within MN in MCF7 cells. Given that MN represent only a small fraction of the cell population under normal culture conditions-likely less than 5% for HeLa cells as shown in Figure S2D-the relevance of this data is limited. Additionally, according to data presented in Figure 1B, importin α1 does not localize or distribute within the PN as it does in MN in MCF7 cells. Therefore, further experiments should be conducted to substantiate that importin α1 preferentially targets euchromatin areas within MN and to compare this distribution with that observed in the principal nucleus. Such studies could reveal potential abnormalities regarding the correlation between epigenetic chromatin status and importin α distribution in MN. To support the hypothesis that importin α1 inhibits RAD51 accessibility within MN, Figures 7D and E should be supplemented with thorough quantification and statistical analysis based on at least three independent experiments. This additional data would enhance confidence in their findings regarding RAD51 accessibility inhibition by importin α1.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      The additional experiments proposed are controls and direct comparisons using the same techniques and experimental designs used by the authors, so it is reasonable that the authors can carry them out within a realistic timeframe.

      Are the data and the methods presented in such a way that they can be reproduced?

      Given the importance of reproducibility and the need to evaluate results based on imaging and quantitation, I strongly recommend that the authors include a detailed description of the optical microscopy procedures utilized in their study. This should encompass imaging conditions, acquisition settings, and the specific equipment used. Providing this information will enhance transparency and facilitate reproducibility. For reference, some valuable guidance on essential parameters for reproducibility can be found in Heddleston et al. (2021) (doi:10.1242/jcs.254144). Incorporating these details will not only strengthen the manuscript but also support other researchers in reproducing the findings accurately.

      Are the experiments adequately replicated and statistical analysis adequate?

      Many of the plots and values in the manuscript lack appropriate statistical analysis, including p-values, which are not detailed in the figures or their legends. Furthermore, the Statistical Analysis section does not provide adequate information regarding the specific statistical tests employed or the criteria used to determine which analyses were applied in each case. To enhance the rigor and clarity of the study, it is essential that these issues be addressed prior to publication. A comprehensive presentation of statistical analysis will improve the reliability of the findings and allow readers to better understand the significance of the results. I recommend that the authors revise this section to include detailed explanations of all statistical methods used, along with corresponding p-values for all relevant comparisons.

      Minor comments:

      Specific experimental issues that are easily addressable.

      The authors claim that importin α1 exhibits remarkably low mobility in the micronuclei (MN) compared to its mobility in the principal nucleus (PN), as illustrated in Figure 1. However, based on the experimental design, this conclusion may not be appropriate. In the current setup, the FRAP experiment conducted in the PN measures the mobility of importin α1 molecules within the cell nucleus, where the influence of nuclear transport is likely negligible. Conversely, in the MN experiments shown, all molecules of importin α1 are bleached within a given MN. Consequently, what is being measured here primarily reflects the effects of nuclear transport rather than intrinsic molecular mobility. To accurately compare kinetics of nuclear transport, it would be essential to completely bleach the entire PN. If measuring molecular mobility between MN and PN is desired, only a small fraction of either MN or PN area/volume should be bleached during FRAP analysis. Additionally, it would be beneficial to include measurements of mobility for other canonical nuclear transport factors (e.g., RAN, CAS, RCC1) for comparative purposes. This broader context would allow for a more comprehensive understanding of importin α1 behavior relative to other factors involved in nuclear transport. Finally, utilizing cells that exhibit importin α1 signals in both PN and MN could further strengthen comparisons and provide more robust conclusions regarding its mobility dynamics.

      Are prior studies referenced appropriately?

      Prior studies are referenced appropriately in general, but the authors missed some references (PMID: 32601372; PMID: 32494070; PMID: 27918550; PMID: 28738408; PMID: 28759889) that I consider key to put the present findings in frame with previous works which link the lack of structural integrity and/or aberrant DNA replication/damage responses in MN with Cchromothripsis and inflammation.

      Are the text and figures clear and accurate?

      The figures presented in the manuscript are clear; however, where plots are included, they require appropriate statistical analysis. It is essential to display p-values on the plots or within their legends to provide readers with information regarding the significance of the results. Including this statistical information will enhance the interpretability of the data and strengthen the overall findings of the study. I recommend that the authors revise these sections accordingly before publication.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      1. In lines 134-135, it is stated that "up to 40% of the MN showed importin α1 accumulation under both standard culture conditions and the reversine treatment (Fig. S2F)." However, Figure S2F only displays percentages for reversine-treated cells, and there is no mention in the text or figures regarding the percentage of importin α1-positive MN determined by immunofluorescence (IF) under standard culture conditions. This discrepancy should be addressed.
      2. In line 170, the authors state that "Cells in which overexpressed EGFP-importin α1 localized only in PN were excluded from the analysis (see Fig. 1E, top panels)." It is unclear why this exclusion was made. The authors should clarify whether they are referring to all constructs or only to the wild-type (WT) construct when mentioning EGFP-importin α1 localization solely in PN. This clarification is important as it may affect the results highlighted in line 173.
      3. The statement in line 191 ("However, this antibody could not be further used in this context due to cross-reactivity with highly concentrated importin α1 in MN (Fig. S4)") is somewhat misleading. While it hints at a technical issue, it does not provide additional relevant information for understanding its implications for the rationale of the research. Moreover, Figure S4 is referenced but appears to refer specifically to panels S4D and E, which are not mentioned in the text. I recommend clarifying this point or removing it altogether.
      4. Lines 197-199 contain a sentence that could be misleading and would benefit from clearer explanation. Although Figure 3D provides some clarity on this matter, no statistical analysis is included-only a bar plot is presented. A proper statistical analysis should be provided here to enhance understanding.
      5. In lines 218-221, it states that importin α1 associates with euchromatin regions characterized by H3K4me3 and H3K36me3; however, Figure 4D lacks the Spearman's correlation coefficient value for H3K36me3 within the matrix. This omission needs correction.
      6. For consistency in the experimental design aimed at identifying potential importin α1-interacting proteins, it would be more appropriate for Figures 5C/D to show IF data from MCF7 cells rather than HeLa cells.
      7. To substantiate claims that importin α1 inhibits RAD51 accessibility within MN, Figures 7D and E should include thorough quantitation and statistical analysis based on at least three independent experiments.
      8. The meaning of lines 336-338-"Therefore, the enrichment of importin α1 in MN, along with its interaction with chromatin, may regulate the accessibility of RAD51 to DNA/chromatin fibers in MN and protect its activity"-is unclear. I suggest rephrasing this sentence for improved clarity and comprehension.
      9. Fig. 1D: Numbers on the y-axis are missing, x-axis labeling is too small
      10. Fig. 1F: As the PN/MN values of the three experiments are seemingly identical (third column) the distribution of the three individual data of the PN (first column) should mirror the distribution of the three individual data of the MN (second column). The authors might want to check why this is not the case.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.
      • Place the work in the context of the existing literature (provide references, where appropriate).

      Micronuclei (MN) primarily arise from defects in mitotic progression and chromatin segregation, often associated with chromatin bridges and/or lagging chromosomes. MN frequently exhibit DNA replication defects and possess a rupture-prone nuclear envelope, which has been linked to genomic instability. The nuclear envelope of MN is notably deficient in crucial factors such as lamin B and nuclear pore complexes (NPCs). This deficiency may be attributed to the influence of microtubules and the gradient of Aurora B activity at the mitotic midzone, which inhibits the recruitment of proper nuclear envelope components. Additionally, several other factors may contribute to this process: for instance, PLK1 controls the assembly of NPC components onto lagging chromosomes; chromosome size and gene density positively correlate with the membrane stability of MN; and abnormal accumulation of the ESCRT complex on MN exacerbates DNA damage within these structures, triggering pro-inflammatory pathways. The work presented by Dr. Miyamoto and colleagues reveals the abnormal behavior of importin α1 in MN during interphase. According to their findings, it is reasonable to consider importin α1 as a molecular marker for characterizing MN dynamics. Furthermore, it could serve as a potential clinical marker if the authors provide additional experiments demonstrating significantly different localization patterns of importin α1 in transformed cells (e.g., MC7, HeLa, MDA-MB-231) compared to non-transformed cells (e.g., MCM10A). While the authors present some evidence indicating partial disruption of nuclear envelopes in MN (Figures 3 and S4), it is noteworthy that this phenomenon also occurs in importin α1-negative MN. Moreover, according to the figure legends, data for both figures originate from a single experiment. As such, convincing evidence linking the aberrant behavior of importin α1 in MN with chromothripsis processes or regulation of the cGAS-STING pathway-and its implications for genomic instability in cancer cells-remains lacking. Overall, it is not entirely clear what significance this advance holds for the field; while there are conceptual contributions made by this work, they do not appear sufficiently robust at this time. Further research is needed to clarify these connections and strengthen their conclusions regarding importin α1's role in MN dynamics and genomic instability. - State what audience might be interested in and influenced by the reported findings.

      Scientist and health care professionals that research on mechanism of genomic instability and cancer - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Mitosis, mitotic chromatin decondensation, nuclear reformation, hematopoietic cancers, light microscopy, image analysis.

    1. Question n°12 : : Comment intègre-t-on une boucle d’asservissement en intensité. Et quel rôle a-t-elle

      Cette question est intéressante mais sans connaissance avant qu'il faut implémenter un capteur ça peut être rude à commencer

    1. C’est la représentation dans le plan complexe de H(jω) lorsque ω varie de 0 à l’infini. Image tikz Tracés dans le lieu de Nyquist \caption{Exemple de tracés dans le lieu de Nyquist} \label{exemple_nyquist}

      Il y a un problème d'affichage ici ainsi que dans la partie suivante

    1. La parte más grande del clítoris está oculta; apenas podemos ver una pequeña punta

      El cuerpo femenino casi siempre se ha entendido a través de un esquema visual patriarcal (solo existe para fines reproductivos, estéticos y / o médicos) y lp que queda fuera de esta mirada se vuelve inexistente. Esto no es casual es totalmente político ya que hay una relación entre lo que no se representa y lo que no se puede experimentar. El que haya sido invisibilizado durante tanto tiempo es una forma de controlar el placer y de negar la anatomía propia del cuerpo. Esto igual funciona como una metáfora para los sistemas de poder en general en donde todo lo que sale de la norma se tapa, se borra o se vuelve una amenaza. Desprogramar entonces sería una forma de reconfigurar nuestro conocimiento a través de lo que se nos ha negado o escondido.

    2. Aprender haciendo, repitiendo para el disfrute, construyendo de la experiencia, explorando siempre sin otro fin más que seguir disfrutando.

      En una sociedad que mide todo en términos de productividad el hacer algo simplemente por placer es un acto de resistencia. Desprogramar el clímax también sería una forma de desarmar la manera en la que el capitalismo entiende el cuerpo humano (como una máquina funcional). Así se podría habitar un cuerpo que existe sin objetivos externos.

    3. No hace falta porque fuera todo está resuelto, porque alguien lo ha resuelto para mi ‘comodidad’.

      Esto implica que las respuestas a nuestras preguntas estén muy condicionadas. Un buen ejemplo es el SEM (Search Engine Marketing): el orden y visibilidad de los resultados no dependen simplemente de la relevancia, sino del presupuesto que puede invertir cada página. Así, "nuestras respuestas" ya llega mediado por intereses económicos, no por nuestra curiosidad o libertad de exploración.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      This study explores chromatin organization around trans-splicing acceptor sites (TASs) in the trypanosomatid parasites Trypanosoma cruzi, T. brucei and Leishmania major. By systematically re-analyzing MNase-seq and MNase-ChIP-seq datasets, the authors conclude that TASs are protected by an MNase-sensitive complex that is, at least in part, histone-based, and that single-copy and multi-copy genes display differential chromatin accessibility. Altogether, the data suggest a common chromatin landscape at TASs and imply that chromatin may modulate transcript maturation, adding a new regulatory layer to an unusual gene-expression system.

      I value integrative studies of this kind and appreciate the careful, consistent data analysis the authors implemented to extract novel insights. That said, several aspects require clarification or revision before the conclusions can be robustly supported. My main concerns are listed below, organized by topic/result section.

      TAS prediction * Why were TAS predictions derived only from insect-stage RNA-seq data? Restricting TAS calls to one life stage risks biasing predictions toward transcripts that are highly expressed in that stage and may reduce annotation accuracy for lowly expressed or stage-specific genes. Please justify this choice and, if possible, evaluate TAS robustness using additional transcriptomes or explicitly state the limitation.

      TAS predictions derived only from insect-stage RNA-seq data because in a previous study it was shown that there are no significant differences between stages in the 5’UTR procesing in T. cruzi life stages (https://doi.org/10.3389/fgene.2020.00166) We are not testing an additional transcriptome here, because the robustness of the software was already probed in the original article were UTRme was described (Radio S, 2018 doi:10.3389/fgene.2018.00671).

      Results - "There is a distinctive average nucleosome arrangement at the TASs in TriTryps": * You state that "In the case of L. major the samples are less digested." However, Supplementary Fig. S1 suggests that replicate 1 of L. major is less digested than the T. brucei samples, while replicate 2 of L. major looks similarly digested. Please clarify which replicates you reference and correct the statement if needed.

      The reviewer has a good point. We made our statement based on the value of the maximum peak of the sequenced DNA molecules, which in general is a good indicative of the extension of the digestion achieved by the sample (Cole H, NAR, 2011).

      As the reviewer correctly points, we should have also considered the length of the DNA molecules in each percentile. However, in this case both, T. brucei’s and L major’s samples were gel purified before sequencing and it is hard to know exactly what fragments were left behind in each case. Therefore, it is better not to over conclude on that regard.

      We have now comment on this in the main manuscript, and we have clarified in the figure legends which data set we used in each case.

      * It appears you plot one replicate in Fig. 1b and the other in Suppl. Fig. S2. Please indicate explicitly which replicate is in each plot. For T. brucei, the NDR upstream of the TAS is clearer in Suppl. Fig. S2 while the TAS protection is less prominent; based on your digestion argument, this should correspond to the more-digested replicate. Please confirm.

      The replicates used for the construction of each figure are explicitly indicated in Table S1. Although we have detailed in the table the original publication, the project and accession number for each data set, the reviewer is correct that in this case it was still not completely clear to which length distribution heatmap was each sample associated with. To avoid this confusion, we have now added the accession number for each data set to the figure legends and also clarified in Table S1. Regarding the reviewer’s comment on the correspondence between the observed TAS protection and the extent of samples digestion, he/she is correct that for a more digested sample we would expect a clearer NDR. In this case, the difference in the extent of digestion between these two samples is minor, as observed the length of the main peak in the length distribution histogram for sequenced DNA molecules is the same. These two samples GSM5363006, represented in Fig1 b, and GSM5363007, represented in S2, belong to the same original paper (Maree et al 2017), and both were gel purified before sequencing. Therefore, any difference between them could not only be the result of a minor difference in the digestion level achieved in each experiment but could be also biased by the fragments included or not during gel purification. Therefore, I would not over conclude about TAS protection from this comparison. We have now included a brief comment on this, in the figure discussion

      * The protected region around the TAS appears centered on the TAS in T. brucei but upstream in L. major. This is an interesting difference. If it is technical (different digestion or TAS prediction offset), explain why; if likely biological, discuss possible mechanisms and implications.

      We appreciate the reviewer suggestion. We cannot assure if it is due to technical or biological reasons, but there is evidence that L. major ‘s genome has a different dinucleotide content and it might have an impact on nucleosome assembly. We have now added a comment about this observation in the final discussion of the manuscript.

      Results - "An MNase sensitive complex occupies the TASs in T. brucei": * The definition of "MNase activity" and the ordering of samples into Low/Intermediate/High digestion are unclear. Did you infer digestion levels from fragment distributions rather than from controlled experimental timepoints? In Suppl. Fig. S3a it is not obvious how "Low digestion" was defined; that sample's fragment distribution appears intermediate. Please provide objective metrics (e.g., median fragment length, fraction 120-180 bp) used to classify digestion levels.

      As the reviewer suggests, the ideal experiment would be to perform a time course of MNase reaction with all the samples in parallel, or to work with a fixed time point adding increasing amounts of MNase. However, even when making controlled experimental timepoints, you need to check the length distribution histogram of sequenced DNA molecules to be sure which level of digestion you have achieved.

      In this particular case, we used public available data sets to make this analysis. We made an arbitrary definition of low, intermediate and high level of digestion, not as an absolute level of digestion, but as a comparative output among the tested samples. We based our definition on the comparison of __the main peak in length distribution heatmaps because this parameter is the best metric to estimate the level of digestion of a given sample. It represents the percentage of the total DNA sequenced that contains the predominant length in the sample tested. __Hence, we considered:

      low digestion: when the main peak is longer than the expected protection for a nucleosome (longer than 150 bp). We expect this sample to contain additional longer bands that correspond to less digested material.

      intermediate digestion, when the main peak is the expected for the nucleosome core-protection (˜146-150bp).

      high digestion, when the main peak is shorter than that (shorter than 146 bp). This case, is normally accompanied by a bigger dispersion in fragment sizes.

      To do this analysis, we chose samples that render different MNase protection of the TAS when plotting all the sequenced DNA molecules relative to this point and we used this protection as a predictor of the extent of sample digestion (Figure 2). To corroborate our hypothesis, that the degree of TAS protection was indeed related to the extent of the MNase digestion of a given sample, we looked at the length distribution histogram of the sequenced DNA molecules in each case. It is the best measurement of the extent of the digestion achieved, especially, when sequencing the whole sample without any gel purification and representing all the reads in the analysis as we did. The only caveat is with the sample called “intermediate digestion 1” that belongs to the original work of Mareé 2017, since only this data set was gel purified.

      Whether the sample used in Figure 1 (from Mareé 2017) is also from the same lab and is an MNase-seq. Strictly speaking, there is no methodological difference between MNase-seq and the input of a native MNase-ChIP-seq, since the input does not undergo the IP.

      * Several fragment distributions show a sharp cutoff at ~100-125 bp. Was this due to gel purification or bioinformatic filtering? State this clearly in Methods. If gel purification occurred, that can explain why some datasets preserve the MNase-sensitive region.

      The sharp cutoff is neither due to gel purification or bioinformatic filtering, it is just due to the length of the paired-end read used in each case. In earlier works the most common was to sequence only 50bp, with the improvement of technologies it went up to 75,100 or 125 bp. We have now clarified in Table S1 the length of the paired-reads used in each case when possible.

      * Please reconcile cases where samples labeled as more-digested contain a larger proportion of >200 bp fragments than supposedly less-digested samples; this ordering affects the inference that digestion level determines the loss/preservation of TAS protection. Based on the distributions I see, "Intermediate digestion 1" appears most consistent with an expected MNase curve - please confirm and correct the manuscript accordingly.

      As explained above, it's a common observation in MNase digestion of chromatin that more extensive digestion can still result in a broad range of fragment sizes, including some longer fragments. This seemingly counter-intuitive result is primarily due to the non-uniform accessibility of chromatin and the sequence preference of the MNase enzyme, which has a preference for AT reach sequences.

      The rationale of this is as follows: when you digest chromatin with MNase and the objective is to map nucleosomes genome-wide, the ideal situation would be to get the whole material contained in the mononucleosome band. Given that MNase is less efficient to digest protected DNA but, if the reaction proceeds further, it always ends up destroying part of it, the result is always far from perfect. The better situation we can get, is to obtain samples were ˜80% of the material is contained in the mononucloesome band. __And here comes the main point: __even in the best scenario, you always get some additional longer bands, such as those for di or tri nucleosomes. If you keep digesting, you will get less than 80 % in the nucleosome band and, those remaining DNA fragments that use to contain di and tri nucleosomes start getting digested as well, originating a bigger dispersion in fragments sizes. How do we explain persistence of Long Fragments? The longest fragments (di-, tri-nucleosomes) that persist in a highly digested sample are the ones that were originally most highly protected by proteins or higher-order structure, or by containing a poor AT sequence content, making their linker DNA extremely resistant to initial cleavage. Once the majority of the genome is fragmented, these few resistant longer fragments become a more visible component of the remaining population, contributing to a broader size dispersion. Hence, you end up observing a bigger dispersion in length distributions in the final material. Bottom line, it is not a good practice to work with under or over digested samples. Our main point, is to emphasize that especially when comparing samples, it important to compare those with comparable levels of digestion. Otherwise, a different sampling of the genome will be represented in the remaining sequenced DNA.

      Results - "The MNase sensitive complexes protecting the TASs in T. brucei and T. cruzi are at least partly composed of histones": * The evidence that histones are part of the MNase-sensitive complex relies on H3 MNase-ChIP signal in subnucleosomal fragment bins. This seems to conflict with the observation (Fig. 1) that fragments protecting TASs are often nucleosome-sized. Please reconcile these points: are H3 signals confined to subnucleosomal fragments flanking the TAS while the TAS itself is depleted of H3? Provide plots that compare MNase-seq and H3 ChIP signals stratified by consistent fragment-size bins to clarify this.

      What we learned from other eukaryotic organisms that were deeply studied, such as yeast, is that NDRs are normally generated at regulatory points in the genome. In this sense, yeast tRNA genes have a complex with a bootprint smaller than a nucleosome formed by TFIIIC-TFIIB (Nagarajavel, doi: 10.1093/nar/gkt611). On the other hand, many promotor regions have an MNase-sensitive complex with a nucleosome-size footprint, but it does not contain histones (Chereji, et al 2017, doi:10.1016/j.molcel.2016.12.009). The reviewer is right that from Figure 1 and S2 we could observe that the footprint of whatever occupies the TAS region, especially in T. brucei, is nucleosome-size. However, it only shows the size, but it doesn’t prove the nature of its components. Nevertheless, those are only MNase-seq data sets. Since it does not include a precipitation with specific antibodies, we cannot confirm the protecting complex is made up by histones. In parallel, a complementary study by Wedel 2017, from Siegel’s lab, shows that using a properly digested sample and further immunoprecipitating with a-H3 antibody, the TAS is not protected by nucleosomes at least not when analyzing nucleosome size-DNA molecules. Besides, Briggs et. al 2018 (doi: 10.1093/nar/gky928) showed that at least at intergenic regions H3 occupancy goes down while R-loops accumulation increases. We have now added a supplemental figure associated to Figure 3 (new Suplemental 5) replotting R-loops and MNase-ChIP-seq for H3 relative to our predicted TAS showing this anti-correlation and how it partly correlates with MNase protection as well. As a control we show that Rpb9 trends resembles H3 as Siegel’s lab have shown in Wedel 2018.

      * Please indicate which datasets are used for each panel in Suppl. Fig. S4 (e.g., Wedel et al., Maree et al.), and avoid calling data from different labs "replicates" unless they are true replicates.

      In most of our analysis we used real replicated experiments. Such is the case MNase-seq data used in Figure 1, with the corresponding replicate experiments used in Figure S2; T. cruzi MNase-ChIP-seq data used in Figure 3b and 4a with the respective replicate used in Figures S4 and S5 (now S6 in the revised manuscript). The only case in which we used experiments coming from two different laboratories, is in the case of MNase-ChIP-seq for H3 from T. brucei. Unfortunately, there are only two public data sets coming each of them from different laboratories. The samples used in Fig 3 (from Siegel’s lab) whether the IP from H3 represented in S4 and S5 (S6 n the updated version) comes from another lab (Patterton’s). To be more rigorous, we now call them data 1 and 2 when comparing these particular case.

      The reviewer is right that in this particular case one is native chromatin (Pattertons’) while the other one is crosslinked (Siegel’s). We have now clarified it in the main text that unfortunately we do not count on a replicate but even under both condition the result remains the same, and this is compatible with my own experience, were crosslinking does not affect the global nucleosome patterns (compared nucleosome organization from crosslinked chromatin MNAse-seq inputs Chereji, Mol Cell, 2017 doi: 10.1016/j.molcel.2016.12.009 and native MNase-seq from Ocampo, NAR, 2016 doi: 10.1093/nar/gkw068).

      * Several datasets show a sharp lower bound on fragment size in the subnucleosomal range (e.g., ~80-100 bp). Is this a filtering artifact or a gel-size selection? Clarify in Methods and, if this is an artifact, consider replotting after removing the cutoff.

      We have only filtered adapter dimmer or overrepresented sequences when needed. In Figures 2 and S3 we represented all the sequenced reads. In other figures when we sort fragments sizes in silico, such as nucleosome range, dinucleosome or subnucleosome size, we make a note in the figure legends. What the reviewer points is related to the length of the sequence DNA fragment in each experiment. As we explained above, the older data-sets were performed with 50 bp paired-end reads, the newer ones are 75, 100 or 125bp. This is information is now clarified in Table S1.

      __Results - "The TASs of single and multi-copy genes are differentially protected by nucleosomes": __

      __ __* Please include T. brucei RNA-seq data in Suppl. Fig. S5b as you did for T. cruzi.

      We have shown chromatin organization for T. brucei in S5b to show that there is a similar trend. Unfortunately, we did not get a robust list of multi-copy genes for T. brucei as we did get for T. cruzi, therefore we do not want to over conclude showing the RNA-seq for these subsets of genes. The limitation is related to the fact that UTRme restrict the search and is extremely strict when calling sites at repetitive regions.

      * Discuss how low or absent expression of multigene families affects TAS annotation (which relies on RNA-seq) and whether annotation inaccuracies could bias the observed chromatin differences.

      The mapping of occurrence and annotations that belong to repetitive regions has great complexity. UTRme is specially designed to avoid overcalling those sites. In other words, there is a chance that we could be underestimating the number of predicted TASs at multi-copy genes. Regarding the impact on chromatin analysis, we cannot rule out that it might have an impact, but the observation favors our conclusion, since even when some TASs at multi-copy genes can remain elusive, we observe more nucleosome density at those places.

      * The statement that multi-copy genes show an "oscillation" between AT and GC dinucleotides is not clearly supported: the multi-copy average appears noisier and is based on fewer loci. Please tone down this claim or provide statistical support that the pattern is periodic rather than noisy.

      We have fixed this now in the preliminary revised version

      * How were multi-copy genes defined in T. brucei? Include the classification method in Methods.

      This classification was done the same way it was explained for T. cruzi

      Genomes and annotations: * If transcriptomic data for the Y strain was used for T. cruzi, please explain why a Y strain genome was not used (e.g., Wang et al. 2021 GCA_015033655.1), or justify the choice. For T. brucei, consider the more recent Lister 427 assembly (Tb427_2018) from TriTrypDB. Use strain-matched genomes and transcriptomes when possible, or discuss limitations.

      The most appropriate way to analyze high throughput data, is to aline it to the same genome were the experiments were conducted. This was clearly illustrated in a previous publication from our group were we explained how should be analyzed data from the hybrid CL Brener strain. A common practice in the past was to use only Esmeraldo-like genome for simplicity, but this resulted in output artifacts. Therefore, we aligned it to CL Brener genome, and then focused the main analysis on the Esmeraldo haplotype (Beati Plos ONE, 2023). Ideally, we should have counted on transcriptomic data for the same strain (CL Brener or Esmeraldo). Since this was not the case at that moment, we used data from Y strain that belongs to the same DTU with Esmeraldo.

      In the case of T. brucei, when we started our analysis and the software code for UTRme was written, the previous version of the genome was available. Upon 2018 version came up, we checked chromatin parameters and observed that it did not change the main observations. Therefore, we continue working with our previous setups.

      Reproducibility and broader integration: * Please share the full analysis pipeline (ideally on GitHub/Zenodo) so the results are reproducible from raw reads to plots.

      We are preparing a full pipeline in GitHub. We will make it available before manuscript full revision

      * As an optional but helpful expansion, consider including additional datasets (other life stages, BSF MNase-seq, ATAC-seq, DRIP-seq) where available to strengthen comparative claims.

      We are now including a new suplemental figure including DRIP-seq and Rp9 ChIP-seq (revised S5). Additionally, we added a new panel c to figure 4, representing FAIRE-seq data for T. cruzi fore single and multi-copy genes

      We are working on ATAC-seq analysis and BSF MNase-seq

      Optional analyses that would strengthen the study: * Stratify single-copy genes by expression (high / medium / low) and examine average nucleosome occupancy at TASs for each group; a correlation between expression and NDR depth would strengthen the functional link to maturation.

      We have now included a panel in suplemental figure 5 (now revised S6), showing the concordance for chromatin organization of stratified genes by RNA-seq levels relative to TAS.

      __Minor / editorial comments: __ * In the Introduction, the sentence "transcription is initiated from dispersed promoters and in general they coincide with divergent strand switch regions" should be qualified: such initiation sites also include single transcription start regions.

      We have clarified this in the preliminary revised version

      * Define the dotted line in length distribution plots (if it is not the median, please clarify) and consider placing it at 147 bp across plots to ease comparison.

      The dotted line is just to indicate where the maximum peak is located. It is now clarified in figure legends.

      * In Suppl. Fig. 4b "Replicate2" the x-axis ticks are misaligned with labels - please fix.

      We have now fixed the figure. Thanks for noticing this mistake.

      * Typo in the Introduction: "remodellingremodeling" → "remodeling

      Thanks for noticing this mistake, it is fixed in the current version of the manuscript

      **Referee cross-commenting** Comment 1: I think Reviewer #2 and Reviewer #3 missed that they authors of this manuscript do cite and consider the results from Wedel at al. 2017. They even re-analysed their data (e.g. Figure 3a). I second Reviewer #2 comment indicating that the inclusion of a schematic figure to help readers visualize and better understand the findings would be an important addition.

      Comment 2: I agree with Reviewer #3 that the use of different MNase digestion procedures in the different datasets have to be considered. On the other hand, I don't think there is a problem with figure 1 showing an MNase-protected TAS for T. brucei as it is based on MNase-seq data and reproduces the reported results (Maree et al. 2017). What the Siegel lab did in Wedel et al. 2017 was MNase-ChIPseq of H3 showing nucleosome depletion at TAS, but both results are not necessary contradictory: There could still be something else (which does not contain H3) sitting on the TAS protecting it from MNase digestion.

      Reviewer #1 (Significance (Required)):

      This study provides a systematic comparative analysis of chromatin landscapes at trans-splicing acceptor sites (TASs) in trypanosomatids, an area that has been relatively underexplored. By re-analyzing and harmonizing existing MNase-seq and MNase-ChIP-seq datasets, the authors highlight conserved and divergent features of nucleosome occupancy around TASs and propose that chromatin contributes to the fidelity of transcript maturation. The significance lies in three aspects: 1. Conceptual advance: It broadens our understanding of gene regulation in organisms where transcription initiation is unusual and largely constitutive, suggesting that chromatin can still modulate post-transcriptional processes such as trans-splicing. 2. Integrative perspective: Bringing together data from T. cruzi, T. brucei and L. major provides a comparative framework that may inspire further mechanistic studies across kinetoplastids. 3. Hypothesis generation: The findings open testable avenues about the role of chromatin in coordinating transcript maturation, the contribution of DNA sequence composition, and potential interactions with R-loops or RNA-binding proteins. Researchers in parasitology, chromatin biology, and RNA processing will find it a useful resource and a stimulus for targeted experimental follow-up.

      My expertise is in gene regulation in eukaryotic parasites, with a focus on bioinformatic analysis of high-throughput sequencing data

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      Siri et al. perform a comparative analysis using publicly available MNase-seq data from three trypanosomatids (T. brucei, T. cruzi, and Leishmania), showing that a similar chromatin profile is observed at TAS (trans-splicing acceptor site) regions. The original studies had already demonstrated that the nucleosome profile at TAS differs from the rest of the genome; however, this work fills an important gap in the literature by providing the most reliable cross-species comparison of nucleosome profiles among the tritryps. To achieve this, the authors applied the same computational analysis pipeline and carefully evaluated MNase digestion levels, which are known to influence nucleosome profiling outcomes.

      In my view, the main conclusion is that the profiles are indeed similar-even when comparing T. brucei and T. cruzi. This was not clear in previous studies (and even appeared contradictory, reporting nucleosome depletion versus enrichment) largely due to differences in chromatin digestion across these organisms. The manuscript could be improved with some clarifications and adjustments:

      1. The authors state from the beginning that available MNase data indicate altered nucleosome occupancy around the TAS. However, they could also emphasize that the conclusions across the different trypanosomatids are inconsistent and even contradictory: NDR in T. cruzi versus protection-in different locations-in T. brucei and Leishmania.

      We start our manuscript by referring to the first MNase-seq data sets publicly available for each TriTryp and we point that one of the main observations, in each of them, is the occurrence of a change in nucleosome density or occupancy at intergenic regions. In T. cruzi, in a previous publication from our group, we stablished that this intergenic drop in nucleosome density occurs near the trans-splicing acceptor site. In this work, we extend our study to the other members of TriTryps: T. brucei and L. major.

      In T. brucei the papers from Patterton’s lab and Siegel’s lab came out almost simultaneously in 2017. Hence, they do not comment on each other’s work. The first one claims the presence of a well-positioned nucleosome at the TAS by using MNase-seq, while the second one, shows an NDR at the TAS by using MNase-ChIP-seq. However, we do not think they are contradictory, or they have inconsistency. We brought them together along the manuscript because we think these works can provide complementary information.

      On one hand, we infer data from Pattertons lab is slightly less digested than the sample from Siegel’s lab. Therefore, we discuss that this moderate digestion must be the reason why they managed to detect an MNase protecting complex sitting at the TAS (Figure 1). On the other hand, Sigel’s lab includes an additional step by performing MNase-ChIP-seq, showing that when analyzing nucleosome size fragments, histones are not detected at the TAS. Here, we go further in this analysis on figure 3, showing that only when looking at subnucleosome-size fragments, we are able to detect histone H3. And this is also true for T. cruzi.

      By integrating every analysis in this work and the previous ones, we propose that TASs are protected by an MNase-sensitive complex (probed in Figure 2). This complex most likely is only partly formed by histones, since only when analyzing sub-nucleosomes size DNA molecules we can detect histone H3 (Figure 3). To be absolutely sure that the complex is not entirely made up by histones, future studies should perform an MNse-ChIP-seq with less digested samples. However, it was previously shown that R-loops are enriched at those intergenic NDRs (Briggs, 2018 doi: 10.1093/nar/gky928) and that R-loops have plenty of interacting proteins (Girasol, 2023 10.1093/nar/gkad836). Therefore, most likely, this MNase-sensitive complexed have a hybrid nature made up by H3 and some other regulatory molecules, possibly involved in trans-splicing. We have now added a new figure S5 showing R-loop co-localization with the NDR.

      Regarding the comparison between different organisms, after explaining the sensitivity to MNase of the TAS protecting complex, we discuss that when comparing equally digested samples T. cruzi and T. brucei display a similar chromatin landscape with a mild NDR at the TAS (See T. cruzi represented in Figure 1 compared to T. brucei represented in Intermediate digestion 2 in Figure 2, intermediate digestion in the revised manuscript). Unfortunately, we cannot make a good comparison with L. major, since we do not count on a similar level of digestion.

      Another point that requires clarification concerns what the authors mean in the introduction and discussion when they write that trypanosomes have "...poorly organized chromatin with nucleosomes that are not strikingly positioned or phased." On the other hand, they also cite evidence of organization: "...well-positioned nucleosome at the spliced-out region.. in Leishmania (ref 34)"; "...a well-positioned nucleosome at the TASs for internal genes (ref37)"; "...a nucleosome depletion was observed upstream of every gene (ref 35)." Aren't these examples of organized chromatin with at least a few phased nucleosomes? In addition, in ref 37, figure 4 shows at least two (possibly three to four) nucleosomes that appear phased. In my opinion, the authors should first define more precisely what they mean by "poorly organized chromatin" and clarify that this interpretation does not contradict the findings highlighted in the cited literature.

      For a better understanding of nucleosome positioning and phasing I recommend the review: Clark 2010 doi:10.1080/073911010010524945, Figure 4. Briefly, in a cell population there are different alternative positions that a given nucleosome can adopt. However, some are more favorable. When talking about favorable positions, we refer to the coordinates in the genome that are most likely covered by a nucleosome and are predominant in the cell population. Additionally, nucleosomes could be phased or not. This refers not only the position in the genome, but to the distance relative to a given point. In yeast, or in highly transcribed genes of more complex eukaryotes, nucleosomes are regularly spaced and phased relative to the transcription start site (TSS) or to the +1 nucleosome (Ocampo, NAR, 2016, doi:10.1093/nar/gkw068). In trypanosomes, nucleosomes have some regular distribution when making a browser inspection but, given that they are not properly phased with respect to any point, it is almost impossible to make a spacing estimation from paired-end data. This is also consistent with a chromatin that is transcribed in an almost constitutive manner.

      As the reviewer mention, we do site evidence of organization. We think the original observations are correct, but we do not fully agree with some of the original statements. In this manuscript our aim is to take the best we learned from their original works and to make a constructive contribution adding to the original discussions. In this regard, in trypanosomes there are some conserved patterns in the chromatin landscape, but their nucleosomes are far from being well-positioned or phased. For a better understanding, compare the variations observed in the y axis when representing av. nucleosome occupancy in yeast with those observed in trypanosomes and you will see that the troughs and peaks are much more prominent in yeast than the ones observed in any TryTryp member.

      Following the reviewer’s suggestion we have now clarified this in the main text

      The paper would also benefit from the inclusion of a schematic figure to help readers visualize and better understand the findings. What is the biological impact of having nucleosomes, di-nucleosomes, or sub-nucleosomes at TAS? This is not obvious to readers outside the chromatin field. For example, the following statement is not intuitive: "We observed that, when analyzing nucleosome-size (120-180 bp) DNA molecules or longer fragments (180-300 bp), the TASs of either T. cruzi or T. brucei are mostly nucleosome-depleted. However, when representing fragments smaller than a nucleosome-size (50-120 bp) some histone protection is unmasked (Fig. 3 and Fig. S4). This observation suggests that the MNase sensitive complex sitting at the TASs is at least partly composed of histones." Please clarify.

      We appreciate the reviewer’s suggestion to make a schematic figure. We are working on this and will be added to the manuscript upon final revision.

      Regarding the biological impact of having mono, di or subnucleosome fragments, it is important to unveil the fragment size of the protected DNA to infer the nature of the protecting complex. In the case of tRNA genes in yeast, at pol III promoters they found footprints smaller than a nucleosome size that ended up being TFIIB-TFIIC (Nagarajavel, doi: 10.1093/nar/gkt611). Therefore, detecting something smaller than a nucleosome might suggest the binding of trans-acting factors different than histones or involving histones in a mixed complex. These mixed complexes are also observed, and that is the case of the centromeric nucleosome which has a very peculiar composition (Ocampo and Clark, Cells Reports, 2015). On the other hand, if instead we detect bigger fragments, it could be indicative of the presence of bigger protecting molecules or that those regions are part of higher order chromatin organization still inaccessible for MNase linker digestions.

      Here we show on 2Dplots, that complex or components protecting the TAS have nucleosome size, but we cannot assure they are entirely made up by histones, since, only when looking at subnucleosome-size fragments, we are able to detect histone H3. We have now added part of this explanation to the discussion.

      By integrating every analysis in this work and the previous ones, we propose that the TAS is protected by an MNase-sensitive complex (Figure 2). This complex most likely is only partly formed by histones, since only when analyzing sub-nucleosomes size DNA molecules we can detect histone H3 (Figure 3). As explained above, to be absolutely sure that the complex is not entirely made up by histones, future studies should perform an MNse-ChIP-seq with less digested samples. However, it was previously shown that R-loops are enriched at those intergenic NDRs (Briggs 2018) and that R-loops have plenty of interacting proteins (Girasol, 2023). Therefore, most likely, this MNase-sensitive complexed have a hybrid nature made up by H3 and some other regulatory molecules. We have now added a new S5 figure showing R-loop co-localization.

      Some references are missing or incorrect:

      we will make a thorough revision

      "In trypanosomes, there are no canonical promoter regions." - please check Cordon-Obras et al. (Navarro's group). Thank you for the appropiate suggestion.

      We have now added this reference

      Please, cite the study by Wedel et al. (Siegel's group), which also performed MNase-seq analysis in T. brucei.

      We understand that reviewer number 2# missed that we cited this reference and that we did used the raw data from the manuscript of Wedel et. al 2017 form Siegel’s group. We used the MNase-ChIP-seq data set of histone H3 in our analysis for Figures 3, S4b and S5b (S6c in the revised version), also detailed in table S1. To be even more explicit we have now included the accession number of each data set in the figure legend.

      Figure-specific comments: Fig. S3: Why does the number of larger fragments increase with greater MNase digestion? Shouldn't the opposite be expected?

      This a good observation. As we also explained to reviewer#1:

      It's a common observation in MNase digestion of chromatin that more extensive digestion can still result in a broad range of fragment sizes, including some longer fragments. This seemingly counter-intuitive result is primarily due to the non-uniform accessibility of chromatin and the sequence preference of the MNase enzyme.

      The rationale of this is as follows: when you digest chromatin with MNase and the objective is to map nucleosomes genome-wide, the ideal situation would to get the whole material contained in the mononucleosome band. Given that MNase is less efficient to digest protected DNA but, if the reaction proceeds further, it always ends up destroying part of it, the result is always far from perfect. The better situation we can get, is to obtain samples were ˜80% of the material is contained in the mononucloesome band. __And here comes the main point: __even in the best scenario, you always have some additional longer bands, such as those for di or tri nucleosomes. If you keep digesting, you will get less than 80 % in the nucleosome band and, those remaining DNA fragments that use to contain di and tri nucleosomes start getting digested as well originating a bigger dispersion in fragments sizes. How do we explain persistence of Long Fragments? The longest fragments (di-, tri-nucleosomes) that persist in a highly digested sample are the ones that were originally most highly protected by proteins or higher-order structure, making their linker DNA extremely resistant to initial cleavage. Once the majority of the genome is fragmented, these few resistant longer fragments become a more visible component of the remaining population, contributing to a broader size dispersion. Hence, there you end up having a bigger dispersion in length distributions in the final material. Bottom line, it is not a good practice to work with under or overdigested samples. Our main point is to emphasize that especially when comparing samples, it important to compare those with comparable levels of digestion. Otherwise, a different sampling of the genome will be represented in the remaining sequenced DNA Fig. S5B: Why not use MNase conditions under which T. cruzi and T. brucei display comparable profiles at TAS? This would facilitate interpretation.

      The reviewer made a reasonable observation. The reason why we used MNase-ChIP_seq instead of just MNase to test occupancy at TAS at the subsets of genes, is because we intended to be more certain if we were talking about the presence of histones or something else. By using IP for histone H3 we can see that at multi-copy genes this protein is present when looking at nucleosome-size fragments. Additionally, as shown in figure S4b, length distribution histograms are also similar for the compared IPs.

      Minor points:

      There are several typos throughout the manuscript.

      Thanks for the observation. We will check carefully.

      Methods: "Dinucelotide frecuency calculation."

      We will add a code in GitHub

      Reviewer #2 (Significance (Required)):

      In my view, the main conclusion is that the profiles are indeed similar-even when comparing T. brucei and T. cruzi. This was not clear in previous studies (and even appeared contradictory, reporting nucleosome depletion versus enrichment) largely due to differences in chromatin digestion across these organisms. Audience: basic science and specialized readers.

      Expertise: epigenetics and gene expression in trypanosomatids.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      The authors analysed publicly accessible MNase-seq data in TriTryps parasites, focusing on the chromatin structure around trans-splicing acceptor sites (TASs), which are vital for processing gene transcripts. They describe a mild nucleosome depletion at the TAS of T. cruzi and L. major, whereas a histone-containing complex protects the TASs of T. brucei. In the subsequent analysis of T. brucei, they suggest that a Mnase-sensitive complex is localised at the TASs. For single-copy versus multi-copy genes, the authors show different di-nucleotide patterns and chromatin structures. Accordingly, they propose this difference could be a novel mechanism to ensure the accuracy of trans-splicing in these parasites.

      Before providing an in- depth review of the manuscript, I note that some missing information would have helped in assessing the study more thoroughly; however, in the light of the available information, I provide the following comments for consideration.

      The numbering of the figures, including the figure legends, is missing in the PDF file. This is essential for assessing the provided information.

      We apologized for not including the figure numbers in the main text, although they are located in the right place when called in the text. The omission was unwillingly made when figure legends were moved to the bottom of the main text. This is now fixed in the updated version of the manuscript.

      The publicly available Mnase- seq data are manyfold, with multiple datasets available for T. cruzi, for example. It is unclear from the manuscript which dataset was used for which figure. This must be clarified.

      This was detailed in Table S1. We have now replaced the table by an improved version, and we have also included the accession number of each data set used in the figure legends.

      Why do the authors start in figure 1 with the description of an MNase- protected TAS for T.brucei, given that it has been clearly shown by the Siegel lab that there is a nucleosome depletion similar to other parasites?

      We did not want to ignore the paper from Patterton’s lab because it was the first one to map nucleosomes genome-wide in T. brucei and the main finding of that paper claimed the existence of a well-positioned nucleosome at intergenic regions, what we though constitutes a point worth to be discussed. While Patterton’s work use MNase-seq from gel-purified samples and provides replicated experiments sequenced in really good depth; Siegel’s lab uses MNase-ChIP-seq of histone H3 but performs only one experiment and its input was not sequenced. So, each work has its own caveats and provides different information that together contributes to make a more comprehensive study. We think that bringing up both data sets to the discussion, as we have done in Figures 1 and 3, helps us and the community working in the field to enrich the discussion.

      If the authors re- analyse the data, they should compare their pipeline to those used in the other studies, highlighting differences and potential improvements.

      We are working on this point. We will provide a more detail description in the final revision.

      Since many figures resemble those in already published studies, there seems little reason to repeat and compare without a detailed comparison of the pipelines and their differences.

      Following the reviewer advice, we are now working on highlighting the main differences that justify analyzing the data the way we did and will be added in the finally revised method section.

      At a first glance, some of the figures might look similar when looking at the original manuscripts comparing with ours. However, with a careful and detailed reading of our manuscripts you can notice that we have added several analyses that allow to unveil information that was not disclosed before.

      First, we perform a systematic comparison analyzing every data set the same way from beginning to end, being the main difference with previous studies the thorough and precise prediction of TAS for the three organisms. Second, we represent the average chromatin organization relative to those predicted TASs for TriTryps and discuss their global patterns. Third, by representing the average chromatin into heatmaps, we show for the very first time, that those average nucleosome landscape are not just an average, they keep a similar organization in most of the genome. These was not done in any of the previous manuscripts except for our own (Beati, PLOS One 2023). Additionally, we introduce the discussion of how the extension of MNase reaction can affect the output of these experiments and we show 2D-plots and length distribution heatmaps to discuss this point (a point completely ignored in all the chromatin literature for trypanosomes). Furthermore, we made a far-reaching analysis by considering the contributions of each publish work even when addressed by different techniques. Finally, we discuss our findings in the context of a topic of current interest in the field, such as TriTryp’s genome compartmentalization.

      Several previous Mnase- seq analysis studies addressing chromatin accessibility emphasized the importance of using varying degrees of chromatin digestion, from low to high digestion (30496478, 38959309, 27151365).

      The reviewer is correct, and this point is exactly what we intended to illustrate in figure number 2. We appreciate he/she suggests these references that we are now citing in the final discussion. Just to clarify, using varying degrees of chromatin digestion is useful to make conclusions about a given organism but when comparing samples, strains, histone marks, etc. It is extremely important to do it upon selection of similar digested samples.

      No information on the extent of DNA hydrolysis is provided in the original Mnase- seq studies. This key information can not be inferred from the length distribution of the sequenced reads.

      The reviewer is correct that “No information on the extent of DNA hydrolysis is provided in the original Mnase-seq studies” and this is another reason why our analysis is so important to be published and discussed by the scientific community working in trypanosomes. We disagree with the reviewer in the second statement, since the level of digestion of a sequenced sample is actually tested by representing the length distribution of the total DNA sequenced. It is true that before sequencing you can, and should, check the level of digestion of the purified samples in an agarose gel and/or in a bioanalyzer. It could be also tested after library preparation, but before sequencing, expecting to observe the samples sizes incremented in size by the addition of the library adapters. But, the final test of success when working with MNase digested samples is to analyze length of DNA molecules by representing the histograms with length distribution of the sequenced DNA molecules. Remarkably, on occasions different samples might look very similar when run in a gel, but they render different length distribution histograms and this is because the nucleosome core could be intact but they might have suffered a differential trimming of the linker DNA associated to it or even be chewed inside (see Cole Hope 2011, section 5.2, doi: 10.1016/B978-0-12-391938-0.00006-9, for a detailed explanation).

      As the input material are selected, in part gel- purified mono- nucleosomal DNA bands. Furthermore the datasets are not directly comparable, as some use native MNase, while others employ MNase after crosslinking; some involve short digestion times at 37 {degree sign} C, while others involve longer digestion at lower temperatures. Combining these datasets to support the idea of an MNase- sensitive complex at the TAS of T. brucei therefore may not be appropriate, and additional experiments using consistent methodologies would strengthen the study's conclusions.

      In my opinion, describing an MNase- sensitive complex based solely on these data is not feasible. It requires specifically designed experiments using a consistent method and well- defined MNase digestion kinetics.

      As the reviewer suggests, the ideal experiment would be to perform a time course of MNase reaction with all the samples in parallel, or to work with a fix time point adding increasing amounts of MNase. However, the information obtained from the detail analysis of the length distribution histogram of sequenced DNA molecules the best test of the real outcome. In fact, those samples with different digestion levels were probably not generated on purpose.

      The only data sets that were gel purified are those from Mareé 2017 (Patterton’s lab), used in Figures 1, S1 and S2 and those from L. major shown in Fig 1. It was a common practice during those years, then we learned that is not necessary to gel purify, since we can sort fragment sizes later in silico when needed.

      As we explained to reviewer #1, to avoid this conflict, we decided to remove this data from figures 2 and S3. In summary, the 3 remaining samples comes from the same lab, and belong to the same publication (Mareé 2022). These sample are the inputs of native MNase ChIp-seq, obtain the same way, totally comparable among each other.

      Reviewer #3 (Significance (Required)):

      Due to the lack of controlled MNase digestion, use of heterogeneous datasets, and absence of benchmarking against previous studies, the conclusions regarding MNase-sensitive complexes and their functional significance remain speculative. With standardized MNase digestion and clearly annotated datasets, this study could provide a valuable contribution to understanding chromatin regulation in TriTryps parasites.

      As we have explained in the previous point our conclusions are valid since we do not compare in any figure samples coming from different treatments. The only exception to this comment could be in figure 3 when talking about MNase-ChIP-seq. We have now added a clear and explicit comment in the section and the discussion that despite having subtle differences in experimental procedures we arrive to the same results. This is the case for T. cruzi IP, run from crosslinked chromatin, compared to T. brucei’s IP, run from native chromatin.

      Along the years it was observed in the chromatin field that nucleosomes are so tightly bound to DNA that crosslinking is not necessary. However, it is still a common practice specially when performing IPs. In our own hands, we did not observe any difference at the global level neither in T. cruzi or in my previous work with yeast.

      ...

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study explores chromatin organization around trans-splicing acceptor sites (TASs) in the trypanosomatid parasites Trypanosoma cruzi, T. brucei and Leishmania major. By systematically re-analyzing MNase-seq and MNase-ChIP-seq datasets, the authors conclude that TASs are protected by an MNase-sensitive complex that is, at least in part, histone-based, and that single-copy and multi-copy genes display differential chromatin accessibility. Altogether, the data suggest a common chromatin landscape at TASs and imply that chromatin may modulate transcript maturation, adding a new regulatory layer to an unusual gene-expression system.

      I value integrative studies of this kind and appreciate the careful, consistent data analysis the authors implemented to extract novel insights. That said, several aspects require clarification or revision before the conclusions can be robustly supported. My main concerns are listed below, organized by topic/result section.

      TAS prediction:

      • Why were TAS predictions derived only from insect-stage RNA-seq data? Restricting TAS calls to one life stage risks biasing predictions toward transcripts that are highly expressed in that stage and may reduce annotation accuracy for lowly expressed or stage-specific genes. Please justify this choice and, if possible, evaluate TAS robustness using additional transcriptomes or explicitly state the limitation.

      Results

      • "There is a distinctive average nucleosome arrangement at the TASs in TriTryps":
      • You state that "In the case of L. major the samples are less digested." However, Supplementary Fig. S1 suggests that replicate 1 of L. major is less digested than the T. brucei samples, while replicate 2 of L. major looks similarly digested. Please clarify which replicates you reference and correct the statement if needed.
      • It appears you plot one replicate in Fig. 1b and the other in Suppl. Fig. S2. Please indicate explicitly which replicate is in each plot. For T. brucei, the NDR upstream of the TAS is clearer in Suppl. Fig. S2 while the TAS protection is less prominent; based on your digestion argument, this should correspond to the more-digested replicate. Please confirm. The protected region around the TAS appears centered on the TAS in T. brucei but upstream in L. major. This is an interesting difference. If it is technical (different digestion or TAS prediction offset), explain why; if likely biological, discuss possible mechanisms and implications.

      Results

      • "An MNase sensitive complex occupies the TASs in T. brucei":
      • The definition of "MNase activity" and the ordering of samples into Low/Intermediate/High digestion are unclear. Did you infer digestion levels from fragment distributions rather than from controlled experimental timepoints? In Suppl. Fig. S3a it is not obvious how "Low digestion" was defined; that sample's fragment distribution appears intermediate. Please provide objective metrics (e.g., median fragment length, fraction 120-180 bp) used to classify digestion levels.
      • Several fragment distributions show a sharp cutoff at ~100-125 bp. Was this due to gel purification or bioinformatic filtering? State this clearly in Methods. If gel purification occurred, that can explain why some datasets preserve the MNase-sensitive region.
      • Please reconcile cases where samples labeled as more-digested contain a larger proportion of >200 bp fragments than supposedly less-digested samples; this ordering affects the inference that digestion level determines the loss/preservation of TAS protection. Based on the distributions I see, "Intermediate digestion 1" appears most consistent with an expected MNase curve - please confirm and correct the manuscript accordingly. Results - "The MNase sensitive complexes protecting the TASs in T. brucei and T. cruzi are at least partly composed of histones":
      • The evidence that histones are part of the MNase-sensitive complex relies on H3 MNase-ChIP signal in subnucleosomal fragment bins. This seems to conflict with the observation (Fig. 1) that fragments protecting TASs are often nucleosome-sized. Please reconcile these points: are H3 signals confined to subnucleosomal fragments flanking the TAS while the TAS itself is depleted of H3? Provide plots that compare MNase-seq and H3 ChIP signals stratified by consistent fragment-size bins to clarify this.
      • Please indicate which datasets are used for each panel in Suppl. Fig. S4 (e.g., Wedel et al., Maree et al.), and avoid calling data from different labs "replicates" unless they are true replicates.
      • Several datasets show a sharp lower bound on fragment size in the subnucleosomal range (e.g., ~80-100 bp). Is this a filtering artifact or a gel-size selection? Clarify in Methods and, if this is an artifact, consider replotting after removing the cutoff. Results - "The TASs of single and multi-copy genes are differentially protected by nucleosomes":
      • Please include T. brucei RNA-seq data in Suppl. Fig. S5b as you did for T. cruzi.
      • Discuss how low or absent expression of multigene families affects TAS annotation (which relies on RNA-seq) and whether annotation inaccuracies could bias the observed chromatin differences.
      • The statement that multi-copy genes show an "oscillation" between AT and GC dinucleotides is not clearly supported: the multi-copy average appears noisier and is based on fewer loci. Please tone down this claim or provide statistical support that the pattern is periodic rather than noisy.
      • How were multi-copy genes defined in T. brucei? Include the classification method in Methods.

      Genomes and annotations:

      • If transcriptomic data for the Y strain was used for T. cruzi, please explain why a Y strain genome was not used (e.g., Wang et al. 2021 GCA_015033655.1), or justify the choice. For T. brucei, consider the more recent Lister 427 assembly (Tb427_2018) from TriTrypDB. Use strain-matched genomes and transcriptomes when possible, or discuss limitations.

      Reproducibility and broader integration:

      • Please share the full analysis pipeline (ideally on GitHub/Zenodo) so the results are reproducible from raw reads to plots.
      • As an optional but helpful expansion, consider including additional datasets (other life stages, BSF MNase-seq, ATAC-seq, DRIP-seq) where available to strengthen comparative claims. Optional analyses that would strengthen the study:
      • Stratify single-copy genes by expression (high / medium / low) and examine average nucleosome occupancy at TASs for each group; a correlation between expression and NDR depth would strengthen the functional link to maturation.

      Minor / editorial comments:

      • In the Introduction, the sentence "transcription is initiated from dispersed promoters and in general they coincide with divergent strand switch regions" should be qualified: such initiation sites also include single transcription start regions.
      • Define the dotted line in length distribution plots (if it is not the median, please clarify) and consider placing it at 147 bp across plots to ease comparison.
      • In Suppl. Fig. 4b "Replicate2" the x-axis ticks are misaligned with labels - please fix.
      • Typo in the Introduction: "remodellingremodeling" → "remodeling."

      Referee cross-commenting

      Comment 1: I think Reviewer #2 and Reviewer #3 missed that they authors of this manuscript do cite and consider the results from Wedel at al. 2017. They even re-analysed their data (e.g. Figure 3a). I second Reviewer #2 comment indicating that the inclusion of a schematic figure to help readers visualize and better understand the findings would be an important addition.

      Comment 2: I agree with Reviewer #3 that the use of different MNase digestion procedures in the different datasets have to be considered. On the other hand, I don't think there is a problem with figure 1 showing an MNase-protected TAS for T. brucei as it is based on MNase-seq data and reproduces the reported results (Maree et al. 2017). What the Siegel lab did in Wedel et al. 2017 was MNase-ChIPseq of H3 showing nucleosome depletion at TAS, but both results are not necessary contradictory: There could still be something else (which does not contain H3) sitting on the TAS protecting it from MNase digestion.

      Significance

      This study provides a systematic comparative analysis of chromatin landscapes at trans-splicing acceptor sites (TASs) in trypanosomatids, an area that has been relatively underexplored. By re-analyzing and harmonizing existing MNase-seq and MNase-ChIP-seq datasets, the authors highlight conserved and divergent features of nucleosome occupancy around TASs and propose that chromatin contributes to the fidelity of transcript maturation.

      The significance lies in three aspects:

      1. Conceptual advance: It broadens our understanding of gene regulation in organisms where transcription initiation is unusual and largely constitutive, suggesting that chromatin can still modulate post-transcriptional processes such as trans-splicing.
      2. Integrative perspective: Bringing together data from T. cruzi, T. brucei and L. major provides a comparative framework that may inspire further mechanistic studies across kinetoplastids.
      3. Hypothesis generation: The findings open testable avenues about the role of chromatin in coordinating transcript maturation, the contribution of DNA sequence composition, and potential interactions with R-loops or RNA-binding proteins. Researchers in parasitology, chromatin biology, and RNA processing will find it a useful resource and a stimulus for targeted experimental follow-up.

      My expertise is in gene regulation in eukaryotic parasites, with a focus on bioinformatic analysis of high-throughput sequencing data

    1. Does it feel like the typebars are catching in the guide, or something binding deeper in the machine? If it's the guide (this is going to sound crazy), grasp the top of the typebar and sort of wiggle it side to side in the segment. Not overly forceful as you don't want to bend the segment slots, but just enough to allow the spring steel to realign. Test, and adjust if needed. It sounds kind of unhinged, but this is the fix for most American made machines that have bars getting stuck in the guide. I've done this with Royals, Coronas, and Underwoods in front of clients before and they look at me like I'm an idiot wizard. Ha If the top of the bar is seriously bent this won't work and you'll need a repair person to use some side alignment pliers, but if the bar is just tweaked it usually works very well with a bit of practice. My unsubstantiated belief of why this occurs is US companies using the same grade(if not the same company) of steel in their bars that tend to be a little softer than their European competitors. *You DON'T want to try this with an Olympia or other German made machines. 😅 If it's coming from deeper in the machine, check the linkages to make sure they're not tweaked and binding against each other. Hope this helps!

      advice via Nashville Typewriter, a repair person. <br /> https://reddit.com/r/typewriters/comments/1o4qxvn/chasing_problems_stuck_keys/

    1. The rotational eigenfunctions and energy levels of a molecule for which all three principal moments of inertia are distinct (a asymmetric top) can not easily be expressed in terms of the angular momentum eigenstates and the J, M, and K quantum numbers. However, given the three principal moments of inertia Ia, Ib, and Ic, a matrix representation of each of the three contributions to the general rotational Hamiltonian in Equation 4.3.5 can be formed within a basis set of the {|J,M,K⟩} rotation matrix functions. This matrix will not be diagonal because the |J,M,K⟩ functions are not eigenfunctions of the asymmetric top Hr⁢o⁢t. However, the matrix can be formed in this basis and subsequently brought to diagonal form by finding its eigenvectors {C n, J,M,K } and its eigenvalues {En}. The vector coefficients express the asymmetric top eigenstates as ψn⁡(θ,φ,χ)=∑J,M,KCn,J,M,K|J,M,K⟩ Because the total angular momentum J2 still commutes with Hr⁢o⁢t, each such eigenstate will contain only one J-value, and hence Ψn can also be labeled by a J quantum number: ψn,J⁡(θ,φ,χ)=∑M,KCn,J,M,K|J,M,K⟩ To form the only non-zero matrix elements of Hr⁢o⁢t within the |J,M,K⟩ basis, one can use the following properties of the rotation-matrix functions: ⟨j,⟩=⟨j,⟩=1/2<j,⟩=h⁡2⁢[J⁢(J+1)−K⁢2], ⟨j,⟩=h2⁡K2 ⟨j⟩=−⟨j⟩=h2⁡[J⁢(J+1)−K⁢(K±1)]⁢1/2⁢[J⁢(J+1)−(K±1)⁢(K±2)]⁢1/2⁢⟨j⟩=0 Each of the elements of Jc2, Ja2, and Jb2 must, of course, be multiplied, respectively, by 1/2⁢Ic, 1/2⁢Ia, and 1/2⁢Ib and summed together to form the matrix representation of Hr⁢o⁢t. The diagonalization of this matrix then provides the asymmetric top energies and wavefunctions.

      Should be rewritten to make clear degeneracy due to space frame of Mj quantum number.

  7. inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net
    1. Mecklenburg School Dis-trict in North Carolina, Roslyn Mickelson compared the placements of Black and White high school students who had similar scores on a na-tional standardized achievement test they took in the sixth grade. More than half of the White students who scored in the ninetieth to ninety-ninth percentile on the test were enrolled in high school Advanced Placement (AP) or International Baccalaureate (IB) English, while only 20 percent of the Black students who also scored in the ninetieth to ninety-ninth percentile were enrolled in these more-rigorous courses. Meanwhile, 35 percent of White students whose test scores were below the seventieth percentile were taking AP or IB English.

      As a poli sci major ive gone through study after study that highlights inequalities at a systematic level and it’s in a way amazing to see the lengths that these inequalities can go in order to make sure communities of color are less likely to grow than white communities

    1. Ahora bien, las propuestas presentadas tienen diferencias importantes. En primer lugar, con ELSOC cuenta con información que permite observar con mayores niveles de granularidad y, consecuentemente, identificar con mayor precisión las dimensiones y subdimensiones de la cohesión social en comparación con la propuesta de que se construye para Latinoamérica con LAPOP, cuya perspectiva es más minimalista. Por su parte, en la versión con ELSOC se consideran varias subdimensiones que no están presentes en el índice de Latinoamérica debido a decisiones basadas en inconsistencia teórica y/o estadística, tal como el factor de comportamiento prosocial, el cual intentó integrarse en con datos de WVS y, si bien la consistencia interna del factor era aceptable, no correlacionaba con los demás factores de su dimensión. Finalmente, hay que decir que las propuestas no comparten ningún factor de sus subdimensiones. Si bien ambas integran el índice confianza interpersonal en su medición, ELSOC la considera como un factor de segundo orden, mientras que en la propuesta de LAPOP es un factor de primer orden.

      Esto se va a la siguiente sección

    2. Antecedentes

      Tiene que estrucutrar el documento. Cómo está organizado: COntar que se parte hablando de la primera experiencia de medición de cohesión para américa latina, cómo se conceptualiza la cohesión horizontal; para luego plantear la emdición de cohesión de otro documento. Finalmente, medición de cohesión social longitudinal con elsoc.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      Summary: 

      In this manuscript, the authors describe a good-quality ancient maize genome from 15th-century Bolivia and try to link the genome characteristics to Inca influence. Overall, the manuscript is below the standard in the field. In particular, the geographic origin of the sample and its archaeological context is not well evidenced. While dating of the sample and the authentication of ancient DNA have been evidenced robustly, the downstream genetic analyses do not support the conclusion that genomic changes can be attributed to Inca influence. Furthermore, sections of the manuscript are written incoherently and with logical mistakes. In its current form, this paper is not robust and possibly of very narrow interest. 

      Strengths: 

      Technical data related to the maize sample are robust. Radiocarbon dating strongly evidenced the sample age, estimated to be around 1474 AD. Authentication of ancient DNA has been done robustly. Spontaneous C-to-T substitutions, which are present in all ancient DNA, are visible in the reported sample with the expected pattern. Despite a low fraction of C-to-T at the 1st base, this number could be consistent with the cool and dry climate in which the sample was preserved. The distribution of DNA fragment sizes is consistent with expectations for a sample of this age. 

      Weaknesses: 

      Thank you for all your thoughtful comments. See below for comments on each.

      (1) Archaeological context for the maize sample is weakly supported by speculation about the origin and has unreasonable claims weighing on it. Perhaps those findings would be more convincing if the authors were to present evidence that supports their conclusions: i) a map of all known tombs near La Paz, ii) evidence supporting the stone tomb origins of this assemblage, and iii) evidence supporting non-Inca provenance of the tomb. 

      We believe we are clear about what information we have about context.  First, the intake records from the MSU Museum from 1890 are not as detailed as we would like, but we cannot enhance them. The mummified girl and her accoutrements, including the maize, came from a stone tower or chullpa south of La Paz, in what is now Bolivia. We do not know which stone chullpa, so a map would be of limited use.  The mortuary group is identified as Inca, but as we note the accoutrements do not appear of high status, so it is possible that she is not an elite.  Mud tombs are normally attributed to the local population, and stone towers to Inca or elites. We have clarified at multiple places in the text that the maize is from the period of Inca incursion in this part of Bolivia and have modified text to reflect greater uncertainty of Inca or local origin, but that selection for environmentally favorable characteristics had taken place.  Regardless, there are three 15th c CE or AD AMS ages on the maize, a cucurbita rind, and a camelid fiber.  The maize is almost certainly mid to late 15th century CE.

      (2) Dismissal of the admixture in the reported samples is not evidenced correctly. Population f3 statistic with an outgroup is indeed one of the most robust metrics for sample relatedness; however, it should not be used as a test of admixture. For an admixture test, the population f3 statistic should be used in the form: i) target population, ii) one possible parental population, iii) another possible parental population. This is typically done iteratively with all combinations of possible parental populations. Even in such a form, the population f3 statistic is not very sensitive to admixture in cases of strong genetic drift, and instead population f4 statistic (with an outgroup) is a recommended test for admixture. 

      We have removed “Our admixture f3-statistics test results suggest aBM is not admixed” in our revised manuscript. Since our goal here is to identify which group(s) has(have) the highest relatedness with aBM, so population f3 statistic with an outgroup is the most robust metric to do the test and to support our conclusion here.

      (3) The geographic placement of the sample based on genetic data is not robust. To make use of the method correctly, it would be necessary to validate that genetic samples in this region follow the assumption of the 'isolation-by-distance' with dense sampling, which has not been done. Additionally, the authors posit that "This suggests that aBM might not only be genetically related to the archaeological maize from ancient Peru, but also in the possible geographic location." The method used to infer the location is based on pure genetic estimation. The above conclusion is not supported by this method, and it directly contradicts the authors' suggestion that the sample comes from Bolivia.  

      We understood that it is necessary to validate the assumption of the 'isolation-by-distance' with dense sampling. But we did not do it because: 1) the ancient maize age ranges from ~5000BP to ~100BP and they were found in very different countries at different times. 2) isolation-by-distance is a population genetic concept and it's often used to test whether populations that are geographically farther apart are also more genetically different. Considering we only have 17 ancient samples in total our sample size is not sufficient for a big population test.

      For "It directly contradicts the authors' suggestion that the sample comes from Bolivia.”, as we described in our manuscript that “Given the provenience of the aBM and its age, it is possible the samples were local or alternatively were introduced into western highland Bolivia from the Inca core area – modern Peru.” The sample recording file did show the aBM sample was found in Bolivia, but we do not know where aBM originally came from before it was found in Bolivia. To answer this question, we used locator.py to predict the potential geographic location that aBM may have originally come from, and our results showed that the predicted location is inside of modern Peru and is also very close to archaeological Peruvian maize.  

      Therefore, our conclusion that "This suggests that aBM might not only be genetically related to the archaeological maize from ancient Peru, but also in the possible geographic location” does not contradict that the sample was found Bolivia.

      (4) The conclusion that Ancient Andean maize is genetically similar to European varieties and hence shares a similar evolutionary history is not well supported. The PCA plot in Figure 4 merely represents sample similarity based on two components (jointly responsible for about 20% of the variation explained), and European samples could be very distant based on other components. Indeed, the direct test using the outgroup f3 statistic does not support that European varieties are particularly closely related to ancient Andean maize. Perhaps these are more closely related to Brazil? We do not know, as this has not been measured. 

      Our conclusion is “We also found that a few types of maize from Europe have a much closer distance to the archaeological maize cluster compared to other modern maize, which indicates maize from Europe might expectedly share certain traits or evolutionary characteristics with ancient maize. It is also consistent with the historical fact that maize spread to Europe after Christopher Columbus's late 15th century voyages to the Americas. But as shown, maize also has diversity inside the European maize cluster. It is possible that European farmers and merchants may have favored different phenotypic traits, and the subsequent spread of specific varieties followed the new global geopolitical maps of the Colonial era”.

      We understood your concerns that two components only explain about 20% of the variation. But as you can see from the Figure 2b in Grzybowski, M.W. et al., 2023 publication, it described that “the first principal component (PC1) of variation for genetic marker data roughly corresponded to the division between domesticated maize and maize wild relatives is only 1.3%”. It shows this is quite common in maize, especially when the datasets include landraces, hybrids, and wild relatives. For our maize dataset, we have archaeological maize data ranging from ~5,000BP to ~100BP, and we also have modern maize, which makes the genetic structure of our data more complicated. Therefore, we think our two components are currently the best explanation currently possible. We also included PCA plot based on component 1 and 3 in Fig4_PCA13.pdf. It does not show that the European samples are very distant.

      For “Perhaps these are more closely related to Brazil?”, thank you for this very good question, but we apologize that we cannot answer this question from our current study because our study focuses on identifying the location where aBM originally came from, establishing and explaining patterns of genetic variability of maize, with a specific focus on maize strains that are related to our current aBM. Thus, we will not explore the story between maize from Brazil and European maize in our current study.

      (5) The conclusion that long branches in the phylogenetic tree are due to selection under local adaptation has no evidence. Long branches could be the result of missing data, nucleotide misincorporations, genetic drift, or simply due to the inability of phylogenetic trees to model complex population-level relationships such as admixture or incomplete lineage sorting. Additionally, captions to Figure S3, do not explain colour-coding.  

      We have removed “aBM tends to have long branches compare to tropicalis maize, which can be explained by adaption for specific local environment by time.” in our revised manuscript.

      We have added the color-coding information under Fig. S3 in our revised manuscript.

      (6) The conclusion that selection detected in aBM sample is due to Inca influence has no support. Firstly, selection signature can be due to environmental or other factors. To disentangle those, the authors would need to generate the data for a large number of samples from similar cultural contexts and from a wide-ranging environmental context, followed by a formal statistical test. Secondly, allele frequency increase can be attributed to selection or demographic processes, and alone is not sufficient evidence for selection. The presented XP-EHH method seems more suitable. Overall, methods used in this paper raise some concerns: i) how accurate are allele-frequency tests of selection when only single individual is used as a proxy for a whole population, ii) the significance threshold has been arbitrary fixed to an absolute number based on other studies, but the standard is to use, for example, top fifth percentile. Finally, linking selection to particular GO terms is not strong evidence, as correlation does not imply causation, and links are unclear anyway. 

      In sum, this manuscript presents new data that seems to be of high quality, but the analyses are frequently inappropriate and/or over-interpreted. 

      Regarding your suggestion that “from similar cultural contexts and from a wide-ranging environmental context, followed by a formal statistical test”, we apologize that this cannot be done in our current study because we could not find other archaeological maize samples/datasets that are from similar cultural contexts.

      For “Secondly, allele frequency increase can be attributed to selection or demographic processes, and alone is not sufficient evidence for selection.” Yes, we agree, and that’s why we said it “inferred” the conclusion instead of “indicated”. Furthermore, we revised the whole manuscript following all reviewers’ comments and reorganized and reduced the part on selection on aBM.

      For “The presented XP-EHH method seems more suitable”, we do not think XP-EHH is the best method that could be used here because we only have one aBM sample, but XP-EHH is more suitable for a population analysis.

      For “Finally, linking selection to particular GO terms is not strong evidence, as correlation does not imply causation, and links are unclear anyway.”, as we described in our manuscript, our results “inferred” instead of “indicated” the conclusion.

      Reviewer #2 (Public review): 

      Summary: 

      The manuscript presents valuable new datasets from two ancient maize seeds that contribute to our growing understanding of the maize evolution and biodiversity landscape in pre-colonial South America. Some of the analyses are robust, but the selection elements are not supported. 

      Strengths: 

      The data collection is robust, and the data appear to be of sufficiently high quality to carry out some interesting analytical procedures. The central finding that aBM maize is closely related to maize from the core Inca region is well supported, although the directionality of dispersal is not supported. 

      Weaknesses: 

      Thank you for your comments and suggestions. See below for responses and explanations.

      The selection results are not justified, see examples in the detailed comments below. 

      (1) The manuscript mentions cultural and natural selection (line 76), but then only gives a couple of examples of selecting for culinary/use traits. There are many examples of selection to tolerate diverse environments that could be relevant for this discussion, if desired. 

      We have added related examples with references supported in our revised manuscript.  

      (2) I would be extremely cautious about interpreting the observations of a Spanish colonizer (lines 95-99) without very significant caveats. Indigenous agriculture and food ways would have been far more nuanced than what could be captured in this context, and the genocidal activities of the Europeans would have impacted food production activities to a degree, and any contemporaneous accounts need to be understood through that lens.  

      We agree with the first part of this comment and have softened our use of this particular textual material such that it is far less central to interpretation.While of interest, we cannot evaluate the impact of colonial European activities or observational bias for purposes of this analysis.

      (3) The f3 stats presented in Figure 2 are not set up to test any specific admixture scenarios, so it is unsupported to conclude that the aBM maize is not admixed on this basis (lines 201-202). The original f3 publication (Patterson et al, 2012) describes some scenarios where f3 characteristics associate with admixture, but in general, there are many caveats to this approach, and it's not the ideal tool for admixture testing, compared with e.g., f4 and D (abba-baba) statistics.  

      You make an important point that f3 stats is not the ideal tool for admixture testing. Since our study goal here is to identify which group(s) has(have) the highest relatedness with aBM, the population f3 statistic with an outgroup is the most robust metrics with which to do the test and to support our conclusion here. We have removed the “Our admixture f3-statistics test results suggest aBM is not admixed” in our revised manuscript.

      (4) I'm a little bit skeptical that the Locator method adds value here, given the small training sample size and the wide geographic spread and genetic diversity of the ancient samples that include Central America. The paper describing that method (Battey et al 2020 eLife) uses much larger datasets, and while the authors do not specifically advise on sample sizes, they caution about small sample size issues. We have already seen that the ancient Peruvian maize has the most shared drift with aBM maize on the basis of the f3 stats, and the Locator analysis seems to just be reiterating that. I would advise against putting any additional weight on the Locator results as far as geographic origins, and personally I would skip this analysis in this case.  

      As we described in our manuscript, we have 17 archaeological samples in total. Please find more detailed information from the “geographical location prediction” section.

      We cannot add more ancient samples because they are all that we could find from all previous publications. We may still want to keep this analysis because f3 stats indicates the genome similarity, but the purpose of locator.py analysis is indicating the predicted location of origin of a genetic sample by comparing it to a set of samples of known geographic origin. 

      (5) The overlap in PCA should not be used to confirm that aBM is authentically ancient, because with proper data handling, PCA placement should be agnostic to modern/ancient status (see lines 224-226). It is somewhat unexpected that the ancient Tehuacan maize (with a major teosinte genomic component) falls near the ancient South American maize, but this could be an artifact of sampling throughout the PCA and the lack of teosinte samples that might attract that individual.  

      We have removed “which supports the authenticity of aBM as archaeological maize” in our revised manuscript. The PCA was only applied for all maize samples, so we did not include any teosinte samples in the analysis.

      (6) What has been established (lines 250-251) is genetic similarity to the Inca core area, not necessarily the directionality. Might aBM have been part of a cultural region supplying maize to the Inca core region, for example? Without a specific test of dispersal directionality, which I don't think is possible with the data at hand, this is somewhat speculative. 

      We added this and re-wrote this part in our revised manuscript.

      (7) Singleton SNPs are not a typical criterion for identifying selection; this method needs some citations supporting the exact approach and validation against neutral expectations (line 278). Without Datasets S2 and S3, which are not included with this submission, it is difficult to assess this result further. However, it is very unexpected that ~18,000 out of ~49,000 SNPs would be unique to the aBM lineage. This most likely reflects some data artifact (unaccounted damage, paralogs not treated for high coverage, which are extremely prevalent in maize, etc). I'm confused about unique SNPs in this context. How can they be unique to the aBM lineage if the SNPs used overlap the Grzybowski set? The GO results do not include any details of the exact method used or a statistical assessment of the results. It is not clear if the GO terms noted are statistically enriched.  

      We have added references 53 and 54 in our revised manuscript, and we also uploaded the Datasets S2 and S3.

      For “I'm confused about unique SNPs in this context. How can they be unique to the aBM lineage if the SNPs used overlap the Grzybowski set?”, as we described in our materials and method part that “To achieve potential unique selection on aBM, we calculated the allele frequency for each SNPs between aBM and other archaeological maize, resulting in allele frequency data for 49,896 SNPs. Of these,18,668 SNPs were unique to aBM.”  Thus, the unique SNPs for aBM came from the comparison between aBM with other archaeological maize, and we did not use any modern maize data from the Grzybowski set.

      For “The GO results do not include any details of the exact method used or a statistical assessment of the results. It is not clear if the GO terms noted are statistically enriched.” We did not do GO Term enrichment, so there are no statistical assessments for the results. What we have done was we retained the GO Terms information for each gene by checking their biological process from MaizeGDB, after that, we summarized the results in Dataset S4.

      (8) The use of XP-EHH with pseudo haplotype variant calls is not viable (line 293). It is not clear what exact implementation of XP-EHH was used, but this method generally relies on phased or sometimes unphased diploid genotype calls to observe shared haplotypes, and some minimum population size to derive statistical power. No implementation of XP-EHH to my knowledge is appropriate for application to this kind of dataset. 

      We used the same XP-EHH as this publication “Sabeti, P.C. et al. Genome-wide detection and characterization of positive selection in human populations. Nature 449, 913-918 (2007).” Specifically in our analysis, the SNP information of modern maize was compared with ancient maize. The code is available in https://doi.org/10.5061/dryad.w6m905qtd.

      XP-EHH is a statistical method used in population genetics to detect recent positive selection in one population compared to another, and it often applied in modern large maize populations in previous research. In our study, we wanted to detect recent positive selection in modern maize compared to ancient maize, thus, we applied XP-EHH here. Although the population size of ancient maize is not big, it is the best method that we can apply for our dataset here to detect recent selection on modern maize.

      Reviewer #3 (Public review): 

      Summary: 

      The authors seek to place archaeological maize samples (2 kernels) from Bolivia into genetic and geographical context and to assess signatures of selection. The kernels were dated to the end of the Incan empire, just prior to European colonization. Genetic data and analyses were used to characterize the distance from other ancient and modern maize samples and to predict the origin of the sample, which was discovered in a tomb near La Paz, Bolivia. Given the conquest of this region by the Incan empire, it is possible that the sample could be genetically similar to populations of maize in Peru, the center of the Incan empire. Signatures of selection in the sample could help reveal various environmental variables and cultural preferences that shaped maize genetic diversity in this region at that time. 

      Strengths: 

      The authors have generated substantial genetic data from these archaeological samples and have assembled a data set of published archaeological and modern maize samples that should help to place these samples in context. The samples are dated to an interesting time in the history of South America during a period of expansion of the Incan empire and just prior to European colonization. Much could be learned from even this small set of samples. 

      Weaknesses: 

      Many thanks for your comments and suggestions.  We have addressed these below and provided further explanation.

      (1) Sample preparation and sequencing: 

      Details of the quality of the samples, including the percentage of endogenous DNA are missing from the methods. The low percentage of mapped reads suggests endogenous DNA was low, and this would be useful to characterize more fully. Morphological assessment of the samples and comparison to morphological data from other maize varieties is also missing. It appears that the two kernels were ground separately and that DNA was isolated separately, but data were ultimately pooled across these genetically distinct individuals for analysis. Pooling would violate assumptions of downstream analysis, which included genetic comparison to single archaeological and modern individuals. 

      We did not do the morphological assessment of the samples and comparison to morphological data from other maize varieties because we only have 2 aBM kernels, and we do not have other archaeological samples that could be used to do comparison.

      For “It appears that the two kernels were ground separately and that DNA was isolated separately, but data were ultimately pooled across these genetically distinct individuals for analysis”, as you can see from our Materials and Methods section that “Whole kernels were crushed in a mortar and pestle”, these two kernels were ground together before sequenced. 

      While morphological assessment of the sample would be interesting, most morphological data reported for maize are from microremains (starch, phytoliths, pollen) and this is beyond the scope of our study. Most studies of macrobotanical remains do not appear to focus solely on individual kernels, but instead on (or in combination with) cob and ear shape, which were not available in the assemblage.

      (2) Genetic comparison to other samples: 

      The authors did not meaningfully address the varying ages of the other archaeological samples and modern maize when comparing the genetic distance of their samples. The archaeological samples were as old as >5000 BP to as young as 70 BP and therefore have experienced varying extents of genetic drift from ancestral allele frequencies. For this reason, age should explicitly be included in their analysis of genetic relatedness. 

      We have changed related part in our revised manuscript.

      (3) Assessment of selection in their ancient Bolivian sample: 

      This analysis relied on the identification of alleles that were unique to the ancient sample and inferred selection based on a large number of unique SNPs in two genes related to internode length. This could be a technical artifact due to poor alignment of sequence data, evidence supporting pseudogenization, or within an expected range of genetic differentiation based on population structure and the age of the samples. More rigor is needed to indicate that these genetic patterns are consistent with selection. This analysis may also be affected by the pooling of the Bolivian archaeological samples.  

      We do not think it is because of poor alignment of sequence data since we used BWA v0.7.17 with disabled seed (-l 1024) and 0 mismatch alignment. Therefore, there are no SNPs that could come from poor alignment. Please see our detailed methods description here “For the archaeological maize samples, adapters were removed and paired reads were merged using AdapterRemoval60 with parameters --minquality 20 --minlength 30. All 5՛ thymine and 3՛ adenine residues within 5nt of the two ends were hard-masked, where deamination was most concentrated. Reads were then mapped to soft-masked B73 v5 reference genome using BWA v0.7.17 with disabled seed (-l 1024 -o 0 -E 3) and a quality control threshold (-q 20) based on the recommended parameter61 to improve ancient DNA mapping”.

      For “More rigor is needed to indicate that these genetic patterns are consistent with selection”, Could you please be more specific about which method or approach we should use here? For example, methods from specific publications that could be referenced? Or which specific tool could be used?

      “This analysis may also be affected by the pooling of the Bolivian archaeological samples.” As we could not prove these two seeds came from two different individual plants, we do not think this analysis was affected by the pooling of the Bolivian archaeological samples.

      (4) Evidence of selection in modern vs. ancient maize: In this analysis, samples were pooled into modern and ancient samples and compared using the XP-EHH statistic. One gene related to ovule development was identified as being targeted by selection, likely during modern improvement. Once again, ancient samples span many millennia and both South, Central, and North America. These, and the modern samples included, do not represent meaningfully cohesive populations, likely explaining the extremely small number of loci differentiating the groups. This analysis is also complicated by the pooling of the Bolivian archaeological samples. 

      Yes, it is possible that ovule development might be a modern improvement. We re-wrote this part in our revised manuscript.

      Reviewer #1 (Recommendations for the authors): 

      My suggestion is to address the comments that outline why the methods used or results obtained are not sufficient to support your conclusions. Overall, I suggest limiting the narrative of Inca influence and framing it as speculation in the discussion section. Presenting conclusions of Inca influence in the title and abstract is not appropriate, given the very questionable evidence. 

      We agree and have changed the title to “Fifteenth century CE Bolivian maize reveals genetic affinities with ancient Peruvian maize”.

      Reviewer #2 (Recommendations for the authors): 

      (1) Line 74: Mexicana is another subspecies of teosinte; the distinction is between ssp. mexicana and ssp. parviglumis (Balsas teosinte), not mexicana and teosinte. 

      We have corrected this in our revised manuscript.

      (2) Line 100-102: This is a bit confusing, it cannot have been a symbol of empire "since its first introduction", since its introduction long predates the formation of imperial politics in the region. Reference 17 only treats the late precolonial Inca context, while ref 22 (which cites maize cultivation at 2450 BC, not 3000 BC) makes no reference to ritual/feasting contexts; it simply documents early phytolith evidence for maize cultivation. As such, this statement is not supported by the references offered.

      lines 100-102. This point is well taken and was poor prose on our part.  We have modified this discussion to reflect both the confusing statement and we have corrected our mistake in age for reference 22. associated prose has been modified accordingly.

      We have corrected them as “Indeed, in the Andes, previous research showed that under the Inca empire, maize was fulfilled multiple contextual roles. In some cases, it operated as a sacred crop” and “…since its first introduction to the region around 2500 BC”.

      (3) Line 161: IntCal is likely not the appropriate calibration curve for this region; dates should probably be calibrated using SHCal.  

      We greatly appreciate this important (and correct) observation. We have completely recalibrated the maize AMS result based on the southern hemisphere calibration curve, discussed the new calibrations, and have also invoked two other AMS dates also subjected to the southern hemisphere calibration on associated material for comparison.We are confident in a 15th century AD/CE age for the maize, most likely mid- to late 15th century.  

      (4) Lines 167-169: The increase of G and A residues shown in Supplementary Figure S1a is just before the 5' end of the read within the reference genome context, and is related to fragmentation bias - a different process from postmortem deamination. Deamination leads to 5' C->T and 3' G->A, resulting in increased T at 5' ends and increased A at 3' ends, and the diagnostic damage curve. The reduction of C/T just before reads begin is not a result of deamination. 

      We have removed the “Both features are indicative of postmortem deamination patterns” in our revised manuscript.

      (5) Lines 187-196 This section presents a lot of important external information establishing hypotheses, and needs some references.  

      We have added the related references here.

      (6) Line 421: This makes it sound like damage masking was done BEFORE read mapping. However, this conflicts with the previous paragraph about map Damage, and Supplementary Figure 1 still shows a slight but perceptible damage curve, which is impossible if all terminal Ts and As are hard-masked. This should be reconciled.  

      The Supplementary Figure 1 shows the raw ancient maize DNA sample before damage masking. Specifically, Step1: We used map Damage to check/estimate if the damage exists, and we made the Supplementary Figure 1. Step 2: Then we used our own code hard-masked the damage bases and did read mapping.

      The purpose of Supplementary Figure 1 is to show the authenticity of aBM as archaeological maize. Therefore, it should show a slight but perceptible damage curve.

      (7) Line 460: PCA method is not given (just the LD pruning and the plotting).  

      The merged dataset of SNPs for archaeological and modern maize was used for PCA analysis by using “plink –pca”.

      (8) "tropicalis" maize is not common usage, it is not clear to me what this refers to. 

      We have changed all “tropicalis maize” as “tropical maize” in our revised manuscript.

      (9) The Figure 4 color palette is not accessible for colorblind/color-deficient vision.  

      We have changed the color of Figure 4. Please find the new colors in our upload Figure 4.

      (10) Datasets S2 and S3 are not included with this submission. 

      Thank you for letting us know and your suggestion. We have included Datasets S2 and S3 here.

    1. A 3-by-4 design has 4 factors each at 3 level

      Not correct, it is 2 factors each with different levels, so one has 3 and one has 4 levels. The number of numbers is amount of factors and the value of the numbers is the amount of levels.

    1. Reviewer #1 (Public review):

      Summary:

      In this study, the authors identified and described the transcriptional trajectories leading to CMs during early mouse development, and characterized the epigenetic landscapes that underlie early mesodermal lineage specification.

      The authors identified two transcriptomic trajectories from a mesodermal population to cardiomyocytes, the MJH and PSH trajectories. These trajectories are relevant to the current model for the First Heart Field (FHF) and the Second Heart Field (SHF) differentiation. Then, the authors characterized both gene expression and enhancer activity of the MJH and PSH trajectories, using a multiomics analysis. They highlighted the role of Gata4, Hand1, Foxf1, and Tead4 in the specification of the MJH trajectory. Finally, they performed a focused analysis of the role of Hand1 and Foxf1 in the MJH trajectory, showing their mutual regulation and their requirement for cardiac lineage specification.

      Strengths:

      The authors performed an extensive transcriptional and epigenetic analysis of early cardiac lineage specification and differentiation which will be of interest to investigators in the field of cardiac development and congenital heart disease. The authors considered the impact of the loss of Hand1 and Foxf1 in-vitro and Hand1 in-vivo.

      Weaknesses:

      The authors used previously published scRNA-seq data to generate two described transcriptomic trajectories.

      (1) Details of the re-analysis step should be added, including a careful characterization of the different clusters and maker genes, more details on the WOT analysis, and details on the time stamp distribution along the different pseudotimes. These details would be important to allow readers to gain confidence that the two major trajectories identified are realistic interpretations of the input data.

      The authors have also renamed the cardiac trajectories/lineages, departing from the convention applied in hundreds of papers, making the interpretation of their results challenging.

      (2) The concept of "reverse reasoning" applied to the Waddington-OT package for directional mass transfer is not adequately explained. While the authors correctly acknowledged Waddington-OT's ability to model cell transitions from ancestors to descendants (using optimal transport theory), the justification for using a "reverse reasoning" approach is missing. Clarifying the rationale behind this strategy would be beneficial.

      (3) As the authors used the EEM cell cluster as a starting point to build the MJH trajectory, it's unclear whether this trajectory truly represents the cardiac differentiation trajectory of the FHF progenitors:<br /> - This strategy infers that the FHF progenitors are mixed in the same cluster as the extra-embryonic mesoderm, but no specific characterization of potential different cell populations included in this cluster was performed to confirm this.

      - The authors identified the EEM cluster as a Juxta-cardiac field, without showing the expression of the principal marker Mab21l2 per cluster and/or on UMAPs.

      - As the FHF progenitors arise earlier than the Juxta-cardiac field cells, it must be possible to identify an early FHF progenitor population (Nkx2-5+; Mab21l2-) using the time stamp. It would be more accurate to use this FHF cluster as a starting point than the EEM cluster to infer the FHF cardiac differentiation trajectory.

      These concerns call into question the overall veracity of the trajectory analysis, and in fact, the discrepancies with prior published heart field trajectories are noted but the authors fail to validate their new interpretation. Because their trajectories are followed for the remainder of the paper, many of the interpretations and claims in the paper may be misleading. For example, these trajectories are used subsequently for annotation of the multiomic data, but any errors in the initial trajectories could result in errors in multiomic annotation, etc, etc.

      (4) As mentioned in the discussion, the authors identified the MJH and PSH trajectories as non-overlapping. But, the authors did not discuss major previously published data showing that both FHF and SHF arise from a common transcriptomic progenitor state in the primitive streak (DOI: 10.1126/science.aao4174; DOI: 10.1007/s11886-022-01681-w). The authors should consider and discuss the specifics of why they obtained two completely separate trajectories from the beginning, how these observations conflict with prior published work, and what efforts they have made at validation.

      (5) Figures 1D and E are confusing, as it's unclear why the authors selected only cells at E7.0. Also, panels 1D 'Trajectory' and 'Pseudotime' suggest that the CM trajectory moves from the PSH cells to the MJH. This result is confusing, and the authors should explain this observation.

      (6) Regarding the PSH trajectory, it's unclear how the authors can obtain a full cardiac differentiation trajectory from the SHF progenitors as the SHF-derived cardiomyocytes are just starting to invade the heart tube at E8.5 (DOI: 10.7554/eLife.30668).

      The above notes some of the discrepancies between the author's trajectory analysis and the historical cardiac development literature. Overall, the discrepancies between the author's trajectory analysis and the historical cardiac development literature are glossed over and not adequately validated.

      (7) The authors mention analyzing "activated/inhibited genes" from Peng et al. 2019 but didn't specify when Peng's data was collected. Is it temporally relevant to the current study? How can "later stage" pathway enrichment be interpreted in the context of early-stage gene expression?

      (8) Motif enrichment: cluster-specific DAEs were analyzed for motifs, but the authors list specific TFs rather than TF families, which is all that motif enrichment can provide. The authors should either list TF families or state clearly that the specific TFs they list were not validated beyond motifs.

      (9) The core regulatory network is purely predictive. The authors again should refrain from language implying that the TFs in the CRN have any validated role.

      Regarding the in vivo analysis of Hand1 CKO embryos, Figures 6 and 7:

      (10) How can the authors explain the presence of a heart tube in the E9.5 Hand1 CKO embryos (Figure 6B) if, following the authors' model, the FHF/Juxta-cardiac field trajectory is disrupted by Hand1 CKO? A more detailed analysis of the cardiac phenotype of Hand1 CKO embryos would help to assess this question.

      (11) The cell proportion differences observed between Ctrl and Hand1 CKO in Figure 6D need to be replicated and an appropriate statistical analysis must be performed to definitely conclude the impact of Hand1 CKO on cell proportions.

      (12) The in-vitro cell differentiations are unlikely to recapitulate the complexity of the heart fields in-vivo, but they are analyzed and interpreted as if they do.

      (13) The schematic summary of Figure 7F is confusing and should be adjusted based on the following considerations:<br /> (a) the 'Wild-type' side presents 3 main trajectories (SHF, Early HT and JCF), but uses a 2-color code and the authors described only two trajectories everywhere else in the article (aka MJH and PSH). It's unclear how the SHF trajectory (blue line) can contribute to the Early HT, when the Early HT is supposed to be FHF-associated only (DOI: 10.7554/eLife.30668). As mentioned previously in Major comment 3., this model suggests a distinction between FHF and JCF trajectories, which is not investigated in the article.<br /> (b) the color code suggests that the MJH (FHF-related) trajectory will give rise to the right ventricle and outflow tract (green line), which is contrary to current knowledge.

      Minor comments:

      (1) How genes were selected to generate Figure 1F? Is this a list of top differentially expressed genes over each pseudotime and/or between pseudotimes?

      (2) Regarding Figure 1G, it's unclear how inhibited signaling can have an increased expression of underlying genes over pseudotimes. Can the authors give more details about this analysis and results?

      (3) How do the authors explain the visible Hand1 expression in Hand1 CKO in Figure S7C 'EEM markers'? Is this an expected expression in terms of RNA which is not converted into proteins?

      (4) The authors do not address the potential presence of doublets (merged cells) within their newly generated dataset. While they mention using "SCTransform" for normalization and artifact removal, it's unclear if doublet removal was explicitly performed.

      Comments on revised version:

      Summary:

      The authors have not addressed the major philosophical problems with the initial submission. They interpret their data without care to conform to years of prior publications in the field. This causes the authors to draw fanciful conclusions that are highly likely to be inaccurate (at best).

      Q1R1: The authors gave more details about the characterization of cell types and the two identified trajectories.

      a) It remains unclear how the authors generated this list. Are they manually selected genes based on relevant literature or an unbiased marker gene identification analysis? Either references should be added, or the bioinformatics explanation should be included in the method section.<br /> b) Revised text satisfies the comment.<br /> c) Revised text satisfies the comment.

      Other comments:

      Figure 1F: left annotation needs to be corrected (two "JCF specific").

      Q2R1: Revised text satisfies the comment.

      Q3R1 (1): Revised text satisfies the comment.

      Q3R1 (2): a) The explanation of how the authors built the JCF trajectory makes sense and the renaming from "MJH" to "JCF" is correct and better represents the identification that was made using time points from E7.5 to E8.5. However, the explanation given does not answer our original question. Our original comment asked about the FHF differentiation trajectory. The authors built the "MJH" trajectory as the combined "FHF/JCF" trajectory, however, it is not directly established whether the FHF and JCF progenitor differentiation trajectories are the same. The authors did not directly try to identify the FHF and JCF trajectories separately using appropriate real time windows but only assumed that they were the same. Every link between JCF and FHF trajectories assuming that they are shared without prior identification of the FHF progenitor differentiation trajectory should be removed from the manuscript (e.g. page 4: "namely the JCF trajectory (the Hand1-expressing early extraembryonic mesoderm - JCF and FHF - CM)").

      b) Adding the Mab21l2 ICA plot satisfies the comment.

      c) The explanation given by the authors regarding the FHF trajectory analysis is missing important details. The authors started the reverse trajectory analysis from E7.75 cardiomyocytes as being the FHF.

      - The authors should be mindful with the distinction between FHF progenitors and FHF-derived cardiomyocytes.<br /> - It is unclear whether cells called after the starting point (E7.75 CMs) in the reverse FHF trajectory, were collected prior E7.75. Can the authors add more details, and a real time point distribution along the FHF pseudotime to their analysis? Also, what cells belong to the FHF trajectory after the E7.75 CMs in the reverse direction? These cells should be shown as in Figure 1A and 1B for the JCF and SHF trajectories.<br /> - As the FHF arises first and differentiates into the cardiac crescent prior to or at the same time the JCF and SHF emerge, it is impossible for late progenitors (JCF and SHF) to contribute to the early FHF progenitor pool. Therefore, the observation that "both JCF and SHF lineages contribute to the early FHF progenitor population" can not be correct. It is also not what Dominguez et al showed. This misinterpretation goes against the current literature (e.g. DOI: 10.1038/ncb3024) and will leads to confusion.

      Q4R1: Revised text and figure satisfy the comment.

      Q5R1: The answer satisfies the comment.

      Q6R1: a) The authors did not address the question and did not change their language in the manuscript. As SHF-derived cardiomyocytes are missing (because they are generated after E8.5), the part of the SHF trajectory going from SHF progenitors to the E8.5 heart tube must be inaccurate.

      b) The authors correctly mentioned, both JCF and SHF will contribute to the four-chamber heart. However, as the dataset used by the authors spans only to E8.5 (which is days before the completion of the four-chamber heart), and all SHF and the vast majority of JCF contributions don't reach the heart until after E8.5, any claims about trajectories from JCF/SHF progenitor pools to cardiomyocytes should be removed because they do not correspond to prior published and accepted work.

      Q7R1: Especially because gene expression levels change over time, the authors might have considered genes as specific and restricted to a pathway based on their expression at a given time (e.g. later time), but at another time (e.g. earlier time), the same genes could have another expression pattern and not be pathway-specific anymore.

      Q8R1: Revised text satisfies the comment.

      Q9R1: Revised text satisfies the comment.

      Q10R1: Thank you for analyzing deeper the cardiac phenotype of the Hand1 cKO embryos.

      Regarding the presence of a heart tube, while, following the authors' model the FHF/JCF trajectory is disrupted:

      - Renaming the "MSH" to "JCF" is more accurate to the data shown by the authors as mainly the EEM is altered after Hand1 cKO.<br /> - The presence of the heart tube suggests that even if the JCF is altered, the FHF can still produce a cardiac crescent and a heart tube (as observed in Hand1-null embryos DOI: 10.1038/ng0398-266). The schematic Figure 7F suggests that only the SHF contribution will allow the formation of the heart tube. This unorthodox idea would need to be assessed by an alternate approach. More likely is that the model simply ignores the FHF contribution (the most important up to E8.5). The schematic is therefore incomplete and inaccurate and should be removed or edited to correspond to the prior literature.

      Q11R1: It is unclear what "replicates" mean in the authors' answer, as if they have been pooled without replicate-specific barcodes they are no longer replicates and should be considered as a single sample. This should be explicitly written in the method section.<br /> Thank you for your IF staining/quantification. If DAPI was used, it should be written in the figure caption.

      Q12R1: Revised text satisfies the comment.

      Q13R1: The answer given by the authors did not satisfy the comment because of the following:

      - The authors investigated two differentiation trajectories (JCF and SHF) in the article but Figure 7F presents three trajectories (JCF, SHF, and Early HT). The "Early HT" is neither mentioned, nor discussed in the manuscript.<br /> - Figure 7F suggests that the "Early HT" trajectory corresponds to a combination of the SHF and JCF trajectories but does not mention the early FHF trajectory. This is going against the current literature. This relates to the comments of Q10R1.<br /> - As the authors rightly point out, the SHF will be contributing to the heart tube, but through a cell invasion of the already differentiated heart tube (10.1016/j.devcel.2023.01.010). Our prior comments did not question the implication of the SHF to the looping and ballooning process but mentioned that the heart tube arises before the invasion from SHF and is FHF-derived. Figure 7F in the context of Hand1-null suggest that the heart tube will form from the SHF lineage, which is confusing as the SHF is known to contribute by invasion of the (already-formed) FHF-derived heart tube. The FHF lineage is missing from the authors' model.<br /> - In the revised manuscript, the FHF trajectory analysis is still unclear and suggests that the JCF and SHF progenitors contribute to the FHF progenitor which is going against current literature. This relates to the comments of Q3R1 (2).

      Overall, the schematic Figure 7F is very confusing as it does not follow already published data without being fully validated and therefore is inaccurate and misleading.

      Minor comments:

      The answers satisfy the minor comments.

    2. Reviewer #3 (Public review):

      In this manuscript, the Xie et al. delineate two cardiac lineage trajectories using pseudo-time and epigenetic analyses, tracing development from E6.5 to E8.5, culminating in cardiomyocytes (CMs). The authors propose that mutual regulation between the transcription factors Hand1 and Foxf1 plays a role in specifying a first cardiac lineage.

      Following the first round of revision, the authors have renamed their EEM-JCF/FHF (MJH) and PM-SHF (PSH) trajectories JCF and SHF. However, their use of this terminology is confusing. The so-called JCF trajectory appears to represent a mixture of JCF and FHF, as Hand1-expressing early extraembryonic mesoderm contributes to FHF-derived cardiomyocytes (e.g., HCN4+, Tbx5+). The authors then argue that JCF arises from Hand1+ cells and is therefore distinct from FHF, yet elsewhere suggest that both JCF and SHF contribute to FHF. This introduces conceptual inconsistencies.

      Furthermore, the expression of Hand1, Foxf1, and Bmp4 in the lateral plate mesoderm complicates the assertion that JCF is distinct from FHF (Development 2015; 142: 3307-3320; Nat Rev Mol Cell Biol, https://www.nature.com/articles/nrm2618; Circ Res 2021, https://doi.org/10.1161/CIRCRESAHA.121.318943). Mab21l2 expression also overlaps with the cardiac crescent. The designation of Tbx20 as a "key JCF-specific gene" is problematic, why should it not equally be considered an FHF-specific marker (https://pmc.ncbi.nlm.nih.gov/articles/PMC10629681)? Perhaps the JCF trajectory represent a subset of FHF. A designation such as "JCF/FHF" may therefore be more appropriate.

      In Figure 1A, the decision to define a single CM state as the endpoint of both trajectories is also problematic. FHF and SHF are known to give rise to distinct CM subtypes, yet in the authors' reconstruction both lineages converge on one CM population. This was the point raised in Question 1 of my initial review. If both trajectories converge on the same CM state, are they truly independent lineages? This interpretation remains unclear and potentially misleading.

    3. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public Review):

      Summary:

      In this study, the authors identified and described the transcriptional trajectories leading to CMs during early mouse development, and characterized the epigenetic landscapes that underlie early mesodermal lineage specification.

      The authors identified two transcriptomic trajectories from a mesodermal population to cardiomyocytes, the MJH and PSH trajectories. These trajectories are relevant to the current model for the First Heart Field (FHF) and the Second Heart Field (SHF) differentiation. Then, the authors characterized both gene expression and enhancer activity of the MJH and PSH trajectories, using a multiomics analysis. They highlighted the role of Gata4, Hand1, Foxf1, and Tead4 in the specification of the MJH trajectory. Finally, they performed a focused analysis of the role of Hand1 and Foxf1 in the MJH trajectory, showing their mutual regulation and their requirement for cardiac lineage specification.

      Strengths:

      The authors performed an extensive transcriptional and epigenetic analysis of early cardiac lineage specification and differentiation which will be of interest to investigators in the field of cardiac development and congenital heart disease. The authors considered the impact of the loss of Hand1 and Foxf1 in-vitro and Hand1 in-vivo.

      Weaknesses:

      The authors used previously published scRNA-seq data to generate two described transcriptomic trajectories.

      We agree that a two-route cardiac development model has been described, which is consistent with our analyses. However, the developmental origins and key events by early lineage specification is unclear. Our study provided new insights from the following aspects:

      a) Computational analyses inferred the earliest cardiac fate segregation by E6.75-7.0.

      b) Provided the new-generated E7.0 multi-omics data which revealed the transcriptomic and chromatin accessibility landscape.

      c) Utilized multi-omics and ChIP-seq data to construct a core regulatory network underlying the JCF lineage specification.

      d) Applied in vitro and in vivo analyses, which elucidated the synergistic and different roles of key transcription factors, HAND1 and FOXF1.

      Q1R1: Details of the re-analysis step should be added, including a careful characterization of the different clusters and maker genes, more details on the WOT analysis, and details on the time stamp distribution along the different pseudotimes. These details would be important to allow readers to gain confidence that the two major trajectories identified are realistic interpretations of the input data.

      R1R1: Thank you for the valuable suggestion. In the last version, we characterized the two major trajectories by identifying their common or specific gene sets, and by profiling the expression dynamics along pseudotime (Figure 1F). But we realized a careful description was not provided. In the revised manuscript, we have made the following improvements:

      a) Provided marker gene analyses based on cell types as well as developmental lineages to support the E7.0 progenitor clusters (Figure S1F).

      b) For Figure 1F: revised the text and introduced characteristic genes for the two trajectories.

      c) For WOT analysis: provided more details in the first paragraph of the ‘Results’ section.

      R2R1: The authors have also renamed the cardiac trajectories/lineages, departing from the convention applied in hundreds of papers, making the interpretation of their results challenging.

      R2R1: Agreed. We have changed the MJH as JCF lineage and PSH as SHF lineage.

      Q3R1: The concept of "reverse reasoning" applied to the Waddington-OT package for directional mass transfer is not adequately explained. While the authors correctly acknowledged Waddington-OT's ability to model cell transitions from ancestors to descendants (using optimal transport theory), the justification for using a "reverse reasoning" approach is missing. Clarifying the rationale behind this strategy would be beneficial.

      R3R1: Thank you for pointing out the unclear explanation. As mentioned in R1R1, we have clarified the rationale in the revised manuscript. 

      We would like to provide some additional details: WOT is designed for time-series scRNA-seq data where the time/stage each single cell is given. At any adjacent time points t<sub>i</sub> and t<sub>i+1</sub>, WOT estimates the transition probability of all cells at t<sub>i</sub> to all cells at t<sub>i+1</sub>. One can select a cell set of interest at any time point t<sub>i</sub> to infer their ancestors at t<sub>i-1</sub> or their descendants at t<sub>i+1</sub> by sums of the transition probabilities. As introduced in the original paper, WOT allows for both ‘forward’ and ‘reverse’ inference (DOI: 10.1016/j.cell.2019.01.006).

      Q3R1: As the authors used the EEM cell cluster as a starting point to build the MJH trajectory, it's unclear whether this trajectory truly represents the cardiac differentiation trajectory of the FHF progenitors:

      - This strategy infers that the FHF progenitors are mixed in the same cluster as the extra-embryonic mesoderm, but no specific characterization of potential different cell populations included in this cluster was performed to confirm this.

      To build the MJH trajectory, we performed a two-step analysis:

      (1) Firstly, we used E8.5 CM cells as a starting point to perform WOT computational reverse lineage tracing and identify CM progenitors at each time point.

      (2) Secondly, we selected EEM cells from the E7.5 CM progenitor pool, as a starting point to perform WOT analysis. Cells along this trajectory consist of the JCF lineage (Figure 1B).

      The reason why we chose to use this subset of E7.5 EEM cells was due to its purity. It is distinct from the SHF lineage as suggested by their separation in the UMAP. It is also different from FHF cells as no FHF/CM markers were detected by E7.5. 

      It is admitted that it is infeasible to achieve 100% purity in this single cell omics analysis, but we believe the current strategy of defining the JCF lineage is reasonable. The distinct gene expression dynamics (Figure 1F) and spatial mapping results (Figure 1C), between JCF and SHF lineages, also supported our conclusion.

      - The authors identified the EEM cluster as a Juxta-cardiac field, without showing the expression of the principal marker Mab21l2 per cluster and/or on UMAPs.

      Thank you for your suggestion. We have added Mab21l2 expression plots in the ICA layout (new Figure S1D), showing its transient expression dynamics, consistent with Tyser et al (DOI: 10.1126/science.abb2986).

      - As the FHF progenitors arise earlier than the Juxta-cardiac field cells, it must be possible to identify an early FHF progenitor population (Nkx2-5+; Mab21l2-) using the time stamp. It would be more accurate to use this FHF cluster as a starting point than the EEM cluster to infer the FHF cardiac differentiation trajectory.

      We appreciate your insights. We used the early FHF progenitor population (E7.75 Nkx2-5+; Mab21l2- CM cells) as the starting point and identified its progenitor cells by E7.0 (Figure S2A). Results suggest both JCF and SHF lineages contribute to the early FHF progenitor population, consistent with live imaging-based single cell tracing by Dominguez et al (DOI: 10.1016/j.cell.2023.01.001).

      These concerns call into question the overall veracity of the trajectory analysis, and in fact, the discrepancies with prior published heart field trajectories are noted but the authors fail to validate their new interpretation. Because their trajectories are followed for the remainder of the paper, many of the interpretations and claims in the paper may be misleading. For example, these trajectories are used subsequently for annotation of the multiomic data, but any errors in the initial trajectories could result in errors in multiomic annotation, etc, etc.

      Thank you for your valuable comments. In the revised manuscript, we have added details about the trajectory analysis including the procedure of WOT lineage inference, marker gene expression and early FHF lineage tracing. We also renamed the two trajectories to avoid confusion with prior published heart field trajectories. Generally, our trajectories are consistent with the published evidence about two major lineages contributing to the linear heart tube:

      a) Clonal analysis: two trajectories exist which demonstrate differential contribution to the E8.5 cardiac tube (Meilhac et al, DOI: 10.1016/s1534-5807(04)00133-9).

      b) Live imaging: JCF cells contribute to the forming heart (Tyser et al, DOI: 10.1126/science.abb2986; Dominguez et al, DOI: 10.1016/j.cell.2023.01.001).

      c) Genetic labelling based lineage tracing: early Hand1+ mesodermal cells differentiate and contribute to the cardiac crescent (Zhang et al, DOI: 10.1161/CIRCRESAHA.121.318943).

      Molecular events by the initial segregation of the two lineages were not characterized before, which are the main focus of our paper. Our analyses suggest that the JCF lineage segregates earlier from the nascent/mixed mesoderm status, also consistent with the clonal analysis (Meilhac et al, DOI: 10.1016/s1534-5807(04)00133-9).

      Q4R1: As mentioned in the discussion, the authors identified the MJH and PSH trajectories as nonoverlapping. But, the authors did not discuss major previously published data showing that both FHF and SHF arise from a common transcriptomic progenitor state in the primitive streak (DOI: 10.1126/science.aao4174; DOI: 10.1007/s11886-022-01681-w). The authors should consider and discuss the specifics of why they obtained two completely separate trajectories from the beginning, how these observations conflict with prior published work, and what efforts they have made at validation.

      R4R1: Thank you for the important question. For trajectory analysis, we assigned cells to the trajectory with higher fate probability, resulting in ‘non-overlapping’ cell sets. However, the statement of ‘two non-overlapping trajectories’ is inaccurate. We performed analysis of fate divergence between two trajectories (which was not shown in the first version), which suggests, before E7.0, mesodermal cells have similar probabilities to choose either trajectory (Figure S1E). We agree with you and previously published data that the JCF and SHF arise from a common progenitor pool. Correction has been made in the revised manuscript.

      Q5R1: Figures 1D and E are confusing, as it's unclear why the authors selected only cells at E7.0. Also, panels 1D 'Trajectory' and 'Pseudotime' suggest that the CM trajectory moves from the PSH cells to the MJH. This result is confusing, and the authors should explain this observation.

      R5R1: Thank you for pointing out the confusion. As mentioned in R4R1, trajectory analysis indicates JCFSHF fate segregation by E7.0 and we used Figures 1D and E to characterize the cellular status. By E7.0, JCF progenitors are at EEM or MM status, while SHF progenitors are still at the earlier differentiation stage (NM). This result is consistent with previous clonal analysis (Meilhac et al, DOI: 10.1016/s1534-5807(04)00133-9) which demonstrates an apparent earlier segregation of the first lineage. Our interpretation of the pseudotime analysis is that it represents different levels of differentiation, instead of developmental direction.

      Q6R1: Regarding the PSH trajectory, it's unclear how the authors can obtain a full cardiac differentiation trajectory from the SHF progenitors as the SHF-derived cardiomyocytes are just starting to invade the heart tube at E8.5 (DOI: 10.7554/eLife.30668).

      R6R1.1: We agree with your opinion. Our trajectory analysis covers E8.5 SHF-derived CM cells and progenitors. Cells that differentiate as CM cells after E8.5 were missed.

      The above notes some of the discrepancies between the author's trajectory analysis and the historical cardiac development literature. Overall, the discrepancies between the author's trajectory analysis and the historical cardiac development literature are glossed over and not adequately validated.

      R6R1.2: Historical cardiac development related literature provided evidence, using multiple techniques, which support the existence of two cardiac lineages with common progenitors at the beginning and overlapping contribution of the four-chamber heart. Our trajectory analysis is in agreement with this model and provides more detailed molecular insights about lineage segregation by E7.0. Thank you for pointing out our mistakes describing the observations. We have corrected the text and provided additional data (Figure S1D-F and S2), aiming to resolved the confusions.

      Q7R1: The authors mention analyzing "activated/inhibited genes" from Peng et al. 2019 but didn't specify when Peng's data was collected. Is it temporally relevant to the current study? How can "later stage" pathway enrichment be interpreted in the context of early-stage gene expression?

      R7R1: The gene sets of "activated/inhibited genes" were collected from several published perturbation datasets (Gene Expression Omnibus accession numbers GSE48092, GSE41260, GSE17879, GSE69669, GSE15268 and GSE31544) using mouse ES cells or embryos. For a specific pathway, the gene set is fixed but the gene expression levels, which change over time, reflect the pathway enrichment. This explains the differential pathway enrichment between early and late stages.

      Q8R1: Motif enrichment: cluster-specific DAEs were analyzed for motifs, but the authors list specific TFs rather than TF families, which is all that motif enrichment can provide. The authors should either list TF families or state clearly that the specific TFs they list were not validated beyond motifs.

      R8R1: Thank you for your comment. For the DAE motif analysis, we firstly inferred the motif and TF families, then tested which specific TFs are expressed in the corresponding cell cluster. We have added this information in the legend of Figure 2D.

      Q9R1: The core regulatory network is purely predictive. The authors again should refrain from language implying that the TFs in the CRN have any validated role.

      R9R1: Thank you for your kind suggestion. We have revised the manuscript to avoid any misleading implications, as follows:

      “Through single-cell multi-omics analysis, a predicted core regulatory network (CRN) in JCF is identified, consisting of transcription factors (TFs) GATA4, TEAD4, HAND1 and FOXF1.”

      Q10R1: Regarding the in vivo analysis of Hand1 CKO embryos, Figures 6 and 7:

      How can the authors explain the presence of a heart tube in the E9.5 Hand1 CKO embryos (Figure 6B) if, following the authors' model, the FHF/Juxta-cardiac field trajectory is disrupted by Hand1 CKO? A more detailed analysis of the cardiac phenotype of Hand1 CKO embryos would help to assess this question.

      R10R1: Thank you for your valuable suggestion. In the revised manuscript, we have added detailed analysis of the cardiac phenotype of Hand1 CKO embryo (Figure S8C). Data suggest that by E8.5 when heart looping initiate in control group (14/17), the hearts of Hand1 CKO embryos (3/3) still demonstrate a linear tube morphology. By E9.5 when atrium and ventricle become distinct in WT embryos, heart looping of Hand1 CKO embryos is abnormal. The cardiac defects of our MESP1CRE driven Hand1 conditional KO are consistent with those of Hand1-null mutant mice (Doi: 10.1038/ng0398-266; D oi: 10.1038/ng0398-271).

      Author response image 1.

      The bright field images of E8.5-E9.5 Ctrl and Hand1 CKO mouse embryos. The arrows indicating the embryonic heart (h) and head folds (hf). Scale bars (E8.5): 200 μm; scale bars (E9.5): 500 μm.

      Q11R1: The cell proportion differences observed between Ctrl and Hand1 CKO in Figure 6D need to be replicated and an appropriate statistical analysis must be performed to definitely conclude the impact of Hand1 CKO on cell proportions.

      R11R1: We appreciate your valuable suggestion. As Figure 6D is based on scRNA-seq experiment, where replicates were merged as one single sequencing library, statistical analysis is infeasible. To address potential concerns about cell proportions, we added IF staining experiments of EEM marker gene, Vim, in serial embryo sections (Figure S8D). Statistical analysis indicates a significant decrease of VIM+ EEM cell proportion of Hand1 CKO embryos.

      Q12R1: The in-vitro cell differentiations are unlikely to recapitulate the complexity of the heart fields invivo, but they are analyzed and interpreted as if they do.

      R12R1: We agree with your opinion. In the revised manuscript, we tuned down the interpretation of the invitro cell differentiation data. 

      Previous version:

      I.  “The analysis indicated that HAND1 and FOXF1 could dually regulate MJH specification through directly activating the MJH specific genes and inhibiting the PSH specific genes.”

      II. “Together, our data indicated that mutual regulation between HAND1 and FOXF1 could play a key role in MJH cardiac progenitor specification.”

      III. “Thus, our data further supported the specific and synergistic roles of HAND1 and FOXF1 in MJH cardiac progenitor specification.”

      Revised version:

      I.  “The analysis indicated that HAND1 and FOXF1 were able to directly activate the JCF specific genes.”

      II. “Together, our in vitro experimental data indicated that mutual regulation between HAND1 and FOXF1 could play a key role in activation of JCF specific genes.”

      III. “These results suggest that HAND1 and FOXF1 may cooperatively regulate early cardiac lineage specification by promoting JCF-associated gene expression and suppressing alternative mesodermal programs.”

      Q13R1: The schematic summary of Figure 7F is confusing and should be adjusted based on the following considerations:

      (a) the 'Wild-type' side presents 3 main trajectories (SHF, Early HT and JCF), but uses a 2-color code and the authors described only two trajectories everywhere else in the article (aka MJH and PSH). It's unclear how the SHF trajectory (blue line) can contribute to the Early HT, when the Early HT is supposed to be FHF-associated only (DOI: 10.7554/eLife.30668). As mentioned previously in Major comment 3., this model suggests a distinction between FHF and JCF trajectories, which is not investigated in the article.

      R13R1(a): Thank you for your great insights. The paper you mentioned used Nkx2.5_cre/+; Rosa26tdtomato+/- and _Nkx2.5_eGFP embryos to reconstruct the cardiac morphologies between E7.5 and E8.2. Their 3D models clearly demonstrate the transition from yolk sac to FHF and then SHF (Figure 2A’ and A’’). The location of yolk sac is defined as JCF in later literature (DOI: 10.1126/science.abb2986). However, as _Nkx2.5 mainly marks cells after the entry of the heart tube, it is unable to reflect the lineage contribution by JCF or SHF. As in R3R1, more and more evidence support the contribution of both lineages to the Early HT, which is discussed in a recent review paper (DOI: 0.1016/j.devcel.2023.01.010).

      (b) the color code suggests that the MJH (FHF-related) trajectory will give rise to the right ventricle and outflow tract (green line), which is contrary to current knowledge.

      R13R1(b): Thank you for pointing out the confusion. The coloring of outflow tract is not an indication of JCF lineage contribution. We have changed the color of JCF/SHF trajectory in the revised model.

      Minor comments:

      Q14R1: How genes were selected to generate Figure 1F? Is this a list of top differentially expressed genes over each pseudotime and/or between pseudotimes?

      R14R1: For each trajectory, we ranked genes by the correlation between expression levels and pseudotime.

      Top 1000 genes for each group were selected.

      Q15R1: Regarding Figure 1G, it's unclear how inhibited signaling can have an increased expression of underlying genes over pseudotimes. Can the authors give more details about this analysis and results?

      R15R1: The increased expression of ‘inhibited genes’ could be explained as an indication of decreasing signaling levels or compensation effect by other signaling pathways. We appreciate your kind suggestion. Details about this analysis have been added in the Method section.

      Q16R1: How do the authors explain the visible Hand1 expression in Hand1 CKO in Figure S7C 'EEM markers'? Is this an expected expression in terms of RNA which is not converted into proteins?

      R16R1: Our opinion is that the visible Hand1 expression caused by the imperfect knock-out efficiency by Mesp1-Cre driven system.

      Q17R1: The authors do not address the potential presence of doublets (merged cells) within their newly generated dataset. While they mention using "SCTransform" for normalization and artifact removal, it's unclear if doublet removal was explicitly performed.

      R17R1: We appreciate your kind reminder. Doublet removal was performed using R package ‘DoubletFinder’ (DOI: 10.1016/j.cels.2019.03.003). We have added this information in the revised manuscript.

      Reviewer #2 (Public review):

      Summary of goals:

      The aims of the study were to identify new lineage trajectories for the cardiac lineages of the heart, and to use computational and cell and animal studies to identify and validate new gene regulatory mechanisms involved in these trajectories.

      Strengths:

      The study addresses the long-standing yet still not fully answered questions of what drives the earliest specification mechanisms of the heart lineages. The introduction demonstrates a good understanding of the relevant lineage trajectories that have been previously established, and the significance of the work is well described. The study takes advantage of several recently published data sets and attempts to use these in combination to uncover any new mechanisms underlying early mesoderm/cardiac specification mechanisms. A strength of the study is the use of an in vitro model system (mESCs) to assess the functional relevance of the key players identified in the computational analysis, including innovative technology such as CRISPR-guided enhancer modulations. Lastly, the study generates mesoderm-specific Hand1 LOF embryos and assesses the differentiation trajectories in these animals, which represents a strong complementary approach to the in vitro and computational analysis earlier in the paper. The manuscript is clearly written and the methods section is detailed and comprehensive.

      Comments and Weaknesses:

      Overall: The computational analysis presented here integrates a large number of published data sets with one new data point (E7.0 single cell ATAC and RNA sequencing). This represents an elegant approach to identifying new information using available data. However, the data presentation at times becomes rather confusing, and relatively strong statements and conclusions are made based on trajectory analysis or other inferred mechanisms while jumping from one data set to another. The cell and in vivo work on Hand1 and Foxf1 is an important part of the study. Some additional experiments in both of these model systems could strongly support the novel aspects that were identified by the computational studies leading into the work.

      We appreciate your positive comments and insightful suggestions. In the revised manuscript, we have incorporated additional analyses and experimental validations to address the concerns raised. Specifically, we added RNA velocity analysis to independently support the identification of the MJH and PSH trajectories, performed immunofluorescence staining of mesodermal and cardiac markers in Hand1 and Foxf1 knockout models, and included Vim staining-based quantification in Hand1 CKO embryos to assess developmental outcomes in vivo. Furthermore, we revised potentially overinterpreted conclusions, clarified methodological details of WOT analysis. These revisions have strengthened both the rigor and clarity of the manuscript.

      Q1R2: Definition of MJH and PSH trajectory:

      The study uses previously published data sets to identify two main new differentiation trajectories: the MJH and the PSH trajectory (Figure 1). A large majority of subsequent conclusions are based on in-depth analysis of these two trajectories. For this reason, the method used to identify these trajectories (WTO, which seems a highly biased analysis with many manually chosen set points) should be supported by other commonly used methods such as for example RNA velocity analysis. This would inspire some additional confidence that the MJH and PSH trajectories were chosen as unbiased and rigorous as possible and that any follow-up analysis is biologically relevant.

      R1R2: We appreciate your valuable comments. It is totally agreed that other commonly used methods help strengthen our conclusion about the two main trajectories. To this end, we performed RNA velocity analysis for the cardiac specification. Results support the contribution to CM along the MJH and PSH routes.

      Author response image 2.

      UMAP layout is colored by cell types. Developmental directions, shown as arrows, are inferred by RNA-velocity analysis.

      Actually, several recent studies indicated a convergence cardiac developing model where progenitors reach a myocardial state along two trajectories (DOI: 10.1016/j.devcel.2023.01.010). However, when and how specification between the two routes were unclear. Our data and analysis revealed a clear fate separation by E7.0 from transcriptomic and epigenetic perspectives, where unbiased RNA velocity analysis was performed (Figure 2C).

      We would like to clarify how we performed WOT (DOI: 10.1016/j.cell.2019.01.006) analysis: the only manually chosen cell set was the starting set, which was all cardiomyocyte cells by E8.5, of computational reverse lineage tracing. The ancestor cells were predicted in an unbiased manner among all mesodermal cells.

      Q2R2.1: Identification of MJH and PSH trajectory progenitors:

      The study defines various mesoderm populations from the published data set (Figure 1A-E), including nascent mesoderm, mixed mesoderm, and extraembryonic mesoderm. It further assigns these mesoderm populations to the newly identified MJH/PSH trajectories. Based on the trajectory definition in Figure 1A it appears that both trajectories include all 3 mesoderm populations, albeit at different proportions and it seems thus challenging to assign these as unique progenitor populations for a distinct trajectory, as is done in the epigenetic study by comparing clusters 8 (MJH) and 2 (PSH)(Figure 2). 

      R2R2.1: According to our model, the most significant difference between the two trajectories is their enrichment of EEM and PM cell types (Figure 1B), which represent the middle stages of cardiac development. Both trajectories begin as Mesp1+ Nascent mesoderm cells (Figure 1F), which is supported by Mesp1 lineage tracing (DOI: 10.1161/CIRCRESAHA.121.318943), and ends as cardiomyocytes. Our epigenetic analysis focused on the E7.0 stage when the two trajectories could be clearly separated and when JCF and SHF lineages were at mixed mesoderm and nascent mesoderm states, respectively. However, SHF lineage was predicted to bypass mixed mesoderm state later on.

      Q2R2.2: Along similar lines, the epigenetic analysis of clusters 2 and 8 did not reveal any distinct differences in H3K4m1, H3K27ac, or H3K4me3 at any of the time points analyzed (Figure 2F). While conceptually very interesting, the data presented do not seem to identify any distinct temporal patterns or differences in clones 2 and 8 (Figure 2H), and thus don't support the conclusion as stated: "the combined transcriptome and chromatin accessibility analysis further supported the early lineage segregation of MJH and the epigenetic priming at gastrulation stage for early cardiac genes".

      R2R2.2: In the epigenetic analysis, we delineated the temporal dynamics of E7.0 cluster-specific DAEs by selecting earlier (E6.5) and later (E7.5) time points. DAEs of C8 and C2 represent regulatory elements for the JCF and SHF lineages, respectively. We also included C1 DAEs as a reference to demonstrate the relative activity of C8 and C2. The overall temporal pattern suggests activation of C8 & C2, as their H3K4me1 and H3K27ac levels surpass C1 over time. Between C8 and C2, the following distinctions could be observed:

      a) H3K4me1 levels of C8 are higher by E6.5 and E7.0, with low H3K27ac levels, indicating early priming of C8 DAEs.

      b) By E7.5, H3K4me1 levels of C8 are caught up by C2 in E7.5 anterior mesoderm (E7.5_AM, Figure 2F column 3), where cardiac mesoderm is located.

      c) H3K4me1 and H3K27ac levels of C8 are similar as C1 in the posterior mesoderm (E7.5_P, Figure 2F column 4) and much higher than C2.

      d) From the perspective of chromatin accessibility, hundreds of characteristic DAEs were identified for C2 and C8 (Figure 2D), exemplified by the primed and active enhancers which were predicted to interact with cluster-specific genes (Figure 2H).

      Together with the transcriptomic analyses (Figure 2C), these data are consistent with our conclusion about early lineage segregation and epigenetic priming.

      Q3R2: Function of Hand1 and Foxf1 during early cardiac differentiation:

      The study incorporated some functional studies by generating Hand1 and Foxf1 KO mESCs and differentiated them into mesoderm cells for RNA sequencing. These lines would present relevant tools to assess the role of Hand1 and Foxf1 in mesoderm formation, and a number of experiments would further support the conclusions, which are made for the most part on transcriptional analysis. For example, the study would benefit from quantification of mesoderm cells and subsequent cardiomyocytes during differentiation (via IF, or more quantitatively, via flow cytometry analysis). These data would help interpret any of the findings in the bulk RNAseq data, and help to assess the function of Hand1 and Foxf1 in generating the cardiac lineages. Conclusions such as "the analysis indicated that HAND1 and FOXF1 could dually regulate MJH specification through directly activating the MJH specific genes and inhibiting PSH specific genes" seem rather strong given the data currently provided.

      R3R2: Thank you for your kind suggestions. We added IF staining of mesodermal (Zic3), JCF (Hand1) and cardiac markers (Tnnt2), followed by cell quantification. Results indicate that Hand1 and Foxf1 knockout leads to reduced commitment to the JCF lineage, evidenced by the loss of Hand1 expression, accumulation of undifferentiated Zic3+ mesoderm, and impaired cardiomyocyte formation (Tnnt2+), consistent with the up-regulation of JCF lineage specific genes and the downregulation of SHF lineage specific genes.

      We also revised the conclusion as “These results suggest that HAND1 and FOXF1 may cooperatively regulate early cardiac lineage specification by promoting JCF-associated gene expression and suppressing alternative mesodermal programs.”.

      (4) Analysis of Hand1 cKO embryos:

      Adding a mouse model to support the computational analysis is a strong way to conclude the study. Given the availability of these early embryos, some of the findings could be strengthened by performing a similar analysis to Figure 7B&C and by including some of the specific EEM markers found to be differentially regulated to complement the structural analysis of the embryos.

      R4R2: hank you for your positive comments and help. In the revised manuscript, we performed IF staining of EEM marker Vim in a similar fashion as Figure 7B&C (Figure S8D). In comparison with control embryos, the Hand1 CKO embryos demonstrated significant less number of Vim+ cells, further strengthening the conclusion that Hand1 CKO blocked the developmental progression toward JCF direction.

      Q5R2: Current findings in the context of previous findings:

      The introduction carefully introduces the concept of lineage specification and different progenitor pools. Given the enormous amount of knowledge already available on Hand1 and Foxf1, and their role in specific lineages of the early heart, some of this information should be added, ideally to the discussion where it can be put into context of what the present findings add to the existing understanding of these transcription factors and their role in early cardiac specification.

      R5R2: We appreciate your positive comments and kind reminder. We have added discussion about how our study could be put into the body of findings on Hand1 and Foxf1. Although these two genes have been validated to be functionally important for heart development, it is unclear when and how they affect this process. Using in-vivo and in-vitro models and single cell multi-omics analyses, we provided evidence to fill the gaps from multiple aspects, including cell state temporal dynamics, regulatory network, and epigenetic regulation underlying the very early cardiac lineage specification.

      Reviewer #3 (Public review):

      Q1R3: In Figure 1A, could the authors justify using E8.5 CMs as the endpoint for the second lineage and better clarify the chamber identities of the E8.5 CMs analysed? Why are the atrial genes in Figure 1C of the PSH trajectory not present in Table S1.1, which lists pseudotime-dependent genes for the MJH/PSH trajectories from Figure 1F?

      R1R3: Thank you for your comments. We used E8.5 CMs as the endpoint of the second (SHF) lineage because this stage represents a critical point where SHF-derived cardiomyocytes have begun distinct differentiation, allowing us to capture terminal lineage states reliably. The chamber identities of E8.5 CMs were determined based on known marker genes (DOI: 10.1186/s13059-025-03633-3). The atrial genes shown in Figure 1C reflect cluster-specific markers that may not meet the strict pseudotime-dependency criteria used to generate Table S1.1, which lists genes dynamically changing along the MJH/PSH trajectories.

      Q2R3: Could the authors increase the resolution of their trajectory and genomic analyses to distinguish between the FHF (Tbx5+ HCN4+) and the JCF (Mab21l2+/ Hand1+) within the MJH lineage? Also, clarify if the early extraembryonic mesoderm contributes to the FHF.

      R2R3: Thank you for your great suggestions. To distinguish between the FHF and JCF trajectories, we used early FHF progenitor population (E7.75 Nkx2-5+; Mab21l2- CM cells) as the starting point and performed WOT lineage inference (Figure S2A). Results suggest that both JCF and SHF progenitors contribute to the FHF, consistent with live imaging-based single cell tracing by Dominguez et al (DOI: 10.1016/j.cell.2023.01.001) and lineage tracing results by Zhang et al (DOI: 10.1161/CIRCRESAHA.121.318943). We also analyzed the expression levels of FHF marker genes (Tbx5, Hcn4) and observed their activation along both trajectories (Figure S2B).

      Q3R3: The authors strongly assume that the juxta-cardiac field (JCF), defined by Mab21l2 expression at E7.5 in the extraembryonic mesoderm, contributes to CMs. Could the authors explain the evidence for this? Could the authors identify Mab21l2 expression in the left ventricle (LV) myocardium and septum transversum at E8.5 (see Saito et al., 2013, Biol Open, 2(8): 779-788)? If such a JCF contribution to CMs exists, the extent to which it influences heart development should be clarified or discussed.

      R3R3: Thank you for the important question. For the JCF contribution to the heart tube, several lines of evidence have been published in recent years using micro-dissection of mouse embryonic heart (DOI: 10.1126/science.abb2986), live imaging (DOI: 10.1016/j.cell.2023.01.001) and lineage tracing approaches (DOI: 10.1161/CIRCRESAHA.121.318943). According to Tyser et al (DOI: 10.1126/science.abb2986), Mab21l2 expression is detected in septum transversum at E8.5 and the Mab21l2+ lineage contribute to LV, basically consistent with the literature you mentioned (Saito et al., 2013, Biol Open, 2(8): 779-788). Our lineage inference analyses further support the model and suggest earlier specification by JCF. However, the focus of our work is the transcriptional and epigenetic regulation of underlying the JCF developmental trajectory.

      Q4R3: Could the authors distinguish the Hand1+ pericardium from JCF progenitors in their single-cell data and explain why they excluded other cell types, such as the endocardium/endothelium and pericardium, or even the endoderm, as endpoints of their trajectory analysis? At the NM and MM mesoderm stages, how did the authors distinguish the earliest cardiac cells from the surrounding developing mesoderm?

      R4R3: We appreciate your insightful question. In our other study (DOI: 10.1186/s13059-025-03633-3), we tried to further divide the CM cells as subclusters and it seems that their difference is mainly driven by the segmentation of the heart tube (e.g. LV, RV, OFT etc.). By the E8.5 stage, we are unable to identify the Hand1+ pericardium cluster. 

      Also, it seems infeasible to distinguish endocardium from other endothelium cells only using singlecell data. High resolution spatial transcriptome data is required. Alternatively, we analyzed the E7.0 mesodermal lineages and determined C5/6 as hematoendothelial progenitors. Marker gene analysis indicate that their lineage segregation has started by this stage (Figure S4C and Author response image 3).

      Author response image 3.

      UMAP layout, using scRNA-seq (Reference data) and snRNA-seq (Multiome data), is colored by cell types (left). Expression of hematoendothelial progenitor marker genes is shown (right).

      We did observe the difference between the earliest cardiac cells from the surrounding developing mesoderm. As in Figure 1D, cells belonging to the JCF lineage (Hand1 high/Lefty2 low) were clustered at the EEM/MM end, in contrast to the NM cells.

      Q5R3: Could the authors contrast their trajectory analysis with those of Lescroart et al. (2018), Zhang et al., Tyser et al., and Krup et al.?

      R5R3: Thank you for the valuable suggestion. We compared our model with the suggested ones and summarized as follows:

      (1) Lescroart et al: The JCF and SHF progenitor cells match their DCT2 (Bmp4+) and DCT3 (Foxc2+) clusters, respectively.

      (2) Zhang et al: The JCF lineage matches their EEM-DC (developing CM)-CM trajectory. The SHF lineage is consistent with their NM-LPM (lateral plate mesoderm)-DC (developing CM)-CM trajectory. Notably, their EEM-DC-CM also expressed FHF marker (Tbx5) at later stages.

      (3) Tyser et al: we performed data integration analysis and found the correspondence between JCF progenitors (EEM cells from the cardiac trajectory) and their Me5, as well as SHF progenitors (PM cells from the cardiac trajectory) with Me7. In their model, both Me5 and Me7 contribute to Me4 (representing the FHF), consistent with our results (see Tyser et al., 2021 and Pijuan-Sala et al., 2019).

      (4) Krup et al also performed URD lineage inference, providing a model with CM (12) and Cardiac mesoderm (29) as cardiac end points. Their model did not seem to suggest distinct trajectories between JCF and SHF lineages, as both JCF (Hand1) and SHF (Isl1) markers co-expressed in CM.

      Q6R3: Previous studies suggest that Mesp2 expression starts at E8 in the presomitic mesoderm (Saga et al., 1997). Could the authors provide in situ hybridization or HCR staining to confirm the early E7 Mesp2 expression suggested by the pseudo-time analysis of the second lineage.

      R6R3: We validated the expression of E7 Mesp2 using Geo-seq spatial transcriptome data (Author response image 4, upper). Results suggest the high spatial enrichment of Mesp2 expression in primitive streak (T+) and/or nascent mesoderm (Mesp1+) cells, which correspond to the progenitors of the second lineage.

      In situ hybridization data (PMID: 17360776) also supports the early expression of Mesp2 by E7 (Author response image 4, lower).

      Author response image 4.

      (Upper) E7 Geo-seq data for selected genes: T, Mesp1, and Mesp2. (Lower) Mesp2 expression during early development; image acquired from Morimoto et al. (PMID: 17360776).

      Q7R3: Could the authors also confirm the complementary Hand1 and Lefty2 expression patterns at E7 using HCR or in situ hybridization? Hand1 expression in the first lineage is plausible, considering lineage tracing results from Zhang et al.

      R7R3: Thank you for your great suggestion. We observed spatially complementary expression patterns of Hand1 and Lefty2 in the Geo-seq spatial transcriptomic data. In the mesoderm layer, Hand1 is highly expressed in the proximal end. While Lefty2+ cells exhibit preference toward the distal direction.

      Author response image 5.

      E7 Geo-seq data for selected genes: Hand1 and Lefty2.

      Q8R3: Could the authors explain why Hand1 and Lefty2+ cells are more likely to be multipotent progenitors, as mentioned in the text?

      R8R3: Thank you for your question. Here, we observed E7.0 Mesp1+ and Lefty2+ nascent mesodermal cells assigned to both the JCF and SHF lineages (Figure 1D), indicating their multipotency. On the other hand, we also found low expressions of JCF markers, Hand1 and Msx2, by the early stage of the SHF trajectory (Figure 1F). Thus, we concluded that both Hand1+ and Lefty2+ E7.0 mesodermal cells are likely to be multipotent.

      Q9R3: Could the authors comment on the low Mesp1 expression in the mesodermal cells (MM) of the MJH trajectory at E7 (Figure 1D)? Is Mesp1 transiently expressed early in MJH progenitors and then turned off by E7? Have all FHF/JCF/SHF cells expressed Mesp1?

      R9R3: Thank you for the insightful questions. Zhang et al. (PMID: 34162224) performed scRNA-seq analysis of Mesp1 lineage-traced cells, which indicate the contribution of Mesp1+ cells to FHF, JCF, and SHF. This is also supported by Dominguez et al. utilizing live imaging approaches (PMID: 36736300). Our temporal dynamics analysis suggests that along the JCF trajectory, Mesp1 is turned off as JCF characteristic genes were up regulated (Figure 1F and S1D).

      Q10R3: Could the authors clarify if their analysis at E7 comprises a mixture of embryonic stages or a precisely defined embryonic stage for both the trajectory and epigenetic analyses? How do the authors know that cells of the second lineage are readily present in the E7 mesoderm they analysed (clusters 0, 1, and 2 for the multiomic analysis)?

      R10R3: Thank you for your questions. Although embryos were collected at E7.0, the developmental stages could be variable. As exemplified by Karl Theiler’s book, “The House Mouse: Atlas of Embryonic Development”, mesoderm was visible for some E7.0 egg cylinders but not in others. To test whether cells of the second lineage are present in the E7.0 mesoderm, we analyzed the WOT lineage tracing results and the cell type composition by E7.0 (Author response image 6, left panel). Most cells belong to the nascent mesoderm (NM) or mixed mesoderm (MM), while almost no cells were assigned to the primitive streak (PS). To avoid the possibility that the E7.0 embryos represented later stages, we also analyzed the E6.75 cells of the second lineage (Author response image 6, middle panel). Results suggest that NM cells were still the dominant contributors to the second lineage, although ~22.6% cells were assigned to the PS. The abovementioned analyses were performed using the scRNA-seq data. The embryos of the E7.0 single-cell multi-omics represent similar developmental stages as the scRNAseq data, as suggested by the well-aligned UMAPs (Figure S1D, right panel). Thus, we conclude that for the multi-omics data, the cells of the second lineage are also readily present in the mesoderm.

      Author response image 6.

      (Left and middle) Lineage inference and cell type composition at E7.0 and E6.75. (Right) UMAPs of E7.0 multi-omics and scRNA-seq data.

      Q11R3: Could the authors further comment on the active Notch signaling observed in the first and second lineages, considering that Notch's role in the early steps of endocardial lineage commitment, but not of CMs, during gastrulation has been previously described by Lescroart et al. (2018)?

      R11R3: We appreciate your kind suggestion. As reported by Lescroart et al. (2018), using Notch1CreERT2/Rosa-tdTomato mice and tamoxifen administration at E6.5, early expression of Notch1 mostly marked endocardial cells (ECs, 76.9-83.9%), with minor contribution to the cardiomyocytes (6.0-16.6%) and to the epicardial cells (EPs, 6.0-6.5%). The lineage specificity of Notch1 is consistent with our E7.0 multi-omics data, where its expression was mainly observed in the NM and hematoendothelial progenitors (Author response image 7). Interestingly, expression of other NOTCH receptor genes (Notch2 and Notch3) and ligand genes (Dll1 and Dll3) in the CM lineages. Notch3 demonstrate higher expression in the first lineage, while Dll1 and Dll3 were highly expressed in the second lineage. The study by Lescroart et al. (2018) emphasized the role of Notch1 as an EC lineage marker, while our analyses aimed at the activity of the NOTCH pathway.

      Author response image 7.

      Expression of representative NOTCH genes at E7.0 (multi-omics data).

      Q12R3: In cluster 8, Figure 2D, it seems that levels of accessibility in cluster 8 are relatively high for genes associated with endothelium/endocardium development in addition to MJH genes. Could the authors comment and/or provide further analysis?

      R12R3: Thanks for you for raising this interesting point. To confirm the association of these genes with endothelium (EC) and/or MJH, we analyzed their expression levels by E7.0 (progenitor stage) and E8.0 (differentiated stage) (Author response image 8). Among target genes of MJH-specific DAEs (cluster 3/7/8 in Figure 2D), Pmp22, Mest, Npr1, Pkp2, and Pdgfb were expressed in the hematoendothelial progenitors. The Nrp1 gene and PDGF pathway play critical roles in endothelial development by modulating cell migration (PMID: 15920019 and 28167492), which is also important for MJH cells. In addition, we observed common ATAC-seq peaks in both hematoendothelial and MJH clusters (Author response image 9), indicating shared regulatory elements. Interestingly, Pdgfb is not expressed by CM in vivo, it is actively expressed in the CM of the in vitro system (Author response image 9). These results indicate regulatory and functional closeness between hematoendothelial and MJH cell groups, at early stages of lineage establishment.

      Author response image 8.

      Regulatory connection between MJH and endothelial cells (ECs).

      Author response image 9.

      Representative genome browser snapshots of scATAC-seq (aggregated gene expression and chromatin accessibility for each cluster) and RNA-seq at the Pdgfb locus.

      Q13R3: Can the authors clarify why they state that cluster 8 DAEs are primed before the full activation of their target genes, considering that Bmp4 and Hand1 peak activities seem to coincide with their gene expression in Figure 2G?

      R13R3: Thanks for your great question. The overall analyses indicate low to medium levels of H3K4me1 and H3K27ac by E6.5-7.0 at cluster 8 DAEs, which were fully activated by E7.5 (Figure 2F). Further inspections suggest different epigenetic status of individual DAEs (Figure 3H), which could be active (K4me1+/K27ac+), primed (K4me1+/K27ac-), or inactive (K4me1-/K27ac-). Thus, we concluded that many DAEs could be primed before full activation. The coincidence of enhancer peak activities and gene expression was observed by aggregating single cell clusters at a single stage E7.0, which does not rule out the possibility that these enhancers are epigenetically primed at earlier stages.

      Q14R3: Did the authors extend the multiomic analysis to Nanog+ epiblast cells at E7 and investigate if cardiac/mesodermal priming exists before mesodermal induction (defined by T/Mesp1 onset of expression)?

      R14R3: We appreciate your kind suggestion. We observed low levels of T/Mesp1 expression in the E7.0 Nanog+ epiblast cells (Author response image 10). Interestingly, the T+/Mesp1+ cells were not clustered toward any specific differentiation directions in the UMAP. We also analyzed DAE activities in each single cell by averaging over the C1/C2/C8 DAE sets. The C2 and C8 DAEs were clearly less active than the C1 DAEs. But C2/C8-DAE active cells were observed among the E7.0 Nanog+ epiblast cells. These data indicate the early priming exists in epiblast cells before the commitment to cardiac/mesodermal differentiation.

      Author response image 10.

      Gene expression and DAE activity levels of E7.0 Nanog+ epiblast cells shown in UMAP layout.

      Q15R3: In the absence of duplicates, it is impossible to statistically compare the proportions of mesodermal cell populations in Hand1 wild-type and knockout (KO) embryos or to assess for abnormal accumulation of PS, NM, and MM cells. Could the authors analyse the proportions of cells by careful imaging of Hand1 wild-type and KO embryos instead?

      R15R3: Thank you for your important question. To assess the proportions of mesodermal cell populations in E7.25 wild-type and Hand1-CKO embryos, we analyzed the serial coronal sections of the extraembryonic portions and performed staining of the Vim gene, which marks the extra-embryonic mesodermal (EEM) cells (Figure S8D). We then counted the numbers of mesodermal/Vim+ EEM cells and calculated the relative proportion of Vim+ EEM cells in each section. The proportion of Vim+ EEM cells was statistically lower in the Hand1-CKO embryo, consistent with our model that Hand1 deletion led to blocked MJH specification.

      Q16R3: Could the authors provide high-resolution images for Figure 7 B-C-D as they are currently hard to interpret?

      R16R3: Thank you for your suggestion. We have replaced Figure 7B-C-D with high-resolution images.

      Recommendations for the authors:  

      Reviewing Editor Comments:

      Discussions among reviewers emphasize the importance of better addressing and validating the trajectory analysis by using more common and alternative bioinformatics and spatial approaches. Further discussion on whether there is a common transcriptional progenitor between the two trajectories is also required to enhance the significance of the study. For functional analysis, further validations are needed as the current data only partially support the claims. Please see public reviews for details.

      Reviewer #2 (Recommendations For The Authors):

      Beyond the suggestions made in the public review, below are some minor aspects for consideration:

      The manuscript is well written overall but may benefit from a thorough read-through and editing of some minor grammatical errors.

      We have carefully read through the manuscript and corrected minor grammatical errors to improve clarity and readability.

      Figure 2C: RNA velocity information gets largely lost due to the color choice of EEM and MM (black) on which the direction of arrows can't be appreciated.

      We have updated the color scheme in Figure 2C.

      Figure 6D: sample information is partially cut off in the graph.

      Sample information is completely shown now.

      The last paragraph of the discussion has some formatting issues with the references.

      We have corrected the formatting issues with the references.

      The methods and results section does not comment on if, or how many embryos were pooled for the sequencing analysis performed for this study.

      We have added the numbers of embryos for sequencing analyses in the methods section.

      Reviewer #3 (Recommendations For The Authors):

      Minor:

      In the discussion, authors could reconsider the sentence: "The process of cardiac lineage segregation is a complex one that may involve TF regulatory networks and signaling pathways," as it is not informative.

      We have re-written the sentence as: “Thus, additional regulation must exist and instructs the process of JCF-SHF lineage segregation.”

    1. Podíl výdajů na bydlení na spotřebě

      A budeme to tu tedy nechávat? Já myslím, že jsou dvě možnosti: 1) necháme, dáme odkaz na Evropu v datech a poukážeme na problematiku tohoto indikátoru (máme tam také tu část srovnání s jinými metodikami apod.) nebo to prostě úplně smažeme a můžeme alespoň do metodiky napsat, že tento údaj dále nesledujeme a to z toho a toho důvodu

    1. Reviewer #2 (Public review):

      Summary:

      This study focused on the roles of the nuclear envelope proteins lamin A and C, as well as nesprin-2, encoded by the LMNA and SYNE2 genes, respectively, on gene expression and chromatin mobility. It is motivated by the established role of lamins in tethering heterochromatin to the nuclear periphery in lamina-associated domains (LADs) and modulating chromatin organization. The authors show that depletion of lamin A, lamin A and C, or nesprin-2 results in differential effects of mRNA and lnRNA expression, primarily affecting genes outside established LADs. In addition, the authors used fluorescent dCas9 labeling of telomeric genomic regions combined with live-cell imaging to demonstrate that depletion of either lamin A, lamin A/C, or nesprin-2 increased the mobility of chromatin, suggesting an important role of lamins and nesprin-2 on chromatin dynamics.

      Strengths:

      The major strength of this study is the detailed characterization of changes in transcript levels and isoforms resulting from depletion of either lamin A, lamin A/C, or nesprin-2 in human osteosarcoma (U2OS) cells. The authors use a variety of advanced tools to demonstrate the effect of protein depletion on specific gene isoforms and to compare the effects on mRNA and lncRNA levels.

      The TIRF imaging of dCas9 labeled telomeres allows for high resolution tracking of multiple telomeres per cell, thus enabling the authors to obtain detailed measurements of the mobility of telomeres within living cells and the effect of lamin A/C or nesprin-2 depletion.

      Weaknesses:

      Although the findings presented by the authors overall confirm existing knowledge about the ability of lamins A/C and nesprin to broadly affect gene expression, chromatin organization, and chromatin dynamics, the specific interpretation and the conclusions drawn from the data presented in this manuscript are limited by several technical and conceptual challenges.

      One major limitation is that the authors only assess the knockdown of their target genes on the mRNA level, where they observe reductions of around 70%. Given that lamins A and C have long half-lives, the effect at the protein level might be even lower. This incomplete and poorly characterized depletion on the protein level makes interpretation of the results difficult. Assessing the effect of the knockdown on the protein level would provide more detailed information both on the extent of the actual protein depletion and the effect on specific lamin isoforms. Similarly, given that nesprin-2 has numerous isoforms resulting from alternative splicing and transcription initiation. In the current form of the manuscript, it remains unclear which specific nesprin-2 isoforms where depleted, and by what extent (on the protein level).

      Another substantial limitation of the manuscript is that the current analysis, with exception of the chromatin mobility measurements, is exclusively based on transcriptomic measurements by RNA-seq and qRT-PCR, without any experimental validation of the predicted protein levels or proposed functional consequences. As such, conclusions about the importance of lamin A/C on RNA synthesis and other functions are derived entirely from gene ontology terms and are not sufficiently supported by experimental data. Thus, the true functional consequences of lamin A/C or nesprin depletion remain unclear.

      Another substantial weakness is that the data and analysis presented in the manuscript raise some concerns about the robustness of the findings. Given that the 'shLMNA' construct is expected to deplete both lamin A and C, i.e., its effect encompasses the depletion of lamin A, which is achieved by the 'shLaminA' construct, one would expect a substantial overlap between the DEGs in the shLMNA and shLaminA conditions, with the shLMNA depletion producing a broader effect as it targets both lamin A and C. However, the Venn Diagram in Figure 4a, the genomic loci distribution in Figure 4b, and the correlation analysis in Suppl. Fig. S2 show little overlap between the shLMNA and shLaminA conditions, which is quite surprising. In the mapping of the DEGs shown in Fig. 4b, it is also surprising not to see the gene targeted by the shRNA, LMNA, found on chromosome 1, in the results for the shLMNA and shLamin A depletion.

      The correlation analysis in Suppl. Figure S2 raises further questions. The authors use dox-inducible shRNA constructs to target lamin A (shLaminA), lamin A/C (shLMNA), or nesprin-2 (shSYNE2). Thus, the no-dox control (Ctr) for each of these constructs would be expected to be very similar to the non-target scrambled controls (Ctrl.shScramble and Dox.shScramble). However, in the correlation matrix, each of the no-dox controls clusters more closely with the corresponding dox-induced shRNA condition than with the Ctrl.shScramble or Dox.shScramble conditions, suggesting either a very leaky dox-inducible system, effects from clonal selection (although less likely, giving the pooling of three clones), or substantial batch effects in the processing. Either of these scenarios could substantially affect the interpretation of the findings.

      The premise of the authors that lamins would only affect peripheral chromatin and genes at LADs neglects the fact that lamins A and C are also found in the nuclear interior, where they form stable structure and influence chromatin organization, and the fact that lamins A and C and nesprins additionally interact with numerous transcriptional regulators such as Rb, c-Fos, and beta-catenins, which could further modulate gene expression when lamins or nesprins are depleted.

      The comparison of the identified DEGs to genes contained in LADs might be confounded by the fact that the authors relied on the identification of LADs from a previous study, which used a different human cell type (human skin fibroblasts) instead of the U2OS osteosarcoma cells used in the present study. As LADs are often highly cell type specific, the use of the fibroblast data set could lead to substantial differences in LADs.

      Overall appraisal and context:

      Despite its limitations, the present study further illustrates the important roles the nuclear envelope proteins lamin A, lamin C, and nesprin-2 have in chromatin organization, dynamics, and gene expression. It thus confirms results from previous studies previously reported for lamin A/C depletion. For example, the effect of lamin A/C depletion on increasing mobility of chromatin, had already been demonstrated by several other groups, such as Bronshtein et al. Nature Comm 2015 (PMID: 26299252) and Ranade et al. BMC Mol Cel Biol 2019 (PMID: 31117946). Additionally, the effect of lamin A/C depletion on gene and protein expression has already been extensively studied in a variety of other cell lines and model systems, including detailed proteomic studies (PMIDs 23990565 and 35896617).

      The finding that that lamin A/C or nesprin depletion not only affects genes at the nuclear periphery but also the nuclear interior is not particularly surprising giving the previous studies and the fact that lamins A and C are also founding within the nuclear interior, where they affect chromatin organization and dynamics, and that lamins A/C and nesprins directly interact with numerous transcriptional regulators that could further affect gene expression independent from their role in chromatin organization.

      The isoform specific effects of LMNA depletion on chromatin mobility and gene expression are not entirely surprising, as recent work by the Medalia group identified a lamin A-specific chromatin binding site not present in lamin C (PMID: 40750945). This work should be cited in the manuscript.

      The authors provide a detailed analysis of isoform switching in response to lamin A/C or nesprin-depletion, but the underlying mechanism remains unclear. Similarly, their analysis of the genomic location of the observed DEGs shows the wide-ranging effects of lamin A/C or nesprin depletion, but lets the reader wonder how these effects are mediated. A more in-depth analysis of predicted regulator factors and their potential interaction with lamins A/C or nesprin would be beneficial in gaining more mechanistic insights.

      Additional note regarding the revised manuscript:

      The authors have made several revisions to the manuscript, including the title and abstract. The above comments have been updated to reflect the latest manuscript version.

      These text revisions made by the authors provide some more detailed discussion of context and interpretation of the work, improving the clarity of the manuscript. However, they do not fundamentally alleviate many of the concerns previously expressed regarding the lack of mechanistic insights and various technical aspects of the study, i.e., use of a single shRNA for knockdown, lack of knockdown validation on the protein level, potential off-target effects of the shRNA, batch-effects of the transcriptomic analysis, cell-type specific differences in LADs, etc. Without further experimental data, the manuscript offers a mostly descriptive analysis on the effect of LMNA and SYNE2 depletion on gene expression and telomere mobility. The manuscript might be useful as a reference data sets for comparison with other LMNA or SYNE2 depletion studies, albeit with various caveats regarding its interpretation due to the technical concerns raised by the reviewers.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      This manuscript reports a descriptive study of changes in gene expression after knockdown of the nuclear envelope proteins lamin A/C and Nesprin2/SYNE2 in human U2OS cells. The readout is RNA-seq, which is analyzed at the level of gene ontology and focused investigation of isoform variants and non-coding RNAs. In addition, the mobility of telomeres is studied after these knockdowns, although the rationale in relation to the RNA-seq analyses is rather unclear.

      We sincerely thank the reviewer for the thoughtful summary and valuable feedback. Regarding the telomere mobility analyses, our intention was to provide additional evidence supporting the hypothesis that knockdown of lamins and nesprins disrupts nuclear architecture. Although the connection to the RNA-seq data was not explicitly detailed, we believe that the increased telomere mobility may reflect broader changes in chromatin organization, which could contribute to the observed differential gene expression. We have revised the manuscript to clarify this rationale and improve the integration between the two analyses.

      RNA-seq after knockdown of lamin proteins has been reported many times, and the current study does not provide significant new insights that help us to understand how lamins control gene expression. This is particularly because the vast majority of the observed effects on gene expression appear to occur in regions that are not bound by lamin A. It seems likely that these effects are indirect. There is also virtually no overlap between genes affected by laminA/C and by SYNE2, which remains unexplained; for example, it would be good to know whether laminA/C and SYNE2 bind to different genomic regions. The claim in the Title and Abstract that LMNA governs gene expression / acts through chromatin organization appears to be based only on an enrichment of gene ontology terms "DNA conformation change" and "covalent chromatin conformation" in the RNA-seq data. This is a gross over-interpretation, as no experimental data on chromatin conformation are shown in this study. The analyses of transcript isoform switching and ncRNA expression are potentially interesting but lack a mechanistic rationale: why and how would these nuclear envelope proteins regulate these aspects of RNA expression? The effects of lamin A on telomere movements have been reported before; the effects of SYNE2 on telomere mobility are novel (to my knowledge), but should be discussed in the light of previously documented effects of SUN1/2 on the dynamics of dysfunctional telomeres (Lottersberger et al, Cell 2015).

      We sincerely thank the reviewer for this thoughtful and detailed critique. We agree that RNA-seq following knockdown of lamin proteins has been previously reported and appreciate the concern regarding the novelty and mechanistic interpretation of our findings. However, For our study, we revealed novel findings that there is distinct isoform switching and lncRNA affected by lamins and nesprins, which have not been reported yet by previous studies. Furthermore, we also revealed not only lamin A, but also nesprin-2 could also affect chromatin mobility.

      For the analysis of LMNA ChIP-seq data from  human fibroblast (Kohta Ikegami, 2021). Their data revealed that Lamin A/C modulates gene expression through interactions with enhancers. The pathogenesis of disorders associated with LMNA mutations may stem primarily from disruptions in this gene regulatory function, rather than from impaired tethering of chromatin to LADs.

      We acknowledge the reviewer’s concern that gene ontology enrichment related to chromatin conformation alone is insufficient to support claims about chromatin structural changes. We have therefore revised the “Title” and “Abstract” to avoid overstating conclusions and to more accurately reflect the scope of our data.

      Regarding telomere dynamics, while Lamin A's role has indeed been previously documented, our study provides evidence that SYNE2/Nesprin-2 also regulates telomere mobility. We have now expanded the discussion to include prior work, particularly the findings of Lottersberger et al. (Cell, 2015), to better contextualize our results and distinguish the contributions of SYNE2.

      Finally, we appreciate the reviewer’s suggestion about transcript isoform and noncoding RNA expression. While our study primarily provides descriptive data, we agree that further mechanistic investigation is warranted. We have clarified this point in the “Discussion” and framed our findings as a foundation for future studies exploring the broader regulatory roles of nuclear envelope proteins.

      We are grateful for the reviewer’s comments, which have helped us improve the clarity and rigor of our manuscript. Please see the revised highlights in our revised manuscript.

      As indicated below, I have substantial concerns about the experimental design of the knockdown experiments.

      Altogether, the results presented here are primarily descriptive and do not offer a significant advance in our understanding of the roles of LaminA and SYNE2 in gene regulation or chromatin biology, because the results remain unexplained mechanistically and functionally. Furthermore, the RNAseq datasets should be interpreted with caution until off-target effects of the shRNAs can be ruled out.

      We fully acknowledge that the original version of our manuscript lacked sufficient mechanistic insight. In response, we have revised the manuscript to include additional analyses and explanations that clarify the potential functional relevance of our findings. For example, we added following text “These findings further underscore the functional relevance of lamin A in coordinating transcriptional programs through modulation of nuclear architecture. In contrast, LMNA knockdown led to differential expression of genes enriched in pathways related to chromatin organization, suggesting potential disruptions in chromatin regulatory networks. Although direct measurements of chromatin conformation were not performed, these transcriptional changes indicate that LMNA may contribute to maintaining nuclear architecture and genomic stability, which aligns with its established involvement in laminopathies and genome integrity disorders.“ More analyses could be found in the main text.

      Regarding the concern about off-target effects of the shRNA-based knockdowns, we agree that this is an important consideration. While shRNA approaches inherently carry the risk of off-target effects, we have now performed additional analyses that help address this issue. These analyses support the specificity of our observations and suggest that the majority of gene expression changes are likely to be directly related to the targeted knockdown. Nonetheless, we have clearly stated the limitations of the approach in the revised discussion and emphasized the need for future validation using complementary methods.

      We hope that these revisions strengthen the overall impact and interpretability of our study.

      Specific comments:

      (1) Knockdowns were only monitored by qPCR. Efficiency at the protein level (e.g., Western blots) needs to be determined.

      We agree that complementary protein-level validation (e.g., by Western blot) would strengthen the findings, and we are in the process of obtaining suitable reagents to address this point in future experiments. We have now clarified this limitation in the revised manuscript  

      (2) For each knockdown, only a single shRNA was used. shRNAs are infamous for offtarget effects; therefore, multiple shRNAs for each protein, or an alternative method such as CRISPR deletion or degron technology, must be tested to rule out such offtarget effects.

      We fully acknowledge the concern regarding the use of only a single shRNA per knockdown and agree that shRNAs are prone to off-target effects. We recognize the importance of validating our findings using multiple independent shRNAs or alternative knockdown strategies, such as CRISPR deletion or degron-based approaches, to ensure specificity. To address this concern, we have conducted qPCR confirmation the knockdown of target proteins from RNA-seq findings, further supporting the validity of our data. In line with this, we are currently optimizing an auxin-inducible degron system (AtAFB2) for targeted and controlled depletion of lamin C. Our preliminary results indicate approximately a 40% knockdown efficiency after 16 hours of auxin induction, highlighting the necessity for further system optimization (Author response image 1). Future experiments will integrate this improved degron technology alongside multiple independent approaches to rigorously address and mitigate concerns about off-target effects, thereby enhancing the robustness and reproducibility of our data.

      Author response image 1.

      FACS analysis of the lamin C degron system at 0, 1, 3, and 16 hours postinduction with 500 μM indole-3-acetic acid (IAA) (Sigma).

      (3) It is not clear whether the replicate experiments are true biological replicates (i.e., done on different days) or simply parallel dishes of cells done in a single experiment (= technical replicates). The extremely small standard deviations in the RT-qPCR data suggest the latter, which would not be adequate.

      We appreciate the reviewer’s insightful comment regarding the nature of our replicates. The RT-qPCR experiments were indeed performed as true biological replicates, with samples collected on different days and from independently cultured cell batches. We have added this to the manuscript Methods. While we observed some variability in the Scramble control group, the low standard deviations in the shRNAtreated samples likely reflect the consistent and efficient knockdown of target genes.

      For the RNA-seq experiments, samples were collected as two batches during RNA extraction and library preparation. The samples still represent biological replicates, as they were derived from independently prepared cultures in separate experimental setups. This approach was chosen to strike a balance between biological variation and technical consistency, thereby improving the reliability of the RNA-seq results.

      Reviewer #2 (Public review):

      Summary:

      This study focused on the roles of the nuclear envelope proteins lamin A and C, as well as nesprin-2, encoded by the LMNA and SYNE2 genes, respectively, on gene expression and chromatin mobility. It is motivated by the established role of lamins in tethering heterochromatin to the nuclear periphery in lamina-associated domains (LADs) and modulating chromatin organization. The authors show that depletion of lamin A, lamin A and C, or nesprin-2 results in differential effects of mRNA and lncRNA expression, primarily affecting genes outside established LADs. In addition, the authors used fluorescent dCas9 labeling of telomeric genomic regions combined with live-cell imaging to demonstrate that depletion of either lamin A, lamin A/C, or nesprin-2 increased the mobility of chromatin, suggesting an important role of lamins and nesprin2 in chromatin dynamics.

      We sincerely appreciate the reviewer’s thoughtful summary of our study and the key findings. Our work is indeed motivated by the well-established roles of lamin A/C in chromatin tethering at the nuclear periphery and the emerging understanding of their broader influence on chromatin organization and gene regulation. In our study, we aimed to further explore these roles by examining the consequences of depleting lamin A, lamin A/C, and nesprin-2 (SYNE2) on both gene expression and chromatin mobility.

      As the reviewer accurately notes, we observed differential effects on mRNA and lncRNA expression, with many changes occurring outside of previously defined LADs. This finding suggests that lamins and nesprin-2 may also influence transcriptional regulation through mechanisms beyond direct LAD association. Furthermore, using live-cell imaging of fluorescently labeled telomeric regions, we demonstrated that loss of these nuclear envelope components leads to increased chromatin mobility, supporting their role in maintaining chromatin stability and nuclear architecture.

      We thank the reviewer for highlighting these aspects, which we believe contribute to a more nuanced understanding of how nuclear envelope proteins modulate chromatin behavior and gene regulation.

      Strengths:

      The major strength of this study is the detailed characterization of changes in transcript levels and isoforms resulting from depletion of either lamin A, lamin A/C, or nesprin-2 in human osteosarcoma (U2OS) cells. The authors use a variety of advanced tools to demonstrate the effect of protein depletion on specific gene isoforms and to compare the effects on mRNA and lncRNA levels.

      The TIRF imaging of dCas9-labeled telomeres allows for high-resolution tracking of multiple telomeres per cell, thus enabling the authors to obtain detailed measurements of the mobility of telomeres within living cells and the effect of lamin A/C or nesprin-2 depletion.

      We are grateful that the reviewer recognized the comprehensive analysis of transcript and isoform changes upon depletion of lamin A, lamin A/C, or nesprin-2 in U2OS cells. We also thank the reviewer for acknowledging our use of advanced tools to investigate isoform-specific effects and to distinguish between changes in mRNA and lncRNA expression.

      Furthermore, we are pleased that the reviewer highlighted the strength of our TIRF imaging approach using dCas9-labeled telomeres. This technique enabled us to capture high-resolution, multi-locus dynamics within single living cells, and we agree that it is instrumental in revealing the impact of lamin A/C and nesprin-2 depletion on telomere mobility.

      Weaknesses:

      Although the findings presented by the authors overall confirm existing knowledge about the ability of lamins A/C and nesprin to broadly affect gene expression, chromatin organization, and chromatin dynamics, the specific interpretation and the conclusions drawn from the data presented in this manuscript are limited by several technical and conceptual challenges.

      One major limitation is that the authors only assess the knockdown of their target genes on the mRNA level, where they observe reductions of around 70%. Given that lamins A and C have long half-lives, the effect at the protein level might be even lower. This incomplete and poorly characterized depletion on the protein level makes interpretation of the results difficult. The description for the shRNA targeting the LMNA gene encoding lamins A and C given by the authors is at times difficult to follow and might confuse some readers, as the authors do not clearly indicate which regions of the gene are targeted by the shRNA, and they do not make it obvious that lamin A and C result from alternative splicing of the same LMNA gene. Based on the shRNA sequences provided in the manuscript, one can conclude that the shLaminA shRNA targets the 3' UTR region of the LMNA gene specific to prelamin A (which undergoes posttranslational processing in the cell to yield lamin A). In contrast, the shRNA described by the authors as 'shLMNA' targets a region within the coding sequence of the LMNA gene that is common to both lamin A and C, i.e., the region corresponding to amino acids 122-129 (KKEGDLIA) of lamin A and C. The authors confirm the isoform-specific effect of the shLaminA isoform, although they seem somewhat surprised by it, but do not confirm the effect of the shLMNA construct. Assessing the effect of the knockdown on the protein level would provide more detailed information both on the extent of the actual protein depletion and the effect on specific lamin isoforms. Similarly, given that nesprin-2 has numerous isoforms resulting from alternative splicing and transcription initiation. In the current form of the manuscript, it remains unclear which specific nesprin-2 isoforms were depleted, and to what extent (on the protein level).

      We have revised the Methods section to include a clearer and more detailed description of the shRNA design, including the specific regions of the LMNA gene targeted by each construct, as well as the relationship between lamin A and C isoforms resulting from alternative splicing. We agree that this clarification will help prevent confusion for readers.

      Regarding the shLMNA construct, we acknowledge the importance of confirming the knockdown at the protein level, especially given the long half-lives of lamin proteins. In our revised manuscript, we now refer to Supplementary Figure S2, which demonstrates that the shLMNA construct effectively reduces both lamin A and lamin C transcript levels. While we initially focused on mRNA quantification, we recognize that additional proteinlevel validation is valuable and have accordingly emphasized this point in the revised discussion.

      We also appreciate the comment on nesprin-2 isoforms. Given the complexity of nesprin-2 splicing, we are currently working to further characterize the specific isoforms affected and will aim to include protein-level data in a future study. 

      Another substantial limitation of the manuscript is that the current analysis, with the exception of the chromatin mobility measurements, is exclusively based on transcriptomic measurements by RNA-seq and qRT-PCR, without any experimental validation of the predicted protein levels or proposed functional consequences. As such, conclusions about the importance of lamin A/C on RNA synthesis and other functions are derived entirely from gene ontology terms and are not sufficiently supported by experimental data. Thus, the true functional consequences of lamin A/C or nesprin depletion remain unclear. Statements included in the manuscript such as "our findings reveal that lamin A is essential for RNA synthesis, ..." (Lines 79-80) are thus either inaccurate or misleading, as the current data do not show that lamin A is ESSENTIAL for RNA synthesis, and lamin A/C and lamin A deficient cells and mice are viable, suggesting that they are capable of RNA synthesis.

      We agree that our current data do not support the claim that lamin A is essential for RNA synthesis, and we acknowledge the importance of distinguishing between correlation and causal relations in our conclusions. In light of this, we have revised the statement in the manuscript to more accurately reflect our findings:

      “Our findings suggest that lamin A contributes to RNA synthesis, supports chromatin spatial organization through LMNA, and that SYNE2 influences chromatin modifications as reflected in transcript levels.”

      We hope this revision better aligns with the limitations of our dataset and addresses the reviewer’s concerns regarding the interpretation of functional consequences based solely on transcriptomic data.

      Another substantial weakness is that the data and analysis presented in the manuscript raise some concerns about the robustness of the findings. Given that the 'shLMNA' construct is expected to deplete both lamin A and C, i.e., its effect encompasses the depletion of lamin A, which is achieved by the 'shLaminA' construct, one would expect a substantial overlap between the DEGs in the shLMNA and shLaminA conditions, with the shLMNA depletion producing a broader effect as it targets both lamin A and C. However, the Venn Diagram in Figure 4a, the genomic loci distribution in Figure 4b, and the correlation analysis in Supplementary Figure S2 show little overlap between the shLMNA and shLaminA conditions, which is quite surprising. In the mapping of the DEGs shown in Figure 4b, it is also surprising not to see the gene targeted by the shRNA, LMNA, found on chromosome 1,  in the results for the shLMNA and shLamin A depletion.

      We have added the discussion into the revised edition: “Interestingly, although both shLMNA and shLaminA constructs target lamin A, with shLMNA additionally depleting lamin C, the DEGs identified under these two conditions show limited overlap. This unexpected finding suggests that depletion of lamin C in the shLMNA condition may trigger distinct or compensatory transcriptional responses that are not elicited by lamin A knockdown alone. Furthermore, variation in shRNA efficiency or off-target effects may contribute to these differences. Notably, despite directly targeting LMNA, the overlap in DEGs between the two conditions remained limited under our stringent threshold criteria. Together, these observations highlight the complex and non-linear regulatory roles of lamin isoforms in gene expression and underscore the need for further mechanistic studies to dissect their individual and combined contributions [28,29].”

      The correlation analysis in Supplementary Figure S2 raises further questions. The authors use doc-inducible shRNA constructs to target lamin A (shLaminA), lamin A/C (shLMNA), or nesprin-2 (shSYNE2). Thus, the no-dox control (Ctr) for each of these constructs would be expected to be very similar to the non-target scrambled controls (Ctrl.shScramble and Dox.shScramble). However, in the correlation matrix, each of the no-dox controls clusters more closely with the corresponding dox-induced shRNA condition than with the Ctrl.shScramble or Dox.shScramble conditions, suggesting either a very leaky dox-inducible system, strong effects from clonal selection, or substantial batch effects in the processing. Either of these scenarios could substantially affect the interpretation of the findings. For example, differences between different clonal cell lines used for the studies, independent of the targeted gene, could explain the limited overlap between the different shRNA constructs and result in apparent differences when comparing these clones to the scrambled controls, which were derived from different clones.

      We thank the reviewer for this thoughtful observation. We would like to clarify that the samples shown in Supplementary Figure S2 were processed and sequenced in two separate batches, and the data presented in the correlation matrix are unnormalized. As such, batch effects are indeed present and likely contribute to the clustering pattern observed, particularly the closer similarity between the dox-induced and no-dox samples for each individual shRNA construct.

      Importantly, our analyses focus on within-construct comparisons (i.e., doxycyclinetreated vs untreated samples for the same shRNA), rather than direct comparisons across different constructs or scrambled controls. Each experimental pair (dox vs nodox) was processed in parallel within its respective batch to ensure internal consistency. Thus, while the global clustering pattern may reflect batch-related differences or baseline variations between independently derived cell lines, these factors do not affect the main conclusions drawn from the within-construct differential expression analysis.

      The manuscript also contains several factually inaccurate or incorrect statements or depictions. For example, the depiction of the nuclear envelope in Figure 1 shows a single bilipid layer, instead of the actual double bi-lipid layer of the inner and outer nuclear membranes that span the nuclear lumen. The depiction further lacks SUN domain proteins, which, together with nesprins, form the LINC complex essential to transmit forces across the nuclear envelope. The statement in line 214 that "Linker of nucleoskeleton and cytoskeleton (LINC) complex component nesprin-2 locates in the nuclear envelope to link the actin cytoskeleton and the nuclear lamina" is not quite accurate, as nesprin-2 also links to microtubules via dynein and kinesin.

      We sincerely thank the reviewer for pointing out these important inaccuracies. In response, we have revised Figure 1 to accurately depict the nuclear envelope as a double bi-lipid membrane and included SUN domain proteins to better reflect the structural components of the LINC complex. Additionally, we have updated the statement and citations 

      This is the revised part that is incorporated in the manuscript “The linker of nucleoskeleton and cytoskeleton (LINC) complex component nesprin-2 is a nuclear envelope protein that connects the nucleus to the cytoskeleton by interacting not only with actin filaments but also with microtubules through motor proteins such as dynein and kinesin. This structural linkage contributes to cellular architecture and facilitates mechanotransduction between the nuclear interior and the extracellular matrix (ECM) [8,21]

      ”We appreciate the reviewer’s insights, which have helped improve the accuracy and clarity of our manuscript.

      The statement that "Our data show that Lamin A knockdown specifically reduced the usage of its primary isoform, suggesting a potential role in chromatin architecture regulation, while other LMNA isoforms remained unaffected, highlighting a selective effect" (lines 407-409) is confusing, as the 'shLaminA' shRNA specifically targets the 3' UTR of lamin A that is not present in the other isoforms. Thus, the observed effect is entirely consistent with the shRNA-mediated depletion, independent of any effects on chromatin architecture.

      We have rephrased the statement “Our data show that knockdown with shLaminA, which specifically targets the 3' UTR unique to the lamin A isoform, selectively reduced lamin A expression without affecting other LMNA isoforms.”

      The premise of the authors that lamins would only affect peripheral chromatin and genes at LADs neglects the fact that lamins A and C are also found in the nuclear interior, where they form stable structure and influence chromatin organization, and the fact that lamins A and C and nesprins additionally interact with numerous transcriptional regulators such as Rb, c-Fos, and beta-catenins, which could further modulate gene expression when lamins or nesprins are depleted.

      Based on the reviewer’s comment we have added the statement into Discussion part “Beyond their well-established role in tethering heterochromatin at the nuclear periphery through lamina-associated domains (LADs), A-type lamins (lamins A and C) also localize to the nuclear interior, where they contribute to chromatin organization and gene regulation independently of LADs [27,28]. Nuclear lamins can form intranuclear foci that associate with active chromatin and are implicated in supporting transcriptional activity. Additionally, both lamins and nesprins participate in diverse protein-protein interactions that may influence transcriptional regulation. For example, lamin A/C interacts with the retinoblastoma protein (Rb) to modulate E2F-dependent transcription [29], and with c-Fos to regulate its nuclear retention and activity [30]. While βcatenin acts as a co-activator in Wnt signaling relies on nuclear translocation and interaction with transcriptional complexes, and evidence suggests that nuclear architecture and envelope components, including nesprins, can influence this process [31]. Therefore, the observed gene expression changes following depletion of lamins or nesprins are likely not restricted to genes located within lamina-associated domains (LADs), but may also result from broader perturbations in nuclear architecture and transcriptional regulatory networks. This is consistent with our findings that lamins and nesprins influence gene expression in distal, non-LAD regions.”

      The comparison of the identified DEGs to genes contained in LADs might be confounded by the fact that the authors relied on the identification of LADs from a previous study (ref #28), which used a different human cell type (human skin fibroblasts) instead of the U2OS osteosarcoma cells used in the present study. As LADs are often highly cell-type specific, the use of the fibroblast data set could lead to substantial differences in LADs.

      DamID in various mammalian cell types has shown that some LADs are cell-type invariant (constitutive LADs [cLADs]), while others interact with the NL in only certain cell types (facultative LAD [fLADs]) (Bas van Steensel, 2017). We agree that facultative LADs (fLADs), which comprise approximately half of all LADs, are often highly cell-type specific. We acknowledge that this specificity may influence the interpretation of our findings. At present, publicly available LAD datasets for U2OS cells are limited to those associated with LMNB. We concur that generating LMNA-specific LAD maps in U2OS cells would enhance the accuracy and relevance of our analyses, and we view this as an important direction for future research.

      Another limitation of the current manuscript is that, in the current form, some of the figures and results depicted in the figures are difficult to interpret for a reader not deeply familiar with the techniques, based in part on the insufficient labeling and figure legends. This applies, for example, to the isoform use analysis shown in Figure 3d or the GenometriCorr analysis quantifying spatial distance between LADs and DEGs shown in Figure 4c.

      For Figure 3, we added text in the caption to make the figure more readable “Isoform switching analysis reveals differential expression of alternative transcript variants between conditions, highlighting a shift in predominant isoform usage.” For Figure 4c, we added text in the caption “GenometriCorr analysis was used to quantify the spatial relationship between LADs and DEGs, evaluating whether the observed genomic proximity deviates from random expectation through empirical distributionbased statistical testing of pairwise distances between genomic intervals.” And also in the ‘Methods”.

      Overall appraisal and context:

      Despite its limitations, the present study further illustrates the important roles the nuclear envelope proteins lamin A, lamin C, and nesprin-2 have in chromatin organization, dynamics, and gene expression. It thus confirms results from previous studies (not always fully acknowledged in the current manuscript) previously reported for lamin A/C depletion. For example, the effect of lamin A/C depletion on increasing mobility of chromatin had already been demonstrated by several other groups, such as Bronshtein et al. Nature Comm 2015 (PMID: 26299252) and Ranade et al. BMC Mol Cel Biol 2019 (PMID: 31117946). Additionally, the effect of lamin A/C depletion on gene and protein expression has already been extensively studied in a variety of other cell lines and model systems, including detailed proteomic studies (PMIDs 23990565 and 35896617).

      We add more discussions as below “Our findings reinforce the pivotal roles of nuclear envelope proteins lamin A, LMNA and nesprin 2 in regulating chromatin organization, chromatin mobility, and gene expression. These results are consistent with and extend prior studies investigating the consequences of lamin depletion. For instance, increased chromatin mobility following the loss of lamin A/C has been previously demonstrated using live-cell imaging approaches [26,35], supporting our observations of nuclear structural relaxation and chromatin redistribution. Additionally, proteomic profiling following lamin A depletion has been extensively documented across both cellular and mouse models, providing valuable insights into the molecular consequences of nuclear envelope disruption [36,37]. While these earlier studies provide a strong foundation, our work contributes novel insights by integrating isoform-specific perturbations with spatial chromatin measurements. This approach emphasizes contextdependent regulatory mechanisms that involve not only lamina-associated regions but also nesprin-associated domains and distal genomic loci, thereby expanding the current understanding of nuclear envelope protein function in gene regulation.”

      The finding that that lamin A/C or nesprin depletion not only affects genes at the nuclear periphery but also the nuclear interior is not particularly surprising giving the previous studies and the fact that lamins A and C are also founding within the nuclear interior, where they affect chromatin organization and dynamics, and that lamins A/C and nesprins directly interact with numerous transcriptional regulators that could further affect gene expression independent from their role in chromatin organization.

      We have added the following statement into the Discussion part “Beyond their well-established role in tethering heterochromatin at the nuclear periphery through lamina-associated domains (LADs), A-type lamins (lamins A and C) also localize to the nuclear interior, where they contribute to chromatin organization and gene regulation independently of LADs [27,28]. Nuclear lamins can form intranuclear foci that associate with active chromatin and are implicated in supporting transcriptional activity. Additionally, both lamins and nesprins participate in diverse protein-protein interactions that may influence transcriptional regulation. For example, lamin A/C interacts with the retinoblastoma protein (Rb) to modulate E2F-dependent transcription [29], and with c-Fos to regulate its nuclear retention and activity [30]. While β-catenin acts as a co-activator in Wnt signaling relies on nuclear translocation and interaction with transcriptional complexes, and evidence suggests that nuclear architecture and envelope components, including nesprins, can influence this process [31]. Therefore, the observed gene expression changes following depletion of lamins or nesprins are likely not restricted to genes located within lamina-associated domains (LADs), but may also result from broader perturbations in nuclear architecture and transcriptional regulatory networks. This is consistent with our findings that lamins and nesprins influence gene expression in distal, non-LAD regions.”

      The authors provide a detailed analysis of isoform switching in response to lamin A/C or nesprin depletion, but the underlying mechanism remains unclear. Similarly, their analysis of the genomic location of the observed DEGs shows the wide-ranging effects of lamin A/C or nesprin depletion, but lets the reader wonder how these effects are mediated. A more in-depth analysis of predicted regulator factors and their potential interaction with lamins A/C or nesprin would be beneficial in gaining more mechanistic insights.

      We agree that the current findings, while highlighting the broad impact of lamin A/C or nesprin depletion on isoform usage and gene expression, do not fully elucidate the underlying regulatory mechanisms. We acknowledge the importance of identifying upstream regulators and understanding their potential interactions with lamins and nesprins. Future investigations integrating epigenetic approaches, such as ChIP-seq for transcription factors and chromatin-associated proteins, will be essential to clarify how lamins and nesprins contribute to isoform switching and to uncover the mechanistic basis of these regulatory effects.

      Reviewer #3 (Public review):

      Summary:

      This manuscript describes DOX inducible RNAi KD of Lamin A, LMNA coded isoforms as a group, and the LINC component SYNE2. The authors report on differentially expressed genes, on differentially expressed isoforms, on the large numbers of differentially expressed genes that are in iLADs rather than LADs, and on telomere mobility changes induced by 2 of the 3 knockdowns.

      Strengths:

      Overall, the manuscript might be useful as a description for reference data sets that could be of value to the community.

      We acknowledge that the initial version of our manuscript lacked comprehensive comparisons with previous studies. In our revised manuscript, we have included more detailed discussions highlighting how our findings complement and extend existing knowledge. Specifically, our study presents novel insights into the role of lamins and nesprins in regulating non-coding RNAs and isoform switching, areas that have not been extensively explored in prior literatures. We hope these additions will clarify the contribution of our work and demonstrate the potential value to the field.

      Weaknesses:

      The results are presented as a type of data description without formulation of models or explanations of the questions being asked and without follow-up. Thus, conceptually, the manuscript doesn't appear to break new ground.

      In our study, we proposed a conceptual model in which gene expression changes are linked to RNA synthesis, chromatin conformation alterations, and chromatin modifications, potentially mediated by lamin A, LMNA, and nesprin-2 at the transcriptional level. However, we acknowledge that this model remains preliminary and largely unexplored. We agree that additional mechanistic insights and identification of specific regulatory factors are needed to strengthen this framework. Future studies will aim to experimentally validate these hypotheses and clarify the pathways and regulators involved.

      Not discussed is the previous extensive work by others on the nucleoplasmic forms of LMNA isoforms. Also not discussed are similar experiments- for instance, gene expression changes others have seen after lamin A knockdowns or knockouts, or the effect of lamina on chromatin mobility, including telomere mobility - see, for example, a review by Roland Foisner (doi.org/10.1242/jcs.203430) on nucleoplasmic lamina. The authors need to do a thorough search of the literature and compare their results as much as possible with previous work.

      We sincerely thank the reviewer for pointing out the important body of previous work on the nucleoplasmic forms of LMNA isoforms and the impact of lamin A depletion on gene expression and chromatin mobility. In the revised version, we have now included relevant citations. Please see the highlights in the Discussion.

      The authors don't seem to make any attempt to explore the correlation of their findings with any of the previous data or correlate their observed differential gene expression with other epigenetic and chromatin features. There is no attempt to explore the direction of changes in gene expression with changes in nuclear positioning or to ask whether the genes affected are those that interact with nucleoplasmic pools of LMNA isoforms. The authors speculate that the DEG might be related to changing mechanical properties of the cells, but do not develop that further.

      We sincerely appreciate the reviewer’s insightful comments. In our revised manuscript, we have addressed this concern by comparing our telomere mobility results with previously published data (Bronshtein et al., 2015), and we observe consistent findings showing that lamin A depletion leads to increased telomere motility. Furthermore, our study provides novel evidence that nesprin-2 depletion similarly enhances telomere migration, suggesting a broader role for nuclear envelope components in chromatin dynamics.

      We acknowledge the importance of integrating gene expression data with epigenetic and chromatin features. However, to our knowledge, such datasets are currently limited for U2OS cells, particularly in the context of lamin and nesprin perturbation. We agree that understanding the correlation between differentially expressed genes and nuclear positioning or interactions with nucleoplasmic pools of LMNA isoforms is a promising direction. We are actively planning future studies that include chromatin profiling and mechanical perturbation assays to further explore these mechanisms.

      The technical concerns include: 1) Use of only one shRNA per target. Use of additional shRNAs would have reduced concern about possible off-target knockdown of other genes; 2) Use of only one cell clone per inducible shRNA construct. Here, the concern is that some of the observed changes with shRNA KDs might show clonal effects, particularly given that the cell line used is aneuploid. 3) Use of a single, "scrambled" control shRNA rather than a true scrambled shRNA for each target shRNA.

      (1) Regarding the use of a single shRNA per target, we agree that utilizing multiple independent shRNAs would strengthen the conclusions. In our study, we selected validated shRNA sequences with minimal predicted off-targets and confirmed knockdown efficiency at mRNA level (by qPCR).

      (2) As for the use of a single cell clones per inducible construct, we understand the concern that clonal variability, particularly in an aneuploid cell line, could influence the observed phenotypes. To clarify this, we have revised in the manuscript “Multiple independent clones per shRNA were screened for knockdown efficiency using reverse transcription quantitative real-time PCR (RT-qPCR). Three clones demonstrating robust and consistent knockdown were selected and expanded. These clones were subsequently pooled to minimize clonal variability and used for downstream analyses, including RNA-seq”. To mitigate this, we ensured consistent results across biological replicates and used inducible systems to reduce variability introduced by random integration. 

      (3) We also acknowledge that the use of a single scrambled shRNA control, rather than matched scrambled controls for each construct, is a limitation. While we used a standard non-targeting scrambled shRNA commonly applied in similar studies, we understand that distinct scrambled sequences might better control for construct-specific effects. .

      Reviewer #1 (Recommendations for the authors):

      Please make the processed RNA-seq data available for each individual experiment, not only the raw reads and averaged data.

      In response to your suggestion, we have now included the raw count data for each individual experiment in Supplementary Table S5 to enhance transparency and reproducibility.   

      Reviewer #2 (Recommendations for the authors):

      The current text contains numerous typos, and some of the text could benefit from additional editing for clarity and conciseness. In addition, several statements, particularly in the section encompassing lines 321-329, lack supporting references.

      In our revised version, we have carefully edited the text for clarity and conciseness.

      We have included related citations from lines 321-329: “The majority of genes located within LADs tend to be transcriptionally repressed or expressed at low levels. This is because LADs are associated with heterochromatin , a tightly packed form of DNA that is generally inaccessible to the cellular machinery required for gene expression 12,23. Lamin mutations and levels have shown to disrup LAD organization and gene expression that have been implicated in various diseases, including cancer and laminopathies 24,25.”

      The figures would benefit from better labeling, including a clear schematic of which specific regions of the LMNA and SYNE2 genes are targeted by the different shRNA constructs, and by labeling the different isoforms in Figure S1 with the common names. Furthermore, note that lamin A arises from posttranslational processing of prelamin A, not from a different transcript. Likely, the "different LMNA genes" shown in Supplementary Figure S1 are just different annotations, with the exceptions of the splice isoforms lamin C and lamin delta10.

      In the Method, we have clearly denoted the design of corresponding shRNAs as suggested “The shRNA designated as shLMNA targets a region within the coding sequence of LMNA that is shared by both lamin A and lamin C, corresponding to amino acids 122–129 (KKEGDLIA) of lamin A/C (RefSeq: NM_001406985.1). The shRNA against SYNE2 (shSYNE2) targets a sequence encoding amino acids 5133– 5140 (KRYERTEF) of the SYNE2 protein (RefSeq: NM_182914.3).”

      For Figure S1, we have added common isoform names to figure and captions. “lamin A (ENST00000368300.9), LMNA 227 (ENST00000675431.1), pre-lamin A/C (ENST00000676385.2), and lamin C (ENST00000677389.1)."

      Several statements about the novelty of the findings or approach are inaccurate. For example, the authors state in the introduction that "However, whether lamins and nesprins actively govern chromatin remodeling and isoform switching beyond their wellcharacterized functions in mechanotransduction remains an open question", as several previous studies have provided detailed characterization of lamin A/C depletion or mutations on chromatin organization, mobility, and gene expression. The authors should revise these statements and better acknowledge the previous work.

      We have added the citations of previous works and revised the text “While significant progress has been made in understanding the role of lamins in genome organization, the precise mechanisms by which lamins and nesprins regulate gene expression through distal chromatin interactions remain incompletely understood [10,11]. Notably, recent evidence suggests a reciprocal interplay between transcription and chromatin conformation, where gene activity can influence chromatin folding and vice versa [12]. However, whether lamins and nesprins actively govern chromatin remodeling and isoform switching beyond their well-characterized functions in mechanotransduction remains an open question.”

      Reviewer #3 (Recommendations for the authors):

      Overall, the manuscript might be useful as a description for reference data sets that could be of value to the community. Otherwise, I did not derive meaningful biological insights from the manuscript. It was not clear to me also how much might be repeating previous work already reported in the literature (see below). For example, I cited a review on nucleoplasmic lamins by Roland Foisner at the end of the specific comments - scanning it very quickly shows that there are already papers on increased chromatin mobility after lamin perturbations, including telomeres. I know there have also been studies of changes in gene expression after lamin A and B KD. The authors need to do a thorough search of the literature and compare their results as much as possible with previous work.

      We acknowledge that the roles of lamins in regulating chromatin dynamics and gene expression, including the effects of lamin perturbations on chromatin mobility and telomere behavior, have been previously reported. In response, we have revised the manuscript to incorporate relevant citations and to better contextualize our results within the existing literature. Importantly, to our knowledge, the finding that nesprin-2 influences telomere mobility has not been previously reported, and we have highlighted this novel observation in the revised text.

      In response, we have now conducted a more comprehensive literature review and revised the manuscript accordingly to better contextualize our findings. Specifically, we have added comparisons to prior studies reporting chromatin mobility changes following lamin A/C depletion. We also now emphasize the novel aspects of our study, such as the isoform-specific perturbations and the integration of spatial chromatin organization with transcriptomic outcomes.

      We hope these revisions strengthen the manuscript’s contribution as both a useful resource and a mechanistic investigation.

      Not even acknowledged is the previous extensive work on the nucleoplasmic forms of LMNA isoforms - I know Robert Goldman published extensively on this, implicating lamin A, for example, on DNA replication in the nuclear interior as well as transcription. More recently, Roland Foisner worked on this, including with molecular approaches. For example, a 2017 review mentions previous ChIP-seq mapping of lamin A binding to iLAD genes and also describes previous work on chromatin mobility, including telomere mobility. Yet the entire writing in the manuscript seems to only discuss the role of LMNA isoforms in the nuclear lamina per se, explaining the surprise in seeing many iLAD genes differentially expressed after KD.

      We have added related studies as suggested by the reviewer and  added the following statement: “Nucleoplasmic lamins bind to chromatin and have been indicated to regulate chromatin accessibility and spatial chromatin organization [24]. Lamins in the nuclear interior regulate gene expression by dynamically binding to heterochromatic and euchromatic regions, influencing epigenetic pathways and chromatin accessibility. They also contribute to chromatin organization and may mediate mechanosignaling [25]. However, the contribution of nesprins and lamins to isoform switch and chromatin dynamics has not been fully understood [7,10,26]. ”

      Overall, I found a surprising lack of review and citation of previous work (see Specific comments below), including the lack of citations for various declarative statements about previous conclusions in the field about lamin A.

      (1) Introduction:

      "However, the contribution of nesprins and lamins to gene 220 expression has not been fully understood."

      There is a literature about changes in gene expression- at least for lamin KD and KO- both in vitro and in vivo- that the authors could and should review and summarize here.

      To address this, we have now revised the manuscript to include a more comprehensive discussion of the relevant literature and added appropriate citations in the corresponding section. We hope this addition provides better context for our current findings and clarifies the contribution of lamins and nesprins to gene regulation.

      (2) Results:

      "A fragment of shRNA that targeting 3' untranslated region (UTR) in LMNA genes was chosen to knockdown lamin A (shLaminA). A fragment of shRNA that targeting coding sequence (CDS) region in LMNA genes was chosen to knockdown LMNA (shLMNA)". The authors should explain more - does one KD both lamin A and C (shLMNA), versus the other being specific to lamin A but not lamin C? It appears so from later text, but the authors should explicitly explain their targeting strategy right at the beginning to make this clear.

      To make the method clearer, we have clear added the text “The shRNA against lamin A (shLaminA) targets the 3′ untranslated region (UTR) of the LMNA gene, specific to prelamin A, which is post-translationally processed into mature lamin A. The shRNA designated as shLMNA targets a region within the coding sequence of LMNA that is shared by both lamin A and lamin C, corresponding to amino acids 122–129 (KKEGDLIA) of lamin A/C (RefSeq: NM_001406985.1). The shRNA against SYNE2 (shSYNE2) targets a sequence encoding amino acids 5133–5140 (KRYERTEF) of the SYNE2 protein (RefSeq: NM_182914.3).”

      But more importantly, the convention with RNAi is to demonstrate consistent results with at least two different small RNAs. This is to rule out that a physiological result is due to the KD of a non-target gene(s) rather than the target gene. The scrambled shRNA controls are not sufficient for this as they test a general effect of the shRNA culture conditions, including tranfection and dox treatment, etc, rather than a specific KD of a different gene(s) than the target due to off-target RNAi.

      We fully acknowledge the concern regarding the use of only a single shRNA per knockdown and agree that shRNAs are prone to off-target effects. However, we have conducted qPCR confirmation of key RNAseq findings, which strongly supports the specificity and validity of our observed results. Additionally, we recognize the importance of validating our findings using multiple independent shRNAs or alternative knockdown strategies, such as CRISPR deletion or degron-based approaches. To address this rigorously, we are currently optimizing an auxin-inducible degron system (AtAFB2) for targeted depletion of lamin C. Our preliminary data indicate approximately 40% knockdown efficiency after 16 hours of auxin induction, highlighting ongoing optimization efforts (Author response image 1). Future experiments will integrate this improved degron system and multiple independent shRNAs to further substantiate our results and definitively rule out potential off-target effects, thereby enhancing the robustness and reproducibility of our data.

      (3) "Single-cell clones 114 were subsequently isolated and expanded in the presence of 2 μg ml-1 puromycin to 115 establish doxycycline-inducible shRNA-knockdown stable cell lines."

      The authors need to describe explicitly in the Results how exactly they did these experiments. Did they do their analysis using a single clone from each lentivirus shRNA transduction? Did they do analysis - ie RNA-seq- on several clones from the same shRNA transduction and compare? Did they pool clones together?

      In our study, single-cell clones and pooled the three independent clones were mixed following lentiviral transduction with doxycycline-inducible shRNA constructs and selected with 2 μg/ml puromycin. For each shRNA, we screened multiple clones for knockdown efficiency and selected a representative clone exhibiting robust knockdown for downstream experiments, including RNA-seq. We did pool three multiple clones; all functional analyses were performed on pooled clones. We have now revised the Method section to explicitly describe this experimental design: “Multiple independent clones per shRNA were screened for knockdown efficiency using reverse transcription quantitative real-time PCR (RT-qPCR). Three clones demonstrating robust and consistent knockdown were selected and expanded. These clones were subsequently pooled to minimize clonal variability and used for downstream analyses, including RNAseq.”

      One confounding problem is that there are clonal differences among cells cloned from a single cell line. This is particularly true for aneuploid cell lines like U2OS. Ideally, they would use mixed clones, but if not, they should at least explain what they did.

      We added the text to method “Three single-cell clones exhibiting robust knockdown efficiency were individually expanded and subsequently pooled. The pooled clones were maintained in medium containing 2 µg ml ¹ puromycin to establish stable cell lines with doxycycline-inducible shRNA expression. Multiple independent clones per shRNA were screened for knockdown efficiency using reverse transcription quantitative real-time PCR (RT-qPCR). Three clones demonstrating robust and consistent knockdown were selected and expanded. These clones were subsequently pooled to minimize clonal variability and used for downstream analyses, including RNA-seq.”

      (4) I am confused by their shScramble control. This is typically done for each shRNA- ie, a separate scrambled control for each of the different target shRNAs. This is because there are nucleotide composition effects, so the scrambled idea is to keep the nucleotide composition the same.

      However, looking at STable 1 and SFig. 2- shows they used a single scrambled control, thus not controlling for different nucleotide composition among the three shRNAs that they used.

      In our study, we used a single non-targeting shRNA (shScramble) as a control to account for potential effects of the shRNA vector and delivery system. This approach is commonly accepted in the field when the scrambled sequence is validated as non-targeting and does not share significant homology with the genes of interest. While we acknowledge that using separate scrambled controls matched in nucleotide composition for each targeting shRNA can further minimize sequence-dependent effects, we believe that the use of a single validated scramble control is appropriate for the scope of this study.

      (5) In Figure 2 - what is on the x-axis? Number of DEG? Please state this explicitly in the figure legend.

      We have added “Counts” as figure legend, and added the caption “Gene counts are displayed on the x-axis.”

      (6) More importantly, in Figure 2 they only show pathway analysis of DEG. They should show more: a) Fold-change of DEG displayed for all DEG; b) Same for genes in LADs vs iLADs. More explicitly, are the DEG primarily in LADs or iLADs, or a mix? Are the DEGs in LADs biased towards increased expression, as might be expected for LAD derepression? Conversely, what about iLADs - is there a bias towards increased or decreased expression?

      We agree that a more detailed characterization of the differentially expressed genes (DEGs) will strengthen the conclusions. In response we have revised the manuscript as following: “Furthermore, differential expression analysis revealed that the majority of DEGs following depletion of lamins and nesprins were located outside lamina-associated domains (non-LADs). Specifically, for shLaminA knockdown, 8 DEGs within LADs were downregulated and 8 were upregulated, whereas 59 non-LAD DEGs were downregulated and 79 were upregulated. For shLMNA, 7 LAD-associated DEGs were downregulated and 15 were upregulated, with 88 downregulated and 140 upregulated DEGs in non-LAD regions. In the case of shSYNE2 knockdown, 161 LAD DEGs were downregulated and 108 were upregulated, while 2,009 non-LAD DEGs were downregulated and 1,851 were upregulated (Figure 2d). These results indicate that the transcriptional changes resulting from the loss of lamins or nesprins predominantly occur at non-LAD genomic regions.”

      We appreciate the reviewer’s comments, which helped improve the clarity and depth of our analysis.

      (7) Is there a scientific rationale for the authors' focus on DE of isoforms? Is this somehow biologically meaningful and different from the overall DE of all genes? The authors should explain in the Results section what their motivation was in deciding to do this analysis.

      We have add the following statement in response to the reviewer “To uncover transcript-specific regulatory changes, we performed isoform-level differential expression analysis. Many genes produce functionally distinct isoforms, and shifts in their usage can occur without changes in total gene expression, making isoform-level analysis essential for detecting subtle but meaningful transcriptional regulation.  Our analysis demonstrated that depletion of lamins and nesprins induced significant alterations in specific transcript isoforms, indicating regulatory changes in alternative splicing or transcription initiation that are not captured by gene-level differential expression analysis.”

      (8) "Expectedly, the DEGs from 327 depletion of lamin A, LMNA, and SYNE2 seldom intersected with genes in 328 LADs (Figure 4a)."

      Why was this expected? The authors have only cited one review paper. Others have seen significant numbers of genes in LADs that are DE after KD of lamina proteins. What was the fold cutoff used for DE? Was there a cutoff for the level of expression prior to KD? The authors should cite relevant primary literature showing that there are active genes in LADs and that some perturbations of the lamina proteins do result in DE of genes in LADs.

      We acknowledge the reviewer's concerns regarding our statement: "Expectedly, the DEGs from 327 depletion of lamin A, LMNA, and SYNE2 seldom intersected with genes in 328 LADs (Figure 4a)." To clarify, this expectation stems from previous observations that LAD-associated genes are typically transcriptionally silent or expressed at very low levels (Guelen et al., 2008). However, dynamic changes in LADs and gene expression status do occur during cellular differentiation (Peric-Hupkes et al., 2010), and some LAD-resident genes can become active and transcriptionally responsive under specific conditions, such as T cell activation. We applied specific foldchange and baseline expression level thresholds in our analysis, as detailed in the Methods section. We added the following text in the “Method”: “Differential gene expression analysis was performed using thresholds of baseMean > 50, absolute log fold change > 0.5, and p-value < 0.05.”  We agree that additional relevant primary literature demonstrating active gene expression changes within LADs upon perturbation of lamina proteins should be cited and we have added the following statement:

      “LADs exhibit dynamic reorganization and changes in gene expression during cellular differentiation [30]. Although genes within LADs are generally transcriptionally silent or expressed at low levels [31], some LAD-resident genes remain active and can be transcriptionally modulated in response to specific stimuli, such as T cell activation [32].”

      (9) "Expectedly, the DEGs from 327 depletion of lamin A, LMNA, and SYNE2 were seldomly intersected with genes in 328 LADs (Figure 4a)." I disagree with the wording of "seldom" which by definition means rarely. I don't see that this applies to the significant number of genes that are in LADs that are DE as shown in the Venn diagram, Fig. 4a. For example, this includes 57 genes for the shLamin A and ~400 genes for the shSYNE2.

      Is there anything of note about which genes are DE within LADs?

      We have rephrased the text to the following “The Venn diagram analysis revealed limited overlap between DEGs resulting from knockdown of lamin A (shLaminA), LMNA (shLMNA), or SYNE2 (shSYNE2) and genes located within laminaassociated domains (LADs). Specifically, only a small subset of DEGs intersected with LAD-associated genes across all three knockdowns, suggesting that the majority of transcriptional changes occur outside LAD regions”. The DEGs in LADs and non-LADs were shown in supplementary Table S4.

      (10) "The relative distance from DE genes (query features) to LADs (reference feature) is plotted by GenometriCorr package (v 1.1.24). The color depicting deviation from the expected distribution and the line indicating the density of the data at relative distance are shown." The authors should explicitly describe what the reference "expected distribution" was based on. This is all very cryptic right now, so we can't assess the biological possible significance. Third, they should clearly explain what is plotted on the x and y axes of Figure 4C. I really don't have a clue. I assume the x-axis is some measure of "relative distance" but what on earth does that mean? I really don't understand this plot, which is crucial to the whole story. What is on the y-axis? Density of DEGs? What? And they need to explain not only what is plotted on the x and y axes but also provide units.

      We have revised the text to clarify that the GenometriCorr analysis (v1.1.24) was used to assess the spatial association between differentially expressed genes (DEGs, query features) and lamina-associated domains (LADs, reference features). Specifically, this method evaluates whether the observed distances between query and reference genomic intervals significantly deviate from a null distribution generated by random permutation of query features across the genome, while preserving size and chromosomal context.

      In the revised figure legend and main text, we now clarify that the x-axis represents the relative genomic distance between each differentially expressed gene (DEG) and the nearest LAD, scaled between –1 and 1, where values near 0 indicate close proximity, and values approaching –1 or 1 reflect greater distances on either side of the LADs. The y-axis denotes the density (or proportion) of query features (DEGs) at each relative distance bin. The color gradient overlays the plot to indicate deviation from the expected null distribution (based on randomized query positions): red indicates enrichment (closer than expected), while blue indicates depletion (further than expected).

      “GenometriCorr analysis (v1.1.24) was used to assess the spatial relationship between DEGs (query) and LADs (reference) [48]. The x-axis shows the relative genomic distance between each DEG and the nearest LAD, scaled from –1 (far upstream) to 1 (far downstream), with 0 indicating closest proximity. The y-axis represents the density of DEGs at each distance bin. A color gradient indicates deviation from a randomized null distribution: red signifies enrichment (closer than expected), and blue signifies depletion. Statistical significance was determined using the Jaccard test (p < 0.05).”

      Second, to correlate with other features and to give more meaning, the authors should show the chromosome location of the DEGs and scale this by the actual DNA sequence distances. This will be needed to correlate with other features from other studies.

      The genomic positions of DEGs have now been displayed in Figure 4b, with distances shown in base pairs to facilitate cross-reference with other features in future studies.

      Third, they should attempt some kind of analysis themselves to try to understand what might correlate with the DEGs. To begin with, they might try to correlate with lamin A ChiP-seq or other molecular proximity assays. Others in fact have shown that lamin A interacts with 5' regulatory regions of a subset of genes- presumably this is the diffuse nucleoplasmic pool of lamin A that has been studied by others in the past.

      We agree that understanding potential regulatory mechanisms underlying DEG distribution is essential. In response, we have expanded our analysis (Figure 2d) to highlight that a substantial portion of DEGs are located outside of LADs, suggesting potential regulation by the nucleoplasmic pool of lamin A. This is consistent with previous studies showing lamin A interaction with regulatory elements such as 5′ UTRs and enhancers, independent of LAD localization. We have now cited relevant literature to support this hypothesis.

      Fourth, in the table, they should go beyond just giving the fold change in expression. Particularly for genes that are expressed at very low levels, this is not particularly meaningful as it is very sensitive to noise. They should provide a metric related to levels of expression both before and after the KD.

      We acknowledge the reviewer’s concern regarding fold-change interpretation for low-abundance transcripts. To improve clarity and interpretability, we have now included Supplementary Table S4, which provides the raw counts and baseMean values (average normalized expression across all samples) for all DEGs. Additionally, we note that in our differential expression analysis, genes with baseMean < 50 and absolute log<sub>2</sub>fold change > 0.5 were filtered out to reduce potential noise from low-expression genes.

      (11) The figure legend and description in the Results section were completely inadequate. I had little understanding of what was being plotted. It is not sufficient to simply state the name of some software package that they used to measure "XYZ" and to show the results. It has no meaning for the average reader.

      Without some type of explanation of rationale, questions being asked, and conclusions made of biological relevance, this section made zero impact on me.

      Yes- details can be provided in the Methods. But conceptually, the methods and the conceptual underpinnings of the approach and as the question being asked and the rationale for the approach, with the significance of the results, need to be developed in the Results section.

      In response, we have revised the “Results” section to better articulate the rationale behind the analysis, the specific biological questions we aimed to address, and the conceptual relevance of the method used. We have also clarified the meaning of the plotted data and how it supports our conclusions.

      While technical details remain in the “Methods” section, we now provide a more accessible narrative in the Results to guide the reader through the approach and highlight the biological significance of our findings. We hope these revisions make the section more informative and impactful.

      (12) The telomere movement part of the manuscript seems to come out of nowhere. Why telomeres? Where are telomeres normally positioned, particularly relative to the nuclear lamina? Does this change with the KDs - particularly for those that increase motion? The MSD for SYNE2 appears unconstrained- they should explore longer delta time periods to see if it reaches a point of constrained movement.

      If the telomeres are simply tethered at the nuclear lamina, then is that the explanation- that they become untethered? But if they are not typically at the periphery, then where are they relative to other nuclear compartments? And why is there mobility changing? Is it related to the loss of nuclear lamina positioning of adjacent LAD regions to the telomeres? Is it an indirect, secondary effect? What would they see after an acute KD? What about other chromosome regions? Again, there is little explanation for the rationale for these observations. It is one of many possible experiments they could have done. Why did they do this one?

      We added the following explanation “Although telomeres are not uniformly tethered to the nuclear lamina, they can transiently associate with the nuclear periphery, particularly during post-mitotic nuclear reassembly, through interactions involving SUN1 and RAP1 36. Given that lamins and nesprins are key components of the nuclear envelope that regulate chromatin organization and mechanics 37,38, we examined telomere dynamics as a proxy for changes in nuclear architecture. Using EGFP-tagged dCas9 to label telomeric regions in live U2OS cells, we assessed whether knockdown of these proteins leads to increased telomere mobility, reflecting a loss of structural constraint or altered chromatin–nuclear envelope interactions 17.” And “To probe how nuclear envelope components regulate chromatin dynamics, we tracked telomeres as a representative genomic locus whose mobility reflects changes in nuclear mechanics and chromatin organization. Although telomeres are not stably tethered to the nuclear lamina, their motion can be influenced by nuclear architecture and transient peripheral associations [36]. Upon depletion of lamin A, LMNA, or SYNE2, we observed significantly increased telomere mobility and nuclear area explored, quantified by mean square displacement and net displacement (Figure 6b–c, Supplementary Movie S1). These changes likely reflect altered chromatin–lamina interactions or disrupted nuclear mechanical constraints, consistent with prior studies showing that lamins modulate chromatin dynamics and nuclear stiffness [37,38,39]. Thus, our findings support a role for lamins and nesprins in constraining chromatin motion through nuclear structural integrity.”

      (13) "Notably, Lamin A depletion led to enrichment of 392 pathways associated with RNA biosynthesis, supporting its previously suggested role 393 in transcriptional activation and ribonucleotide metabolism."

      There is a literature on this. Say more and cite the references.

      Notably, lamin A depletion led to enrichment of pathways associated with RNA biosynthesis, supporting its previously suggested role in transcriptional activation and ribonucleotide metabolism 45.  

      (14) "This aligns with prior studies indicating that Lamin A contributes to chromatin accessibility and RNA polymerase activity." Again, there is a literature on this. Say more and cite the references.

      This aligns with prior studies indicating that lamin A contributes to chromatin accessibility and RNA polymerase activity 46. These findings further underscore the functional relevance of lamin A in coordinating transcriptional programs through modulation of nuclear architecture.

      (15) "In contrast, LMNA knockdown was linked to alterations in chromatin conformation." No. The authors show gene ontology and implicate perturbed RNA levels for genes implicated in "chromatin conformation". That is not the same thing as measuring chromatin conformation, which is not done, and showing changes in conformation.

      Based on the reviewer’s comment we have revised the text as the following: “In contrast, LMNA knockdown led to differential expression of genes enriched in pathways related to chromatin organization, suggesting potential disruptions in chromatin regulatory networks. Although direct measurements of chromatin conformation were not performed, these transcriptional changes indicate that LMNA may contribute to maintaining nuclear architecture and genomic stability, which aligns with its established involvement in laminopathies and genome integrity disorders.”

      (16) "The findings that DEGs are predominantly located in non-LAD regions highlight a unique regulatory aspect of lamins and nesprins, emphasizing their spatial specificity in gene expression". Is this novel? Can the authors separate direct from indirect effects? Is the percentage of genes in LADs that are altered in expression different from the percentage of genes in iLADs that are altered in expression? There are many more active genes in iLADs, so one expects more DEGs in iLADs even if this is random. Also - how does this correlate with lamin A binding near 5' regulatory regions detected by ChIP-seq? See the following review for references to this question and also previous work on lamin A versus chromatin mobility, including telomeres. J Cell Sci (2017) 130 (13): 2087-2096. https://doi.org/10.1242/jcs.203430

      We appreciate the reviewer’s valuable comments and feedback, we have revised the manuscript as the following to address the feedback. “Furthermore, differential expression analysis revealed that the majority of DEGs following depletion of lamins and nesprins were located outside lamina-associated domains (non-LADs). Specifically, for shLaminA knockdown, 8 DEGs within LADs were downregulated and 8 were upregulated, whereas 59 non-LAD DEGs were downregulated and 79 were upregulated. For shLMNA, 7 LAD-associated DEGs were downregulated and 15 were upregulated, with 88 downregulated and 140 upregulated DEGs in non-LAD regions. In the case of shSYNE2 knockdown, 161 LAD DEGs were downregulated and 108 were upregulated, while 2,009 non-LAD DEGs were downregulated and 1,851 were upregulated (Figure 2d, Supplementary Table S4). These results indicate that the transcriptional changes resulting from the loss of lamins or nesprins predominantly occur at non-LAD genomic regions.

      The percentage of DEGs was consistently higher in non-LADs, which are gene rich and transcriptionally active, whereas LADs, known to be enriched for silent or lowly expressed genes, showed fewer expression changes. These findings are consistent with previous studies demonstrating that active genes are more prevalent in non-LADs and that LAD associated genes are generally repressed or less responsive to perturbation [27,28]. Together, these results support a model in which lamins and nesprins influence gene expression through both structural organization and promoter proximal interactions, particularly within euchromatic nuclear regions [10,26,29].”

    1. XG385, XG752, and XH855) that contained a βgeo fusion gene inserted into intron B of Slain1 (as illustrated Fig. 8A) were obtained from the Mutant Mouse Regional Resource Center (http://www.mmrrc.org/)

      DOI: 10.1016/j.ydbio.2006.01.023

      Resource: Mutant Mouse Regional Resource Center (RRID:SCR_002953)

      Curator: @AleksanderDrozdz

      SciCrunch record: RRID:SCR_002953


      What is this?

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.


      Reply to the Reviewers

      We thank the reviewers for their positive assessments overall and for many helpful suggestions for clarification to make the manuscript more accessible to a broader audience. We made minor text changes and added more labels to the figures to address these comments.

      • *

      __Referee #1

      __

      Summary: In this study, the authors show a genetic interaction of the lipid receptors Lpr-1, Lpr-3 and Scav-2 in C. elegans. They show that Lpr-1 loss-of-function specifically affects aECM localization of Lpr-3 and attribute the lethality of Lpr-1 mutants to this phenotype. The authors performed a mutagenesis screen and identified a third lipid receptor, Scav-2, as a modulating factor: loss of scav-2 partially rescues the Lpr-1 phenotype. The authors created a variety of tools for this study, notably Crispr-Cas9-mediated knock-ins for endogenous tagging of the receptors.

      Major comments:

      1. while the authors provide a nice diagram showing the potential roles and interplay of lpr-1, lpr-3 and scav-2, it remains unclear what their respective cargo is. The nature of interaction between the proteins remains unclear from the data.

      Response

      • We agree that identifying the relevant cargo(s) will be key to understanding the detailed mechanisms involved and that the lack of such information is a limitation of our study. However, the impact of our study is to show that these lipid transporters functionally interact to affect aECM organization, a role that could be relevant to many systems, including humans.

      As an optional (since time-consuming) experiment I would suggest trying more tissue-specific lipidomics.

      Response

      • This would be an interesting future experiment but is outside our current technical capabilities.

      The lipidomics data should be presented in the figures, even if there were no significant changes. Importantly, show the lipid abundance at least of total lipids, better of individual classes, normalized to the material input (e.g. number of embryos, protein).

      Response

      • The reviewer is right to point out that lipid variations could occur at different levels, and that we should exercise caution. However, the unsupervised lipidomics analysis would have detected not only individual lipid variations, but also variations in the total or subgroup lipid content. Indeed, the eggs were weighed prior to extraction and each sample was extracted with the same precise volume of solvent before analysis. Furthermore, the LC-MS/MS injection sequence included blanks and quality control (QC) samples. The blanks were the extraction solvent, which allowed us to control for features unrelated to the biological samples. The QC sample was a mixture of all the samples included in the injection sequence, reflecting the central values of the model. If a subclass of samples, such as the lpr-1 mutant, had been characterized by a decrease in one lipid, a subgroup of lipids, or all lipids, it would have clustered separately. Instead, our PCA showed that the variation between samples of the same genotype (wild type, lpr-1 mutant, or lpr-1; scav-2) was similar to the variation between samples from two different genotypes. This means that we did not detect modifications to lipid quantity specifically or in total. A figure illustrating the lipid contents would show no difference between groups.

      Figure 1g: I do not understand what the lpr3:gfp signal is: the punctae in the overview image? and where are they in the zoom image showing anulli and alae? Also, how where the anulli and alae structures labeled? please provide more information

      Response

      • All of the fluorescent signal shown in this figure panel corresponds to the indicated LPR fusion - no other labelling method was used. SfGFP::LPR-3 labels the matrix structures (alae and annuli) as well as some puncta – the ratio of matrix to puncta changes over developmental stages. We edited the figure legend to make this more clear.

      One point that is not sufficiently adressed is that the authors deduce from the inability of the scav-2 gfp knock in to suppress lpr1 lethality that scav2 function is not impaired. This is quite indirect. Can the authors provide more convincing evidence that scav-2 ki has normal function?

      Response

      • Suppression of lpr-1 (or other aECM mutant) lethality is the only known phenotype caused by loss of scav-2 Therefore, this is the only phenotype for which we can do a rescue experiment to test functionality of the knock-in. The data presented do indicate that the knock-in fusion retains significant function.

      In general, the data is clearly presented and the statistical analyses look sound.

      Response

      • Thank you

      __Minor comments: __

      Please provide page and line numbers!

      Response:

      • done

      Avoid contractions like "don't" in both text and figure legends

      Response:

      • changed one instance of “don’t” to “do not”

      Page 12: I do not understand the meaning of the sentence "This transgene also caused more modest lethality in a wild-type background"

      Response:

      • Wording changed to “This transgene caused very little lethality in a wild-type background (Fig. 6C), indicating it is not generally toxic.”

      Figure 7: what is meant with "Dodt"?

      Response:

      • Dodt gradient contrast imaging is a method for transmitted light imaging similar to DIC and is used on some confocal microscopes. It is now explained in the Methods section. We removed the Dodt label from Figure 7 since it seems to be confusing and it is not really important whether the brightfield image is DIC or Dodt.

        Reviewer #1 (Significance (Required)):

        The study is experimentally sound and uses numerous novel tools, such as endogenously tagged lipid receptors. It is an interesting study for researchers in basic research studying lipid receptors and ECM biology. It provides insights on the genetic interaction of lipid receptors. My expertise is in lipid biochemistry, inter-organ lipid trafficking and imaging. I am not very familiar with C. elegans genetics.

      __Referee #2 __ 1. The manuscript is very well written; the documentation is fine, but some more details are needed for better following the subject for readers not familiar with nematode anatomy.

      For instance, while alae are somehow explained, annuli are not - structures that look abnormal in lpr1 and lpr1-scav2 mutants (Fig. 5B).

      Response

      • Apologies for this oversight. We added annuli labels to Figure 1 and Figure 5 panels and added descriptions of annuli to the Figure 1 legend and the Results text.

      Moreover, the authors show in Fig. 1 the punctae etc in the epidermis, whereas in Fig. 2 the show Lpr3 accumulation or not in the duct and the pore (lpr1). How do they localize in the cells of these structures at high magnification? It is also important to see the Lpr3 localisation in lpr1 mutants shown in Fig. 2A with the quality of the images shown in Fig. 1F. This applies also to Figs. 4 and 5.

      Responses:

      • The embryonic duct and pore cells are very small and we have not reliably seen puncta within them. In Figs 2 and 5, we supplemented the duct and pore images with those from the epidermis, which is a much larger tissue, allowing us to resolve puncta and matrix structures with better resolution.
      • The laser settings in Figs 2,4,5 (as opposed to Fig. 1) were chosen to avoid saturation of the matrix signal so that we could do accurate quantifications as shown. The images are unmodified with respect to brightness and therefore appear relatively dim – but we think they convey the observations very accurately.

      I would like to see punctae in lpr1-scav2 doubles.

      Response:

      • Puncta in this genotype are shown for the epidermis in Figure 5. It has not been possible to see puncta specifically within the embryonic duct and pore.

      Regarding the central mechanism, one possibility is - what the authors describe - that Lpr1 is needed for Lpr3 accumulation in ducts and tubes. Alternatively, Lpr1 is needed for duct and tube expansion, in lack of which Lpr3 is unable to reach its destination that is the lumina. Scav2, in this scenario, might be antagonist of tube and duct expansion, and thereby rescue the Lpr1 mutant phenotype independently. Admittedly, the non-accumulation of Lpr3 in scav2 mutants argues against a lpr1-independent function of scav2.

      Responses:

      • LPR-1 is indeed needed to maintain duct and pore tube integrity as the tubes grow, but in mutants the tubes appear to collapse at a later stage than we imaged here (Stone et al 2009). The ~normal accumulation of LET-4 and LET-653 further argues that the duct and pore tubes are still intact at the 1.5-to-2-fold stages. Therefore, we conclude that the defect in LPR-3 accumulation precedes duct and pore collapse.
      • The changes we document in the epidermis also show that the lpr-1 mutant affects LPR-3 accumulation in another (non-tube) tissue.

      In any case, to underline the aspect of Lpr1-Scav2 dosage relationship, the authors may also have a look at Lpr3 distribution in lpr1 heterozygous, and lpr1-scav2 double heterozygous worms. In this spirit, it would be interesting to see the semi-dominant effects of scav2 on Lpr3 localisation in lpr1 mutants by microscopy.

      Response:

      • Because of the hermaphroditism of C. elegans, it would be technically challenging to confidently identify heterozygous (vs. homozygous) embryos for confocal imaging. We do not think that the results would be informative enough to warrant the effort, given that we’ve already shown that scav-2 heterozygosity can partly suppress lpr-1 The expectation is that LPR-3 levels would be partially restored in the scav-2 het, but it might take a very large sample size to confidently assess that partial effect.

      One word to the overexpression studies: it is surprising that the amounts of Scav2 delivered by the expression through the grl-2 promoter in the lpr1, scav2 background are almost matching those by the opposite effect of scav2 mutations on lpr1 dysfunction.

      Response:

      • The reviewer refers to the transgenic rescue experiment with the grl-2pro::SCAV-2 transgene. Because the scav-2 mutant phenotype being tested is suppression of lpr-1 lethality, the expected result from scav-2 rescue is to restore the lpr-1 lethal phenotype to the strain. This is exactly the result we see. We have revised the text to more clearly explain the logic.

      One issue concerns the localization of scav2-gfp "rarely" in vesicles: what are these vesicles?

      Response

      • Only a handful of vesicles were seen across all the images we collected, and we have not yet identified them. They could be associated with either SCAV-2 delivery or removal from the plasma membrane, as now stated in the text. SCAV-2 trafficking would be an interesting area for further study but is beyond the scope of this paper.

      One comment to the Let653 transgenes/knock-ins: the localization of transgenic Let653-gfp may be normal in lpr1 mutants because there are wild-type copies in the background.

      Response

      • There are wild type copies of LET-653 in the background, but no wild type copies of LPR-1. Even if the untagged LET-653 would be recruiting the tagged LET-653 as the reviewer suggests, we can still conclude that lpr-1 loss does not prevent the untagged LET-653 (and thus also the tagged LET-653) from accumulating in the duct lumen matrix.

      One thought to the model: if Scav2 has a function in a lpr1 background, this means that yet another transporter X delivers the substrate for Scav2, isn't it?

      Response

      • Yes, we completely agree with this interpretation and have revised the discussion and Figure 8 legend to more explicitly make this point.

      A word to the term haploinsifficient that is used in this study: scav2 mutants would be haploinsifficient if the heterozygous worms died in an otherwise wild-type background.

      Response

      • We disagree with this comment. The term “haploinsufficient” simply means that heterozygosity for a deletion or other loss of function allele can cause a mutant phenotype – the term is not restricted to lethal phenotypes.

        Reviewer #2 (Significance (Required)):

        Alexandra C.Belfi and colleagues wrote the manuscript entitled "Opposing roles for lipocalins and a CD36 family scavenger receptor in apical extracellular matrix-dependent protection of narrow tube integrity" in which they report on their findings on the genetic and cell-biological interaction between the lipid transporters Lpr1 and scav2 in the nematode C. elegans. In principle, these two proteins are involved in shaping the apical extracellular matrix (aECM) of ducts by regulating the amounts of Lpr3 in the extracellular space. While seems to act cell autonomously, Lpr1 has a non-cell autonomous effect on Lpr3.


      __Referee #3 __ Summary: Using a powerful combination of genetic and quantitative imaging approaches, Belfi et al., describe novel findings on the roles of several lipocalins-secreted lipid carrier proteins-in the production and organization of the apical extracellular matrix (aECM) required for small diameter tube formation in C. elegans. The work comprises a substantial extension of previous studies carried out by the Sundaram lab, which has pioneered studies into the roles of aECM and accessory proteins in creating the duct-pore excretion tube and which also plays a role in patterning of the epidermal cuticle. One core finding is that the lipocalin LPR-1 does not stably associate with the aECM but is instead required for the incorporation of another lipocalin, LPR-3. A second major finding is that reduction of function in SCAV-2, a SCARB family membrane lipid transporter, suppresses lpr-1 mutant lethality along with associated duct-pore defects and mislocalization of LPR-3. Likewise loss of scav-2 partially suppresses defects in two other aECM proteins and restores defects in LPR-3 localization in one of them (let-653). Additional genetic and protein localization studies lead to the model that LPR-1 and SCAV-2 may antagonistically regulate one or more lipid or lipoprotein factors necessary for LPR-3 localization and duct-pore formation. A role for LPR-1 and LPR-3 at lysosomes is clearly implicated based on co-localization studies, although a specific role for lysosomes (or related organelles) is not defined. Finally, MS data suggests that neither LPR-1 or SCAV-2 grossly affect lipid composition in embryos, consistent with dietary interventions failing to affect mutant phenotypes. Ultimately, a plausible schematic model is presented to explain for much of the data.

      __*Major comments:

      *__

      1. The studies are very thorough, convincing, and generally well described. Conclusions are logical and well grounded. Additional experiments are not required to support the authors major conclusions, and the data and methods are described in a sufficient detail to allow replication. As such my comments are minor and should be addressable at the author's discretion in writing.

      Response

      • Thank you for these positive comments

        __Minor comments: __2) In the abstract, "tissue-specific suppression" made me think that there was going to be a tissue-specific knockdown experiment, which was not the case. Rather scav-2 suppression is specific to the duct-pore, which corresponds to where scav-2 is expressed. Consider rewording this.

      Response

      • Wording was changed to “duct/pore-specific suppression”

        3) Page 5. Suggest wording change to, "Whereas LPR-3 incorporates stably into the precuticle, suggesting a structural role in matrix organization, LPR-1..."

      Response

      • Done

        4) LIMP-2 versus LIMP2. Both are used. Uniprot lists LIMP2, but some papers use LIMP-2. Choose one and be consistent.

      Response

      • Everything changed to LIMP2.

        5) Some of the data for S6 Fig wasn't referred to directly in the text. Namely results regarding pcyt-1 and pld-1. I'd suggest incorporating this into the results section possibly using, "As a control for our lipid supplementation experiments..."

      Response

      • These experiments are now described on page 11.

        6) Page 12 bottom. I understand the use of "oppose", but another way to put it is that SCAV-2 and LPR-1 (antagonistically or collectively) modulate aECM composition. Other terms that might confuse some readers is the use of upstream and downstream, although I OK with its use in the context of this work.

      Response

      • The genetics indicate that lpr-1 and scav-2 have opposite effects on tube shaping and LPR-3 localization, so they do function antagonistically rather than collectively/cooperatively; we decided to keep this terminology.

        7) Page 16. I understand the logic that SCAV-2 is unlikely to directly modulate LPR-3 given its presumed molecular function. But is it possible that LPR-3 levels are already maxed out in the aECM so that loss of SCAV-2 doesn't lead to any increase? Conversely, one could argue that even if acting indirectly, SCAV-2 could have led to increased LPR-3 levels, unless they were already maxed.

      Response

      • This is a good point and the possibility is now mentioned in the Results page 9. We also changed our wording in the Abstract and Discussion to acknowledge the possibility that LPR-3 could be the SCAV-2 cargo, though we still don’t favor this model.

        8) Figure legend 1. I did not see an asterisk in figure 1B.

      Response

      • thanks for catching this error, text removed

        9) Figure 1C. Might want to define the "degree" term in the legend for people outside the field.

      Response

      • We added an explanation to the figure legend.

        10) Fig 1 G. I was just wondering if cuticle autofluorescence was an issue for taking these images.

      Response

      • Cuticle auto fluorescence is generally quite dim in L4s with our settings, and it was not an issue at this mid/late L4 stage, which corresponds to when both LPR fusions are at their brightest. Note that both large panels are MAX projections and yet you can’t see any cuticle auto-fluorescence in the LPR-1 panel.

        11) Fig 2 and others. Please define error bars.

      Response

      • These correspond to the standard deviation; this information is now added to the Methods.

        12) Fig 5. From the images, it looks like lpr-1; scav-2 doubles might have a worse (pre)cuticle defect in LPR-3 localization than lpr-1 singles. If so that would be interesting and would suggest that their relationship with respect to the modulation of LPR-3 is context dependent. Admittedly, the lack of obvious scav-2 expression in the epidermis would not be consistent with an effect (positive or negative).

      Response

      • The lpr-1 scav-2 strain is certainly not improved over lpr-1 but we have not noted any consistent worsening of the phenotype either.

        13) Consider defining Dodt in the first figure legend where it appears.

      Response

      • Dodt gradient contrast imaging is a method of transmitted light imaging similar to DIC and is used on some confocal microscopes. It is now explained in the Methods section. We removed the term from Figure 7 since it seems to be confusing.

        14) For Mander's, is there a reason to report just one of the two findings (M1 or M2) versus both?

      Response

      • We now include the 2nd Manders value in the figure legend and note that value is much lower (0.25) because much of the red signal is lysosomes (where green would be quenched by acidity).

        15) Consider referring to specific panels (A, B...) within references to the supplemental files.

      Response

      • done

        16) Fig S6E. Neither "increasing nor increasing" to "increasing nor decreasing".

      Response

      • fixed

        **Referees cross-commenting**

        I thought that Reviewers 1 and 2 brought up some good points. My sense is that Belfi and colleagues can address most of these in writing, but are of course welcome to add new data as they see fit. I get that it's not a "perfect" paper where everything is explained fully or comes together, but I don't see that as a flaw that needs to be fixed. I think that the manuscript represents a good deal of work (as it is) and provides a sufficient advance while also suggesting an interesting link to disease. It will be up to individual journals to decide if the findings meets their criteria.

        Reviewer #3 (Significance (Required)):

        Significance: The work carried out in this paper, and more generally by the Sundaram lab, always has a ground-breaking element because very few labs in the field have studied in detail the developmental roles and regulation of the aECM, in large part because it can be challenging to dissect. The core findings in this study are rather novel and unexpected, namely the opposing roles of the paralogous LPR-1 and LPR-3 lipocalins and their functional interactions with SCAV-2. The study does stop short of finding specific molecules (lipid or lipoprotein) that would mediate the effects they report, and it wasn't yet clear how the lysosomal co-loc plays a role, but this is not a criticism of the work presented or the forward progress. I was particularly intrigued by the idea, presented in the discussion, that disruption of vascular aECM could potentially account for some of the (complex) observations regarding the role of lipocalins and SCARB proteins in human disease. This would represent a new avenue for researchers to consider and underscores the power of using non-biased approaches in model systems.

        As for all my reviews, this is signed by David Fay.

      • *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript is very well written; the documentation is fine, but some more details are needed for better following the subject for readers not familiar with nematode anatomy. For instance, while alae are somehow explained, annuli are not - structures that look abnormal in lpr1 and lpr1-scav2 mutants (Fig. 5B). Moreover, the authors show in Fig. 1 the punctae etc in the epidermis, whereas in Fig. 2 the show Lpr3 accumulation or not in the duct and the pore (lpr1). How do they localize in the cells of these structures at high magnification? It is also important to see the Lpr3 localisation in lpr1 mutants shown in Fig. 2A with the quality of the images shown in Fig. 1F. This applies also to Figs. 4 and 5. I would like to see punctae in lpr1-scav2 doubles. Regarding the central mechanism, one possibility is - what the authors describe - that Lpr1 is needed for Lpr3 accumulation in ducts and tubes. Alternatively, Lpr1 is needed for duct and tube expansion, in lack of which Lpr3 is unable to reach its destination that is the lumina. Scav2, in this scenario, might be antagonist of tube and duct expansion, and thereby rescue the Lpr1 mutant phenotype independently. Admittedly, the non-accumulation of Lpr3 in scav2 mutants argues against a lpr1-independent function of scav2. In any case, to underline the aspect of Lpr1-Scav2 dosage relationship, the authors may also have a look at Lpr3 distribution in lpr1 heterozygous, and lpr1-scav2 double heterozygous worms. In this spirit, it would be interesting to see the semi-dominant effects of scav2 on Lpr3 localisation in lpr1 mutants by microscopy. One word to the overexpression studies: it is surprising that the amounts of Scav2 delivered by the expression through the grl-2 promoter in the lpr1, scav2 background are almost matching those by the opposite effect of scav2 mutations on lpr1 dysfunction.

      One issue concerns the localization of scav2-gfp "rarely" in vesicles: what are these vesicles?

      One comment to the Let653 transgenes/knock-ins: the localization of transgenic Let653-gfp may be normal in lpr1 mutants because there are wild-type copies in the background.

      One thought to the model: if Scav2 has a function in a lpr1 background, this means that yet another transporter X delivers the substrate for Scav2, isn't it?

      A word to the term haploinsifficient that is used in this study: scav2 mutants would be haploinsifficient if the heterozygous worms died in an otherwise wild-type background.

      Significance

      Alexandra C.Belfi and colleagues wrote the manuscript entitled "Opposing roles for lipocalins and a CD36 family scavenger receptor in apical extracellular matrix-dependent protection of narrow tube integrity" in which they report on their findings on the genetic and cell-biological interaction between the lipid transporters Lpr1 and scav2 in the nematode C. elegans. In principle, these two proteins are involved in shaping the apical extracellular matrix (aECM) of ducts by regulating the amounts of Lpr3 in the extracellular space. While seems to act cell autonomously, Lpr1 has a non-cell autonomous effect on Lpr3.

  8. bafybeid7gjtxre33jbpmnzs6avjyvludaufoeubczurkrkrncisabad7w4.ipfs.localhost:8080 bafybeid7gjtxre33jbpmnzs6avjyvludaufoeubczurkrkrncisabad7w4.ipfs.localhost:8080
    1. The same morphic page

      that is the capability and the information as a unit would be different in another browser profile

      As it is the content is available only on my machine Need to add the ability to get a publically shared version to show up

      with the abiity for anyone to make it their own and work on it and share their version back

      In all this the common social sharing is mediated via hypothesis

      I am now using the ready availability of annotations

      even on pages that are constantly being changed

      as a way to

      write on the margins to facilitate the formulative thinking as developing IT to make it all work work

    1. partir de conflictos propiciadospor la expansión de las poblaciones humanas, poniendoen riesgo la biodiversidad de los humedales

      el crecimieno urbanoaltera la situacion actual

    1. In this manuscript, the author provides a historical review of the place of peer review in the scientific ecosystem, including a discussion of the so-called current crisis and a presentation of three important peer review models. I believe this is a non-comprehensive yet useful overview. My main contention is that the structure of the paper could be improved. More specifically, the author could expand on the different goals of peer review and discuss these goals earlier in the paper. This would allow readers to better interpret the different issues plaguing peer review and helps put the costs and benefits of the three models into context. Other than that, I found some claims made in the paper a little too strong. Presenting some empirical evidence or downplaying these claims would improve the manuscript in my opinion. Below, you can find my comments:

      1. In my view, the biggest issue with the current peer review system is the low quality of reviews, but the manuscript only mentions this fleetingly. The current system facilitates publication bias, confirmation bias, and is generally very inconsistent. I think this is partly due to reviewers’ lack of accountability in such a closed peer review system, but I would be curious to hear the author’s ideas about this, more elaborately than they provide them as part of issue 2.

      2. I’m missing a section in the introduction on what the goals of peer review are or should be. You mention issues with peer review, and these are mostly fair, but their importance is only made salient if you link them to the goals of peer review. The author does mention some functions of peer review later in the paper, but I think it would be good to expand that discussion and move it to a place earlier in the manuscript.

      3. Table 1 is intuitive but some background on how the author arrived at these categorizations would be welcome. When is something incremental and when is something radical? Why are some innovations included but not others (e.g., collaborative peer review, see https://content.prereview.org/how-collaborative-peer-review-can-transform-scientific-research/)?

      4. “Training of reviewers through seminars and online courses is part of the strategies of many publishers. At the same time, we have not been able to find statistical data or research to assess the effectiveness of such training.” (p. 5)  There is some literature on this, although not recent. See work by Sara Schroter for example, Schroter et al., 2004; Schroter et al., 2008)

      5. “It should be noted that most initiatives aimed at improving the quality of peer review simultaneously increase the costs.” (p. 7)  This claim needs some support. Please explicate why this typically is the case and how it should impact our evaluations of these initiatives.

      6. I would rephrase “Idea of the study” in Figure 2 since the other models start with a tangible output (the manuscript). This is the same for registered reports where they submit a tangible report including hypotheses, study design, and analysis plan. In the same vein, I think study design in the rest of the figure might also not be the best phrasing.  Maybe the author could use the terminology used by COS (Stage 1 manuscript, and Stage 2 manuscript, see Details & Workflow tab of https://www.cos.io/initiatives/registered-reports). Relatedly, “Author submits the first version of the manuscript” in the first box after the ‘Manuscript (report)’ node maybe a confusing phrase because I think many researchers see the first version of the manuscript as the stage 1 report sent out for stage 1 review.

      7. One pathway that is not included in Figure 2 is that authors can decide to not conduct the study when improvements are required. Relatedly, in the publish-review-curate model, is revising the manuscripts based on the reviews not optional as well? Especially in the case of 3a, authors can hardly be forced to make changes even though the reviews are posted on the platform.

      8. I think the author should discuss the importance of ‘open identities’ more. This factor is now not explicitly included in any of the models, while it has been found to be one of the main characteristics of peer review systems (Ross-Hellauer, 2017). More generally, I was wondering why the author chose these three models and not others. What were the inclusion criteria for inclusion in the manuscript? Some information on the underlying process would be welcome, especially when claims like “However, we believe that journal-independent peer review is a special case of Model 3 (“Publish-Review-Curate”).” are made without substantiation.

      9. Maybe it helps to outline the goals of the paper a bit more clearly in the introduction. This helps the reader to know what to expect.

      10. The Modular Publishing section is not inherently related to peer review models, as you mention in the first sentence of that paragraph. As such, I think it would be best to omit this section entirely to maintain the flow of the paper. Alternatively, you could shortly discuss it in the discussion section but a separate paragraph seems too much from my point of view.

      11. Labeling model 3 as post-publication review might be confusing to some readers. I believe many researchers see post-publication review as researchers making comments on preprints, or submitting commentaries to journals. Those activities are substantially different from the publish-review-curate model so I think it is important to distinguish between these types.

      12. I do not think the conclusions drawn below Table 3 logically follow from the earlier text. For example, why are “all functions of scientific communication implemented most quickly and transparently in Model 3”? It could be that the entire process takes longer in Model 3 (e.g. because reviewers need more time), so that Model 1 and Model 2 lead to outputs quicker. The same holds for the following claim: “The additional costs arising from the independent assessment of information based on open reviews are more than compensated by the emerging opportunities for scientific pluralism.” What is the empirical evidence for this? While I personally do think that Model 3 improves on Model 1, emphatic statements like this require empirical evidence. Maybe the author could provide some suggestions on how we can attain this evidence. Model 2 does have some empirical evidence underpinning its validity (see Scheel, Schijen, Lakens, 2021; Soderberg et al., 2021; Sarafoglou et al. 2022) but more meta-research inquiries into the effectiveness and cost-benefits ratio of registered reports would still be welcome in general.

      13. What is the underlaying source for the claim that openness requires three conditions?

      14. “If we do not change our approach, science will either stagnate or transition into other forms of communication.” (p. 2)  I don’t think this claim is supported sufficiently strongly. While I agree there are important problems in peer review, I think would need to be a more in-depth and evidence-based analysis before claims like this can be made.

      15. On some occasions, the author uses “we” while the study is single authored.

      16. Figure 1: The top-left arrow from revision to (re-)submission is hidden

      17. “The low level of peer review also contributes to the crisis of reproducibility in scientific research (Stoddart, 2016).” (p. 4)  I assume the author means the low quality of peer review.

      18. “Although this crisis is due to a multitude of factors, the peer review system bears a significant responsibility for it.” (p. 4)  This is also a big claim that is not substantiated

      19. “Software for automatic evaluation of scientific papers based on artificial intelligence (AI) has emerged relatively recently” (p. 5)  The author could add RegCheck (https://regcheck.app/) here, even though it is still in development. This tool is especially salient in light of the finding that preregistration-paper checks are rarely done as part of reviews (see Syed, 2023)

      20. There is a typo in last box of Figure 1 (“decicion” instead of “decision”). I also found typos in the second box of Figure 2, where “screns” should be “screens”, and the author decision box where “desicion” should be “decision”

      21. Maybe it would be good to mention results blinded review in the first paragraph of 3.2. This is a form of peer review where the study is already carried out but reviewers are blinded to the results. See work by Locascio (2017), Grand et al. (2018), and Woznyj et al. (2018).

      22. Is “Not considered for peer review” in figure 3b not the same as rejected? I feel that it is rejected in the sense that neither the manuscript not the reviews will be posted on the platform.

      23. “In addition to the projects mentioned, there are other platforms, for example, PREreview12, which departs even more radically from the traditional review format due to the decentralized structure of work.” (p. 11)  For completeness, I think it would be helpful to add some more information here, for example why exactly decentralization is a radical departure from the traditional model.

      24. “However, anonymity is very conditional - there are still many “keys” left in the manuscript, by which one can determine, if not the identity of the author, then his country, research group, or affiliated organization.” (p.11)  I would opt for the neutral “their” here instead of “his”, especially given that this is a paragraph about equity and inclusion.

      25. “Thus, “closeness” is not a good way to address biases.” (p. 11)  This might be a straw man argument because I don’t believe researchers have argued that it is a good method to combat biases. If they did, it would be good to cite them here. Alternatively, the sentence could be omitted entirely.

      26. I would start the Modular Publishing section with the definition as that allows readers to interpret the other statements better.

      27. It would be helpful if the Models were labeled (instead of using Model 1, Model 2, and Model 3) so that readers don’t have to think back what each model involved.

      28. Table 2: “Decision making” for the editor’s role is quite broad, I recommend to specify and include what kind of decisions need to be made.

      29. Table 2: “Aim of review” – I believe the aim of peer review differs also within these models (see the “schools of thought” the author mentions earlier), so maybe a statement on what the review entails would be a better way to phrase this.

      30. Table 2: One could argue that the object of the review’ in Registered Reports is also the manuscript as a whole, just in different stages. As such, I would phrase this differently.

      Good luck with any revision!

      Olmo van den Akker (ovdakker@gmail.com)

      References

      Grand, J. A., Rogelberg, S. G., Banks, G. C., Landis, R. S., & Tonidandel, S. (2018). From outcome to process focus: Fostering a more robust psychological science through registered reports and results-blind reviewing. Perspectives on Psychological Science, 13(4), 448-456.

      Ross-Hellauer, T. (2017). What is open peer review? A systematic review. F1000Research, 6.

      Sarafoglou, A., Kovacs, M., Bakos, B., Wagenmakers, E. J., & Aczel, B. (2022). A survey on how preregistration affects the research workflow: Better science but more work. Royal Society Open Science, 9(7), 211997.

      Scheel, A. M., Schijen, M. R., & Lakens, D. (2021). An excess of positive results: Comparing the standard psychology literature with registered reports. Advances in Methods and Practices in Psychological Science, 4(2), 25152459211007467.

      Schroter, S., Black, N., Evans, S., Carpenter, J., Godlee, F., & Smith, R. (2004). Effects of training on quality of peer review: randomised controlled trial. Bmj, 328(7441), 673.

      Schroter, S., Black, N., Evans, S., Godlee, F., Osorio, L., & Smith, R. (2008). What errors do peer reviewers detect, and does training improve their ability to detect them?. Journal of the Royal Society of Medicine, 101(10), 507-514.

      Soderberg, C. K., Errington, T. M., Schiavone, S. R., Bottesini, J., Thorn, F. S., Vazire, S., ... & Nosek, B. A. (2021). Initial evidence of research quality of registered reports compared with the standard publishing model. Nature Human Behaviour, 5(8), 990-997.

      Syed, M. (2023). Some data indicating that editors and reviewers do not check preregistrations during the review process. PsyArXiv Preprints.

      Locascio, J. J. (2017). Results blind science publishing. Basic and applied social psychology, 39(5), 239-246.

      Woznyj, H. M., Grenier, K., Ross, R., Banks, G. C., & Rogelberg, S. G. (2018). Results-blind review: A masked crusader for science. European Journal of Work and Organizational Psychology, 27(5), 561-576.

    2. Response to the Editors and the Reviewers

      I am sincerely grateful to the editors and peer reviewers at MetaROR for their detailed feedback and valuable comments and suggestions. I have addressed each point below.

      Handling editor

      1. “However, the article’s progression and arguments, along with what it seeks to contribute to the literature need refinement and clarification. The argument for PRC is under-developed due to a lack of clarity about what the article means by scientific communication. Clarity here might make the endorsement of PRC seem like less of a foregone conclusion.”

      The structure of the paper (and discussion) has changed significantly to address the feedback.

      2. “I strongly endorse the main theme of most of the reviews, which is that the progression and underlying justifications for this article’s arguments needs a great deal of work. In my view, this article’s main contribution seems to be the evaluation of the three peer review models against the functions of scientific communication. I say ‘seems to be’ because the article is not very clear on that and I hope you will consider clarifying what your manuscript seeks to add to the existing work in this field. In any case, if that assessment of the three models is your main contribution, that part is somewhat underdeveloped. Moreover, I never got the sense that there is clear agreement in the literature about what the tenets of scientific communication are. Note that scientific communication is a field in its own right.”

      I have implemented a more rigorous approach to argumentation in response. “Scientific communication” was replaced by “scholarly communication.”

      3. “I also agree that paper is too strongly worded at times, with limitations and assumptions in the analysis minimised or not stated. For example, all of the typologies and categories drawn could easily be reorganised and there is a high degree of subjectivity in this entire exercise. Subjective choices should be highlighted and made salient for the reader. Note that greater clarity, rigour, and humility may also help with any alleged or actual bias.”

      I have incorporated the conceptual framework and description of the research methodology. However, the Discussion section reflects my personal perspective in some points, which I have explicitly highlighted to ensure clarity.

      4. “I agree with Reviewer 3 that the ‘we’ perspective is distracting.”

      This has been fixed.

      5. “The paragraph starting with ‘Nevertheless’ on page 2 is very long.”

      The text was restructured.

      6. “There are many points where language could be shortened for readability, for example:

      Page 3: ‘decision on publication’ could be ‘publication decision’.

      Page 5: ‘efficiency of its utilization’ could be ‘its efficiency’.

      Page 7: ‘It should be noted…’ could be ‘Note that…’.”

      I have proofread the text.

      7. “Page 7: ‘It should be noted that..’ – this needs a reference.”

      This statement has been moved to the Discussion section, paraphrased, and reference added

      “It should be also noted that peer review innovations pull in opposing directions, with some aiming to increase efficiency and reduce costs, while others aim to promote rigor and increase costs (Kaltenbrunner et al., 2022).”

      8. “I’m not sure that registered reports reflect a hypothetico-deductive approach (page 6). For instance, systematic reviews (even non-quantitative ones) are often published as registered reports and Cochrane has required this even before the move towards registered reports in quantitative psychology.”

      I have added this clarification.

      9. “I agree that modular publishing sits uneasily as its own chapter.”

      Modular publishing has been combined with registered reports into the deconstructed publication group of models, now Section 5.1.

      10. “Page 14: ‘The "Publish-Review-Curate" model is universal that we expect to be the future of scientific publishing. The transition will not happen today or tomorrow, but in the next 5-10 years, the number of projects such as eLife, F1000Research, Peer Community in, or MetaROR will rapidly increase’. This seems overly strong (an example of my larger critique and that of the reviewers).”

      This part of the text has been rewritten.

      Reviewer 1

      11. “For example, although Model 3 is less chance to insert bias to the readers, it also weakens the filtering function of the review system. Let’s just think about the dangers of machine-generated articles, paper-mills, p-hacked research reports and so on. Although the editors do some pre-screening for the submissions, in a world with only Model 3 peer review the literature could easily get loaded with even more ‘garbage’ than in a model where additional peers help the screening.”

      I think that generated text is better detected by software tools. At the same time, I tried and described the pros and cons of different models in a more balanced way in the concluding section.

      12. “Compared to registered reports other aspects can come to focus that Model 3 cannot cover. It’s the efficiency of researchers’ work. In the care of registered reports, Stage 1 review can still help researchers to modify or improve their research design or data collection method. Empirical work can be costly and time-consuming and post-publication review can only say that ‘you should have done it differently then it would make sense’.”

      Thank you very much for this valuable contribution, I have added this statement at P. 11.

      13. “Finally, the author puts openness as a strength of Model 3. In my eyes, openness is a separate question. All models can work very openly and transparently in the right circumstances. This dimension is not an inherent part of the models.”

      I think that the model, providing peer reviews to all the submissions, ensures maximum transparency. However, I have made effort to make the wording more balanced and distinguish my personal perspective from the literature.

      14. “In conclusion, I would not make verdict over the models, instead emphasize the different functions they can play in scientific communication.”

      This idea has been reflected now in the concluding section.

      15. “A minor comment: I found that a number of statements lack references in the Introduction. I would have found them useful for statements such as ‘There is a point of view that peer review is included in the implicit contract of the researcher.’”

      Thank you for your feedback. I have implemented a more rigorous approach to argumentation in response.

      Reviewer 2

      16. “The primary weakness of this article is that it presents itself as an 'analysis' from which they 'conclude' certain results such as their typology, when this appears clearly to be an opinion piece. In my view, this results in a false claim of objectivity which detracts from what would

      otherwise be an interesting and informative, albeit subjective, discussion, and thus fails to discuss the limitations of this approach.”

      I have incorporated the conceptual framework and description of the research methodology. However, the Discussion section reflects my personal perspective in some points, which I have explicitly highlighted to ensure clarity.

      17. “A secondary weakness is that the discussion is not well structured and there are some imprecisions of expression that have the potential to confuse, at least at first.”

      The structure of the paper (and discussion) has changed significantly.

      18. “The evidence and reasoning for claims made is patchy or absent. One instance of the former is the discussion of bias in peer review. There are a multitude of studies of such bias and indeed quite a few meta-analyses of these studies. A systematic search could have been done here but there is no attempt to discuss the totality of this literature. Instead, only a few specific studies are cited. Why are these ones chosen? We have no idea. To this extent I am not convinced that the references used here are the most appropriate.”

      I have reviewed the existing references and incorporated additional sources. However, the study does not claim to conduct a systematic literature review; rather, it adopts an interpretative approach to literature analysis.

      19. “Instances of the latter are the claim that ‘The most well-known initiatives at the moment are ResearchEquals and Octopus’ for which no evidence is provided, the claim that ‘we believe that journal-independent peer review is a special case of Model 3’ for which no further argument is provided, and the claim that ‘the function of being the "supreme judge" in deciding what is "good" and "bad" science is taken on by peer review’ for which neither is provided.

      Thank you for your feedback. I have implemented a more rigorous approach to argumentation in response.

      20. “A particular example of this weakness, which is perhaps of marginal importance to the overall paper but of strong interest to this reviewer is the rather odd engagement with history within the paper. It is titled "Evolution of Peer Review" but is really focussed on the contemporary state-of-play. Section 2 starts with a short history of peer review in scientific publishing, but that seems intended only to establish what is described as the 'traditional' model of peer review. Given that that short history had just shown how peer review had been continually changing in character over centuries - and indeed Kochetkov goes on to describe further changes - it is a little difficult to work out what 'traditional' might mean here; what was 'traditional' in 2010 was not the same as what was 'traditional' in 1970. It is not clear how seriously this history is being taken. Kochetkov has earlier written that "as early as the beginning of the 21st century, it was argued that the system of peer review is 'broken'" but of course criticisms - including fundamental criticisms - of peer review are much older than this. Overall, this use of history seems designed to privilege the experience of a particular moment in time, that coincides with the start of the metascience reform movement.”

      While the paper addresses some aspects of peer review history, it does not provide a comprehensive examination of this topic. A clarifying statement to this effect has been included in the methodology section.

      “… this section incorporates elements of historical analysis, it does not fully qualify as such because primary sources were not directly utilized. Instead, it functions as an interpretative literature review, and one that is intentionally concise, as a comprehensive history of peer review falls outside the scope of this research”.

      21. “Section 2 also demonstrates some of the second weakness described, a rather loose structure. Having moved from a discussion of the history of peer review to detail the first model, 'traditional' peer review, it then also goes on to describe the problems of this model. This part of the paper is one of the best - and best - evidenced. Given the importance of it to the main thrust of the discussion it should probably have been given more space as a Section all on its own.”

      This section (now Section 4) has been extended, see also previous comment.

      22. “Another example is Section 4 on Modular Publishing, in which Kochetkov notes "Strictly speaking, modular publishing is primarily an innovative approach for the publishing workflow in general rather than specifically for peer review." Kochetkov says "This is why we have placed this innovation in a separate category" but if it is not an innovation in peer review, the bigger question is 'Why was it included in this article at all?'.”

      Modular publishing has been combined with registered reports into the deconstructed publication group of models, now Section 5.1.

      23. “One example of the imprecisions of language is as follows. The author also shifts between the terms 'scientific communication' and 'science communication' but, at least in many contexts familiar to this reviewer, these are not the same things, the former denoting science-internal dissemination of results through publication (which the author considers), conferences and the like (which the author specifically excludes) while the latter denotes the science-external public dissemination of scientific findings to non-technical audiences, which is entirely out of scope for this article.”

      Thank you for your remark. As a non- native speaker, I initially did not grasp the distinction between the terms. However, I believe the phrase ‘scholarly communication’ is the most universally applicable term. This adjustment has now been incorporated into the text.

      24. “A final note is that Section 3, while an interesting discussion, seems largely derivative from a typology of Waltman, with the addition of a consideration of whether a reform is 'radical' or 'incremental', based on how 'disruptive' the reform is. Given that this is inherently a subjective decision, I wonder if it might not have been more informative to consider 'disruptiveness' on a scale and plot it accordingly. This would allow for some range to be imagined for each reform as well; surely reforms might be more or less disruptive depending on how they are implemented. Given that each reform is considered against each model, it is somewhat surprising that this is not presented in a tabular or graphical form.”

      Ultimately, I excluded this metric due to its current reliance on purely subjective judgment. Measuring 'disruptiveness', e.g., through surveys or interviews remains a task for future research.

      25. “Reconceptualize this as an opinion piece. Where systematic evidence can be drawn upon to make points, use that, but don't be afraid to just present a discussion from what is clearly a well-informed author.”

      I cannot definitively classify this work as an opinion piece. In fact, this manuscript synthesizes elements of a literature review, research article, and opinion essay. My idea was to integrate the strengths of all three genres.

      26. “Reconsider the focus on history and 'evolution' if the point is about the current state of play and evaluation of reforms (much as I would always want to see more studies on the history and evolution of peer review).”

      I have revised the title to better reflect the study’s scope and explicitly emphasize its focus on contemporary developments in the field.

      “Peer Review at the Crossroads”

      27. “Consider ways in which the typology might be expanded, even if at subordinate level.”

      I have updated the typology and introduced the third tier, where it is applicable (see Fig.2).

      Reviewer 3

      28. “In my view, the biggest issue with the current peer review system is the low quality of reviews, but the manuscript only mentions this fleetingly. The current system facilitates publication bias, confirmation bias, and is generally very inconsistent. I think this is partly due to reviewers’ lack of accountability in such a closed peer review system, but I would be curious to hear the author’s ideas about this, more elaborately than they provide them as part of issue 2.

      I have elaborated on this issue in the footnote.

      29. “I’m missing a section in the introduction on what the goals of peer review are or should be. You mention issues with peer review, and these are mostly fair, but their importance is only made salient if you link them to the goals of peer review. The author does mention some functions of peer review later in the paper, but I think it would be good to expand that discussion and move it to a place earlier in the manuscript.”

      The functions of peer review are summarized in the first paragraph of Introduction.

      30. “Table 1 is intuitive but some background on how the author arrived at these categorizations would be welcome. When is something incremental and when is something radical? Why are some innovations included but not others (e.g., collaborative peer review, see https://content.prereview.org/how-collaborative-peer-review-can-transform-scientific-research/)?”

      Collaborative peer review, namely, Prereview was mentioned in the context of Model 3 (Publish-Review-Curate). However, I have extended this part of the paper.

      31“‘Training of reviewers through seminars and online courses is part of the strategies of many publishers. At the same time, we have not been able to find statistical data or research to assess the effectiveness of such training.’ (p. 5)  There is some literature on this, although not recent. See work by Sara Schroter for example, Schroter et al., 2004; Schroter et al., 2008)”

      Thank you very much, I have added these studies and a few more recent ones.

      32. “‘It should be noted that most initiatives aimed at improving the quality of peer review simultaneously increase the costs.’ (p. 7) This claim needs some support. Please explicate why this typically is the case and how it should impact our evaluations of these initiatives.”

      I have moved this part to the Discussion section.

      33. “I would rephrase “Idea of the study” in Figure 2 since the other models start with a tangible output (the manuscript). This is the same for registered reports where they submit a tangible report including hypotheses, study design, and analysis plan. In the same vein, I think study design in the rest of the figure might also not be the best phrasing. Maybe the author could use the terminology used by COS (Stage 1 manuscript, and Stage 2 manuscript, see Details & Workflow tab of https://www.cos.io/initiatives/registered-reports). Relatedly, “Author submits the first version of the manuscript” in the first box after the ‘Manuscript (report)’ node maybe a confusing phrase because I think many researchers see the first version of the manuscript as the stage 1 report sent out for stage 1 review.”

      Thank you very much. Stage 1 and Stage 2 manuscripts look like suitable labelling solution.

      34. “One pathway that is not included in Figure 2 is that authors can decide to not conduct the study when improvements are required. Relatedly, in the publish-review-curate model, is revising the manuscripts based on the reviews not optional as well? Especially in the case of

      3a, authors can hardly be forced to make changes even though the reviews are posted on the platform.”

      All the four models imply a certain level of generalization; thus, I tried to avoid redundant details. However, I have added this choice to the PRC model (now, Model 4).

      35. “I think the author should discuss the importance of ‘open identities’ more. This factor is now not explicitly included in any of the models, while it has been found to be one of the main characteristics of peer review systems (Ross-Hellauer, 2017).”

      This part has been extended.

      36. “More generally, I was wondering why the author chose these three models and not others. What were the inclusion criteria for inclusion in the manuscript? Some information on the underlying process would be welcome, especially when claims like ‘However, we believe that journal-independent peer review is a special case of Model 3 (‘Publish-Review-Curate’).’ are made without substantiation.”

      The study included four generalized models of peer review that involved some level of abstraction.

      37. “Maybe it helps to outline the goals of the paper a bit more clearly in the introduction. This helps the reader to know what to expect.”

      The Introduction has been revised including the goal and objectives.

      38. “The Modular Publishing section is not inherently related to peer review models, as you mention in the first sentence of that paragraph. As such, I think it would be best to omit this section entirely to maintain the flow of the paper. Alternatively, you could shortly discuss it in the discussion section but a separate paragraph seems too much from my point of view.”

      Modular publishing has been combined with registered reports into the fragmented publishing group of models, now in Section 5.

      39. “Labeling model 3 as post-publication review might be confusing to some readers. I believe many researchers see post-publication review as researchers making comments on preprints, or submitting commentaries to journals. Those activities are substantially different from the publish-review-curate model so I think it is important to distinguish between these types.”

      The label was changed into Publish- Review-Curate model.

      40. “I do not think the conclusions drawn below Table 3 logically follow from the earlier text. For example, why are “all functions of scientific communication implemented most quickly and transparently in Model 3”? It could be that the entire process takes longer in Model 3 (e.g. because reviewers need more time), so that Model 1 and Model 2 lead to outputs quicker. The same holds for the following claim: ‘The additional costs arising from the independent assessment of information based on open reviews are more than compensated by the emerging opportunities for scientific pluralism.’ What is the empirical evidence for this? While I personally do think that Model 3 improves on Model 1, emphatic statements like this require empirical evidence. Maybe the author could provide some suggestions on how we can attain this evidence. Model 2 does have some empirical evidence underpinning its validity (see Scheel, Schijen, Lakens, 2021; Soderberg et al., 2021; Sarafoglou et al. 2022) but more meta-research inquiries into the effectiveness and cost-benefits ratio of registered reports would still be welcome in general.”

      The Discussion section has been substantially revised to address this point. While I acknowledge the current scarcity of empirical studies on innovative peer review models, I have incorporated a critical discussion of this methodological gap. I am grateful for the suggested literature on RRs, which I have now integrated into the relevant subsection.

      41. “What is the underlaying source for the claim that openness requires three conditions?”

      I have made effort to clarify within the text that this reflects my personal stance.

      42. “‘If we do not change our approach, science will either stagnate or transition into other forms of communication.’ (p. 2) I don’t think this claim is supported sufficiently strongly. While I agree there are important problems in peer review, I think would need to be a more in-depth and evidence-based analysis before claims like this can be made.”

      The sentence has been rephrased.

      43. “On some occasions, the author uses ‘we’ while the study is single authored.”

      This has been fixed.

      44. “Figure 1: The top-left arrow from revision to (re-)submission is hidden”

      I have updated Figure 1.

      45. “‘The low level of peer review also contributes to the crisis of reproducibility in scientific research (Stoddart, 2016).’ (p. 4) I assume the author means the low quality of peer review.”

      This has been fixed.

      46. “‘Although this crisis is due to a multitude of factors, the peer review system bears a significant responsibility for it.’ (p. 4) This is also a big claim that is not substantiated”

      I have paraphrased this sentence as “While multiple factors drive this crisis, deficiencies in the peer review process remain a significant contributor.” and added a footnote.

      47. “‘Software for automatic evaluation of scientific papers based on artificial intelligence (AI) has emerged relatively recently” (p. 5) The author could add RegCheck (https://regcheck.app/) here, even though it is still in development. This tool is especially salient in light of the finding that preregistration-paper checks are rarely done as part of reviews (see Syed, 2023)”

      Thank you very much, I have added this information.

      48. “There is a typo in last box of Figure 1 (‘decicion’ instead of ‘decision’). I also found typos in the second box of Figure 2, where ‘screns’ should be ‘screens’, and the author decision box where ‘desicion’ should be ‘decision’”

      This has been fixed.

      49. “Maybe it would be good to mention results blinded review in the first paragraph of 3.2. This is a form of peer review where the study is already carried out but reviewers are blinded to the results. See work by Locascio (2017), Grand et al. (2018), and Woznyj et al. (2018).”

      Thanks, I have added this (now section 5.2)

      50. “Is ‘Not considered for peer review’ in figure 3b not the same as rejected? I feel that it is rejected in the sense that neither the manuscript not the reviews will be posted on the platform.”

      Changed into “Rejected”

      51. “‘In addition to the projects mentioned, there are other platforms, for example, PREreview12, which departs even more radically from the traditional review format due to the decentralized structure of work.’ (p. 11) For completeness, I think it would be helpful to add some more information here, for example why exactly decentralization is a radical departure from the traditional model.”

      I have extended this passage.

      52. “‘However, anonymity is very conditional - there are still many “keys” left in the manuscript, by which one can determine, if not the identity of the author, then his country, research group, or affiliated organization.’ (p.11) I would opt for the neutral ‘their’ here instead of ‘his’, especially given that this is a paragraph about equity and inclusion.”

      This has been fixed.

      53. “‘Thus, “closeness” is not a good way to address biases.’ (p. 11) This might be a straw man argument because I don’t believe researchers have argued that it is a good method to combat biases. If they did, it would be good to cite them here. Alternatively, the sentence could be

      omitted entirely.

      I have omitted the sentence.

      54. “I would start the Modular Publishing section with the definition as that allows readers to interpret the other statements better.”

      Modular publishing has been combined with registered reports into the deconstructed publication group of models, now in Section 5, general definition added.

      55. “It would be helpful if the Models were labeled (instead of using Model 1, Model 2, and Model 3) so that readers don’t have to think back what each model involved.”

      All the models represent a kind of generalization, which is why non-detailed labels are used. The text labels may vary depending on the context.

      56. “Table 2: ‘Decision making’ for the editor’s role is quite broad, I recommend to specify and include what kind of decisions need to be made.”

      Changed into “Making accept/reject decisions”

      57. “Table 2: ‘Aim of review’ – I believe the aim of peer review differs also within these models (see the ‘schools of thought’ the author mentions earlier), so maybe a statement on what the review entails would be a better way to phrase this.”

      Changed into “What does peer review entail?”

      58. “Table 2: One could argue that the object of the review’ in Registered Reports is also the manuscript as a whole, just in different stages. As such, I would phrase this differently.

      Current wording fits your remark: “Manuscript in terms of study design and execution”

      Reviewer 4

      59. “Page 3: It’s hard to get a feel for the timeline given the dates that are described. We have peer review becoming standard after WWII (after 1945), definitively established by the second half of the century, an example of obligatory peer review starting in 1976, and in crisis by the end of the 20th century. I would consider adding examples that better support this timeline – did it become more common in specific journals before 1976? Was the crisis by the end of the 20th century something that happened over time or something that was already intrinsic to the institution? It doesn’t seem like enough time to get established and then enter crisis, but more details/examples could help make the timeline clear. Consider discussing the benefits of the traditional model of peer review.”

      This section has been extended.

      60. “Table 1 – Most of these are self-explanatory to me as a reader, but not all. I don’t know what a registered report refers to, and it stands to reason that not all of these innovations are familiar to all readers. You do go through each of these sections, but that’s not clear when I initially look at the table. Consider having a more informative caption. Additionally, the left column is “Course of changes” here but “Directions” in text. I’d pick one and go with it for consistency.”

      Table 1 has been replaced by Figure 2. I have also extended text descriptions, added definitions.

      61. “With some of these methods, there’s the ability to also submit to a regular journal. Going to a regular journal presumably would instigate a whole new round of review, which may or may not contradict the previous round of post-publication review and would increase the length of time to publication by going through both types. If someone has a goal to publish in a journal, what benefit would they get by going through the post-publication review first, given this extra time?”

      Some of these platforms, e.g., F1000, Lifecycle Journal, replace conventional journal publishing. Modular publishing allows for step-by-step feedback from peers. An important advantage of RRs over other peer review models lies in their capacity to enhance research efficiency. By conducting peer review at Stage 1, researchers gain the opportunity to refine their study design or data collection protocols before empirical work begins. Other models of review can offer critiques such as "the study should have been conducted differently" without actionable opportunity for improvement. The key motivation for having my paper reviewed in MetaROR is the quality of peer review – I have never received so many comments, frankly! Moreover, platforms such as MetaROR usually have partnering journals.

      62. “There’s a section talking about institutional change (page 14). It mentions that openness requires three conditions – people taking responsibility for scientific communication, authors and reviewers, and infrastructure. I would consider adding some discussion of readers and evaluators. Readers have to be willing to accept these papers as reliable, trustworthy, and respectable to read and use the information in them. Evaluators such as tenure committees and potential employers would need to consider papers submitted through these approaches as evidence of scientific scholarship for the effort to be worthwhile for scientists.”

      I have omitted these conditions and employed the Moore’s Technology Adoption Life Cycle. Thank you very much for your comment!

      63. Based on this overview, which seems somewhat skewed towards the merits of these methods (conflict of interest, limited perspective on downsides to new methods/upsides to old methods), I am not quite ready to accept this effort as equivalent of a regular journal and pre-publication peer review process. I look forward to learning more about the approach and seeing this review method in action and as it develops.

      The Discussion section has been substantially revised to address this point. While I acknowledge the current scarcity of empirical studies on innovative peer review models, I have incorporated a critical discussion of this methodological gap.

    1. Reviewer #2 (Public review):

      Summary:

      HIV-1 infection induces CPSF6 aggregates in the nucleus that contain the viral protein CA. The study of the functions and composition of these nuclear aggregates have raised considerable interest in the field, and they have emerged as sites in which reverse transcription is completed and in the proximity of which viral DNA becomes integrated. In this work, the authors have mutated several regions of the CPSF6 protein to identify the domains important for nuclear aggregation, in addition to the already-known FG-region; they have characterized the kinetics of fusion between CPSF6 aggregates and SC35 nuclear speckles and have determined the role of two nuclear speckle components in this process (SRRM2, SUN2).

      Strengths:

      The work examines systematically the domains of CPSF6 of importance for nuclear aggregate formation in an elegant manner in which these mutants complement an otherwise CPSF6-KO cell line. In addition, this work evidences a novel role for the protein SRRM2 in HIV-induced aggregate formation, overall advancing our comprehension of the components required for their formation and regulation.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      In recent years, our understanding of the nuclear steps of the HIV-1 life cycle has made significant advances. It has emerged that HIV-1 completes reverse transcription in the nucleus and that the host factor CPSF6 forms condensates around the viral capsid. The precise function of these CPSF6 condensates is under investigation, but it is clear that the HIV-1 capsid protein is required for their formation. This study by Tomasini et al. investigates the genesis of the CPSF6 condensates induced by HIV-1 capsid, what other co-factors may be required, and their relationship with nuclear speckels (NS). The authors show that disruption of the condensates by the drug PF74, added post-nuclear entry, blocks HIV-1 infection, which supports their functional role. They generated CPSF6 KO THP-1 cell lines, in which they expressed exogenous CPSF6 constructs to map by microscopy and pull down assays of the regions critical for the formation of condensates. This approach revealed that the LCR region of CPSF6 is required for capsid binding but not for condensates whereas the FG region is essential for both. Using SON and SRRM2 as markers of NS, the authors show that CPSF6 condensates precede their merging with NS but that depletion of SRRM2, or SRRM2 lacking the IDR domain, delays the genesis of condensates, which are also smaller. 

      The study is interesting and well conducted and defines some characteristics of the CPSF6-HIV-1 condensates. Their results on the NS are valuable. The data presented are convincing. 

      I have two main concerns. Firstly, the functional outcome of the various protein mutants and KOs is not evaluated. Although Figure 1 shows that disruption of the CPSF6 puncta by PF74 impairs HIV-1 infection, it is not clear if HIV-1 infection is at all affected by expression of the mutant CPSF6 forms (and SRRM2 mutants) or KO/KD of the various host factors. The cell lines are available, so it should be possible to measure HIV-1 infection and reverse transcription. Secondly, the authors have not assessed if the effects observed on the NS impact HIV-1 gene expression, which would be interesting to know given that NS are sites of highly active gene transcription. With the reagents at hand, it should be possible to investigate this too. 

      We thank the reviewer for her/his valuable feedback on our manuscript. We are pleased to see her/his appreciation of our results, and we did our utmost to address the highlighted points to further improve our work.

      To correctly perform the infectivity assay, we generated stable cell clones—a process that required considerable time, particularly during the selection of clones expressing protein levels comparable to wild-type (WT) cells. To accurately measure infectivity, it was essential to use stable clones expressing the most important deletion mutant, ∆FG CPSF6, at levels similar to those of CPSF6 in WT cells (new Fig.5 A-B). Importantly, we assessed the reproducibility of our experiments by freezing and thawing these clones.

      Regarding SRRM2, in THP-1 cells we were only able to achieve a knockdown, which still retains residual SRRM2 protein, albeit at much lower levels. Due to the essential role of SRRM2 in cell survival, obtaining a complete knockout in this cell line is not feasible, making it difficult to draw definitive conclusions from these experiments.

      In contrast, 293T cells carrying the endogenous SRRM2 deletion mutant (ΔIDR) cannot be infected with replication-competent HIV-1, as they lack expression of CD4 and either CCR4 or CCR5. These cells were instead used to monitor the dynamics of CPSF6 puncta assembly within nuclear speckles. However, they are not a suitable model for studying the impact of the depletion of SRRM2 in viral infection.

      Thus, we performed infectivity assays in a more relevant cell line for HIV-1 infection, THP-1 macrophage-like cells, using both a single-round virus and a replication-competent virus. The new results, shown in Figure 5 C-D, indicate that complete depletion of CPSF6 reduces infectivity, as measured by luciferase expression in a single-round infection (KO: ~65%; ΔFG: ~74%; compared to WT: 100% on average). Notably, a more pronounced defect in viral particle production was observed when WT virus was used for infection (KO: ~21%; ΔFG: ~16%; compared to WT: 100% on average). These findings support the referee’s insightful suggestion that the absence of CPSF6 could also impair HIV-1 gene expression. 

      Reviewer #2 (Public review): 

      Summary: 

      HIV-1 infection induces CPSF6 aggregates in the nucleus that contain the viral protein CA. The study of the functions and composition of these nuclear aggregates have raised considerable interest in the field, and they have emerged as sites in which reverse transcription is completed and in the proximity of which viral DNA becomes integrated. In this work, the authors have mutated several regions of the CPSF6 protein to identify the domains important for nuclear aggregation, in addition to the alreadyknown FG region; they have characterized the kinetics of fusion between CPSF6 aggregates and SC35 nuclear speckles and have determined the role of two nuclear speckle components in this process (SRRM2, SUN2). 

      Strengths: 

      The work examines systematically the domains of CPSF6 of importance for nuclear aggregate formation in an elegant manner in which these mutants complement an otherwise CPSF6-KO cell line. In addition, this work evidences a novel role for the protein SRRM2 in HIV-induced aggregate formation, overall advancing our comprehension of the components required for their formation and regulation. 

      Weaknesses: 

      Some of the results presented in this manuscript, in particular the kinetics of fusion between CPSF6aggregates and SC35 speckles have been published before (PMID: 32665593; 32997983). 

      The observations of the different effects of CPSF6 mutants, as well as SRRM2/SUN2 silencing experiments are not complemented by infection data which would have linked morphological changes in nuclear aggregates to function during viral infection. More importantly, these functional data could have helped stratify otherwise similar morphological appearances in CPSF6 aggregates. 

      Overall, the results could be presented in a more concise and ordered manner to help focus the attention of the reader on the most important issues. Most of the figures extend to 3-4 different pages and some information could be clearly either aggregated or moved to supplementary data. 

      First, we thank the reviewer for her/his appreciation of our study and to give to us the opportunity to better explain our results and to improve our manuscript. We appreciate the reviewer’s positive feedback on our study, and we will do our best to address her/his concerns. In the meantime, we would like to clarify the focus of our study. Our research does not aim to demonstrate an association between CPSF6 condensates (we use the term "condensates" rather than "aggregates," as aggregates are generally non-dynamic (Alberti & Hyman, 2021; Banani et al., 2017; Scoca et al., JMCB 2022), and our work specifically examines the dynamic behavior of CPSF6 puncta formed during infection and nuclear speckles. The association between CPSF6 puncta and NS has already been established in previous studies, as noted in the manuscript (PMID: 32665593; 32997983). The previous studies (PMID: 32665593; 32997983) showed that CPSF6 puncta colocalize with SC35 upon HIV infection and in the submitted study we study their kinetics.

      About the point highlighted by the reviewer: "Kinetics of fusion between CPSF6-aggregates and SC35 speckles have been published before."  

      Our study differs from prior work PMID 32665593 because we utilize a full-length HIV genome, and we did not follow the integrase (IN) fluorescence in trans and its association with CPSF6 but we specifically assess if CPSF6 clusters form in the nucleus independently of NS factors and next to fuse with them. In the current study we evaluated the dynamics of formation of CPSF6/NS puncta, which it has not been explored before. Given this focus, we believe that our work offers a novel perspective on the molecular interactions that facilitate HIV / CPSF6-NS fusion.

      We calculated that 27% of CPSF6 clusters were independent from NS at 6 h post-infection, compared to only 9% at 30 h. This likely reflects a reduction in individual clusters as more become fused with nuclear speckles over time. At the same time, these data suggest that the fusion process can begin even earlier. Indeed, it has been reported that in macrophages, the peak of viral nuclear import occurs before 6 h post-infection (doi: 10.1038/s41564-020-0735-8).

      In addition, we have incorporated new experiments assessing viral infectivity in the absence of CPSF6, or in CPSF6-knockout cells expressing either a CPSF6 mutant lacking the FG peptide or the WT protein. As shown in our new Figure 5, these results demonstrate that the FG peptide is critical for viral replication in THP-1 cells.

      For better clarity, we would like to specify that our study focuses on the role of SON, a scaffold factor of nuclear speckles, rather than SUN2 (SUN domain-containing protein 2), which is a component of the LINC (Linker of Nucleoskeleton and Cytoskeleton) complex.

      As suggested by the reviewer, we have revised the text and combined figures to improve clarity and facilitate reader comprehension. We appreciate the constructive comment of the reviewer.

      Reviewer #3 (Public review): 

      In this study, the authors investigate the requirements for the formation of CPSF6 puncta induced by HIV-1 under a high multiplicity of infection conditions. Not surprisingly, they observe that mutation of the Phe-Gly (FG) repeat responsible for CPSF6 binding to the incoming HIV-1 capsid abrogates CPSF6 punctum formation. Perhaps more interestingly, they show that the removal of other domains of CPSF6, including the mixed-charge domain (MCD), does not affect the formation of HIV-1-induced CPSF6 puncta. The authors also present data suggesting that CPSF6 puncta form individual before fusing with nuclear speckles (NSs) and that the fusion of CPSF6 puncta to NSs requires the intrinsically disordered region (IDR) of the NS component SRRM2. While the study presents some interesting findings, there are some technical issues that need to be addressed and the amount of new information is somewhat limited. Also, the authors' finding that deletion of the CPSF6 MCD does not affect the formation of HIV-1-induced CPSF6 puncta contradicts recent findings of Jang et al. (doi.org/10.1093/nar/gkae769). 

      We thank the reviewer for her/his thoughtful feedback and the opportunity to elaborate on why our findings provide a distinct perspective compared to those of Jang et al. (doi.org/10.1093/nar/gkae769).

      One potential reason for the differences between our findings and those of Jang et al. could be the choice of experimental systems. Jang et al. conducted their study in HEK293T cells with CPSF6 knockouts, as described in Sowd et al., 2016 (doi.org/10.1073/pnas.1524213113). In contrast, our work focused on macrophage-like THP-1 cells, which share closer characteristics with HIV-1’s natural target cells. 

      Our approach utilized a complete CPSF6 knockout in THP-1 cells, enabling us to reintroduce untagged versions of CPSF6, such as wild-type and deletion mutants, to avoid potential artifacts from tagging. Jang et al. employed HA-tagged CPSF6 constructs, which may lead to subtle differences in experimental outcomes due to the presence of the tag.

      Finally, our investigation into the IDR of SRRM2 relied on CRISPR-PAINT to generate targeted deletions directly in the endogenous gene (Lester et al., 2021, DOI: 10.1016/j.neuron.2021.03.026). This approach provided a native context for studying SRRM2’s role.

      We will incorporate these clarifications into the discussion section of the revised manuscript.  

      Reviewer #1 (Recommendations for the authors): 

      (1) Figure 2E: The statistical analysis should be extended to the comparison between the "+HIV" samples. 

      We showed the statistics between only HIV+ cells now new Fig. 2D.  

      (2) Figure 4A top panel is out of focus. 

      We modified the figure now figure 6A.

      Reviewer #2 (Recommendations for the authors): 

      (1) Some of the sentences could be rewritten for the sake of simplicity, also taking care to avoid overstatement. 

      We modified the sentences as best as we could.

      (2) For instance: There is no evidence that "viral genomes in nuclear niches may be contributing to the formation of viral reservoirs" (lines 33-35). 

      We changed the sentence as follows: “Despite antiretroviral treatment, viral genomes can persist in these nuclear niches and reactivate upon treatment interruption, raising the possibility that they could play a role in the establishment of viral reservoirs.”

      (3) Line 53: unclear sentence. "The initial stages of the viral life cycle have been understood....." The authors certainly mean reverse transcription, but as formulated this is not clear. The authors should also bear in mind that reverse transcription starts already in budding/just released virions. 

      We clarified the concept as follows: “the initial stages of the viral life cycle, such as the reverse transcription (the conversion of the viral RNA in DNA) and the uncoating (loss of the capsid), have been understood to mainly occur within the host cytoplasm.”

      (4) Line 124: the results in Figure 1 are not at all explained in the text. PF74 does not act on CPSF6, it acts on CA and this in turn leads to CPS6 puncta disappearance. 

      PF74 binds the same hydrophobic pocket of the viral core as CPSF6. However, when viral cores are located within CPSF6 puncta, treatment with a high dose of PF74 leads to a rapid disassembly of these puncta, while viral cores remain detectable up to 2 hours post-treatment (Ay et al., EMBO J. 2024). Here, we simply describe what we observed by confocal microscopy. Said that HIV-Induced CPSF6 Puncta include both CPSF6 proteins and viral cores as we have now specified.

      (5) Line 130; 'hinges into two key ...' should be 'hinges on'. 

      Thanks we modified it.

      (6) Supplementary Figures are not cited sequentially in the text. 

      We have now modified the numbers of the supplementary figures according to their appearance in the text.

      (7) Line 44: define FG. 

      We defined it.

      Reviewer #3 (Recommendations for the authors): 

      Specific comments that the authors should address are outlined below. 

      (1) As mentioned in the summary above, the authors' findings seem to be in direct contradiction with recent work published by Alan Engelman's lab in NAR. The authors should address the possible reason(s) for this discrepancy. 

      We mention the potential reasons for the differences in the results between our study and Engelman’s lab study in the discussion.

      (2) The major finding here that deletion of the CFSF6 FG repeat prevents the formation of CFSP6 puncta is unsurprising, as the FG repeat is responsible for capsid binding. This has been reported previously and such mutants have been used as controls in other studies. 

      Our study demonstrates that the FG domain is the sole region responsible for the formation of CPSF6 puncta, rather than the LCR or MCD domains. The unique role of the FG domain in CPSF6 that promotes the formation of CPSF6 puncta without the help of the other IDRs during viral infection is a finding particularly novel, as it has not yet been reported in the literature.

      (3) Line 339, the authors state: "incoming viral RNA has been observed to be sequestered in nuclear niches in cells treated with the reversible reverse transcriptase inhibitor, NEV. When macrophage-like cells are infected in the presence of NEV, the incoming viral RNA is held within the nucleus (Rensen et al., 2021; Scoca et al., 2023). This scenario is comparable to what is observed in patients undergoing antiretroviral therapy". In what way is this comparable to what is observed in individuals on ART? I see no basis for this statement. Sequestration of viral RNA in the nucleus is not the basis for maintaining the viral reservoir in individuals on therapy. 

      Thanks, we rephrased the sentence.

      (4) General comment: analyzing single-cell-derived KO clones is very risky because of random clonal variability between individual cells in the population. If single-cell-derived clones are used, phenotypes could be confirmed with multiple, independent clones. 

      We used a clone completely KO for CPSF6 mainly to investigate the role of a specific domain in condensate formation and it will be difficult that clone selection could have introduced artifacts in this context. Other available clones retain residual endogenous protein, which prevents us from accurately assessing CPSF6 cluster formation in the various deletion mutants. A complete CPSF6 knockout is essential for studying puncta formation, as it eliminates potential artifacts arising from protein tags that could alter the phase separation properties of the protein under investigation.

      (5) Line 214. "It is predicted to form two short α helices and a ß strand, arranged as: α helix - FG - ß strand - α helix". What is this based on? No citation is provided and no data are shown. 

      In fact, the statement "It is predicted to form two short α helices and a ß strand, arranged as: α helix - FG - ß strand - α helix" is based on the data shown in Figure 4E presenting data generated by PSIPRED. 

      (6) Figure 1B. "Luciferase values were normalized by total proteins revealed with the Bradford kit". What does this mean? I couldn't find anything explaining how the viral inputs were normalized. 

      The amount of the virus used is the same for all samples, we used MOI 10 as described in the legend of Figure 1. It is important to normalize the RLU (luciferase assay) with the total amount of proteins to be sure that we are comparing similar number of cells. Obviously, the cells were plated on the same amount on each well, the normalization in our case it is just an additional important control.

      (7) I can't interpret what is being shown in the movies. 

      We updated the movie 1B and rephrased the movie legends and we added a new suppl. Fig.4B.

      (8) Figure 5B. The differences seen are very small and of questionable significance. The data suggest that by 6 hpi, around 75% of HIV-induced CPSF6 puncta are already fused with NSs. 

      We calculated that 27% of CPSF6 clusters were independent from NS at 6 h post-infection, compared to only 9% at 30 h. This likely reflects a reduction in individual clusters as more become fused with nuclear speckles over time. At the same time, these data suggest that the fusion process can begin even earlier. Indeed, it has been reported that in macrophages, the peak of viral nuclear import occurs before 6 h post-infection (doi: 10.1038/s41564-020-0735-8).

      (9) Figure 6. Immunofluorescence is not a good method for quantifying KD efficiency. The authors should perform western blotting to measure KD efficiency. This is an important point, because the effect sizes are small, quite likely due to incomplete KD. 

      We performed WB and quantified the results, which correlated with the IF data and their imaging analysis. These new findings have been incorporated into Figure 8A. Of note, deletion of the IDR of SRRM2 does not affect the number of SON puncta (Fig.8C), but significantly reduces the number of CPSF6 puncta in infected cells compared to those expressing full-length SRRM2 (Fig.8D).

      (10) There are a variety of issues with the text that should be corrected. 

      The authors use "RT" to mean both the enzyme (reverse transcriptase) and the process (reverse transcription). This is incorrect and will confuse the reader. RT refers to the enzyme (noun, not verb). 

      The commonly used abbreviation for nevirapine is NVP, not NEV. 

      In line 60, it is stated that the capsid contains 250 hexamers. This number is variable, depending on the size and shape of the capsid. By contrast, the capsid has exactly 12 pentamers. 

      Line 75. Typo: "nuclear niches containing, such as like". 

      Line 82. Typo: "the mechanism behinds". 

      Line 102. Typo: "we aim to elucidate how these HIV-induced CPSF6 form". 

      Line 107. Type: "CPSF6 is responsible for tracking the viral core" ("trafficking the viral core"?). 

      Thanks, we corrected all of them.

    1. Reviewer #1 (Public review):

      Zhu and colleagues used high-density Neuropixel probes to perform laminar recordings in V1 while presenting either small stimuli that stimulated the classical receptive field (CRF) or large stimuli whose border straddled the RF to provide nonclassical RF (nCRF) stimulation. Their main question was to understand the relative contribution of feedforward (FF), feedback (FB), and horizontal circuits to border ownership (B<sub>own</sub> ), which they addressed by measuring cross-correlation across layers. They found differences in cross-correlation between feedback/horizontal (FH) and input layers during CRF and nCRF stimulation.

      Comments on revisions:

      In the revision, the authors have added a paragraph in the Discussion to address the question of layers 2/3 neurons leading layer 4 neurons, and have provided answers to the questions in the public review without making substantial changes in the paper. However, there were several other recommendations, which I am not sure why were not considered. I am adding those again below.

      * For CRF stimulation, the zero lag between 4C and 4A/B with layer 5/6 (Figure 3D last two columns on the right) was surprising to me. I just felt that this could be because layer 6 may also be getting FF inputs. Perhaps better not to club layer 5 with 6, as mentioned earlier also.

      * Interpreting the nCRF delays, with often negative delays, was very challenging for me. For example, 4C -> 5/6 (third column in Figure 3) has a significantly negative peak (although that does not show up in statistical analysis because it seems to be a signed test to just test if the median was greater than zero, not if the median was different from zero; line 285). What is the interpretation here? Are spikes in 5/6 causing spikes in 4C (which, as mentioned earlier, would require anatomical projections from 5/6 to 4C)? On the other hand, if FB inputs arrive in 5/6 but there are no inputs going to 4C, then why should there even be a significant cross-correlation?

      The only explanation I could think of is somehow an alignment of inputs in these two layers such that FH inputs come in Layer 5/6 just before FF inputs arrive in 4C, each causing a spike in a neuron in each layer which are otherwise not anatomically interconnected. But this would require both a very precise temporal coupling between FF and FH inputs arriving in these areas AND neurons in layer 5/6 which very strongly respond to FH stimulation (I thought that FH inputs are mainly modulatory and not as strong). Anyway, it would be good to see some cross correlation functions which have a negative lag (all examples in Fig 3B has positive or zero lag).

      * I think cross-correlation analysis would have been useful if there was data from a feedback area (say V2). In its absence, perhaps latency analysis (by just comparing the PSTH) could have revealed something interesting, given that the hypothesis is about differences in the timings in FH versus FF inputs. Do PSTHs across layers show the type of differences that are being claimed (e.g. in line 295-297)?

      * Line 262-63: "Notably, the rates were nearly identical under the two stimulus conditions" - I would have thought CRF stimulation would produce higher rates. Can the authors explain this?

      * Line 174-175: Isn't the proportion of border ownership cells in layer 4C higher than one would expect under the assumption that nCRF effects are mediated by horizontal and feedback connections which layer 4C does not receive? Can authors explain?

      * Figure 3D: it would also be good to show the heatmaps stacked up in the increasing order of the interelectrode distance of the pairs so that it will be easy to see how the peak lag changes with distance as well.

      * It will be good to show the shift in peak lag and CCG asymmetry between CRF and nCRF conditions for the same pairs, using a violin or bar plot with lines connecting each pair in Figure 3.

      * Line 594, 603, 628 and 630: What procedure was used to determine the size, location of the CRF, and optimal orientation manually online?

      * Line 733-734: Although a reference is cited, please explicitly mention the rationale for keeping the peak lag cutoff at 10 ms.

      * It is unclear why a grating was used for the CRF condition, instead of just having the portion of the stimulus within the RF for the nCRF condition, as the comparisons for FHi with FF are with different FF drives in each case.

      * Figure 5 - the scatter is enormous, can you please provide the R2 values?

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      Zhu and colleagues used high-density Neuropixel probes to perform laminar recordings in V1 while presenting either small stimuli that stimulated the classical receptive field (CRF) or large stimuli whose border straddled the RF to provide nonclassical RF (nCRF) stimulation. Their main question was to understand the relative contribution of feedforward (FF), feedback (FB), and horizontal circuits to border ownership (Bown), which they addressed by measuring crosscorrelation across layers. They found differences in cross-correlation between feedback/horizontal (FH) and input layers during CRF and nCRF stimulation. 

      Although the data looks high quality and analyses look mostly fine, I had a lot of difficulty understanding the logic in many places. Examples of my concerns are written below. 

      (1) What is the main question? The authors refer to nCRF stimulation emerging from either feedback from higher areas or horizontal connections from within the same area (e.g. lines 136 to 138 and again lines 223-232). I initially thought that the study would aim to distinguish between the two. However, the way the authors have clubbed the layers in 3D, the main question seems to be whether Bown is FF or FH (i.e., feedback and horizontal are clubbed). Is this correct? If so, I don't see the logic, since I can't imagine Bown to be purely FF. Thus, just showing differences between CRF stimulation (which is mainly expected to be FF) and nCRF stimulation is not surprising to me. 

      We thank the reviewer for their thoughtful comments. As explained in the discussion, we grouped cortical layers to reduce uncertainty in precisely assigning laminar boundaries and to increase statistical power. Consequently, this limits our ability to distinguish the relative contributions of feedback inputs, primarily targeting layers 1 and 6, and horizontal connections, mainly within layers 2/3 and 5. Nevertheless, previous findings, especially regarding the rapid emergence of B<sub>own</sub> signals, suggest that feedback is more biologically plausible than horizontal-based mechanisms.

      Importantly, the emergence of B<sub>own</sub> signals in the primate brain should not be taken for granted. Direct physiological evidence that distinguishes feedforward from feedback/horizontal mechanisms has been lacking. While we agree it is unlikely that B<sub>own</sub> is mediated solely by feedforward processing, we felt it was necessary to test this empirically, particularly using highresolution laminar recordings.

      As discussed, feedforward models of B<sub>own</sub> have been proposed (e.g., Super, Romeo, and Keil, 2010; Saki and Nishimura, 2006). These could, in theory, be supported by more general nCRF modulations arising through early feedforward inhibitions, such as those observed in the retinogeniculate pathway (e.g., Webb, Tinsley, Vincent and Derrington, 2005; Blitz and Regehr, 2005; Alitto and Usrey, 2008). However, most B<sub>own</sub> models rely heavily on response latency, yet very few studies have recorded across layers or areas simultaneously to address this directly. Notably, recent findings in area V4 show that B<sub>own</sub> signals emerge earlier in deep layers than in granular (input) layers, suggesting a non-feedforward origin (Franken and Reynolds, 2021).

      Furthermore, although previous studies have shown that the nCRF can modulate firing rates and the timing of neuronal firing across layers, our findings go beyond these effects. We provide clear evidence that nCRF modulation also alters precise spike timing relationships and interlaminar coordination, and that the magnitude of nCRF modulation depends on these interlaminar interactions. This supports the idea that B<sub>own</sub> , or more general nCRF modulation, involves more than local rate changes, reflecting layer-specific network dynamics consistent with feedback or lateral integration.

      (2) Choice of layers for cross-correlation analysis: In the Introduction, and also in Figure 3C, it is mentioned that FF inputs arrive in 4C and 6, while FB/Horizontal inputs arrive at "superficial" and "deep", which I take as layer 2/3 and 5. So it is not clear to me why (i) layer 4A/B is chosen for analysis for Figure 3D (I would have thought layer 6 should have been chosen instead) and (ii) why Layers 5 and 6 are clubbed. 

      We thank the reviewer for raising this important point. The confusion likely stems from our use of the terms “superficial” and “deep” layers when describing the targets of feedback/horizontal inputs. To clarify, by “superficial” and “deep,” we specifically refer to layers 1–3 and layers 5–6, respectively, as illustrated in Figure 3C. Feedback and horizontal inputs relatively avoid entire layer 4, including both 4C and 4A/B.

      We also emphasize that the classification of layers as feedforward or feedback/horizontal recipients is relative rather than absolute. For example, although layer 6 receives both feedforward and feedback/horizontal inputs, it contains a higher proportion of feedback/horizontal inputs compared to layers 4C and 4A/B. 

      We had addressed this rationale in the Discussion, but recognize it may not have been sufficiently emphasized. We have revised the main text accordingly to clarify this point for readers in the final manuscript version.

      (3) Addressing the main question using cross-correlation analysis: I think the nice peaks observed in Figure 3B for some pairs show how spiking in one neuron affects the spiking in another one, with the delay in cross-correlation function arising from the conduction delay. This is shown nicely during CRF stimulation in Figure 3D between 4C -> 2/3, for example. However, the delay (positive or negative) is constrained by anatomical connectivity. For example, unless there are projections from 2/3 back to 4C which causes firing in a 2/3 layer neuron to cause a spike in a layer 4 neuron, we cannot expect to get a negative delay no matter what kind of stimulation (CRF versus nCRF) is used. 

      We thank the reviewer for the insightful comment. The observation that neurons within FH<sub>i</sub> laminar compartments (layers 2/3, 5/6) can lead those in layer 4 (4C, 4A/B) during nCRF stimulation may indeed seem unexpected. However, several anatomical pathways could mediate the propagation of B<sub>own</sub> signals from FH<sub>i</sub> compartments to layer 4. We have revised the Discussion section in the final version of the manuscript to address this point explicitly.

      In Macaque V1, projections from layers 2/3 to 4A/B have been documented (Blasdel et al., 1985; Callaway and Wiser, 1996), and neurons in 4A/B often extend apical dendrites into layers 2/3 (Lund, 1988; Yoshioka et al., 1994). Although direct projections from layers 2/3 to 4C are generally sparse (Callaway, 1998), a subset of neurons in the lower part of layer 3 can give off collateral axons to 4C (Lund and Yoshioka, 1991). Additionally, some 4C neurons extend dendrites into 4B, enabling potential dendritic integration of inputs from more superficial layers (Somogyi and Cowey, 1981; Mates and Lund, 1983; Yabuta and Callaway, 1998). Sparse connections from 2/3 to layer 4 have also been reported in cat V1 (Binzegger, Douglas and Martin, 2004). Moreover, layers 2/3 may influence 4C neurons disynaptically, without requiring dense monosynaptic connections. 

      Importantly, while CCGs can suggest possible circuit arrangements, functional connectivity may arise through mechanisms not fully captured by traditional anatomical tracing. Indeed, the apparent discrepancy between anatomical and functional data is not uncommon. For example, although 4B is known to receive anatomical input primarily from 4Cα, but not 4Cβ, photostimulation experiments have shown that 4B neurons can also be functionally driven by 4Cβ (Sawatari and Callaway, 1996). Our observation of functional inputs from layers 2/3 to layer 4 is also consistent with prior findings in rodent V1, where CCG analysis (e.g., Figure 7 in Senzai, Fernandez-Ruiz and Buzsaki, 2019) or photostimulation (Xu et al., 2016) revealed similar pathways. 

      Layers 5/6 provide dense projections to layers 4A/B (Lund, 1988; Callaway, 1998). In particular, layer 6 pyramidal neurons, especially the subset classified as Type 1 cells, project substantially to layer 4C (Wiser and Callaway, 1996; Fitzpatrick et al., 1985). 

      Reviewer #2 (Public review): 

      Summary: 

      The authors present a study of how modulatory activity from outside the classical receptive field (cRF) differs from cRF stimulation. They study neural activity across the different layers of V1 in two anesthetized monkeys using Neuropixels probes. The monkeys are presented with drifting gratings and border-ownership tuning stimuli. They find that border-ownership tuning is organized into columns within V1, which is unexpected and exciting, and that the flow of activity from cellto-cell (as judged by cross-correlograms between single units) is influenced by the type of visual stimulus: border-ownership tuning stimuli vs. drifting-grating stimuli. 

      Strengths: 

      The questions addressed by the study are of high interest, and the use of Neuropixels probes yields extremely high numbers of single-units and cross-correlation histograms (CCHs) which makes the results robust. The study is well-described. 

      Weaknesses: 

      The weaknesses of the study are (a) the use of anesthetized animals, which raises questions about the nature of the modulatory signal being measured and the underlying logic of why a change in visual stimulus would produce a reversal in information flow through the cortical microcircuit and (b) the choice of visual stimuli, which do not uniquely isolate feedforward from feedback influences. 

      (1) The modulation latency seems quite short in Figure 2C. Have the authors measured the latency of the effect in the manuscript and how it compares to the onset of the visually driven response? It would be surprising if the latency was much shorter than 70ms given previous measurements of BO and figure-ground modulation latency in V2 and V1. On the same note, it might be revealing to make laminar profiles of the modulation (i.e. preferred - non-preferred border orientation) as it develops over time. Does the modulation start in feedback recipient layers? 

      (2) Can the authors show the average time course of the response elicited by preferred and nonpreferred border ownership stimuli across all significant neurons? 

      We thank the reviewer for the insightful comment—this is indeed an important and often overlooked point. As noted in the Discussion, B<sub>own</sub> modulation differs from other forms of figure-ground modulation (e.g., Lamme et al., 1998) in that it can emerge very rapidly in early visual cortex—within ~10–35 ms after response onset (Zhou et al., 2000; Sugihara et al., 2011). This rapid emergence has been interpreted as evidence for the involvement of fast feedback inputs, which can propagate up to ten times faster than horizontal connections (Girard et al., 2001). Moreover, interlaminar interactions via monosynaptic or disynaptic connections can occur on very short timescales (a few milliseconds), further complicating efforts to disentangle feedback influences based solely on latency.

      Thus, while the early onset of modulation in our data may appear surprising, it is consistent with prior B<sub>own</sub> findings, and likely reflects a combination of fast feedback and rapid interlaminar processing. This makes it challenging to use conventional latency measurements to resolve laminar differences in B<sub>own</sub> modulation. Latency comparisons are well known to be susceptible to confounds such as variability in response onset, luminance, contrast, stimulus size, and other sensory parameters. 

      Although we did not explicitly quantify the latency of B<sub>own</sub> modulation in this manuscript, our cross-correlation analysis provides a more sensitive and temporally resolved measure of interlaminar information flow. We therefore focused on this approach rather than laminar modulation profiles, as it more directly addresses our primary research question.

      (3) The logic of assuming that cRF stimulation should produce the opposite signal flow to borderownership tuning stimuli is worth discussing. I suspect the key difference between stimuli is that they used drifting gratings as the cRF stimulus, the movement of the stimulus continually refreshes the retinal image, leading to continuous feedforward dominance of the signals in V1. Had they used a static grating, the spiking during the sustained portion of the response might also show more influence of feedback/horizontal connections. Do the initial spikes fired in response to the borderownership tuning stimuli show the feedforward pattern of responses? The authors state that they did not look at cross-correlations during the initial response, but if they do, do they see the feedforward-dominated pattern? The jitter CCH analysis might suffice in correcting for the response transient. 

      We thank the reviewer for the insightful comment. As noted in the final Results section, our CRF and nCRF stimulation paradigms differ in respects beyond the presence or absence of nonclassical modulation, including stimulus properties within the CRF.

      We agree with the reviewer’s speculation that drifting gratings may continually refresh the retinal image, promoting sustained feedforward dominance in V1, whereas static gratings might allow greater influence from feedback/horizontal inputs during the sustained response. Likewise, the initial response to the B<sub>own</sub> stimulus could be dominated by feedforward activity before feedback/horizontal influences arrive. 

      This contrast was a central motivation for our experimental design: we deliberately used two stimulus conditions — drifting gratings to emphasize feedforward processing, and B<sub>own</sub> stimuli, which are known to engage feedback modulation — to test whether these two conditions yield different patterns of interlaminar information flow. Our results confirm that they do. While we did not separately analyze the very initial spike period, our focus is on interlaminar information flow during the sustained response, which serves as the primary measure of feedback/horizontal engagement in this study.

      Finally, beyond this direct comparison, we show in Figure 5 that under nCRF stimulation alone, the direction and strength of interlaminar information flow correlate with the magnitude of B<sub>own</sub> modulation, further supporting the idea that our cross-correlation approach reveals functionally meaningful differences in cortical processing.

      (4) The term "nCRF stimulation" is not appropriate because the CRF is stimulated by the light/dark edge. 

      We thank the reviewer for the comment. As noted in the Introduction, nCRF effects described in the literature invariably involve stimulation both inside and outside the CRF. Our use of the term “nCRF stimulation” refers to this experimental paradigm, rather than suggesting that the CRF itself is unstimulated. We hope this clarifies our use of the term.

      Reviewer #3 (Public review): 

      Summary: 

      The paper by Zhu et al is on an important topic in visual neuroscience, the emergence in the visual cortex of signals about figures and ground. This topic also goes by the name border ownership. The paper utilizes modern recording techniques very skillfully to extend what is known about border ownership. It offers new evidence about the prevalence of border ownership signals across different cortical layers in V1 cortex. Also, it uses pairwise cross-correlation to study signal flow under different conditions of visual stimulation that include the border ownership paradigm. 

      Strengths: 

      The paper's strengths are its use of multi-electrode probes to study border ownership in many neurons simultaneously across the cortical layers in V1, and its innovation of using crosscorrelation between cortical neurons -- when they are viewing border-ownership patterns or instead are viewing grating patterns restricted to the classical receptive field (CRF). 

      Weaknesses: 

      The paper's weaknesses are its largely incremental approach to the study of border ownership and the lack of a critical analysis of the cross-correlation data. The paper as it is now does not advance our understanding of border ownership; it mainly confirms prior work, and it does not challenge or revise consensus beliefs about mechanisms. However, it is possible that, in the rich dataset the authors have obtained, they do possess data that could be added to the paper to make it much stronger. 

      Critique: 

      The border ownership data on V1 offered in the paper replicates experimental results obtained by Zhou and von der Heydt (2000) and confirms the earlier results using the same analysis methods as Zhou. The incremental addition is that the authors found border ownership in all cortical layers extending Zhou's results that were only about layer 2/3. 

      The cross-correlation results show that the pattern of the cross-correlogram (CCG) is influenced by the visual pattern being presented. However, the results are not analyzed mechanistically, and the interpretation is unclear. For instance, the authors show in Figure 3 (and in Figure S2) that the peak of the CCG can indicate layer 2/3 excites layer 4C when the visual stimulus is the border ownership test pattern, a large square 8 deg on a side. But how can layer 2/3 excite layer 4C? The authors do not raise or offer an answer to this question. Similar questions arise when considering the CCG of layer 4A/B with layer 2/3. What is the proposed pathway for layer 2/3 to excite 4A/B? Other similar questions arise for all the interlaminar CCG data that are presented. What known functional connections would account for the measured CCGs? 

      We thank the reviewer for raising this important point. As noted in our response to a previous comment, several anatomical pathways could mediate apparent functional inputs from layers 2/3 to 4C and 4A/B. In macaque V1, projections from layers 2/3 to 4A/B have been documented (Blasdel et al., 1985; Callaway and Wiser, 1996), and neurons in 4A/B often extend apical dendrites into layers 2/3 (Lund, 1988; Yoshioka et al., 1994). Although direct projections from layers 2/3 to 4C are generally sparse (Callaway, 1998), a subset of lower layer 3 neurons can give off collateral axons to 4C (Lund and Yoshioka, 1991). Some 4C neurons also extend dendrites into 4B, potentially allowing dendritic integration of inputs from more superficial layers (Somogyi and Cowey, 1981; Mates and Lund, 1983; Yabuta and Callaway, 1998). Sparse connections from 2/3 to layer 4 have also been reported in cat V1 (Binzegger et al., 2004).

      Moreover, layers 2/3 may influence 4C neurons disynaptically, without requiring dense monosynaptic connections. While CCGs suggest possible circuit arrangements, functional connectivity may arise through mechanisms not fully captured by anatomical tracing, and apparent discrepancies between anatomical and functional data are not uncommon. For example, although 4B is known to receive anatomical input primarily from 4Cα, 4B neurons can also be functionally driven by 4Cβ using photostimulation (Sawatari and Callaway, 1996). Our observation of functional inputs from layers 2/3 to layer 4 is also consistent with prior findings in rodent V1, where CCG analysis (e.g., Figure 7 in Senzai, Fernandez-Ruiz and Buzsaki, 2019) or photostimulation (Xu et al., 2016) revealed similar pathways. 

      Layers 5/6 also provide dense projections to layers 4A/B (Lund, 1988; Callaway, 1998). In particular, layer 6 pyramidal neurons, especially the subset classified as Type 1 cells, project substantially to layer 4C (Wiser and Callaway, 1996; Fitzpatrick et al., 1985). 

      We have revised the Discussion section to explicitly address these points and clarify the potential anatomical and functional pathways underlying the measured interlaminar CCGs, highlighting how inputs from layers 2/3 and 5/6 to layer 4 can be mediated via both direct and indirect connections.

      The problems in understanding the CCG data are indirectly caused by the lack of a critical analysis of what is happening in the responses that reveal the border ownership signals, as in Figure 2. Let's put it bluntly - are border ownership signals excitatory or inhibitory? The reason I raise this question is that the present authors insightfully place border ownership as examples of the action of the non-classical receptive field (nCRF) of cortical cells. Most previous work on the nCRF (many papers cited by the authors) reveal the nCRF to be inhibitory or suppressive. In order to know whether nCRF signals are excitatory or inhibitory, one needs a baseline response from the CRF, so that when you introduce nCRF signals you can tell whether the change with respect to the CRF is up or down. As far as I know, prior work on border ownership has not addressed this question, and the present paper doesn't either. This is where the rich dataset that the present authors possess might be used to establish a fundamental property of border ownership. 

      Then we must go back to consider what the consequences of knowing the sign of the border ownership signal would mean for interpreting the CCG data. If the border ownership signals from extrastriate feedback or, alternatively, from horizontal intrinsic connections, are excitatory, they might provide a shared excitatory input to pairs of cells that would show up in the CCG as a peak at 0 delay. However, if the border ownership manuscript signals are inhibitory, they might work by exciting only inhibitory neurons in V1. This could have complicated consequences for the CCG.The interpretation of the CCG data in the present version of the m is unclear (see above). Perhaps a clearer interpretation could be developed once the authors know better what the border ownership signals are. 

      We thank the reviewer for raising this fundamental and thought-provoking question. As noted, B<sub>own</sub> signals arise from nCRF, which has often been associated with suppressive effects. However, Zhang and von der Heydt (2010) provided important insight into this issue by systematically varying the placement of figure fragments outside the CRF while keeping an edge centered within the CRF. They found that contextual fragments on the preferred side of B<sub>own</sub> produce facilitation, while those on the non-preferred side produce suppression. Thus, the nCRF contribution to B<sub>own</sub> reflects both excitatory and inhibitory modulation, depending on the spatial configuration of the figure.

      These effects were well explained by their model in which feedback from grouping cells in higher areas selectively enhances or suppresses V1/V2 neuron responses, depending on their B<sub>own</sub> preference. In this framework, the B<sub>own</sub> signal itself is not inherently excitatory or inhibitory; rather, it results from the net effect of feedback, which can be either facilitative or suppressive. Importantly, it is the input that is modulated — not that the receiving neurons are necessarily inhibitory themselves.

      In the current study, our analysis focused on CCGs showing excessive coincident spiking, i.e., positive peaks, which are typically interpreted as evidence for shared excitatory input or excitatory connections. Due to the limited number of connections, we did not analyze inhibitory interactions, such as anti-correlations or delayed suppression in the CCGs, which would be expected if the reference neuron were inhibitory. Therefore, the CCGs we report here likely reflect the excitatory component of the B<sub>own</sub> signal, and possibly its upstream drive via feedback. While a full separation of excitatory and inhibitory components remains an important goal for future work, our data suggest that B<sub>own</sub> modulation is at least partially mediated through excitatory feedback input.

      My critique of the CCG analysis applies to Figure 5 also. I cannot comprehend the point of showing a very weak correlation of CCG asymmetry with Border Ownership Index, especially when what CCG asymmetry means is unclear mechanistically. Figure 5 does not make the paper stronger in my opinion. 

      We thank the reviewer for this comment. As described in the Results section for Figure 5, the observation that interlaminar information flow correlates with B<sub>own</sub> modulation is important because it demonstrates that these flow patterns are specifically related to the magnitude of B<sub>own</sub> signals, independent of the comparisons between CRF and nCRF stimulation. 

      In Figure 3, the authors show two CCGs that involve 4C--4C pairs. It would be nice to know more about such pairs. If there are any 6--6 pairs, what they look like also would be interesting. The authors also in Figure 3 show CCG's of two 4C--4A/B pairs and it would be quite interesting to know how such CCGs behave when CRF and nCRF stimuli are compared. In other words, the authors have shown us they have many data but have chosen not to analyze them further or to explain why they chose not to analyze them. It might help the paper if the authors would present all the CCG types they have. This suggestion would be helpful when the authors know more about the sign of border ownership signals, as discussed at length above. 

      We thank the reviewer for the insightful comment. The rationale for selecting specific laminar pairs is described in the Results section after Figure 3C and further discussed in the Discussion. In brief, we focused on CCGs computed from pairs in which one neuron resided in laminar compartments receiving feedback/horizontal inputs (layers 2/3 and 5/6) and the other within compartments relatively devoid of these inputs (layers 4C and 4A/B).

      To mitigate uncertainty in defining exact laminar boundaries and to maximize statistical power, we combined some anatomical layers into distinct laminar compartments. This approach allowed us to compare the relative spike timing between neuronal pairs during CRF and nCRF stimulation. If feedback/horizontal inputs contribute more during nCRF than CRF stimulation, we expect this to be reflected in the lead-lag relationships of the CCGs. While other pairs (e.g., 5/6–5/6 or 4C– 4A/B) could in principle be analyzed, the hypothesized patterns for these pairs are less clear, and thus they were not the focus of our study. Nonetheless, these additional pairs represent interesting directions for future work.

    1. Scaling

      The set of questions on a survey cannot be considered a scale unless a scaling process was followed to identify the questions and determine how the responses would be combined. So, just because a set of questions on a survey looks like a scale, it collects data using the same response scale, and it is even analyzed like a scale, it isn’t a real scale unless some type of scaling process was used to create it.

    1. La paix sans les palestiniens est impossible ?

      Parce qu'elle le serait avec eux ?

      J'aime bien les prisonniers capturés par Israël pour "servir de monnaie d'échange". Dont des assassins du pogrome...

      Sinon Hakim a bien compris... Il s'agit bien de continuer de péter la gueule aux assassins islamistes. Malheur aux vaincus.

      L'incapacité des palestiniens à se gérer eux mêmes est ainsi actée. Et c'est très bien.

      La résolution 242 ne reconnait pas l'existence d'Israël mais pas non plus celle de la Palestine, qui n'est pas mentionnée... Tout ce qui s'est basé depuis dessus fut dans les faits abandonné.

      Qui parle au nom des palestiniens ? Personne sauf des corrompus et des assassins.

      La conclusion reste bien qu'il n'y a qu'un seul Etat.

    1. Author response:

      The following is the authors’ response to the original reviews

      We thank all the reviewers for their constructive comments. We have carefully considered your feedback and revised the manuscript accordingly. The major concern raised was the applicability of SegPore to the RNA004 dataset. To address this, we compared SegPore with f5c and Uncalled4 on RNA004, and found that SegPore demonstrated improved performance, as shown in Table 2 of the revised manuscript.

      Following the reviewers’ recommendations, we updated Figures 3 and 4. Additionally, we added one table and three supplementary figures to the revised manuscript:

      · Table 2: Segmentation benchmark on RNA004 data

      · Supplementary Figure S4: RNA translocation hypothesis illustrated on RNA004 data

      · Supplementary Figure S5: Illustration of Nanopolish raw signal segmentation with eventalign results

      · Supplementary Figure S6: Running time of SegPore on datasets of varying sizes

      Below, we provide a point-by-point response to your comments.

      Reviewer #1 (Public review):

      Summary:

      In this manuscript, the authors describe a new computational method (SegPore), which segments the raw signal from nanopore-direct RNA-Seq data to improve the identification of RNA modifications. In addition to signal segmentation, SegPore includes a Gaussian Mixture Model approach to differentiate modified and unmodified bases. SegPore uses Nanopolish to define a first segmentation, which is then refined into base and transition blocks. SegPore also includes a modification prediction model that is included in the output. The authors evaluate the segmentation in comparison to Nanopolish and Tombo, and they evaluate the impact on m6A RNA modification detection using data with known m6A sites. In comparison to existing methods, SegPore appears to improve the ability to detect m6A, suggesting that this approach could be used to improve the analysis of direct RNA-Seq data.

      Strengths:

      SegPore addresses an important problem (signal data segmentation). By refining the signal into transition and base blocks, noise appears to be reduced, leading to improved m6A identification at the site level as well as for single-read predictions. The authors provide a fully documented implementation, including a GPU version that reduces run time. The authors provide a detailed methods description, and the approach to refine segments appears to be new.

      Weaknesses:

      In addition to Nanopolish and Tombo, f5c and Uncalled4 can also be used for segmentation, however, the comparison to these methods is not shown.

      The method was only applied to data from the RNA002 direct RNA-Sequencing version, which is not available anymore, currently, it remains unclear if the methods still work on RNA004.

      Thank you for your comments.

      To clarify the background, there are two kits for Nanopore direct RNA sequencing: RNA002 (the older version) and RNA004 (the newer version). Oxford Nanopore Technologies (ONT) introduced the RNA004 kit in early 2024 and has since discontinued RNA002. Consequently, most public datasets are based on RNA002, with relatively few available for RNA004 (as of 30 June 2025).

      Nanopolish and Tombo were developed for raw signal segmentation and alignment using RNA002 data, whereas f5c and Uncalled4are the only two software supporting RNA004 data.  Since the development of SegPore began in January 2022, we initially focused on RNA002 due to its data availability. Accordingly, our original comparisons were made against Nanopolish and Tombo using RNA002 data.

      We have now updated SegPore to support RNA004 and compared its performance against f5c and Uncalled4 on three public RNA004 datasets.

      As shown in Table 2 of the revised manuscript, SegPore outperforms both f5c and Uncalled4 in raw signal segmentation. Moreover, the jiggling translocation hypothesis underlying SegPore is further supported, as shown in Supplementary Figure S4.

      The overall improvement in accuracy appears to be relatively small.

      Thank you for the comment.

      We understand that the improvements shown in Tables 1 and 2 may appear modest at first glance due to the small differences in the reported standard deviation (std) values. However, even small absolute changes in std can correspond to substantial relative reductions in noise, especially when the total variance is low.

      To better quantify the improvement, we assume that approximately 20% of the std for Nanopolish, Tombo, f5c, and Uncalled4 arises from noise. Using this assumption, we calculate the relative noise reduction rate of SegPore as follows:

      Noise reduction rate = (baseline std − SegPore std) / (0.2 × baseline std) ​​

      Based on this formula, the average noise reduction rates across all datasets are:

      - SegPore vs Nanopolish: 49.52%

      - SegPore vs Tombo: 167.80%

      - SegPore vs f5c: 9.44%

      - SegPore vs Uncalled4: 136.70%

      These results demonstrate that SegPore can reduce the noise level by at least 9% given a noise level of 20%, which we consider a meaningful improvement for downstream tasks, such as base modification detection and signal interpretation. The high noise reduction rates observed in Tombo and Uncalled4 (over 100%) suggest that their actual noise proportion may be higher than our 20% assumption.

      We acknowledge that this 20% noise level assumption is an approximation. Our intention is to illustrate that SegPore provides measurable improvements in relative terms, even when absolute differences appear small.

      The run time and resources that are required to run SegPore are not shown, however, it appears that the GPU version is essential, which could limit the application of this method in practice.

      Thank you for your comment.

      Detailed instructions for running SegPore are provided in github (https://github.com/guangzhaocs/SegPore). Regarding computational resources, SegPore currently requires one CPU core and one Nvidia GPU to perform the segmentation task efficiently.

      We present SegPore’s runtime for typical datasets in Supplementary Figure S6 in the revised manuscript.  For a typical 1 GB fast5 file, the segmentation takes approximately 9.4 hours using a single NVIDIA DGX‑1 V100 GPU and one CPU core.

      Currently, GPU acceleration is essential to achieve practical runtimes with SegPore. We acknowledge that this requirement may limit accessibility in some environments. To address this, we are actively working on a full C++ implementation of SegPore that will support CPU-only execution. While development is ongoing, we aim to release this version in a future update.

      Reviewer #2 (Public review):

      Summary:

      The work seeks to improve the detection of RNA m6A modifications using Nanopore sequencing through improvements in raw data analysis. These improvements are said to be in the segmentation of the raw data, although the work appears to position the alignment of raw data to the reference sequence and some further processing as part of the segmentation, and result statistics are mostly shown on the 'data-assigned-to-kmer' level.

      As such, the title, abstract, and introduction stating the improvement of just the 'segmentation' does not seem to match the work the manuscript actually presents, as the wording seems a bit too limited for the work involved.

      The work itself shows minor improvements in m6Anet when replacing Nanopolish eventalign with this new approach, but clear improvements in the distributions of data assigned per kmer. However, these assignments were improved well enough to enable m6A calling from them directly, both at site-level and at read-level.

      Strengths:

      A large part of the improvements shown appear to stem from the addition of extra, non-base/kmer specific, states in the segmentation/assignment of the raw data, removing a significant portion of what can be considered technical noise for further analysis. Previous methods enforced the assignment of all raw data, forcing a technically optimal alignment that may lead to suboptimal results in downstream processing as data points could be assigned to neighbouring kmers instead, while random noise that is assigned to the correct kmer may also lead to errors in modification detection.

      For an optimal alignment between the raw signal and the reference sequence, this approach may yield improvements for downstream processing using other tools.<br /> Additionally, the GMM used for calling the m6A modifications provides a useful, simple, and understandable logic to explain the reason a modification was called, as opposed to the black models that are nowadays often employed for these types of tasks.

      Weaknesses:

      The work seems limited in applicability largely due to the focus on the R9's 5mer models. The R9 flow cells are phased out and not available to buy anymore. Instead, the R10 flow cells with larger kmer models are the new standard, and the applicability of this tool on such data is not shown. We may expect similar behaviour from the raw sequencing data where the noise and transition states are still helpful, but the increased kmer size introduces a large amount of extra computing required to process data and without knowledge of how SegPore scales, it is difficult to tell how useful it will really be. The discussion suggests possible accuracy improvements moving to 7mers or 9mers, but no reason why this was not attempted.

      Thank you for pointing out this important limitation. Please refer to our response to Point 1 of Reviewer 1 for SegPore’s performance on RNA004 data. Notably, the jiggling behavior is also observed in RNA004 data, and SegPore achieves better performance than both f5c and Uncalled4.

      The increased k-mer size in RNA004 affects only the training phase of SegPore (refer to Supplementary Note 1, Figure 5 for details on the training and testing phases). Once the baseline means and standard deviations for each k-mer are established, applying SegPore to RNA004 data proceeds similarly to RNA002. This is because each k-mer in the reference sequence has, at most, two states (modified and unmodified). While the larger k-mer size increases the size of the parameter table, it does not increase the computational complexity during segmentation. Although estimating the initial k-mer parameter table requires significant time and effort on our part, it does not affect the runtime for end users applying SegPore to RNA004 data.

      Extending SegPore from 5-mers to 7-mers or 9-mers for RNA002 data would require substantial effort to retrain the model and generate sufficient training data. Additionally, such an extension would make SegPore’s output incompatible with widely used upstream and downstream tools such as Nanopolish and m6Anet, complicating integration and comparison. For these reasons, we leave this extension for future work.

      The manuscript suggests the eventalign results are improved compared to Nanopolish. While this is believably shown to be true (Table 1), the effect on the use case presented, downstream differentiation between modified and unmodified status on a base/kmer, is likely limited as during actual modification calling the noisy distributions are usually 'good enough', and not skewed significantly in one direction to really affect the results too terribly.

      Thank you for your comment. While current state-of-the-art (SOTA) methods perform well on benchmark datasets, there remains significant room for improvement. Most SOTA evaluations are based on limited datasets, primarily covering DRACH motifs in human and mouse transcriptomes. However, m6A modifications can also occur in non-DRACH motifs, where current models may underperform. Additionally, other RNA modifications—such as pseudouridine, inosine, and m5C—are less studied, and their detection may benefit from improved signal modeling.

      We would also like to emphasize that raw signal segmentation and RNA modification detection are distinct tasks. SegPore focuses on the former, providing a cleaner, more interpretable signal that can serve as a foundation for downstream tasks. Improved segmentation may facilitate the development of more accurate RNA modification detection algorithms by the community.

      Scientific progress often builds incrementally through targeted improvements to foundational components. We believe that enhancing signal segmentation, as SegPore does, contributes meaningfully to the broader field—the full impact will become clearer as the tool is adopted into more complex workflows.

      Furthermore, looking at alternative approaches where this kind of segmentation could be applied, Nanopolish uses the main segmentation+alignment for a first alignment and follows up with a form of targeted local realignment/HMM test for modification calling (and for training too), decreasing the need for the near-perfect segmentation+alignment this work attempts to provide. Any tool applying a similar strategy probably largely negates the problems this manuscript aims to improve upon.

      We thank the reviewer for this insightful comment.

      To clarify, Nanopolish provides three independent commands: polya, eventalign, and call-methylation.

      - The polya command identifies the adapter, poly(A) tail, and transcript region in the raw signal.

      - The eventalign command aligns the raw signal to a reference sequence, assigning a signal segment to individual k-mers in the reference.

      - The call-methylation command detects methylated bases from DNA sequencing data.

      The eventalign command corresponds to “the main segmentation+alignment for a first alignment,” while call-methylation corresponds to “a form of targeted local realignment/HMM test for modification calling,” as mentioned in the reviewer’s comment. SegPore’s segmentation is similar in purpose to Nanopolish’s eventalign, while its RNA modification estimation component is similar in concept to Nanopolish’s call-methylation.

      We agree the general idea may appear similar, but the implementations are entirely different. Importantly, Nanopolish’s call-methylation is designed for DNA sequencing data, and its models are not trained to recognize RNA modifications. This means they address distinct research questions and cannot be directly compared on the same RNA modification estimation task. However, it is valid to compare them on the segmentation task, where SegPore exhibits better performance (Table 1).

      We infer the reviewer may suggest that because m6Anet is a deep neural network capable of learning from noisy input, the benefit of more accurate segmentation (such as that provided by SegPore) might be limited. This concern may arise from the limited improvement of SegPore+m6Anet over Nanopolish+m6Anet in bulk analysis (Figure 3). Several factors may contribute to this observation:

      (i) For reads aligned to the same gene in the in vivo data, alignment may be inaccurate due to pseudogenes or transcript isoforms.

      (ii) The in vivo benchmark data are inherently more complex than in vitro datasets and may contain additional modifications (e.g., m5C, m7G), which can confound m6A calling by altering the signal baselines of k-mers.

      (iii) m6Anet is trained on events produced by Nanopolish and may not be optimal for SegPore-derived events.

      (iv) The benchmark dataset lacks a modification-free (IVT) control sample, making it difficult to establish a true baseline for each k-mer.

      In the IVT data (Figure 4), SegPore shows a clear improvement in single-molecule m6A identification, with a 3~4% gain in both ROC-AUC and PR-AUC. This demonstrates SegPore’s practical benefit for applications requiring higher sensitivity at the molecule level.

      As noted earlier, SegPore’s contribution lies in denoising and improving the accuracy of raw signal segmentation, which is a foundational step in many downstream analyses. While it may not yet lead to a dramatic improvement in all applications, it already provides valuable insights into the sequencing process (e.g., cleaner signal profiles in Figure 4) and enables measurable gains in modification detection at the single-read level. We believe SegPore lays the groundwork for developing more accurate and generalizable RNA modification detection tools beyond m6A.

      We have also added the following sentence in the discussion to highlight SegPore’s limited performance in bulk analysis:

      “The limited improvement of SegPore combined with m6Anet over Nanopolish+m6Anet in bulk in vivo analysis (Figure 3) may be explained by several factors: potential alignment inaccuracies due to pseudogenes or transcript isoforms, the complexity of in vivo datasets containing additional RNA modifications (e.g., m5C, m7G) affecting signal baselines, and the fact that m6Anet is specifically trained on events produced by Nanopolish rather than SegPore. Additionally, the lack of a modification-free control (in vitro transcribed) sample in the benchmark dataset makes it difficult to establish true baselines for each k-mer. Despite these limitations, SegPore demonstrates clear improvement in single-molecule m6A identification in IVT data (Figure 4), suggesting it is particularly well suited for in vitro transcription data analysis.”

      Finally, in the segmentation/alignment comparison to Nanopolish, the latter was not fitted(/trained) on the same data but appears to use the pre-trained model it comes with. For the sake of comparing segmentation/alignment quality directly, fitting Nanopolish on the same data used for SegPore could remove the influences of using different training datasets and focus on differences stemming from the algorithm itself.

      In the segmentation benchmark (Table 1), SegPore uses the fixed 5-mer parameter table provided by ONT. The hyperparameters of the HHMM are also fixed and not estimated from the raw signal data being segmented. Only in the m6A modification task,  SegPore does perform re-estimation of the baselines for the modified and unmodified states of k-mers. Therefore, the comparison with Nanopolish is fair, as both tools rely on pre-defined models during segmentation.

      Appraisal:

      The authors have shown their method's ability to identify noise in the raw signal and remove their values from the segmentation and alignment, reducing its influences for further analyses. Figures directly comparing the values per kmer do show a visibly improved assignment of raw data per kmer. As a replacement for Nanopolish eventalign it seems to have a rather limited, but improved effect, on m6Anet results. At the single read level modification modification calling this work does appear to improve upon CHEUI.

      Impact:

      With the current developments for Nanopore-based modification largely focusing on Artificial Intelligence, Neural Networks, and the like, improvements made in interpretable approaches provide an important alternative that enables a deeper understanding of the data rather than providing a tool that plainly answers the question of whether a base is modified or not, without further explanation. The work presented is best viewed in the context of a workflow where one aims to get an optimal alignment between raw signal data and the reference base sequence for further processing. For example, as presented, as a possible replacement for Nanopolish eventalign. Here it might enable data exploration and downstream modification calling without the need for local realignments or other approaches that re-consider the distribution of raw data around the target motif, such as a 'local' Hidden Markov Model or Neural Networks. These possibilities are useful for a deeper understanding of the data and further tool development for modification detection works beyond m6A calling.

      Reviewer #3 (Public review):

      Summary:

      Nucleotide modifications are important regulators of biological function, however, until recently, their study has been limited by the availability of appropriate analytical methods. Oxford Nanopore direct RNA sequencing preserves nucleotide modifications, permitting their study, however, many different nucleotide modifications lack an available base-caller to accurately identify them. Furthermore, existing tools are computationally intensive, and their results can be difficult to interpret.

      Cheng et al. present SegPore, a method designed to improve the segmentation of direct RNA sequencing data and boost the accuracy of modified base detection.

      Strengths:

      This method is well-described and has been benchmarked against a range of publicly available base callers that have been designed to detect modified nucleotides.

      Weaknesses:

      However, the manuscript has a significant drawback in its current version. The most recent nanopore RNA base callers can distinguish between different ribonucleotide modifications, however, SegPore has not been benchmarked against these models.

      I recommend that re-submission of the manuscript that includes benchmarking against the rna004_130bps_hac@v5.1.0 and rna004_130bps_sup@v5.1.0 dorado models, which are reported to detect m5C, m6A_DRACH, inosine_m6A and PseU.<br /> A clear demonstration that SegPore also outperforms the newer RNA base caller models will confirm the utility of this method.

      Thank you for highlighting this important limitation. While Dorado, the new ONT basecaller, is publicly available and supports modification-aware basecalling, suitable public datasets for benchmarking m5C, inosine, m6A, and PseU detection on RNA004 are currently lacking. Dorado’s modification-aware models are trained on ONT’s internal data, which is not publicly released. Therefore, it is not currently feasible to evaluate or directly compare SegPore’s performance against Dorado for m5C, inosine, m6A, and PseU detection.

      We would also like to emphasize that SegPore’s main contribution lies in raw signal segmentation, which is an upstream task in the RNA modification detection pipeline. To assess its performance in this context, we benchmarked SegPore against f5c and Uncalled4 on public RNA004 datasets for segmentation quality. Please refer to our response to Point 1 of Reviewer 1 for details.

      Our results show that the characteristic “jiggling” behavior is also observed in RNA004 data (Supplementary Figure S4), and SegPore achieves better segmentation performance than both f5c and Uncalled4 (Table 2).

      Recommendations for the authors:

      Reviewing Editor:

      Please note that we also received the following comments on the submission, which we encourage you to take into account:

      took a look at the work and for what I saw it only mentions/uses RNA002 chemistry, which is deprecated, effectively making this software unusable by anyone any more, as RNA002 is not commercially available. While the results seem promising, the authors need to show that it would work for RNA004. Notably, there is an alternative software for resquiggling for RNA004 (not Tombo or Nanopolish, but the GPU-accelerated version of Nanopolish (f5C), which does support RNA004. Therefore, they need to show that SegPore works for RNA004, because otherwise it is pointless to see that this method works better than others if it does not support current sequencing chemistries and only works for deprecated chemistries, and people will keep using f5C because its the only one that currently works for RNA004. Alternatively, if there would be biological insights won from the method, one could justify not implementing it in RNA004, but in this case, RNA002 is deprecated since March 2024, and the paper is purely methodological.

      Thank you for the comment. We agree that support for current sequencing chemistries is essential for practical utility. While SegPore was initially developed and benchmarked on RNA002 due to the availability of public data, we have now extended SegPore to support RNA004 chemistry.

      To address this concern, we performed a benchmark comparison using public RNA004 datasets against tools specifically designed for RNA004, including f5c and Uncalled4. Please refer to our response to Point 1 of Reviewer 1 for details. The results show that SegPore consistently outperforms f5c and Uncalled4 in segmentation accuracy on RNA004 data.

      Reviewer #2 (Recommendations for the authors):

      Various statements are made throughout the text that require further explanation, which might actually be defined in more detail elsewhere sometimes but are simply hard to find in the current form.

      (1) Page 2, “In this technique, five nucleotides (5mers) reside in the nanopore at a time, and each 5mer generates a characteristic current signal based on its unique sequence and chemical properties (16).”

      5mer? Still on R9 or just ignoring longer range influences, relevant? It is indeed a R9.4 model from ONT.

      Thank you for the observation. We apologize for the confusion and have clarified the relevant paragraph to indicate that the method is developed for RNA002 data by default. Specifically, we have added the following sentence:

      “Two versions of the direct RNA sequencing (DRS) kits are available: RNA002 and RNA004. Unless otherwise specified, this study focuses on RNA002 data.”

      (2) Page 3, “Employ models like Hidden Markov Models (HMM) to segment the signal, but they are prone to noise and inaccuracies.”

      That's the alignment/calling part, not the segmentation?

      Thank you for the comment. We apologize for the confusion. To clarify the distinction between segmentation and alignment, we added a new paragraph before the one in question to explain the general workflow of Nanopore DRS data analysis and to clearly define the task of segmentation. The added text reads:

      “The general workflow of Nanopore direct RNA sequencing (DRS) data analysis is as follows. First, the raw electrical signal from a read is basecalled using tools such as Guppy or Dorado, which produce the nucleotide sequence of the RNA molecule. However, these basecalled sequences do not include the precise start and end positions of each ribonucleotide (or k-mer) in the signal. Because basecalling errors are common, the sequences are typically mapped to a reference genome or transcriptome using minimap2 to recover the correct reference sequence. Next, tools such as Nanopolish and Tombo align the raw signal to the reference sequence to determine which portion of the signal corresponds to each k-mer. We define this process as the segmentation task, referred to as "eventalign" in Nanopolish. Based on this alignment, Nanopolish extracts various features—such as the start and end positions, mean, and standard deviation of the signal segment corresponding to a k-mer. This signal segment or its derived features is referred to as an "event" in Nanopolish.”

      We also revised the following paragraph describing SegPore to more clearly contrast its approach:

      “In SegPore, we first segment the raw signal into small fragments using a Hierarchical Hidden Markov Model (HHMM), where each fragment corresponds to a sub-state of a k-mer. Unlike Nanopolish and Tombo, which directly align the raw signal to the reference sequence, SegPore aligns the mean values of these small fragments to the reference. After alignment, we concatenate all fragments that map to the same k-mer into a larger segment, analogous to the "eventalign" output in Nanopolish. For RNA modification estimation, we use only the mean signal value of each reconstructed event.”

      We hope this revision clarifies the difference between segmentation and alignment in the context of our method and resolves the reviewer’s concern.

      (3) Page 4, Figure 1, “These segments are then aligned with the 5mer list of the reference sequence fragment using a full/partial alignment algorithm, based on a 5mer parameter table. For example, 𝐴𝑗 denotes the base "A" at the j-th position on the reference.”

      I think I do understand the meaning, but I do not understand the relevance of the Aj bit in the last sentence. What is it used for?

      When aligning the segments (output from Step 2) to the reference sequence in Step 3, it is possible for multiple segments to align to the same k-mer. This can occur particularly when the reference contains consecutive identical bases, such as multiple adenines (A). For example, as shown in Fig. 1A, Step 3, the first two segments (μ₁ and μ₂) are aligned to the first 'A' in the reference sequence, while the third segment is aligned to the second 'A'. In this case, the reference sequence AACTGGTTTC...GTC, which contains exactly two consecutive 'A's at the start. This notation helps to disambiguate segment alignment in regions with repeated bases.

      Additionally, this figure and its subscript include mapping with Guppy and Minimap2 but do not mention Nanopolish at all, while that seems an equally important step in the preprocessing (pg5). As such it is difficult to understand the role Nanopolish exactly plays. It's also not mentioned explicitly in the SegPore Workflow on pg15, perhaps it's part of step 1 there?

      We thank the reviewer for pointing this out. We apologize for the confusion. As mentioned in the public response to point 3 of Reviewer 2, SegPore uses Nanopolish to identify the poly(A) tail and transcript regions from the raw signal. SegPore then performs segmentation and alignment on the transcript portion only. This step is indeed part of Step 1 in the preprocessing workflow, as described in Supplementary Note 1, Section 3.

      To clarify this in the main text, we have updated the preprocessing paragraph on page 6 to explicitly describe the role of Nanopolish:

      “We begin by performing basecalling on the input fast5 file using Guppy, which converts the raw signal data into ribonucleotide sequences. Next, we align the basecalled sequences to the reference genome using Minimap2, generating a mapping between the reads and the reference sequences. Nanopolish provides two independent commands: "polya" and "eventalign".
The "polya" command identifies the adapter, poly(A) tail, and transcript region in the raw signal, which we refer to as the poly(A) detection results. The raw signal segment corresponding to the poly(A) tail is used to standardize the raw signal for each read. The "eventalign" command aligns the raw signal to a reference sequence, assigning a signal segment to individual k-mers in the reference. It also computes summary statistics (e.g., mean, standard deviation) from the signal segment for each k-mer. Each k-mer together with its corresponding signal features is termed an event. These event features are then passed into downstream tools such as m6Anet and CHEUI for RNA modification detection. For full transcriptome analysis (Figure 3), we extract the aligned raw signal segment and reference sequence segment from Nanopolish's events for each read by using the first and last events as start and end points. For in vitro transcription (IVT) data with a known reference sequence (Figure 4), we extract the raw signal segment corresponding to the transcript region for each input read based on Nanopolish’s poly(A) detection results.”

      Additionally, we revised the legend of Figure 1A to explicitly include Nanopolish in step 1 as follows:

      “The raw current signal fragments are paired with the corresponding reference RNA sequence fragments using Nanopolish.”

      (4) Page 5, “The output of Step 3 is the "eventalign," which is analogous to the output generated by the Nanopolish "eventalign" command.”

      Naming the function of Nanopolish, the output file, and later on (pg9) the alignment of the newly introduced methods the exact same "eventalign" is very confusing.

      Thank you for the helpful comment. We acknowledge the potential confusion caused by using the term “eventalign” in multiple contexts. To improve clarity, we now consistently use the term “events” to refer to the output of both Nanopolish and SegPore, rather than using "eventalign" as a noun. We also added the following sentence to Step 3 (page 6) to clearly define what an “event” refers to in our manuscript:

      “An "event" refers to a segment of the raw signal that is aligned to a specific k-mer on a read, along with its associated features such as start and end positions, mean current, standard deviation, and other relevant statistics.”

      We have revised the text throughout the manuscript accordingly to reduce ambiguity and ensure consistent terminology.

      (5) Page 5, “Once aligned, we use Nanopolish's eventalign to obtain paired raw current signal segments and the corresponding fragments of the reference sequence, providing a precise association between the raw signals and the nucleotide sequence.”

      I thought the new method's HHMM was supposed to output an 'eventalign' formatted file. As this is not clearly mentioned elsewhere, is this a mistake in writing? Is this workflow dependent on Nanopolish 'eventalign' function and output or not?

      We apologize for the confusion. To clarify, SegPore is not dependent on Nanopolish’s eventalign function for generating the final segmentation results. As described in our response to your comment point 2 and elaborated in the revised text on page 4, SegPore uses its own HHMM-based segmentation model to divide the raw signal into small fragments, each corresponding to a sub-state of a k-mer. These fragments are then aligned to the reference sequence based on their mean current values.

      As explained in the revised manuscript:

      “In SegPore, we first segment the raw signal into small fragments using a Hierarchical Hidden Markov Model (HHMM), where each fragment corresponds to a sub-state of a k-mer. Unlike Nanopolish and Tombo, which directly align the raw signal to the reference sequence, SegPore aligns the mean values of these small fragments to the reference. After alignment, we concatenate all fragments that map to the same k-mer into a larger segment, analogous to the "eventalign" output in Nanopolish. For RNA modification estimation, we use only the mean signal value of each reconstructed event.”

      To avoid ambiguity, we have also revised the sentence on page 5 to more clearly distinguish the roles of Nanopolish and SegPore in the workflow. The updated sentence now reads:

      “Nanopolish provides two independent commands: "polya" and "eventalign".
The "polya" command identifies the adapter, poly(A) tail, and transcript region in the raw signal, which we refer to as the poly(A) detection results. The raw signal segment corresponding to the poly(A) tail is used to standardize the raw signal for each read. The "eventalign" command aligns the raw signal to a reference sequence, assigning a signal segment to individual k-mers in the reference. It also computes summary statistics (e.g., mean, standard deviation) from the signal segment for each k-mer. Each k-mer together with its corresponding signal features is termed an event. These event features are then passed into downstream tools such as m6Anet and CHEUI for RNA modification detection. For full transcriptome analysis (Figure 3), we extract the aligned raw signal segment and reference sequence segment from Nanopolish's events for each read by using the first and last events as start and end points. For in vitro transcription (IVT) data with a known reference sequence (Figure 4), we extract the raw signal segment corresponding to the transcript region for each input read based on Nanopolish’s poly(A) detection results.”

      (6) Page 5, “Since the polyA tail provides a stable reference, we normalize the raw current signals across reads, ensuring that the mean and standard deviation of the polyA tail are consistent across all reads.”

      Perhaps I misread this statement: I interpret it as using the PolyA tail to do the normalization, rather than using the rest of the signal to do the normalization, and that results in consistent PolyA tails across all reads.

      If it's the latter, this should be clarified, and a little detail on how the normalization is done should be added, but if my first interpretation is correct:

      I'm not sure if its standard deviation is consistent across reads. The (true) value spread in this section of a read should be fairly limited compared to the rest of the signal in the read, so the noise would influence the scale quite quickly, and such noise might be introduced to pores wearing down and other technical influences. Is this really better than using the non-PolyA tail part of the reads signal, using Median Absolute Deviation to scale for a first alignment round, then re-fitting the signal scaling using Theil Sen on the resulting alignments (assigned read signal vs reference expected signal), as Tombo/Nanopolish (can) do?

      Additionally, this kind of normalization should have been part of the Nanopolish eventalign already, can this not be re-used? If it's done differently it may result in different distributions than the ONT kmer table obtained for the next step.

      Thank you for this detailed and thoughtful comment. We apologize for the confusion. The poly(A) tail–based normalization is indeed explained in Supplementary Note 1, Section 3, but we agree that the motivation needed to be clarified in the main text.

      We have now added the following sentence in the revised manuscript (before the original statement on page 5 to provide clearer context:

      “Due to inherent variability between nanopores in the sequencing device, the baseline levels and standard deviations of k-mer signals can differ across reads, even for the same transcript. To standardize the signal for downstream analyses, we extract the raw current signal segments corresponding to the poly(A) tail of each read. Since the poly(A) tail provides a stable reference, we normalize the raw current signals across reads, ensuring that the mean and standard deviation of the poly(A) tail are consistent across all reads. This step is crucial for reducing…..”

      We chose to use the poly(A) tail for normalization because it is sequence-invariant—i.e., all poly(A) tails consist of identical k-mers, unlike transcript sequences which vary in composition. In contrast, using the transcript region for normalization can introduce biases: for instance, reads with more diverse k-mers (having inherently broader signal distributions) would be forced to match the variance of reads with more uniform k-mers, potentially distorting the baseline across k-mers.

      In our newly added RNA004 benchmark experiment, we used the default normalization provided by f5c, which does not include poly(A) tail normalization. Despite this, SegPore was still able to mask out noise and outperform both f5c and Uncalled4, demonstrating that our segmentation method is robust to different normalization strategies.

      (7) Page 7, “The initialization of the 5mer parameter table is a critical step in SegPore's workflow. By leveraging ONT's established kmer models, we ensure that the initial estimates for unmodified 5mers are grounded in empirical data.”

      It looks like the method uses Nanopolish for a first alignment, then improves the segmentation matching the reference sequence/expected 5mer values. I thought the Nanopolish model/tables are based on the same data, or similarly obtained. If they are different, then why the switch of kmer model? Now the original alignment may have been based on other values, and thus the alignment may seem off with the expected kmer values of this table.

      Thank you for this insightful question. To clarify, SegPore uses Nanopolish only to identify the poly(A) tail and transcript regions from the raw signal. In the bulk in vivo data analysis, we use Nanopolish’s first event as the start and the last event as the end to extract the aligned raw signal chunk and its corresponding reference sequence. Since SegPore relies on Nanopolish solely to delineate the transcript region for each read, it independently aligns the raw signals to the reference sequence without refining or adjusting Nanopolish’s segmentation results.

      While SegPore's 5-mer parameter table is initially seeded using ONT’s published unmodified k-mer models, we acknowledge that empirical signal values may deviate from these reference models due to run-specific technical variation and the presence of RNA modifications. For this reason, SegPore includes a parameter re-estimation step to refine the mean and standard deviation values of each k-mer based on the current dataset.

      The re-estimation process consists of two layers. In the outer layer, we select a set of 5mers that exhibit both modified and unmodified states based on the GMM results (Section 6 of Supplementary Note 1), while the remaining 5mers are assumed to have only unmodified states. In the inner layer, we align the raw signals to the reference sequences using the 5mer parameter table estimated in the outer layer (Section 5 of Supplementary Note 1). Based on the alignment results, we update the 5mer parameter table in the outer layer. This two-layer process is generally repeated for 3~5 iterations until the 5mer parameter table converges.This re-estimation ensures that:

      (1) The adjusted 5mer signal baselines remain close to the ONT reference (for consistency);

      (2) The alignment score between the observed signal and the reference sequence is optimized (as detailed in Equation 11, Section 5 of Supplementary Note 1);

      (3) Only 5mers that show a clear difference between the modified and unmodified components in the GMM are considered subject to modification.

      By doing so, SegPore achieves more accurate signal alignment independent of Nanopolish’s models, and the alignment is directly tuned to the data under analysis.

      (8) Page 9, “The output of the alignment algorithm is an eventalign, which pairs the base blocks with the 5mers from the reference sequence for each read (Fig. 1C).”

      “Modification prediction

      After obtaining the eventalign results, we estimate the modification state of each motif using the 5mer parameter table.”

      This wording seems to have been introduced on page 5 but (also there) reads a bit confusingly as the name of the output format, file, and function are now named the exact same "eventalign". I assume the obtained eventalign results now refer to the output of your HHMM, and not the original Nanopolish eventalign results, based on context only, but I'd rather have a clear naming that enables more differentiation.

      We apologize for the confusion. We have revised the sentence as follows for clarity:

      “A detailed description of both alignment algorithms is provided in Supplementary Note 1. The output of the alignment algorithm is an alignment that pairs the base blocks with the 5mers from the reference sequence for each read (Fig. 1C). Base blocks aligned to the same 5-mer are concatenated into a single raw signal segment (referred to as an “event”), from which various features—such as start and end positions, mean current, and standard deviation—are extracted. Detailed derivation of the mean and standard deviation is provided in Section 5.3 in Supplementary Note 1. In the remainder of this paper, we refer to these resulting events as the output of eventalign analysis or the segmentation task. ”

      (9) Page 9, “Since a single 5mer can be aligned with multiple base blocks, we merge all aligned base blocks by calculating a weighted mean. This weighted mean represents the single base block mean aligned with the given 5mer, allowing us to estimate the modification state for each site of a read.”

      I assume the weights depend on the length of the segment but I don't think it is explicitly stated while it should be.

      Thank you for the helpful observation. To improve clarity, we have moved this explanation to the last paragraph of the previous section (see response to point 8), where we describe the segmentation process in more detail.

      Additionally, a complete explanation of how the weighted mean is computed is provided in Section 5.3 of Supplementary Note 1. It is derived from signal points that are assigned to a given 5mer.

      (10) Page 10, “Afterward, we manually adjust the 5mer parameter table using heuristics to ensure that the modified 5mer distribution is significantly distinct from the unmodified distribution.”

      Using what heuristics? If this is explained in the supplementary notes then please refer to the exact section.

      Thank you for pointing this out. The heuristics used to manually adjust the 5mer parameter table are indeed explained in detail in Section 7 of Supplementary Note 1.

      To clarify this in the manuscript, we have revised the sentence as follows:

      “Afterward, we manually adjust the 5mer parameter table using heuristics to ensure that the modified 5mer distribution is significantly distinct from the unmodified distribution (see details in Section 7 of Supplementary Note 1).”

      (11) Page 10, “Once the table is fixed, it is used for RNA modification estimation in the test data without further updates.”

      By what tool/algorithm? Perhaps it is your own implementation, but with the next section going into segmentation benchmarking and using Nanopolish before this seems undefined.

      Thank you for pointing this out. We use our own implementation. See Algorithm 3 in Section 6 of Supplementary Note 1.

      We have revised the sentence for clarity:

      “Once a stabilized 5mer parameter table is estimated from the training data, it is used for RNA modification estimation in the test data without further updates. A more detailed description of the GMM re-estimation process is provided in Section 6 of Supplementary Note 1.”

      (12) Page 11, “A 5mer was considered significantly modified if its read coverage exceeded 1,500 and the distance between the means of the two Gaussian components in the GMM was greater than 5.”

      Considering the scaling done before also not being very detailed in what range to expect, this cutoff doesn't provide any useful information. Is this a pA value?

      Thank you for the observation. Yes, the value refers to the current difference measured in picoamperes (pA). To clarify this, we have revised the sentence in the manuscript to include the unit explicitly:

      “A 5mer was considered significantly modified if its read coverage exceeded 1,500 and the distance between the means of the two Gaussian components in the GMM was greater than 5 picoamperes (pA).”

      (13) Page 13, “The raw current signals, as shown in Figure 1B.”

      Wrong figure? Figure 2B seems logical.

      Thank you for catching this. You are correct—the reference should be to Figure 2B, not Figure 1B. We have corrected this in the revised manuscript.

      (14) Page 14, Figure 2A, these figures supposedly support the jiggle hypothesis but the examples seem to match only half the explanation. Any of these jiggles seem to be followed shortly by another in the opposite direction, and the amplitude seems to match better within each such pair than the next or previous segments. Perhaps there is a better explanation still, and this behaviour can be modelled as such instead.

      Thank you for your comment. We acknowledge that the observed signal patterns may appear ambiguous and could potentially suggest alternative explanations. However, as shown in Figure 2A, the red dots tend to align closely with the baseline of the previous state, while the blue dots align more closely with the baseline of the next state. We interpret this as evidence for the "jiggling" hypothesis, where k-mer temporarily oscillates between adjacent states during translocation.

      That said, we agree that more sophisticated models could be explored to better capture this behavior, and we welcome suggestions or references to alternative models. We will consider this direction in future work.

      (15) Page 15, “This occurs because subtle transitions within a base block may be mistaken for transitions between blocks, leading to inflated transition counts.”

      Is it really a "subtle transition" if it happens within a base block? It seems this is not a transition and thus shouldn't be named as such.

      Thank you for pointing this out. We agree that the term “subtle transition” may be misleading in this context. We revised the sentence to clarify the potential underlying cause of the inflated transition counts:

      “This may be due to a base block actually corresponding to a sub-state of a single 5mer, rather than each base block corresponding to a full 5mer, leading to inflated transition counts. To address this issue, SegPore’s alignment algorithm was refined to merge multiple base blocks (which may represent sub-states of the same 5mer) into a single 5mer, thereby facilitating further analysis.”

      (16) Page 15, “The SegPore "eventalign" output is similar to Nanopolish's "eventalign" command.”

      To the output of that command, I presume, not to the command itself.

      Thank you for pointing out the ambiguity. We have revised the sentence for clarity:

      “The final outputs of SegPore are the events and modification state predictions. SegPore’s events are similar to the outputs of Nanopolish’s "eventalign" command, in that they pair raw current signal segments with the corresponding RNA reference 5-mers. Each 5-mer is associated with various features — such as start and end positions, mean current, and standard deviation — derived from the paired signal segment.”

      (17) Page 15, “For selected 5mers, SegPore also provides the modification rate for each site and the modification state of that site on individual reads.”

      What selection? Just all kmers with a possible modified base or a more specific subset?

      We revised the sentence to clarify the selection criteria:

      “For selected 5mers that exhibit both a clearly unmodified and a clearly modified signal component, SegPore reports the modification rate at each site, as well as the modification state of that site on individual reads.”

      (18) Page 16, “A key component of SegPore is the 5mer parameter table, which specifies the mean and standard deviation for each 5mer in both modified and unmodified states (Figure 2A).”

      Wrong figure?

      Thank you for pointing this out. You are correct—it should be Figure 1A, not Figure 2A. We intended to visually illustrate the structure of the 5mer parameter table in Figure 1A, and we have corrected this reference in the revised manuscript.

      (19) Page 16, Table 1, I can't quite tell but I assume this is based on all kmers in the table, not just a m6A modified subset. A short added statement to make this clearer would help.

      Yes, you are right—it is averaged over all 5mers. We have revised the sentence for clarity as follows:

      " As shown in Table 1, SegPore consistently achieved the best performance averaged on all 5mers across all datasets..…."

      (20) Page 16, “Since the peaks (representing modified and unmodified states) are separable for only a subset of 5mers, SegPore can provide modification parameters for these specific 5mers. For other 5mers, modification state predictions are unavailable.”

      Can this be improved using some heuristics rather than the 'distance of 5' cutoff as described before? How small or big is this subset, compared to how many there should be to cover all cases?

      We agree that more sophisticated strategies could potentially improve performance. In this study, we adopted a relatively conservative approach to minimize false positives by using a heuristic cutoff of 5 picoamperes. This value was selected empirically and we did not explore alternative cutoffs. Future work could investigate more refined or data-driven thresholding strategies.

      (21) Page 16, “Tombo used the "resquiggle" method to segment the raw signals, and we standardized the segments using the polyA tail to ensure a fair comparison.”

      I don't know what or how something is "standardized" here.

      Standardized’ refers to the poly(A) tail–based signal normalization described in our response to point 6. We applied this normalization to Tombo’s output to ensure a fair comparison across methods. Without this standardization, Tombo’s performance was notably worse. We revised the sentence as follows:

      “Tombo used the "resquiggle" method to segment the raw signals, and we standardized the segments using the poly(A) tail to ensure a fair comparison (See preprocessing section in Materials and Methods).”

      (22) Page 16, “To benchmark segmentation performance, we used two key metrics: (1) the log-likelihood of the segment mean, which measures how closely the segment matches ONT's 5mer parameter table (used as ground truth), and (2) the standard deviation (std) of the segment, where a lower std indicates reduced noise and better segmentation quality. If the raw signal segment aligns correctly with the corresponding 5mer, its mean should closely match ONT's reference, yielding a high log-likelihood. A lower std of the segment reflects less noise and better performance overall.”

      Here the segmentation part becomes a bit odd:

      A: Low std can be/is achieved by dropping any noisy bits, making segments really small (partly what happens here with the transition segments). This may be 'true' here, in the sense that the transition is not really part of the segment, but the comparison table is a bit meaningless as the other tools forcibly assign all data to kmers, instead of ignoring parts as transition states. In other words, it is a benchmark that is easy to cheat by assigning more data to noise/transition states.

      B: The values shown are influenced by the alignment made between the read and expected reference signal. Especially Tombo tends to forcibly assign data to whatever looks the most similar nearby rather than providing the correct alignment. So the "benchmark of the segmentation performance" is more of an "overall benchmark of the raw signal alignment". Which is still a good, useful thing, but the text seems to suggest something else.

      Thank you for raising these important concerns regarding the segmentation benchmarking.

      Regarding point A, the base blocks aligned to the same 5mer are concatenated into a single segment, including the short transition blocks between them. These transition blocks are typically very short (4~10 signal points, average 6 points), while a typical 5mer segment contains around 20~60 signal points. To assess whether SegPore’s performance is inflated by excluding transition segments, we conducted an additional comparison: we removed 6 boundary signal points (3 from the start and 3 from the end) from each 5mer segment in Nanopolish and Tombo’s results to reduce potential noise. The new comparison table is shown in the following:

      SegPore consistently demonstrates superior performance. Its key contribution lies in its ability to recognize structured noise in the raw signal and to derive more accurate mean and standard deviation values that more faithfully represent the true state of the k-mer in the pore. The improved mean estimates are evidenced by the clearly separated peaks of modified and unmodified 5mers in Figures 3A and 4B, while the improved standard deviation is reflected in the segmentation benchmark experiments.

      Regarding point B, we apologize for the confusion. We have added a new paragraph to the introduction to clarify that the segmentation task indeed includes the alignment step.

      “The general workflow of Nanopore direct RNA sequencing (DRS) data analysis is as follows. First, the raw electrical signal from a read is basecalled using tools such as Guppy or Dorado, which produce the nucleotide sequence of the RNA molecule. However, these basecalled sequences do not include the precise start and end positions of each ribonucleotide (or k-mer) in the signal. Because basecalling errors are common, the sequences are typically mapped to a reference genome or transcriptome using minimap2 to recover the correct reference sequence. Next, tools such as Nanopolish and Tombo align the raw signal to the reference sequence to determine which portion of the signal corresponds to each k-mer. We define this process as the segmentation task, referred to as "eventalign" in Nanopolish. Based on this alignment, Nanopolish extracts various features—such as the start and end positions, mean, and standard deviation of the signal segment corresponding to a k-mer. This signal segment or its derived features is referred to as an "event" in Nanopolish. The resulting events serve as input for downstream RNA modification detection tools such as m6Anet and CHEUI.”

      (23) Page 17 “Given the comparable methods and input data requirements, we benchmarked SegPore against several baseline tools, including Tombo, MINES (26), Nanom6A (27), m6Anet, Epinano (28), and CHEUI (29).”

      It seems m6Anet is actually Nanopolish+m6Anet in Figure 3C, this needs a minor clarification here.

      m6Anet uses Nanopolish’s estimated events as input by default.

      (24) Page 18, Figure 3, A and B are figures without any indication of what is on the axis and from the text I believe the position next to each other on the x-axis rather than overlapping is meaningless, while their spread is relevant, as we're looking at the distribution of raw values for this 5mer. The figure as is is rather confusing.

      Thanks for pointing out the confusion. We have added concrete values to the axes in Figures 3A and 3B and revised the figure legend as follows in the manuscript:

      “(A) Histogram of the estimated mean from current signals mapped to an example m6A-modified genomic location (chr10:128548315, GGACT) across all reads in the training data, comparing Nanopolish (left) and SegPore (right). The x-axis represents current in picoamperes (pA).

      (B) Histogram of the estimated mean from current signals mapped to the GGACT motif at all annotated m6A-modified genomic locations in the training data, again comparing Nanopolish (left) and SegPore (right). The x-axis represents current in picoamperes (pA).”

      (25) Page 18 “SegPore's results show a more pronounced bimodal distribution in the raw signal segment mean, indicating clearer separation of modified and unmodified signals.”

      Without knowing the correct values around the target kmer (like Figure 4B), just the more defined bimodal distribution could also indicate the (wrongful) assignment of neighbouring kmer values to this kmer instead, hence this statement lacks some needed support, this is just one interpretation of the possible reasons.

      Thank you for the comment. We have added concrete values to Figures 3A and 3B to support this point. Both peaks fall within a reasonable range: the unmodified peak (125 pA) is approximately 1.17 pA away from its reference value of 123.83 pA, and the modified peak (118 pA) is around 7 pA away from the unmodified peak. This shift is consistent with expected signal changes due to RNA modifications (usually less than 10 pA), and the magnitude of the difference suggests that the observed bimodality is more likely caused by true modification events rather than misalignment.

      (26) Page 18 “Furthermore, when pooling all reads mapped to m6A-modified locations at the GGACT motif, SegPore showed prominent peaks (Fig. 3B), suggesting reduced noise and improved modification detection.”

      I don't think the prominent peaks directly suggest improved detection, this statement is a tad overreaching.

      We revised the sentense to the following:

      “SegPore exhibited more distinct peaks (Fig. 3B), indicating reduced noise and potentially enabling more reliable modification detection”.

      (27) Page18 “(2) direct m6A predictions from SegPore's Gaussian Mixture Model (GMM), which is limited to the six selected 5mers.”

      The 'six selected' refers to what exactly? Also, 'why' this is limited to them is also unclear as it is, and it probably would become clearer if it is clearly defined what this refers to.

      It is explained the page 16 in the SegPore’s workflow in the original manuscript as follows:

      “A key component of SegPore is the 5mer parameter table, which specifies the mean and standard deviation for each 5mer in both modified and unmodified states (Fig. 2A1A). Since the peaks (representing modified and unmodified states) are separable for only a subset of 5mers, SegPore can provide modification parameters for these specific 5mers. For other 5mers, modification state predictions are unavailable.”

      e select a small set of 5mers that show clear peaks (modified and unmodified 5mers) in GMM in the m6A site-level data analysis. These 5mers are provided in Supplementary Fig. S2C, as explained in the section “m6A site level benchmark” in the Material and Methods (page 12 in the original manuscript).

      “…transcript locations into genomic coordinates. It is important to note that the 5mer parameter table was not re-estimated for the test data. Instead, modification states for each read were directly estimated using the fixed 5mer parameter table. Due to the differences between human (Supplementary Fig. S2A) and mouse (Supplementary Fig. S2B), only six 5mers were found to have m6A annotations in the test data’s ground truth (Supplementary Fig. S2C). For a genomic location to be identified as a true m6A modification site, it had to correspond to one of these six common 5mers and have a read coverage of greater than 20. SegPore derived the ROC and PR curves for benchmarking based on the modification rate at each genomic location….”

      We have updated the sentence as follows to increase clarity:

      “which is limited to the six selected 5mers that exhibit clearly separable modified and unmodified components in the GMM (see Materials and Methods for details).”

      (28) Page 19, Figure 4C, the blue 'Unmapped' needs further explanation. If this means the segmentation+alignment resulted in simply not assigning any segment to a kmer, this would indicate issues in the resulting mapping between raw data and kmers as the data that probably belonged to this kmer is likely mapped to a neighbouring kmer, possibly introducing a bimodal distribution there.

      This is due to deletion event in the full alignment algorithm. See Page 8 of SupplementaryNote1:

      During the traceback step of the dynamic programming matrix, not every 5mer in the reference sequence is assigned a corresponding raw signal fragment—particularly when the signal’s mean deviates substantially from the expected mean of that 5mer. In such cases, the algorithm considers the segment to be generated by an unknown 5mer, and the corresponding reference 5mer is marked as unmapped.

      (29) Page 19, “For six selected m6A motifs, SegPore achieved an ROC AUC of 82.7% and a PR AUC of 38.7%, earning the third-best performance compared with deep leaning methods m6Anet and CHEUI (Fig. 3D).”

      How was this selection of motifs made, are these related to the six 5mers in the middle of Supplementary Figure S2? Are these the same six as on page 18? This is not clear to me.

      It is the same, see the response to point 27.

      (30) Page 21 “Biclustering reveals that modifications at the 6th, 7th, and 8th genomic locations are specific to certain clusters of reads (clusters 4, 5, and 6), while the first five genomic locations show similar modification patterns across all reads.”

      This reads rather confusingly. Both the '6th, 7th, and 8th genomic locations' and 'clusters 4,5,6' should be referred to in clearer terms. Either mark them in the figure as such or name them in the text by something that directly matches the text in the figure.

      We have added labels to the clusters and genomic locations Figure 4C, and revised the sentence as follows:

      “Biclustering reveals that modifications at g6 are specific to cluster C4, g7 to cluster C5, and g8 to cluster C6, while the first five genomic locations (g1 to g5) show similar modification patterns across all reads.”

      (31) Page 21, “We developed a segmentation algorithm that leverages the jiggling property in the physical process of DRS, resulting in cleaner current signals for m6A identification at both the site and single-molecule levels.”

      Leverages, or just 'takes into account'?

      We designed our HHMM specifically based on the jiggling hypothesis, so we believe that using the term “leverage” is appropriate.

      (32) Page 21, “Our results show that m6Anet achieves superior performance, driven by SegPore's enhanced segmentation.”

      Superior in what way? It barely improves over Nanopolish in Figure 3C and is outperformed by other methods in Figure 3D. The segmentation may have improved but this statement says something is 'superior' driven by that 'enhanced segmentation', so that cannot refer to the segmentation itself.

      We revise it as follows in the revised manuscript:

      ”Our results demonstrate that SegPore’s segmentation enables clear differentiation between m6A-modified and unmodified adenosines.”

      (33) Page 21, “In SegPore, we assume a drastic change between two consecutive 5mers, which may hold for 5mers with large difference in their current baselines but may not hold for those with small difference.”

      The implications of this assumption don't seem highlighted enough in the work itself and may be cause for falsely discovering bi-modal distributions. What happens if such a 5mer isn't properly split, is there no recovery algorithm later on to resolve these cases?

      We agree that there is a risk of misalignment, which can result in a falsely observed bimodal distribution. This is a known and largely unavoidable issue across all methods, including deep neural network–based methods. For example, many of these models rely on a CTC (Connectionist Temporal Classification) layer, which implicitly performs alignment and may also suffer from similar issues.

      Misalignment is more likely when the current baselines of neighboring k-mers are close. In such cases, the model may struggle to confidently distinguish between adjacent k-mers, increasing the chance that signals from neighboring k-mers are incorrectly assigned. Accurate baseline estimation for each k-mer is therefore critical—when baselines are accurate, the correct alignment typically corresponds to the maximum likelihood.

      We have added the following sentence to the discussion to acknowledge this limitation:

      “As with other RNA modification estimation methods, SegPore can be affected by misalignment errors, particularly when the baseline signals of adjacent k-mers are similar. These cases may lead to spurious bimodal signal distributions and require careful interpretation.”

      (34) Page 21, “Currently, SegPore models only the modification state of the central nucleotide within the 5mer. However, modifications at other positions may also affect the signal, as shown in Figure 4B. Therefore, introducing multiple states to the 5mer could help to improve the performance of the model.”

      The meaning of this statement is unclear to me. Is SegPore unable to combine the information of overlapping kmers around a possibly modified base (central nucleotide), or is this referring to having multiple possible modifications in a single kmer (multiple states)?

      We mean there can be modifications at multiple positions of a single 5mer, e.g. C m5C m6A m7G T. We have revised the sentence to:

      “Therefore, introducing multiple states for a 5mer to accout for modifications at mutliple positions within the same 5mer could help to improve the performance of the model.”

      (35) Page 22, “This causes a problem when apply DNN-based methods to new dataset without short read sequencing-based ground truth. Human could not confidently judge if a predicted m6A modification is a real m6A modification.”

      Grammatical errors in both these sentences. For the 'Human could not' part, is this referring to a single person's attempt or more extensively tested?

      Thanks for the comment. We have revised the sentence as follows:

      “This poses a challenge when applying DNN-based methods to new datasets without short-read sequencing-based ground truth. In such cases, it is difficult for researchers to confidently determine whether a predicted m6A modification is genuine (see Supplmentary Figure S5).”

      (36) Page 22, “…which is easier for human to interpret if a predicted m6A site is real.”

      "a" human, but also this probably meant to say 'whether' instead of 'if', or 'makes it easier'.

      Thanks for the advice. We have revise the sentence as follows:

      “One can generally observe a clear difference in the intensity levels between 5mers with an m6A and those with a normal adenosine, which makes it easier for a researcher to interpret whether a predicted m6A site is genuine.”

      (37) Page 22, “…and noise reduction through its GMM-based approach…”

      Is the GMM providing noise reduction or segmentation?

      Yes, we agree that it is not relevant. We have removed the sentence in the revised manuscript as follows:

      “Although SegPore provides clear interpretability and noise reduction through its GMM-based approach, there is potential to explore DNN-based models that can directly leverage SegPore's segmentation results.”

      (38) Page 23, “SegPore effectively reduces noise in the raw signal, leading to improved m6A identification at both site and single-molecule levels…”

      Without further explanation in what sense this is meant, 'reduces noise' seems to overreach the abilities, and looks more like 'masking out'.

      Following the reviewer’s suggestion, we change it to ‘mask out'’ in the revised manuscript.

      “SegPore effectively masks out noise in the raw signal, leading to improved m6A identification at both site and single-molecule levels.”

      Reviewer #3 (Recommendations for the authors):

      I recommend the publication of this manuscript, provided that the following comments (and the comments above) are addressed.

      In general, the authors state that SegPore represents an improvement on existing software. These statements are largely unquantified, which erodes their credibility. I have specified several of these in the Minor comments section.

      Page 5, Preprocessing: The authors comment that the poly(A) tail provides a stable reference that is crucial for the normalisation of all reads. How would this step handle reads that have variable poly(A) tail lengths? Or have interrupted poly(A) tails (e.g. in the case of mRNA vaccines that employ a linker sequence)?

      We apologize for the confusion. The poly(A) tail–based normalization is explained in Supplementary Note 1, Section 3.

      As shown in Author response image 1 below, the poly(A) tail produces a characteristic signal pattern—a relatively flat, squiggly horizontal line. Due to variability between nanopores, raw current signals often exhibit baseline shifts and scaling of standard deviations. This means that the signal may be shifted up or down along the y-axis and stretched or compressed in scale.

      Author response image 1.

      The normalization remains robust with variable poly(A) tail lengths, as long as the poly(A) region is sufficiently long. The linker sequence will be assigned to the adapter part rather than the poly(A) part.

      To improve clarity in the revised manuscript, we have added the following explanation:

      “Due to inherent variability between nanopores in the sequencing device, the baseline levels and standard deviations of k-mer signals can differ across reads, even for the same transcript. To standardize the signal for downstream analyses, we extract the raw current signal segments corresponding to the poly(A) tail of each read. Since the poly(A) tail provides a stable reference, we normalize the raw current signals across reads, ensuring that the mean and standard deviation of the poly(A) tail are consistent across all reads. This step is crucial for reducing…..”

      We chose to use the poly(A) tail for normalization because it is sequence-invariant—i.e., all poly(A) tails consist of identical k-mers, unlike transcript sequences which vary in composition. In contrast, using the transcript region for normalization can introduce biases: for instance, reads with more diverse k-mers (having inherently broader signal distributions) would be forced to match the variance of reads with more uniform k-mers, potentially distorting the baseline across k-mers.

      Page 7, 5mer parameter table: r9.4_180mv_70bps_5mer_RNA is an older kmer model (>2 years). How does your method perform with the newer RNA kmer models that do permit the detection of multiple ribonucleotide modifications? Addressing this comment is crucial because it is feasible that SegPore will underperform in comparison to the newer RNA base caller models (requiring the use of RNA004 datasets).

      Thank you for highlighting this important point. For RNA004, we have updated SegPore to ensure compatibility with the latest kit. In our revised manuscript, we demonstrate that the translocation-based segmentation hypothesis remains valid for RNA004, as supported by new analyses presented in the supplementary Figure S4.

      Additionally, we performed a new benchmark with f5c and Uncalled4 in RNA004 data in the revised manuscript (Table 2), where SegPore exhibit a better performance than f5c and Uncalled4.

      We agree that benchmarking against the latest Dorado models—specifically rna004_130bps_hac@v5.1.0 and rna004_130bps_sup@v5.1.0, which include built-in modification detection capabilities—would provide valuable context for evaluating the utility of SegPore. However, generating a comprehensive k-mer parameter table for RNA004 requires a large, well-characterized dataset. At present, such data are limited in the public domain. Additionally, Dorado is developed by ONT and its internal training data have not been released, making direct comparisons difficult.

      Our current focus is on improving raw signal segmentation quality, which are upstream tasks critical to many downstream analyses, including RNA modification detection. Future work may include benchmarking SegPore against models like Dorado once appropriate data become available.

      The Methods and Results sections contain redundant information - please streamline the information in these sections and reduce the redundancy. For example, the benchmarking section may be better situated in the Results section.

      Following your advice, we have removed redundant texts about the Segmentation benchmark from Materials and Methods in the revised manuscript.

      Minor comments

      (1) Introduction

      Page 3: "By incorporating these dynamics into its segmentation algorithm...". Please provide an example of how motor protein dynamics can impact RNA translocation. In particular, please elaborate on why motor protein dynamics would impact the translocation of modified ribonucleotides differently to canonical ribonucleotides. This is provided in the results, but please also include details in the Introduction.

      Following your advice, we added one sentence to explain how the motor protein affect the translocation of the DNA/RNA molecule in the revised manuscript.

      “This observation is also supported by previous reports, in which the helicase (the motor protein) translocates the DNA strand through the nanopore in a back-and-forth manner. Depending on ATP or ADP binding, the motor protein may translocate the DNA/RNA forward or backward by 0.5-1 nucleotides.”

      As far as we understand, this translocation mechanism is not specific to modified or unmodified nucleotides. For further details, we refer the reviewer to the original studies cited.

      Page 3: "This lack of interpretability can be problematic when applying these methods to new datasets, as researchers may struggle to trust the predictions without a clear understanding of how the results were generated." Please provide details and citations as to why researchers would struggle to trust the predictions of m6Anet. Is it due to a lack of understanding of how the method works, or an empirically demonstrated lack of reliability?

      Thank you for pointing this out. The lack of interpretability in deep learning models such as m6Anet stems primarily from their “black-box” nature—they provide binary predictions (modified or unmodified) without offering clear reasoning or evidence for each call.

      When we examined the corresponding raw signals, we found it difficult to visually distinguish whether a signal segment originated from a modified or unmodified ribonucleotide. The difference is often too subtle to be judged reliably by a human observer. This is illustrated in the newly added Supplementary Figure S5, which shows Nanopolish-aligned raw signals for the central 5mer GGACT in Figure 4B, displayed both uncolored and colored by modification state (according to the ground truth).

      Although deep neural networks can learn subtle, high-dimensional patterns in the signal that may not be readily interpretable, this opacity makes it difficult for researchers to trust the predictions—especially in new datasets where no ground truth is available. The issue is not necessarily an empirically demonstrated lack of reliability, but rather a lack of transparency and interpretability.

      We have updated the manuscript accordingly and included Supplementary Figure S5 to illustrate the difficulty in interpreting signal differences between modified and unmodified states.

      Page 3: "Instead of relying on complex, opaque features...". Please provide evidence that the research community finds the figures generated by m6Anet to be difficult to interpret, or delete the sections relating to its perceived lack of usability.

      See the figure provided in the response to the previous point. We added a reference to this figure in the revised manuscript.

      “Instead of relying on complex, opaque features (see Supplementary Figure S5), SegPore leverages baseline current levels to distinguish between…..”

      (2) Materials and Methods

      Page 5, Preprocessing: "We begin by performing basecalling on the input fast5 file using Guppy, which converts the raw signal data into base sequences.". Please change "base" to ribonucleotide.

      Revised as requested.

      Page 5 and throughout, please refer to poly(A) tail, rather than polyA tail throughout.

      Revised as requested.

      Page 5, Signal segmentation via hierarchical Hidden Markov model: "...providing more precise estimates of the mean and variance for each base block, which are crucial for downstream analyses such as RNA modification prediction." Please specify which method your HHMM method improves upon.

      Thank you for the suggestion. Since this section does not include a direct comparison, we revised the sentence to avoid unsupported claims. The updated sentence now reads:

      "...providing more precise estimates of the mean and variance for each base block, which are crucial for downstream analyses such as RNA modification prediction."

      Page 10, GMM for 5mer parameter table re-estimation: "Typically, the process is repeated three to five times until the 5mer parameter table stabilizes." How is the stabilisation of the 5mer parameter table quantified? What is a reasonable cut-off that would demonstrate adequate stabilisation of the 5mer parameter table?

      Thank you for the comment. We assess the stabilization of the 5mer parameter table by monitoring the change in baseline values across iterations. If the absolute change in baseline values for all 5mers is less than 1e-5 between two consecutive iterations, we consider the estimation to have stabilized.

      Page 11, M6A site level benchmark: why were these datasets selected? Specifically, why compare human and mouse ribonuclotide modification profiles? Please provide a justification and a brief description of the experiments that these data were derived from, and why they are appropriate for benchmarking SegPore.

      Thank you for the comment. These data are taken from a previous benchmark studie about m6A estimation from RNA002 data in the literature (https://doi.org/10.1038/s41467-023-37596-5). We think the data are appropreciate here.

      Thank you for the comment. The datasets used were taken from a previous benchmark study on m6A estimation using RNA002 data (https://doi.org/10.1038/s41467-023-37596-5). These datasets include human and mouse transcriptomes and have been widely used to evaluate the performance of RNA modification detection tools. We selected them because (i) they are based on RNA002 chemistry, which matches the primary focus of our study, and (ii) they provide a well-characterized and consistent benchmark for assessing m6A detection performance. Therefore, we believe they are appropriate for validating SegPore.

      (3) Results

      Page 13, RNA translocation hypothesis: "The raw current signals, as shown in Fig. 1B...". Please check/correct figure reference - Figure 1B does not show raw current signals.

      Thank you for pointing this out. The correct reference should be Figure 2B. We have updated the figure citation accordingly in the revised manuscript.

      Page 19, m6A identification at the site level: "For six selected m6A motifs, SegPore achieved an ROC AUC of 82.7% and a PR AUC of 38.7%, earning the third best performance compared with deep leaning methods m6Anet and CHEUI (Fig. 3D)." SegPore performs third best of all deep learning methods. Do the authors recommend its use in conjunction with m6Anet for m6A detection? Please clarify in the text.

      This sentence aims to convey that SegPore alone can already achieve good performance. If interpretability is the primary goal, we recommend using SegPore on its own. However, if the objective is to identify more potential m6A sites, we suggest using the combined approach of SegPore and m6Anet. That said, we have chosen not to make explicit recommendations in the main text to avoid oversimplifying the decision or potentially misleading readers.

      Page 19, m6A identification at the single molecule level: "one transcribed with m6A and the other with normal adenosine". I assume that this should be adenine? Please replace adenosine with adenine throughout.

      Thank you for pointing this out. We have revised the sentence to use "adenine" where appropriate. In other instances, we retain "adenosine" when referring specifically to adenine bound to a ribose sugar, which we believe is suitable in those contexts.

      Page 19, m6A identification at the single molecule level: "We used 60% of the data for training and 40% for testing". How many reads were used for training and how many for testing? Please comment on why these are appropriate sizes for training and testing datasets.

      In total, there are 1.9 million reads, with 1.14 million used for training and 0.76 million  for testing (60% and 40%, respectively). We chose this split to ensure that the training set is sufficiently large to reliably estimate model parameters, while the test set remains substantial enough to robustly evaluate model performance. Although the ratio was selected somewhat arbitrarily, it balances the need for effective training with rigorous validation.

      (4) Discussion

      Page 21: "We believe that the de-noised current signals will be beneficial for other downstream tasks." Which tasks? Please list an example.

      We have revised the text for clarity as follows:

      “We believe that the de-noised current signals will be beneficial for other downstream tasks, such as the estimation of m5C, pseudouridine, and other RNA modifications.”

      Page 22: "One can generally observe a clear difference in the intensity levels between 5mers with a m6A and normal adenosine, which is easier for human to interpret if a predicted m6A site is real." This statement is vague and requires qualification. Please reference a study that demonstrates the human ability to interpret two similar graphs, and demonstrate how it relates to the differences observed in your data.

      We apologize for the confusion. We have revised the sentence as follows:

      “One can generally observe a clear difference in the intensity levels between 5mers with an m6A and those with a normal adenosine, which makes it easier for a researcher to interpret whether a predicted m6A site is genuine.”

      We believe that Figures 3A, 3B, and 4B effectively illustrate this concept.

      Page 23: How long does SegPore take for its analyses compared to other similar tools? How long would it take to analyse a typical dataset?

      We have added run-time statistics for datasets of varying sizes in the revised manuscript (see Supplementary Figure S6). This figure illustrates SegPore’s performance across different data volumes to help estimate typical processing times.

      (5) Figures

      Figure 4C. Please number the hierachical clusters and genomic locations in this figure. They are referenced in the text.

      Following your suggestion, we have labeled the hierarchical clusters and genomic locations in Figure 4C in the revised manuscript.

      In addition, we revised the corresponding sentence in the main text as follows: “Biclustering reveals that modifications at g6 are specific to cluster C4, g7 to cluster C5, and g8 to cluster C6, while the first five genomic locations (g1 to g5) show similar modification patterns across all reads.”