10,000 Matching Annotations
  1. Apr 2025
    1. 除环

      你的定义基本正确,但需要一些补充和修正以确保准确性。以下是除环(Division Ring)的详细解释:


      除环的正式定义

      除环(或称体、斜域)是一个环 \( R \),满足以下条件: 1. 加法结构:<br /> \((R, +)\) 是一个 阿贝尔群(交换群),即满足: - 封闭性、结合律、交换律、加法单位元(零元 \( 0 \))、加法逆元。

      1. 乘法结构:<br /> \((R \setminus {0}, \cdot)\) 构成一个 (即非零元对乘法封闭且满足群公理):
      2. 封闭性:\( a \cdot b \in R \setminus {0} \),\( \forall a, b \in R \setminus {0} \)。
      3. 结合律:\( a \cdot (b \cdot c) = (a \cdot b) \cdot c \),\( \forall a, b, c \in R \setminus {0} \)。
      4. 单位元:存在 \( 1 \in R \setminus {0} \),使得 \( a \cdot 1 = 1 \cdot a = a \),\( \forall a \in R \setminus {0} \)。
      5. 逆元:每个非零元 \( a \in R \setminus {0} \) 存在唯一逆元 \( a^{-1} \in R \setminus {0} \),满足: [ a \cdot a^{-1} = a^{-1} \cdot a = 1. ]

      6. 分配律:<br /> 乘法对加法满足分配律:

      7. \( a \cdot (b + c) = a \cdot b + a \cdot c \),
      8. \( (a + b) \cdot c = a \cdot c + b \cdot c \),<br /> \( \forall a, b, c \in R \)。

      关键补充说明

      1. 乘法结构的澄清

      • 乘法幺半群 vs. 乘法群
      • 整个环 \( R \) 的乘法结构 \((R, \cdot)\) 是一个 含幺半群(存在单位元 \( 1 \),但零元 \( 0 \) 没有逆元)。
      • 非零元子集 \((R \setminus {0}, \cdot)\) 构成一个 (即非零元可逆且满足群公理)。

      2. 与域的关系

      • 域是交换的除环:<br /> 若除环的乘法满足交换律(\( a \cdot b = b \cdot a \)),则它是一个 (Field)。
      • 例子:有理数域 \( \mathbb{Q} \)、实数域 \( \mathbb{R} \)、复数域 \( \mathbb{C} \)。

      3. 非交换性

      • 除环的乘法 不一定交换,因此非交换的除环也称为 斜域(Skew Field)。
      • 经典例子:四元数环 \( \mathbb{H} \)(非交换)。

      4. 韦德伯恩定理

      • 有限除环必为域:<br /> 根据 Wedderburn 小定理,任何有限除环的乘法必交换,因此是有限域。
      • 例子:有限域 \( \mathbb{Z}_p \)(\( p \) 为素数)。

      5. 除环的性质

      • 无零因子:<br /> 若 \( a \cdot b = 0 \),则 \( a = 0 \) 或 \( b = 0 \)。
      • 证明:若 \( a \neq 0 \),则 \( a \) 可逆,故 \( b = a^{-1} \cdot 0 = 0 \)。

      • 单环:<br /> 除环的唯一双边理想是 \( {0} \) 和 \( R \) 本身,因此是 单环(Simple Ring)。


      6. 例子与反例

      • 经典例子
      • 四元数环 \( \mathbb{H} \):非交换除环,元素形如 \( a + bi + cj + dk \),乘法不交换。
      • 有限域 \( \mathbb{Z}_p \):既是域,也是交换除环。
      • 实数域 \( \mathbb{R} \):交换除环,同时也是域。

      • 非除环的例子

      • 整数环 \( \mathbb{Z} \):非零元如 \( 2 \) 无逆元,故非除环。
      • 矩阵环 \( M_n(\mathbb{R}) \)(\( n \geq 2 \)):存在非零元(如对角矩阵 \( \text{diag}(1, 0, \dots, 0) \))无逆元。

      7. 重要定理

      • Wedderburn 小定理:<br /> 有限除环的乘法必交换,即有限除环必为域。
      • 应用:有限域的构造与分类依赖此定理。

      • Schur 引理:<br /> 若 \( S \) 是环 \( R \) 的单模,则其内同态环是除环。


      总结

      除环是“非零元可逆的含幺环”,其核心特征是:<br /> 1. 非零元构成乘法群;<br /> 2. 乘法不一定交换;<br /> 3. 无零因子;<br /> 4. 有限除环必为域。

      若需进一步探讨具体例子或定理证明,请随时提问!

    2. 含幺环

      含幺环(Unitary Ring)的详细解释

      根据你的定义和知识库中的信息,含幺环的定义是正确的,但需要进一步明确和补充以下关键点:


      1. 含幺环的正式定义

      含幺环(也称幺环、含单位元的环)是一个环 \( R \),满足以下条件: 1. 加法结构:<br /> \((R, +)\) 是一个 阿贝尔群(交换群),即满足: - 封闭性、结合律、交换律、加法单位元(零元 \( 0 \))、加法逆元。

      1. 乘法结构:<br /> \((R, \cdot)\) 是一个 含幺半群(即乘法半群存在单位元),即满足:
      2. 封闭性、结合律、存在乘法单位元 \( 1 \)(满足 \( a \cdot 1 = 1 \cdot a = a \) 对所有 \( a \in R \))。

      3. 分配律:<br /> 乘法对加法满足分配律:

      4. 左分配律:\( a \cdot (b + c) = a \cdot b + a \cdot c \),
      5. 右分配律:\( (a + b) \cdot c = a \cdot c + b \cdot c \)。

      2. 关键补充说明

      (1) 单位元的唯一性

      • 知识库[1][3][4]指出:若环 \( R \) 含单位元,则单位元是唯一的。<br /> 证明:假设存在 \( 1 \) 和 \( 1' \),则 \( 1 = 1 \cdot 1' = 1' \),故唯一。

      (2) 单位元与零元的关系

      • 若 \( R \) 是非零环(即 \( |R| \geq 2 \)),则单位元 \( 1 \neq 0 \)。
      • 若 \( 1 = 0 \),则对任意 \( a \in R \),有 \( a = a \cdot 1 = a \cdot 0 = 0 \),故 \( R \) 只含零元,称为零环

      (3) 乘法半群的结构

      • 通常乘法半群不要求单位元,但含幺环强制要求乘法半群有单位元,因此乘法结构是含幺半群(monoid)而非普通半群。

      3. 含幺环的例子

      • 经典例子
      • 整数环 \( \mathbb{Z} \):单位元为 \( 1 \),是含幺交换环。
      • 多项式环 \( R[x] \):若 \( R \) 是含幺环,则 \( R[x] \) 也是含幺环,单位元为常数项 \( 1 \)。
      • 矩阵环 \( M_n(R) \)(\( n \geq 1 \)):单位元为单位矩阵 \( I_n \),但乘法非交换(除非 \( n = 1 \))。
      • 模 \( n \) 剩余类环 \( \mathbb{Z}_n \):单位元为 \( [1] \),是含幺交换环。

      • 非含幺环的例子

      • 偶数环 \( 2\mathbb{Z} \):乘法无单位元(因 \( 1 \notin 2\mathbb{Z} \)),故非含幺环。

      4. 含幺环的分类与性质

      (1) 交换含幺环(Commutative Ring with Unity)

      • 若乘法满足交换律,则称 \( R \) 为交换含幺环(CRI)。
      • 例子:\( \mathbb{Z}, \mathbb{Q}, \mathbb{Z}_n \)。

      (2) 无零因子环

      • 若 \( ab = 0 \Rightarrow a = 0 \) 或 \( b = 0 \),则称 \( R \) 为无零因子环
      • 整环(Integral Domain):交换、含幺、无零因子的环。
        • 例子:\( \mathbb{Z}, \mathbb{Q}, \mathbb{R} \)。

      (3) 除环与域

      • 除环(Division Ring):含幺、无零因子,且每个非零元可逆(存在乘法逆元)。
      • 非交换例子:四元数环 \( \mathbb{H} \)。
      • 域(Field):交换的除环。
      • 例子:\( \mathbb{Q}, \mathbb{R}, \mathbb{C} \),以及有限域 \( \mathbb{Z}_p \)(\( p \) 为素数)。

      5. 含幺环的重要性

      • 代数结构核心:<br /> 许多代数结构(如域、模、代数)都基于含幺环的性质构建。
      • 应用领域
      • 代数数论:研究数域的整数环(如代数整数构成的整环)。
      • 线性代数:矩阵环是线性变换的代数模型。
      • 编码理论:多项式环用于构造纠错码(如循环码)。

      6. 常见误区澄清

      • 误区1:所有环都含幺元。
      • 纠正:需明确区分“环”与“含幺环”。例如,偶数环 \( 2\mathbb{Z} \) 是环但非含幺环。
      • 误区2:乘法半群必须交换。
      • 纠正:含幺环的乘法半群可以非交换(如矩阵环 \( M_n(\mathbb{R}) \))。

      总结

      含幺环是环的“升级版”,通过要求乘法存在单位元,为代数结构提供了更丰富的性质和应用。其核心在于平衡加法的交换性与乘法的结合性,并通过单位元支持逆元和多项式等高级运算。理解含幺环是深入交换代数、非交换代数及应用领域的关键基础。

      如果需要进一步探讨具体例子或深入某个性质,请随时提问!

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02465

      Corresponding author(s): Saravanan, Palani

      1. General Statements

      We would like to thank the Review Commons Team for handling our manuscript and the Reviewers for their constructive feedback and suggestions. In our revised manuscript, we have addressed and incorporated all the major suggestions of the reviewers, and we have also added new significant data on the role of Tropomyosin in regulation of endocytosis through its control over actin monomer pool maintenance and actin network homeostasis. We believe that with all these additions, our study has significantly gained in quality, strength of conclusions made, and scope for future work.

      2. Point-by-point description of the revisions

      Reviewer #1

      Evidence, reproducibility and clarity

      There are 2 Major issues -

      Having an -ala-ser- linker between the GFP and tropomyosin mimics acetylation. This is not the case, and more likely the this linker acts as a spacer that allows tropomyosin polymers to form on the actin, and without it there is steric hindrance. A similar result would be seen with a simple flexible uncharged linker. It has been shown in a number of labs that the GFP itself masks the effect of the charge on the amino terminal methionine. This is consistent with NMR, crystallographic and cryo structural studies. Biochemical studies should be presented to demonstrate that the impact of a linker for the conclusions stated to be made, which provide the basis of a major part of this study.

      Response: We would like to clarify that all mNG-Tpm constructs used in our study contain a 40 amino-acid (aa) flexible linker between the N-terminal mNG fluorescent protein and the Tpm protein as per our earlier published study (Hatano et al., 2022). During initial optimization, we have also experimented with linker length and the 40aa-linker length works optimally for clear visualization of Tpm onto actin cable structures in budding yeast, fission yeast (both S. pombe and S. japonicus), and mammalian cells (Hatano et al., 2022). These constructs have also been used since in other studies (Wirshing et al., 2023; Wirshing and Goode, 2024) and currently represents the best possible strategy to visualize Tpm isoforms in live cells. In our study, we characterized these proteins for functionality and found that both mNG-Tpm1 and mNG-Tpm2 were functional and can rescue the synthetic lethality observed in Dtpm1Dtpm2 cells. During our study, we observed that mNG-Tpm1 expression from a single-copy integration vector did not restore full length actin cables in Dtpm1 cells (Fig. 1B, 1C). We hypothesized that this could be a result of reduced binding affinity of the tagged tropomyosin due to lack of normal N-terminal acetylation which stabilizes the N-terminus. The 40aa linker is unstructured and may not be able to neutralize the charge on the N-terminal Methionine, thus, we tried to insert -Ala-Ser- dipeptide which has been routinely used in vitro biochemical studies to stabilize the N-terminal helix and impart a similar effect as the N-terminal acetylation (Alioto et al., 2016; Palani et al., 2019; Christensen et al., 2017) by restoring normal binding affinity of Tpm to F-actin (Monteiro et al., 1994; Greenfield et al., 1994). We observed that addition of the -Ala-Ser- dipeptide to mNG-Tpm fusion, indeed, restored full length actin cables when expressed in Dtpm1 cells, performing significantly better in our in vivo experiments (Fig. 1B, 1C). We agree with the reviewer that the -AS- dipeptide addition may not mimic N-terminal acetylation structurally but as per previous studies, it may stabilize the N-terminus of Tpm and allow normal head-to-tail dimer formation (Greenfield et al., 1994; Monteiro et al., 1994; Frye et al., 2010). We have discussed this in our new Discussion section (Lines 350-372). Since, the addition of -AS- dipeptide was referred to as "acetyl-mimic (am)" in a previous study (Alioto et al., 2016), we continued to use the same nomenclature in our study. Now as per your suggestions and to be more accurate, we have renamed "mNG-amTpm" constructs as "mNG-ASTpm" throughout the study to not confuse or claim that -AS- addition mimics acetylation. In any case, we have not seen any other ill effect of -AS- dipeptide introduction in addition to our 40 amino acid linker suggesting that it can also be considered part of the linker. Although, we agree with the reviewer that biochemical characterization of the effect of linker would be important to determine, we strongly believe that it is currently outside the scope of this study and should be taken up for future work with these proteins. Our study has majorly aimed to understand the functionality and utility of these mNG-Tpm fusion proteins for cell biological experiments in vivo, which was not done earlier in any other model system.

      My major issue however is making the conclusions stated here, using an amino-terminal fluorescent protein tag that s likely to impact any type of isoform selection at the end of the actin polymer. Carboxyl terminal tagging may have a reduced effect, but modifying the ends of the tropomyosin, which are integral in stabilising end to end interactions with itself on the actin filament, never mind any section systems that may/maynot be present in the cell, is not appropriate.

      Response: We agree with the reviewer that N-terminal tagging of tropomyosin may have effects on its function, but these constructs represent the only fluorescently tagged functional tropomyosin constructs available currently while C-terminal fusions are either non-functional (we were unable to construct strains with endogenous Tpm1 gene fused C-terminally to GFP) or do not localize clearly to actin structures (See Figure R1 showing endogenous C-terminally tagged Tpm2-yeGFP that shows almost no localization to actin cables). To our knowledge, our study represents a first effort to understand the question of spatial sorting of Tpm isoforms, Tpm1 and Tpm2, in S. cerevisiae and any future developments with better visualization strategies for Tpm isoforms without compromising native N-terminal modifications and function will help improve our understanding of these proteins in vivo. We have also discussed these possibilities in our new Discussion section (Lines 391-396).

      Significance

      This paper explores the role of formin in determining the localisation of different tropomyosins to different actin polymers and cellular locations within budding yeast. Previous studies have indicated a role for the actin nucleating proteins in recruiting different forms of tropomyosin within fission yeast. In mammalian cells there is variation in the role of formins in affiecting tropomyosin localisation - variation between cell type. There is also evidence that other actin binding proteins, and tropomyosin abundance play roles in regulating the tropomyosin-actin association according to cell type. Biochemical studies have previously been undertaken using budding yeast and fission yeast that the core actin polymerisation domain of formins do not interact with tropomyosin directly. The significance of this study, given the above, and the concerns raised is not clear to this reviewer.

      Response: __Our study explores multiple facets of Tropomyosin (Tpm) biology. The lack of functional tagged Tpm has been a major bottleneck in understanding Tpm isoform diversity and function across eukaryotes. In our study, we characterize the first functional tagged Tpm proteins (Fig. 1, Fig. S1) and use them to answer long-standing questions about localization and spatial sorting of Tpm isoforms in the model organism S. cerevisiae (Fig. 2, Fig. 3, Fig. S2, Fig. S3). We also discover that the dual Tpm isoforms, Tpm1 and Tpm2, are functionally redundant for actin cable organization and function, while having gained divergent functions in Retrograde Actin Cable Flow (RACF) (Fig. 4, Fig. 5A-D, Fig. S4, Fig. S5, Fig. S6). We have now added new data on role of global Tpm levels controlling endocytosis via maintenance of normal linear-to-branched actin network homeostasis in S. cerevisiae (Fig. 5E-G)__. We respectfully differ with the reviewer on their assessment of our study and request the reviewer to read our revised manuscript which discusses the significance, limitations, and future perspectives of our study in detail.

      Reviewer #2

      Evidence, reproducibility and clarity

      This manuscript by Dhar, Bagyashree, Palani and colleagues examines the function of the two tropomyosins, Tpm1 and Tpm2, in the budding yeast S. cerevisiae. Previous work had shown that deletion of tpm1 and tpm2 causes synthetic lethality, indicating overlapping function, but also proposed that the two tropomyosins have distinct functions, based on the observation that strong overexpression of Tpm2 causes defects in bud placement and fails to rescue tpm1∆ phenotypes (Drees et al, JCB 1995). The manuscript first describes very functional mNeonGreen tagged version of Tpm1 and Tpm2, where an alanine-serine dipeptide is inserted before the first methionine to mimic acetylation. It then proposes that the Tpm1 and Tpm2 exhibit indistinguishable localization and that low level overexpression (?) of Tpm2 can replace Tpm1 for stabilization of actin cables and cell polarization, suggesting almost completely redundant functions. They also propose on specific function of Tpm2 in regulating retrograde actin cable flow.

      Overall, the data are very clean, well presented and quantified, but in several places are not fully convincing of the claims. Because the claims that Tpm1 and Tpm2 have largely overlapping function and localization are in contradiction to previous publication in S. cerevisiae and also different from data published in other organisms, it is important to consolidate them. There are fairly simple experiments that should be done to consolidate the claims of indistinguishable localization, and levels of expression, for which the authors have excellent reagents at their disposal.

      1. Functionality of the acetyl-mimic tagged tropomyosin constructs: The overall very good functionality of the tagged Tpm constructs is convincing, but the authors should be more accurate in their description, as their data show that they are not perfectly functional. For instance, the use of "completely functional" in the discussion is excessive. In the results, the statement that mNG-Tpm1 expression restores normal growth (page 3, line 69) is inaccurate. Fig S1C shows that tpm1∆ cells expressing mNG-Tpm1 grow more slowly than WT cells. (The next part of the same sentence, stating it only partially restores length of actin cables should cite only Fig S1E, not S1F.) Similarly, the growth curve in Fig S1C suggests that mNG-amTpm1, while better than mNG-Tpm1 does not fully restore the growth defect observed in tpm1∆ (in contrast to what is stated on p. 4 line 81). A more stringent test of functionality would be to probe whether mNG-amTpm1 can rescue the synthetic lethality of the tpm1∆ tpm2∆ double mutant, which would also allow to test the functionality of mNG-amTpm2.

      __Response: __We would like to thank the reviewer for his feedback and suggestions. Based on the suggestions, we have now more accurately described the growth rescue observed by expression of mNG-ASTpm1 in Dtpm1 cells in the revised text. We have also removed the use of "completely functional" to describe mNG-Tpm functionality and corrected any errors in Figure citations in the revised manuscript.

      As per reviewers' suggestion, we have now tested rescue of synthetic lethality of Dtpm1Dtpm2 cells by expression of all mNG-Tpm variants and we find that all of them are capable of restoring the viability of Dtpm1Dtpm2 cells when expressed under their native promoters via a high-copy plasmid (pRS425) (Fig. S1E) but only mNG-Tpm1 and mNG-ASTpm1 restored viability of Dtpm1Dtpm2 cells when expressed under their native promoters via an integration plasmid (pRS305) (Fig. S1F). These results clearly suggest that while both mNG-Tpm1 and mNG-Tpm2 constructs are functional, Tpm1 tolerates the presence of the N-terminal fluorescent tag better than Tpm2. These observations now enhance our understanding of the functionality of these mNG-Tpm fusion proteins and will be a useful resource for their usage and experimental design in future studies in vivo.

      It would also be nice to comment on whether the mNG-amTpm constructs really mimicking acetylation. Given the Ala-Ser peptide ahead of the starting Met is linked N-terminally to mNG, it is not immediately clear it will have the same effect as a free acetyl group decorating the N-terminal Met.

      Response: __We agree with the reviewer's observation and for the sake of clarity and accuracy, we have now renamed "mNG-amTpm" with "mNG-ASTpm". The use of -AS- dipeptide is very routine in studies with Tpm (Alioto et al., 2016; Palani et al., 2019; Christensen et al., 2017) and its addition restores normal binding affinities to Tpm proteins purified from E. coli (Monteiro et al., 1994). We agree with the reviewer that the -AS- dipeptide addition may not mimic N-terminal acetylation structurally but as per previous studies, it may help neutralize the impact of a freely protonated Met on the alpha-helical structure and stabilize the N-terminus helix of Tpm and allow normal head-to-tail dimer formation (Monteiro et al., 1994; Frye et al., 2010; Greenfield et al., 1994). Consistent with this, we also observe a highly significant improvement in actin cable length when expressing mNG-ASTpm as compared to mNG-Tpm in Dtpm1 cells, suggesting an improvement in function probably due to increased binding affinity (Fig. 1B, 1C). We have also discussed this in our answer to Question 1 of Reviewer 1 and the revised manuscript (Lines 350-372)__.

      __ Localization of Tpm1 and Tpm2:__Given the claimed full functionality of mNG-amTpm constructs and the conclusion from this section of the paper that relative local concentrations may be the major factor in determining tropomyosin localization to actin filament networks, I am concerned that the analysis of localization was done in strains expressing the mNG-amTpm construct in addition to the endogenous untagged genes. (This is not expressly stated in the manuscript, but it is my understanding from reading the strain list.) This means that there is a roughly two-fold overexpression of either tropomyosin, which may affect localization. A comparison of localization in strains where the tagged copy is the sole Tpm1 (respectively Tpm2) source would be much more conclusive. This is important as the results are making a claim in opposition to previous work and observation in other organisms.

      Response: __We thank the reviewer for this observation and their suggestions. We agree that relative concentrations of functional Tpm1 and Tpm2 in cells may influence the extent of their localizations. As per the reviewer's suggestion, we have now conducted our quantitative analysis in cells lacking endogenous Tpm1 and only expressing mNG-ASTpm1 from an integrated plasmid copy at the leu2 locus and the data is presented in new __Figure S3. We compared Tpm-bound cable length (Fig. S3A, S3B) __and Tpm-bound cable number (Fig. S3A, S3C) along with actin cable length (Fig. S3D, S3E) and actin cable number (Fig. S3D, S3F) in wildtype, Dbnr1, and Dbni1 cells. Our analysis revealed that mNG-ASTpm1 localized to actin cable structures in wildtype, Dbnr1, and Dbni1 cells and the decrease observed in Tpm-bound cable length and number upon loss of either Bnr1 or Bni1, was accompanied by a corresponding decrease in actin cable length and number upon loss of either Bnr1 or Bni1. Thus, this analysis reached the same conclusion as our earlier analysis (Fig. 2) that mNG-ASTpm1 does not show preference between Bnr1 and Bni1-made actin cables. mNG-ASTpm2 did not restore functionality, when expressed as single integrated copy, in Dtpm1Dtpm2 cells (new results in __Fig. S1E, S1F, S5A) thus, we could not conduct a similar analysis for mNG-ASTpm2. This suggests that use of mNG-ASTpm2 would be more meaningful in the presence of endogenous Tpm2 as previously done in Fig. 2D-F.

      We have now also performed additional yeast mating experiments with cells lacking bnr1 gene and expressing either mNG-ASTpm1 or mNG-ASTpm2 and the data is shown in new Figure 3. From these observations, we observe that both mNG-ASTpm1 and mNG-ASTpm2 localize to the mating fusion focus in a Bnr1-independent manner (Fig. 3B, 3D) and suggests that they bind to Bni1-made actin cables that are involved in polarized growth of the mating projection. These results also add strength to our conclusion that Tpm1 and Tpm2 localize to actin cables irrespective of which formin nucleates them. Overall, these new results highlight and reiterate our model of formin-isoform independent binding of Tpm1 and Tpm2 in S. cerevisiae.

      In fact, although the authors conclude that the tropomyosins do not exhibit preference for certain actin structures, in the images shown in Fig 2A and 2D, there seems to be a clear bias for Tpm1 to decorate cables preferentially in the bud, while Tpm2 appears to decorate them more in the mother cell. Is that a bias of these chosen images, or does this reflect a more general trend? A quantification of relative fluorescence levels in bud/mother may be indicative.

      Response: __We thank the reviewer for pointing this out. Our data and analysis do not suggest that Tpm1 and Tpm2 show any preference for decoration of cables in either mother or bud compartment. As per the reviewer's suggestion, we have now quantified the ratio of mean mNG fluorescence in the bud to the mother (Bud/Mother) and the data is shown in __Figure. S2G. The bud-to-mother ratio was similar for mNG-ASTpm1 and mNG-ASTpm2 in wildtype cells, and the ratio increased in Dbnr1 cells and decreased in Dbni1 cells for both mNG-ASTpm1 and mNG-ASTpm2 (Fig. S2G). __This is consistent with the decreased actin cable signal in the mother compartment in Dbnr1 cells and decreased actin cable signal in the bud compartment in Dbni1 cells (Fig. S2A-D). Thus, our new analysis shows that both mNG-ASTpm1 and mNG-ASTpm2 have similar changes in their concentration (mean fluorescence) upon loss of either formins Bnr1 and Bni1 and show similar ratios in wildtype cells as well, suggesting no preference for binding to actin cables in either bud or mother compartment. The preference inferred by the reviewer seems to be a bias of the current representative images and thus, we have replaced the images in __Fig. 2A, 2D to more accurately represent the population.

      The difficulty in preserving mNG-amTpm after fixation means that authors could not quantify relative Tpm/actin cable directly in single fixed cells. Did they try to label actin cables with Lifeact instead of using phalloidin, and thus perform the analysis in live cells?

      __Response: __We did not use LifeAct for our analysis as LifeAct is known to cause expression-dependent artefacts in cells (Courtemanche et al., 2016; Flores et al., 2019; Xu and Du, 2021) and it also competes with proteins that regulate normal cable organization like cofilin. Use of LifeAct would necessitate standardization of expression to avoid such artefacts in vivo. Also, phalloidin staining provides the best staining of actin cables and allows for better quantitative results in our experiments. The use of LifeAct along with mNG-Tpm would also require optimization with a red fluorescent protein which usually tend to have lower brightness and photostability. However, during the revision of our study, a new study from Prof. Goode's lab has developed and optimized expression of new LifeAct-3xmNeonGreen constructs for use in S. cerevisiae (Wirshing and Goode, 2024). Thus, a similar strategy of using tandem copies of bright and photostable red fluorescent proteins can be explored for use in combination with mNG-Tpm in the future studies.

      __ Complementation of tpm1∆ by Tpm2:__

      I am confused about the quantification of Tpm2 expression by RT-PCR shown in Fig S3F. This figure shows that tpm2 mRNA expression levels are identical in cells with an empty plasmid or with a tpm2-encoding plasmid. In both strains (which lack tpm1), as well as in the WT control, one tpm2 copy is in the genome, but only one strain has a second tpm2 copy expressed from a centromeric plasmid, yet the results of the RT-PCR are not significantly different. (If anything, the levels are lower in the tpm2 plasmid-containing strain.) The methods state that the primers were chosen in the gene, so likely do not distinguish the genomic from the plasmid allele. However, the text claims a 1-fold increase in expression, and functional experiments show a near-complete rescue of the tpm1∆ phenotype. This is surprising and confusing and should be resolved to understand whether higher levels of Tpm2 are really the cause of the observed phenotypic rescue.

      The authors could for instance probe for protein levels. I believe they have specific nanobodies against tropomyosin. If not, they could use expression of functional mNG-amTpm2 to rescue tpm1∆. Here, the expression of the protein can be directly visualized.

      Response: __We thank the reviewer for pointing this out. We would like to clarify that in our RT-qPCR experiments, the primers were chosen within the Tpm1 and Tpm2 gene and do not distinguish between transcripts from endogenous or plasmid copy. We have now mentioned this in the Materials and Methods section of the revised manuscript. So, they represent a relative estimate of the total mRNA of these genes present in cells. We were consistently able to detect ~19 fold increase in Tpm2 total mRNA levels as compared to wildtype and ∆tpm1 cells (Fig. S4D) when tpm2 was expressed from a high-copy plasmid (pRS425). This increase in Tpm2 mRNA levels was accompanied by a rescue in growth (Fig. S4A) and actin cable organization (Fig. S4B) of ∆tpm1 cells containing pRS425-ptpm2TPM2. When tpm2 was expressed from a low-copy number centromeric plasmid (pRS316), we detected a ~2 fold increase in Tpm2 transcript levels when using the tpm1 promoter and no significant change was detected when using tpm2 promoter (Fig. S4E)__. We have made sure that these results are accurately described in the revised manuscript.

      As per the reviewer's suggestion, we have now conducted a more extensive analysis to ascertain the expression levels of Tpm2 in our experiments and the data is now presented in new Figure S5. We used mNG-ASTpm1 and mNG-ASTpm2 to rescue growth of ∆tpm1 (Fig. S5A) and correlated growth rescue with protein levels using quantified fluorescence intensity (Fig. S5B, S5C) and western blotting (anti-mNG) (Fig. S5D, S5E). We find that ∆tpm1 cells containing pRS425-ptpm1mNG-ASTpm1 had the highest protein level followed by pRS425-ptpm2 mNG-ASTpm2, pRS305-ptpm1mNG-ASTpm1, and the least protein levels were found in pRS305-ptpm2 mNG-ASTpm2 containing ∆tpm1 cells in both fluorescence intensity and western blotting quantifications (Fig. S5C, S5E). Surprisingly, we were not able to detect any protein levels in ∆tpm1 cells containing pRS305-ptpm2 mNG-ASTpm2 with western blotting (Fig. S5D) which was also accompanied by a lack of growth rescue (Fig. S5A). This most likely due to weak expression from the native Tpm2 promoter which is consistent with previous literature (Drees et al., 1995). Taken together, this data clearly shows that the rescue observed in ∆tpm1 cells is caused due to increased expression of mNG-ASTpm2 in cells and supports our conclusion that increase in Tpm2 expression leads to restoration of normal growth and actin cables in ∆tpm1 cells.

      __ Specific function of Tpm2:__

      The data about the retrograde actin flow is interpreted as a specific function of Tpm2, but there is no evidence that Tpm1 does not also share this function. To reach this conclusion one would have to investigate retrograde actin flow in tpm1∆ (difficult as cables are weak) or for instance test whether Tpm1 expression restores normal retrograde flow to tpm2∆ cells.

      Response: __We agree with the reviewer and as per the reviewer's suggestion, we have performed another experiment which include wildtype, ∆tpm2 cells containing empty pRS316 vector or pRS316-ptpm2TPM1 or pRS316-ptpm1TPM1. We find that RACF rate increased in ∆tpm2 cells as compared to wildtype and was restored to wildtype levels by exogenous expression of Tpm2 but not Tpm1 (Fig. S6E, S6F). Since, actin cables were not detectable in ∆tpm1 cells, we measured RACF rates in ∆tpm1 cells expressing Tpm1 or Tpm2 from a plasmid copy, which restored actin cables as shown previously in __Fig. 5A-C. We observed that RACF rates were similar to wildtype in ∆tpm1 cells expressing either Tpm1 or Tpm2 (Fig. S6E, S6F), suggesting that Tpm1 is not involved in RACF regulation. Taken together, these results suggest a specific role for Tpm2, but not Tpm1, in RACF regulation in S. cerevisiae, consistent with previous literature (Huckaba et al., 2006).

      Minor comments: __1.__The growth of tpm1∆ with empty plasmid in Fig S3A is strangely strong (different from other figures).

      Response: We thank the reviewer for pointing this out. We have now repeated the drop test multiple times (Fig. R2), but we see similar growth rates as the drop test already presented in Fig. S4A. __At this point, it would be difficult to ascertain the basis of this difference observed at 23{degree sign}C and 30{degree sign}C, but a recent study that links leucine levels to actin cable stability (Sing et al., 2022) might explain the faster growth of these ∆tpm1 cells containing a leu2 gene carrying high-copy plasmid. However, there is no effect on growth rate at 37{degree sign}C which is consistent with other spot assays shown in __Fig. S1D, S4F, S5A.

      Significance

      I am a cell biologist with expertise in both yeast and actin cytoskeleton.

      The question of how tropomyosin localizes to specific actin networks is still open and a current avenue of study. Studies in other organisms have shown that different tropomyosin isoforms, or their acetylated vs non-acetylated versions, localize to distinct actin structures. Proposed mechanisms include competition with other ABPs and preference imposed by the formin nucleator. The current study re-examines the function and localization of the two tropomyosin proteins from the budding yeast and reaches the conclusion that they co-decorate all formin-assembled structures and also share most functions, leading to the simple conclusion that the more important contribution of Tpm1 is simply linked to its higher expression. Once consolidated, the study will appeal to researchers working on the actin cytoskeleton.

      We thank the reviewer for their positive assessment of our work and the constructive feedback that has greatly improved the quality of our study. After addressing the points raised by the reviewer, we believe that our study has significantly gained in consolidating the major conclusions of our work.

      **Referees cross-commenting**

      Having read the other reviewers' comments, I do agree with reviewer 1 that it is not clear whether the Ala-Ser linker really mimics acetylation. I am less convinced than reviewer 3 that the key conclusions of the study are well supported, notably the issue of Tpm2 expression levels is not convincing to me.

      Response: __We acknowledge the reviewer's point about the effect of Ala-Ser dipeptide and would request the reviewer to refer to our response to Reviewer 1 (Question 1) for a more detailed discussion on this. We have also extensively addressed the question of Tpm2 expression levels as suggested by the reviewer (new data in __Figure S5) which has further strengthened the conclusions of our study.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:__ The study presents the first fully functional fluorescently tagged Tpm proteins, enabling detailed probing of Tpm isoform localization and functions in live cells. The authors created a modified fusion protein, mNG-amTpm, which mimicked native N-terminal acetylation and restored both normal growth and full-length actin cables in yeast cells lacking native Tpm proteins, demonstrating the constructs' full functionality. They also show that Tpm1 and Tpm2 do not have a preference for actin cables nucleated by different formins (Bnr1 and Bni1). Contrary to previous reports, the study found that overexpressing Tpm2 in Δtpm1 cells could restore growth rates and actin cable formation. Furthermore, it is shown that despite its evolutionary divergence, Tpm2 retains actin-protective functions and can compensate for the loss of Tpm1, contributing to cellular robustness.

      Major and Minor Comments: 1. The key conclusions of this paper are convincing. However, I suggest that more detail be provided regarding the image analysis used in this study. Specifically, since threshold settings can impact the quality of the generated data and, therefore, its interpretation, it would be useful to see a representative example of the quantification methods used for actin cable length/number (as in refs. 80 and 81) and mitochondria morphology. These could be presented as Supplemental Figures. Additionally, it would help to interpret the results if the authors could be more specific about the statistical tests that were used.

      Response: __We agree with the reviewer's suggestions and have now updated our Materials and Methods section to describe the image analysis pipelines used in more detail. We have also added examples of quantification procedure for actin cable length/number and mitochondrial morphology as an additional Supplementary __Figure S7. Briefly, the following pipelines were used:

      • Actin cable length and number analysis: This was done exactly as mentioned in McInally et al., 2021, McInally et al., 2022. Actin cables were manually traced in Fiji as shown in __ S7A__, and then the traces files for each cell were run through a Python script (adapted from McInally et al., 2022) that outputs mean actin cable length and number per cell.
      • Mitochondria morphology: Mitochondria Analyzer plug-in in Fiji was used to segment out the mitochondrial fragments. The parameters used for 2D segmentation of mitochondria were first optimized using "2D Threshold Optimize" to find the most accurate segmentation and then the same parameters were run on all images. After segmentation of the mitochondrial network, measurements of fragment number were done using "Analyze Particles" function in Fiji. An example of the overall process is shown in __ S7B.__ As per the reviewer's suggestion, we have now included the description of the statistical test used in the Figure Legends of each Figure in the revised manuscript. We have used One-Way Anova with Tukey's Multiple Comparison test, Kruskal-Wallis test with Dunn's Multiple Comparisons, and Unpaired Two-tailed t-test using the in-built functions in GraphPad Prism (v.6.04).

      **Referees cross-commenting**

      I agree with both reviewers 1 and 2 regarding the issues with the Ala-Ser acetylation mimic and Tpm2 expression levels, respectively. I think the authors should be more careful in how they frame the results, but I consider that these issues do not invalidate the main conclusions of this study.

      Response: __We acknowledge the reviewer's concern about the Ala-Ser dipeptide and would request them to refer our earlier discussion on this in response to Reviewer 1 (Question 1) and Reviewer 2 (Question 2). We would also request the reviewer to refer to our answer to Reviewer 2 (Question 6) where we have extensively addressed the question of Tpm2 expression levels and their effect on rescue of Dtpm1 cells. This data is now presented as new __Figure S5 in our revised manuscript.

      Reviewer#3 (Significance (Required)):

      The finding that Tpm2 can compensate for the loss of Tpm1, restoring actin cable organization and normal growth rates, challenges previous assumptions about the non-redundant functions of these isoforms in Saccharomyces cerevisiae (ref. 16). It also supports a concentration-dependent and formin-independent localization of Tpm isoforms to actin cables in this species. The development of fully functional fluorescently tagged Tpm proteins is a significant methodological advancement. This advancement overcomes previous visualization challenges and allows for accurate in vivo studies of Tpm function and regulation in S. cerevisiae.

      The findings will be of particular interest to researchers in the field of cellular and molecular biology who study actin cytoskeleton dynamics. Additionally, it will be relevant for those utilizing advanced microscopy and live-cell imaging techniques.

      As a researcher, my experience lies in cytoskeleton dynamics and protein interactions, though I do not have specific experience related to tropomyosin. I use different yeast species as models and routinely employ live-cell imaging as a tool.

      We thank the reviewer for their positive outlook and assessment of our study. We have incorporated all their suggestions, and we are confident that the revised manuscript has significantly improved in quality due to these additions.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript by Dhar, Bagyashree, Palani and colleagues examines the function of the two tropomyosins, Tpm1 and Tpm2, in the budding yeast S. cerevisiae. Previous work had shown that deletion of tpm1 and tpm2 causes synthetic lethality, indicating overlapping function, but also proposed that the two tropomyosins have distinct functions, based on the observation that strong overexpression of Tpm2 causes defects in bud placement and fails to rescue tpm1∆ phenotypes (Drees et al, JCB 1995). The manuscript first describes very functional mNeonGreen tagged version of Tpm1 and Tpm2, where an alanine-serine dipeptide is inserted before the first methionine to mimic acetylation. It then proposes that the Tpm1 and Tpm2 exhibit indistinguishable localization and that low level overexpression (?) of Tpm2 can replace Tpm1 for stabilization of actin cables and cell polarization, suggesting almost completely redundant functions. They also propose on specific function of Tpm2 in regulating retrograde actin cable flow.

      Overall, the data are very clean, well presented and quantified, but in several places are not fully convincing of the claims. Because the claims that Tpm1 and Tpm2 have largely overlapping function and localization are in contradiction to previous publication in S. cerevisiae and also different from data published in other organisms, it is important to consolidate them. There are fairly simple experiments that should be done to consolidate the claims of indistinguishable localization, and levels of expression, for which the authors have excellent reagents at their disposal.

      Functionality of the acetyl-mimic tagged tropomyosin constructs:

      The overall very good functionality of the tagged Tpm constructs is convincing, but the authors should be more accurate in their description, as their data show that they are not perfectly functional. For instance, the use of "completely functional" in the discussion is excessive. In the results, the statement that mNG-Tpm1 expression restores normal growth (page 3, line 69) is inaccurate. Fig S1C shows that tpm1∆ cells expressing mNG-Tpm1 grow more slowly than WT cells. (The next part of the same sentence, stating it only partially restores length of actin cables should cite only Fig S1E, not S1F.) Similarly, the growth curve in Fig S1C suggests that mNG-amTpm1, while better than mNG-Tpm1 does not fully restore the growth defect observed in tpm1∆ (in contrast to what is stated on p. 4 line 81). A more stringent test of functionality would be to probe whether mNG-amTpm1 can rescue the synthetic lethality of the tpm1∆ tpm2∆ double mutant, which would also allow to test the functionality of mNG-amTpm2.

      It would also be nice to comment on whether the mNG-amTpm constructs really mimicking acetylation. Given the Ala-Ser peptide ahead of the starting Met is linked N-terminally to mNG, it is not immediately clear it will have the same effect as a free acetyl group decorating the N-terminal Met.

      Localization of Tpm1 and Tpm2:

      Given the claimed full functionality of mNG-amTpm constructs and the conclusion from this section of the paper that relative local concentrations may be the major factor in determining tropomyosin localization to actin filament networks, I am concerned that the analysis of localization was done in strains expressing the mNG-amTpm construct in addition to the endogenous untagged genes. (This is not expressly stated in the manuscript, but it is my understanding from reading the strain list.) This means that there is a roughly two-fold overexpression of either tropomyosin, which may affect localization. A comparison of localization in strains where the tagged copy is the sole Tpm1 (respectively Tpm2) source would be much more conclusive. This is important as the results are making a claim in opposition to previous work and observation in other organisms.

      In fact, although the authors conclude that the tropomyosins do not exhibit preference for certain actin structures, in the images shown in Fig 2A and 2D, there seems to be a clear bias for Tpm1 to decorate cables preferentially in the bud, while Tpm2 appears to decorate them more in the mother cell. Is that a bias of these chosen images, or does this reflect a more general trend? A quantification of relative fluorescence levels in bud/mother may be indicative.

      The difficulty in preserving mNG-amTpm after fixation means that authors could not quantify relative Tpm/actin cable directly in single fixed cells. Did they try to label actin cables with Lifeact instead of using phalloidin, and thus perform the analysis in live cells?

      Complementation of tpm1∆ by Tpm2:

      I am confused about the quantification of Tpm2 expression by RT-PCR shown in Fig S3F. This figure shows that tpm2 mRNA expression levels are identical in cells with an empty plasmid or with a tpm2-encoding plasmid. In both strains (which lack tpm1), as well as in the WT control, one tpm2 copy is in the genome, but only one strain has a second tpm2 copy expressed from a centromeric plasmid, yet the results of the RT-PCR are not significantly different. (If anything, the levels are lower in the tpm2 plasmid-containing strain.) The methods state that the primers were chosen in the gene, so likely do not distinguish the genomic from the plasmid allele. However, the text claims a 1-fold increase in expression, and functional experiments show a near-complete rescue of the tpm1∆ phenotype. This is surprising and confusing and should be resolved to understand whether higher levels of Tpm2 are really the cause of the observed phenotypic rescue. The authors could for instance probe for protein levels. I believe they have specific nanobodies against tropomyosin. If not, they could use expression of functional mNG-amTpm2 to rescue tpm1∆. Here, the expression of the protein can be directly visualized.

      Specific function of Tpm2:

      The data about the retrograde actin flow is interpreted as a specific function of Tpm2, but there is no evidence that Tpm1 does not also share this function. To reach this conclusion one would have to investigate retrograde actin flow in tpm1∆ (difficult as cables are weak) or for instance test whether Tpm1 expression restores normal retrograde flow to tpm2∆ cells.

      Minor comments:

      The growth of tpm1∆ with empty plasmid in Fig S3A is strangely strong (different from other figures).

      Referees cross-commenting

      Having read the other reviewers' comments, I do agree with reviewer 1 that it is not clear whether the Ala-Ser linker really mimics acetylation. I am less convinced than reviewer 3 that the key conclusions of the study are well supported, notably the issue of Tpm2 expression levels is not convincing to me.

      Significance

      I am a cell biologist with expertise in both yeast and actin cytoskeleton.

      The question of how tropomyosin localizes to specific actin networks is still open and a current avenue of study. Studies in other organisms have shown that different tropomyosin isoforms, or their acetylated vs non-acetylated versions, localize to distinct actin structures. Proposed mechanisms include competition with other ABPs and preference imposed by the formin nucleator. The current study re-examines the function and localization of the two tropomyosin proteins from the budding yeast and reaches the conclusion that they co-decorate all formin-assembled structures and also share most functions, leading to the simple conclusion that the more important contribution of Tpm1 is simply linked to its higher expression. Once consolidated, the study will appeal to researchers working on the actin cytoskeleton.

    1. Reviewer #1 (Public review):

      Summary:

      This study aimed to investigate the effects of optically stimulating the A13 region in healthy mice and a unilateral 6-OHDA mouse model of Parkinson's disease (PD). The primary objectives were to assess changes in locomotion, motor behaviors, and the neural connectome. For this, the authors examined the dopaminergic loss induced by 6-OHDA lesioning. They found a significant loss of tyrosine hydroxylase (TH+) neurons in the substantia nigra pars compacta (SNc) while the dopaminergic cells in the A13 region were largely preserved. Then, they optically stimulated the A13 region using a viral vector to deliver the channelrhodopsine (CamKII promoter). In both sham and PD model mice, optogenetic stimulation of the A13 region induced pro-locomotor effects, including increased locomotion, more locomotion bouts, longer durations of locomotion, and higher movement speeds. Additionally, PD model mice exhibited increased ipsilesional turning during A13 region photoactivation. Lastly, the authors used whole-brain imaging to explore changes in the A13 region's connectome after 6-OHDA lesions. These alterations involved a complex rewiring of neural circuits, impacting both afferent and efferent projections. In summary, this study unveiled the pro-locomotor effects of A13 region photoactivation in both healthy and PD model mice. The study also indicates the preservation of A13 dopaminergic cells and the anatomical changes in neural circuitry following PD-like lesions that represent the anatomical substrate for a parallel motor pathway.

      Strengths:

      These findings hold significant relevance for the field of motor control, providing valuable insights into the organization of the motor system in mammals. Additionally, they offer potential avenues for addressing motor deficits in Parkinson's disease (PD). The study fills a crucial knowledge gap, underscoring its importance, and the results bolster its clinical relevance and overall strength.

      The authors adeptly set the stage for their research by framing the central questions in the introduction, and they provide thoughtful interpretations of the data in the discussion section. The results section, while straightforward, effectively supports the study's primary conclusion-the pro-locomotor effects of A13 region stimulation, both in normal motor control and in the 6-OHDA model of brain damage.

      Weaknesses:

      (1) Anatomical investigation. I have a major concern regarding the anatomical investigation of plastic changes in the A13 connectome (Figures 4 and 5). While the methodology employed to assess the connectome is technically advanced and powerful, the results lack mechanistic insight at the cell or circuit level into the pro-locomotor effects of A13 region stimulation in both physiological and pathological conditions. This concern is exacerbated by a textual description of results that doesn't pinpoint precise brain areas or subareas but instead references large brain portions like the cortical plate, making it challenging to discern the implications for A13 stimulation. Lastly, the study is generally well-written with a smooth and straightforward style, but the connectome section presents challenges in readability and comprehension. The presentation of results, particularly the correlation matrices and correlation strength, doesn't facilitate biological understanding. It would be beneficial to explore specific pathways responsible for driving the locomotor effects of A13 stimulation, including examining the strength of connections to well-known locomotor-associated regions like the Pedunculopontine nucleus, Cuneiformis nucleus, LPGi, and others in the diencephalon, midbrain, pons, and medulla. Additionally, identifying the primary inputs to A13 associated with motor function would enhance the study's clarity and relevance.

      The study raises intriguing questions about compensatory mechanisms in Parkinson's disease a new perspective with the preservation of dopaminergic cells in A13, despite the SNc degeneration, and the plastic changes to input/output matrices. To gain inspiration for a more straightforward reanalysis and discussion of the results, I recommend the authors refer to the paper titled "Specific populations of basal ganglia output neurons target distinct brain stem areas while collateralizing throughout the diencephalon from the David Kleinfeld laboratory." This could guide the authors in investigating motor pathways across different brain regions.

      (2) Description of locomotor performance. Figure 3 provides valuable data on the locomotor effects of A13 region photoactivation in both control and 6-OHDA mice. However, a more detailed analysis of the changes in locomotion during stimulation would enhance our understanding of the pro-locomotor effects, especially in the context of 6-OHDA lesions. For example, it would be informative to explore whether the probability of locomotion changes during stimulation in the control and 6-OHDA groups. Investigating reaction time, speed, total distance, and even kinematic aspects during stimulation could reveal how A13 is influencing locomotion, particularly after 6-OHDA lesions. The laboratory of Whelan has a deep knowledge of locomotion and the neural circuits driving it so these features may be instructive to infer insights on the neural circuits driving movement. On the same line, examining features like the frequency or power of stimulation related to walking patterns may help elucidate whether A13 is engaging with the Mesencephalic Locomotor Region (MLR) to drive the pro-locomotor effects. These insights would provide a more comprehensive understanding of the mechanisms underlying A13-mediated locomotor changes in both healthy and pathological conditions.

      (3) Figure 2 indeed presents valuable information regarding the effects of A13 region photoactivation. To enhance the comprehensiveness of this figure and gain a deeper understanding of the neurons driving the pro-locomotor effect of stimulation, it would be beneficial to include quantifications of various cell types:

      • cFos-Positive Cells/TH-Positive Cells: it can help determine the impact of A13 stimulation on dopaminergic neurons and the associated pro-locomotor effect in healthy condition and especially in the context of Parkinson's disease (PD) modeling.

      • cFos-Positive Cells /TH-Negative Cells: Investigating the number of TH-negative cells activated by stimulation is also important, as it may reveal non-dopaminergic neurons that play a role in locomotor responses. Identifying the location and characteristics of these TH-negative cells can provide insights into their functional significance.<br /> Incorporating these quantifications into Figure 2 would enhance the figure's informativeness and provide a more comprehensive view of the neuronal populations involved in the locomotor effects of A13 stimulation.

      (4) Referred to Figure 3. In the main text (page 5) when describing the animal with 6-OHDA the wrong panels are indicated. It is indicated in Figure 2A-E but it should be replaced with 3A-E. Please do that.

      Summary of the Study after revision

      The revised manuscript reflects significant efforts to improve clarity, organization, and data interpretation. The refinements in anatomical descriptions, behavioral analyses, and contextual framing have strengthened the manuscript considerably. However, the study still lacks direct causal evidence linking anatomical remodeling to behavioral improvements, and the small sample size in the anatomical analyses remains a concern. The authors have addressed many points raised in the initial review, but further acknowledgement of the exploratory nature of these findings would enhance the scientific rigor of the work.

      Key Improvements in the Revision

      The revised manuscript demonstrates considerable progress in clarifying data presentation, refining behavioral analyses, and improving the contextualization of anatomical findings. The restructuring of the anatomical section now provides greater precision in describing motor-related pathways, integrating terminology from the Allen Brain Atlas. The addition of new figures (Figures 4 and 5) strengthens the accessibility of these findings by illustrating key connectivity patterns more effectively. Furthermore, the correlation matrices have been adjusted to improve interpretability, ensuring that the presented data contribute meaningfully to the overall narrative of the study.

      The authors have also made significant improvements in their behavioral analyses, particularly in the organization and presentation of locomotor data. Figure 3 has been revised to distinctly separate results from 6-OHDA and sham animals, providing a clearer comparison of locomotor outcomes. Additional metrics, such as reaction time, locomotion bouts, and movement speed, further enhance the granularity of the analysis, making the results more informative.

      The discussion surrounding anatomical connectivity has also been strengthened. The revised manuscript now places greater emphasis on motor-related pathways and refines its analysis of A13 efferents and afferents. A newly introduced figure provides a concise summary of these connections, improving the contextualization of the anatomical data within the study's broader scope. Moreover, the authors have addressed the translational relevance of their findings by acknowledging the differences between optogenetic stimulation and deep brain stimulation (DBS). Their discussion now better situates the findings within existing literature on PD-related motor circuits, providing a more balanced perspective on the potential implications of A13 stimulation.

      Remaining Concerns

      Despite these substantial improvements, a number of critical concerns remain. The anatomical findings, though insightful, remain largely correlative and do not establish a causal link between structural remodeling and locomotor recovery. While the authors argue that these data will serve as a reference for future investigations, their necessity for the core conclusions of the study is not entirely clear. Additionally, while the anatomical data offer an interesting perspective on A13 connectivity, their direct relevance to the study's primary goal-demonstrating the role of A13 in locomotor recovery-remains uncertain. The authors emphasize that these data will be valuable for future research, yet their integration into the study's main narrative feels somewhat supplementary. Based on this last thought of the authors it is even more relevant another key limitation lying in the small sample size used for connectivity analyses. With only two sham and three 6-OHDA animals included, the statistical confidence in the findings is inherently limited. The absence of direct statistical comparisons between ipsilesional and contralesional projections further weakens the conclusions drawn from these anatomical studies. The authors have acknowledged that obtaining the necessary samples, acquiring the data, and analyzing them is a prolonged and resource-intensive process. While this may be a valid practical limitation, it does not justify the lack of a robust statistical approach. A more rigorous statistical framework should be employed to reinforce the findings, or alternative techniques should be considered to provide additional validation. Given these constraints, it remains unclear why the authors have not opted for standard immunohistochemistry, which could provide a complementary and more statistically accessible approach to validate the anatomical findings. Employing such an approach would not only increase the robustness of the results but also strengthen the study's impact by providing an independent confirmation of the observed structural changes.

    2. Reviewer #2 (Public review):

      Summary:

      The paper by Kim et al. investigates the potential of stimulating the dopaminergic A13 region to promote locomotor restoration in a Parkinson's mouse model. Using wild-type mice, 6-OHDA injection depletes dopaminergic neurons in the substantia nigra pars compacta, without impairing those of the A13 region and the ventral tegmentum area, as previously reported in humans. Moreover, photostimulation of presumably excitatory (CAMKIIa) neurons in the vicinity of the A13 region improves bradykinesia and akinetic symptoms after 6-OHDA injection. Whole-brain imaging with retrograde and anterograde tracers reveals that the A13 region undergoes substantial changes in the distribution of its afferents and projections after 6-OHDA injection, thus suggesting a remodeling of the A13 connectome. Whether this remodelling contributes to pro-locomotor effects of the photostimulation of the A13 region remains unknown as causality was not addressed.

      Strengths:

      Photostimulation of presumably excitatory (CAMKIIa) neurons in the vicinity of the A13 region promotes locomotion and locomotor recovery of wild-type mice 1 month after 6-OHDA injection in the medial forebrain bundle, thus identifying a new potential target for restoring motor functions in Parkinson's disease patients. The study also provides a description of the A13 region connectome pertaining to motor behaviors and how it changes after a dopaminergic lesion. Although there is no causal link between anatomical and behavioral data, it raises interesting questions for further studies.

      Weaknesses:

      Although CAMKIIa is a marker of presumably excitatory neurons and can be used as an alternative marker of dopaminergic neurons, some uncertainty remains regarding the phenotype of neurons underlying recovery of akinesia and improvement of bradykinesia.

      Figure 4 is improved, but the results from the correlation analyses remain difficult to interpret, as they may reflect changes in various impaired brain regions independently of the A13 region. While the analysis offers a snapshot of correlated changes within the connectome, it does not identify which specific cell or axonal populations are actually increasing or decreasing. Although functional MRI connectome analyses are well-established, anatomical data seem less suitable for this purpose. How can one interpret correlated changes in anatomical inputs or outputs between two distinct regions?

      Figure 5 is also improved, but there is room for further enhancement. As currently presented, it is difficult to distinguish the differences between the sham and 6-OHDA groups. The first column could compare afferents, while the second column could compare efferents. Given the small sample size, it would be more appropriate to present individual data rather than the mean and standard deviation.

      Appraisal and impact

      Although the behavioral experiments are convincing, the low number of animals in the anatomical studies is insufficient to make any relevant statistical conclusions due to extremely low statistical power.

    3. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      This study aimed to investigate the effects of optically stimulating the A13 region in healthy mice and a unilateral 6-OHDA mouse model of Parkinson's disease (PD). The primary objectives were to assess changes in locomotion, motor behaviors, and the neural connectome. For this, the authors examined the dopaminergic loss induced by 6-OHDA lesioning. They found a significant loss of tyrosine hydroxylase (TH+) neurons in the substantia nigra pars compacta (SNc) while the dopaminergic cells in the A13 region were largely preserved. Then, they optically stimulated the A13 region using a viral vector to deliver the channelrhodopsine (CamKII promoter). In both sham and PD model mice, optogenetic stimulation of the A13 region induced pro-locomotor effects, including increased locomotion, more locomotion bouts, longer durations of locomotion, and higher movement speeds. Additionally, PD model mice exhibited increased ipsi lesional turning during A13 region photoactivation. Lastly, the authors used whole-brain imaging to explore changes in the A13 region's connectome after 6-OHDA lesions. These alterations involved a complex rewiring of neural circuits, impacting both afferent and efferent projections. In summary, this study unveiled the pro-locomotor effects of A13 region photoactivation in both healthy and PD model mice. The study also indicates the preservation of A13 dopaminergic cells and the anatomical changes in neural circuitry following PD-like lesions that represent the anatomical substrate for a parallel motor pathway.

      Strengths:

      These findings hold significant relevance for the field of motor control, providing valuable insights into the organization of the motor system in mammals. Additionally, they offer potential avenues for addressing motor deficits in Parkinson's disease (PD). The study fills a crucial knowledge gap, underscoring its importance, and the results bolster its clinical relevance and overall strength.

      The authors adeptly set the stage for their research by framing the central questions in the introduction, and they provide thoughtful interpretations of the data in the discussion section. The results section, while straightforward, effectively supports the study's primary conclusion - the pro-locomotor effects of A13 region stimulation, both in normal motor control and in the 6-OHDA model of brain damage.

      We thank the reviewer for their positive comments.

      Weaknesses:

      (1) Anatomical investigation. I have a major concern regarding the anatomical investigation of plastic changes in the A13 connectome (Figures 4 and 5). While the methodology employed to assess the connectome is technically advanced and powerful, the results lack mechanistic insight at the cell or circuit level into the pro-locomotor effects of A13 region stimulation in both physiological and pathological conditions. This concern is exacerbated by a textual description of results that doesn't pinpoint precise brain areas or subareas but instead references large brain portions like the cortical plate, making it challenging to discern the implications for A13 stimulation. Lastly, the study is generally well-written with a smooth and straightforward style, but the connectome section presents challenges in readability and comprehension. The presentation of results, particularly the correlation matrices and correlation strength, doesn't facilitate biological understanding. It would be beneficial to explore specific pathways responsible for driving the locomotor effects of A13 stimulation, including examining the strength of connections to well-known locomotor-associated regions like the Pedunculopontine nucleus, Cuneiformis nucleus, LPGi, and others in the diencephalon, midbrain, pons, and medulla.

      We initially considered two approaches. The first was to look at specific projections to the motor regions, focusing on the MLR. The second was to utilize a whole-brain analysis, which is presented here. Given what we know about the zona incerta, especially its integrative role, we felt that examining the full connectome was a reasonable starting point.

      The value of the whole-brain approach is that it provides a high-level overview of the afferents and efferents to the region. The changes in the brain that occur following Parkinson-like lesions, such as those in the nigrostriatal pathway, are complex and can affect neighbouring regions such as the A13. Therefore, we wished to highlight the A13, which we considered a therapeutic target, and examine changes in connectivity that could occur following acute lesions affecting the SNc. We acknowledge that this study does not provide a causal link, but it presents the fundamental background information for subsequent hypothesis-driven, focused, region-specific analysis.

      The terms provided were taken from the Allen Brain Atlas terminology and presented as abbreviations. We have added two new figures focusing on motor regions to make the information more comprehensible (new Figures 4 and 5) and rewrote the connectomics section to make it easier to understand.

      Additionally, identifying the primary inputs to A13 associated with motor function would enhance the study's clarity and relevance.

      This is a great point to help simplify the whole-brain results. We have presented the motor-related inputs and outputs as part of a new figure in the main paper (Figure 5) and added accompanying text in the results section. We have also updated the correlation matrices to concentrate on motor regions (Figure 4). This highlights possible therapeutic pathways. We have also enhanced our discussion of these motor-related pathways. We have retained the entire dataset and added it to our data repository for those interested.

      The study raises intriguing questions about compensatory mechanisms in Parkinson's disease and a new perspective on the preservation of dopaminergic cells in A13, despite the SNc degeneration, and the plastic changes to input/output matrices. To gain inspiration for a more straightforward reanalysis and discussion of the results, I recommend the authors refer to the paper titled "Specific populations of basal ganglia output neurons target distinct brain stem areas while collateralizing throughout the diencephalon from the David Kleinfeld laboratory." This could guide the authors in investigating motor pathways across different brain regions.

      Thank you for the advice. As pointed out, Kleinfeld’s group presented their data in a nice, focused way. For the connectomic piece, we have added Figure 5, which provides a better representation than our previous submission.

      (2) Description of locomotor performance. Figure 3 provides valuable data on the locomotor effects of A13 region photoactivation in both control and 6-OHDA mice. However, a more detailed analysis of the changes in locomotion during stimulation would enhance our understanding of the pro-locomotor effects, especially in the context of 6-OHDA lesions. For example, it would be informative to explore whether the probability of locomotion changes during stimulation in the control and 6-OHDA groups. Investigating reaction time, speed, total distance, and could reveal how A13 is influencing locomotion, particularly after 6-OHDA lesions. The laboratory of Whelan has a deep knowledge of locomotion and the neural circuits driving it so these features may be instructive to infer insights on the neural circuits driving movement. On the same line, examining features like the frequency or power of stimulation related to walking patterns may help elucidate whether A13 is engaging with the Mesencephalic Locomotor Region (MLR) to drive the pro-locomotor effects. These insights would provide a more comprehensive understanding of the mechanisms underlying A13-mediated locomotor changes in both healthy and pathological conditions.

      Thank you for these suggestions. We have reorganized Figure 3 to highlight the metrics by separating the 6-OHDA from the Sham experiments (3F-J, which highlights distance travelled, average speed and duration). We have also added additional text to highlight these metrics better in the text. We have relabelled Supplementary Figure S3, which presents reaction time as latency to initiate locomotion and updated the main text to address the reviewers' points.

      Reviewer #2 (Public Review):

      Summary:

      The paper by Kim et al. investigates the potential of stimulating the dopaminergic A13 region to promote locomotor restoration in a Parkinson's mouse model. Using wild-type mice, 6-OHDA injection depletes dopaminergic neurons in the substantia nigra pars compacta, without impairing those of the A13 region and the ventral tegmentum area, as previously reported in humans. Moreover, photostimulation of presumably excitatory (CAMKIIa) neurons in the vicinity of the A13 region improves bradykinesia and akinetic symptoms after 6-OHDA injection. Whole-brain imaging with retrograde and anterograde tracers reveals that the A13 region undergoes substantial changes in the distribution of its afferents and projections after 6-OHDA injection. The study suggests that if the remodeling of the A13 region connectome does not promote recovery following chronic dopaminergic depletion, photostimulation of the A13 region restores locomotor functions.

      Strengths:

      Photostimulation of presumably excitatory (CAMKIIa) neurons in the vicinity of the A13 region promotes locomotion and locomotor recovery of wild-type mice 1 month after 6-OHDA injection in the medial forebrain bundle, thus identifying a new potential target for restoring motor functions in Parkinson's disease patients.

      Weaknesses:

      Electrical stimulation of the medial Zona Incerta, in which the A13 region is located, has been previously reported to promote locomotion (Grossman et al., 1958). Recent mouse studies have shown that if optogenetic or chemogenetic stimulation of GABAergic neurons of the Zona Incerta promotes and restores locomotor functions after 6-OHDA injection (Chen et al., 2023), stimulation of glutamatergic ZI neurons worsens motor symptoms after 6-OHDA (Lie et al., 2022).

      Thank you - we have added this reference. It is helpful as Grossman did stimulate the zona incerta in the cat and elicit locomotion, suggesting that stimulation of the area in normal mice has external validity. Grossman’s results prompted a later clinical examination of the zona incerta, but it concentrated on the zona incerta regions close to the subthalamic regions (Ossowska 2019), further caudal to the area we focused on. Chen et al. (2023) targeted the area in the lateral aspect of central/medial zona incerta, formed by dorsal and ventral zona incerta, which may account for the differing results. Our data were robust for stimulation of the medial aspect of the rostromedial zona incerta. The thigmotactic behaviour that we observed in our work that focused on CamKII neurons has not been observed with chemogenetic, optogenetic activation or with photoinhibition of GABAergic central/medial ZI (Chen et al. 2023).

      GABAergic activation of mZI to Cuneiform projections (Sharma et al. 2024) also did not produce thigmotactic behavior. We have added these points to the discussion.

      Although CAMKIIa is a marker of presumably excitatory neurons and can be used as an alternative marker of dopaminergic neurons, behavioral results of this study raise questions about the neuronal population targeted in the vicinity of the A13 region. Moreover, if YFP and CHR2-YFP neurons express dopamine (TH) within the A13 region (Fig. 2), there is also a large population of transduced neurons within and outside of the A13 region that do not, thus suggesting the recruitment of other neuronal cell types that could be GABAergic or glutamatergic.

      We found that CamKII transfection of the A13 region was extremely effective in promoting locomotor activity, which was critical for our work in exploring its possible therapeutic potential. We have since quantified the cell number, we found that the c-fos cell number was increased following ChR2 activation. There is evidence of TH activation - but the data suggest that other cell types contribute. C-fos alone is a blunt tool to assess specificity - rather, it is better at showing overall photostimulus efficacy - which we have demonstrated. Moreover, there is evidence that cell types are not purely dopaminergic, with GABA co-localized (Negishi et al. 2020). We acknowledge that specific viral approaches that target the GABAergic, glutamatergic, and dopaminergic circuits would be very useful. The range of tools to target A13 dopaminergic circuits is more limited than the SNc, for example, because the A13 region lacks DAT, and TH-IRES-Cre approaches, while helpful, are less specific than DAT-Cre mouse models. Intersectional approaches targeting multiple transmitters (glutamate & dopamine, for example) may be one solution as we do not expect that a single transmitter-specific pathway would work, as well as broad targeting of the A13 region. Our recent work suggests that GABAergic neuron activation may have more general effects on behaviour rather than control of ongoing locomotor parameters (Sharma et al. 2024). Recent work shows a positive valence effect of dopamine A13 activation on motivated food-seeking behavior, which differs from consummatory behavior observed with GABAergic modulation (Ye, Nunez, and Zhang 2023). Chemogenetic inactivation and ablation of dopaminergic A13 revealed that they contribute to grip strength and prehensile movements, uncoupling food-seeking grasping behavior from motivational factors (Garau et al. 2023). Overall, this suggests differing effects of GABA compared to DA and/or glutamatergic cell types, consistent with our effects of stimulating CamKII. The discussion has been updated.

      Regarding the analysis of interregional connectivity of the A13 region, there is a lack of specificity (the viral approach did not specifically target the A13 region), the number of mice is low for such correlation analyses (2 sham and 3 6-OHDA mice), and there are no statistics comparing 6-OHDA versus sham (Fig. 4) or contra- versus ipsilesional sides (Fig. 5). Moreover, the data are too processed, and the color matrices (Fig. 4) are too packed in the current format to enable proper visualization of the data. The A13 afferents/efferents analysis is based on normalized relative values; absolute values should also be presented to support the claim about their upregulation or downregulation.

      Generally, papers using tissue-clearing imaging approaches have low sample sizes due to technical complexity and challenges. The technical challenges of obtaining these data were substantial in both collection and analysis. There are multiple technical complexities arising from dual injections (A13 and MFB coordinates) and targeting the area correctly. The A13 region is difficult to target as it spans only around 300 µm in the anterior-posterior axis. While clearing the brain takes weeks, and light-sheet imaging also takes time, the time necessary to analyze the tissue using whole-brain quantification is labor intensive, especially with a lack of a standardized analysis pipeline from atlas registrations, signal segmentations, and quantifications. The field is still relatively new, requiring additional time to refine pipelines.

      Correlation matrices are often used in analyzing connectivity patterns on a brain-wide scale, as they can identify any observable patterns within a large amount of data. We used correlation matrices to display estimated correlation coefficients between the afferent and efferent proportions from one brain subregion to another across 251 brain regions in total in a pairwise manner (not for hypothesis testing). We provided descriptive statistics (mean and error bars) in the original Figure 5C and G. As mentioned in comments for Reviewer 1, we have now presented the data in revised Figure 4 and 5 that focuses specifically on motor-related pathways to provide information on possible pathways. The has simplified the correlation matrices and highlighted the differences in 6-OHDA efferent data especially. As suggested, raw values are shared in a supplemental file on our data repository.

      In the absence of changes in the number of dopaminergic A13 neurons after 6-OHDA injection, results from this correlation analysis are difficult to interpret as they might reflect changes from various impaired brain regions independently of the A13 region.

      We acknowledge that models of Parkinson’s disease, particularly those using 6-OHDA, induce plasticity in various regions, which may subsequently affect A13 connectivity. We aim to emphasize the residual, intact A13 pathways that could serve as therapeutic targets in future investigations. This emphasis is pertinent in the context of potential clinical applications, as the overall input and output to the region fundamentally dictate the significance of the A13 region in lesioned nigrostriatal models. We agree with the reviewer that the changes certainly can be independent of A13; however, the fact that there was a significant change in the connectome post-6-OHDA injection and striatonigral degeneration is in and of itself important to document. We have added a sentence acknowledging this limitation to the discussion.

      There is no causal link between anatomical and behavioral data, which raises questions about the relevance of the anatomical data.

      This point was also addressed earlier in response to a comment from Reviewer 1. Focusing on specific motor pathways is one avenue to explore. However, given that the zona incerta acts as an integrative hub, we believed it is prudent to initially examine both afferent and efferent pathways using a brain-wide approach. For instance, without employing this methodology, the potential significance of cortical interconnectivity to the A13 region might not have been fully appreciated. As mentioned previously, we will place additional emphasis on motor-related regions in our revised paper, thereby enhancing the relevance of the anatomical data presented. With these modifications, we anticipate that our data will underscore specific motor-related targets for future exploration, employing optogenetic targeting to assess necessity and sufficiency.

      Overall, the study does not take advantage of genetic tools accessible in the mouse to address the direct or indirect behavioral and anatomical contributions of the A13 region to motor control and recovery after 6-OHDA injection.

      Our study has not specifically targeted neurons that express dopaminergic, glutamatergic, or GABAergic properties (refer to earlier comment for more detail). However, like others, we find that targeting one neuronal population often does not result in a pure transmitter phenotype. For instance, evidence suggests co-localization of dopamine neurons with a subpopulation of GABA neurons in the A13/medial zona incerta (Negishi et al. 2020). In the hypothalamus, research by Deisseroth and colleagues (Romanov et al. 2017) indicates the presence of multiple classes of dopamine cells, each containing different ratios of co-localized peptides and/or fast neurotransmitters. Consequently, we believe our work lays the foundation for the investigations suggested by the reviewer. Furthermore, if one considers this work in the context of a preclinical study to determine whether the A13 might be a target in human Parkinson's disease, the existing technology that could be utilized is deep brain stimulation (DBS) or electrical modulation, which would also affect different neuronal populations in a non-specific manner.

      While optogenetic stimulation therapy is longer term, using CamKII combined with the DJ hybrid AAV could be a translatable strategy for targeting A13 neuronal populations in non-human primates (Watakabe et al. 2015; Watanabe et al. 2020). We have added to the discussion.

      Reviewer #3 (Public Review):

      Kim, Lognon et al. present an important finding on pro-locomotor effects of optogenetic activation of the A13 region, which they identify as a dopamine-containing area of the medial zona incerta that undergoes profound remodeling in terms of afferent and efferent connectivity after administration of 6-OHDA to the MFB. The authors claim to address a model of PD-related gait dysfunction, a contentious problem that can be difficult to treat with dopaminergic medication or DBS in conventional targets. They make use of an impressive array of technologies to gain insight into the role of A13 remodeling in the 6-OHDA model of PD. The evidence provided is solid and the paper is well written, but there are several general issues that reduce the value of the paper in its current form, and a number of specific, more minor ones. Also, some suggestions, that may improve the paper compared to its recent form, come to mind.

      Thank you for the suggestions and careful consideration of our work - it is appreciated.

      The most fundamental issue that needs to be addressed is the relation of the structural to the behavioral findings. It would be very interesting to see whether the structural heterogeneity in afferent/effects projections induced by 6-OHDA is related to the degree of symptom severity and motor improvement during A13 stimulation.

      As mentioned in comments for Reviewer 1, we have performed additional analysis and present this in Figure 5. We have also revised Figure 4, focusing on motor regions. Our work will provide a roadmap for future studies to disentangle divergent or convergent A13 pathways that are involved in different or all PD-related motor symptoms. Because we could not measure behavioural change in the same animals studied with the anatomic study (essentially because the optrode would have significantly disrupted the connectome we are measuring), we cannot directly compare behaviour to structure.

      The authors provide extensive interrogation of large-scale changes in the organization of the A13 region afferent and efferent distributions. It remains unclear how many animals were included to produce Fig 4 and 5. Fig S5 suggests that only 3 animals were used, is that correct? Please provide details about the heterogeneity between animals. Please provide a table detailing how many animals were used for which experiment. Were the same animals used for several experiments?

      The behavioral set and the anatomical set were necessarily distinct. In the anatomical experiments, we employed both anterograde and retrograde viral approaches to target the afferent and efferent A13 populations with fluorescent proteins. For the behavioral approach, a single ChR2 opsin was utilized to photostimulate the A13 region; hence combining the two populations was not feasible. We were also concerned that the optrode itself would interfere with connectomics. A lower number of animals were used for the whole-brain work due to technical limitations described earlier. We have now provided additional information regarding numbers in all figures and the text. Using Spearman’s correlation analysis, we found afferent and efferent proportions across animals to be consistent, with an average correlation of 0.91, which is reported in Figure S6.

      While the authors provide evidence that photoactivation of the A13 is sufficient in driving locomotion in the OFT, this pro-locomotor effect seems to be independent of 6-OHDA-induced pathophysiology. Only in the pole test do they find that there seems to be a difference between Sham vs 6-OHDA concerning the effects of photoactivation of the A13. Because of these behavioral findings, optogenic activation of A13 may represent a gain of function rather than disease-specific rescue. This needs to be highlighted more explicitly in the title, abstract, and conclusion.

      Optogenetic activation of A13 may represent a gain of function in both healthy and 6-OHDA mice, highlighting a parallel descending motor pathway that remains intact. 6-OHDA lesions have multiple effects on motor and cognitive function. This makes a single pathway unlikely to rescue all deficits observed in 6-OHDA models. The lack of locomotion observed in 6-OHDA models can be reversed by A13 region photostimulation. Therefore, this is a reversal of a loss of function, in this case. However, the increase in turning represents a gain of function. We have highlighted this as suggested in the discussion.

      The authors claim that A13 may be a possible target for DBS to treat gait dysfunction. However, the experimental evidence provided (in particular the lack of disease-specific changes in the OFT) seems insufficient to draw such conclusions. It needs to be highlighted that optogenetic activation does not necessarily have the same effects as DBS (see the recent review from Neumann et al. in Brain: https://pubmed.ncbi.nlm.nih.gov/37450573/). This is important because ZI-DBS so far had very mixed clinical effects. The authors should provide plausible reasons for these discrepancies. Is cell-specificity, which only optogenetic interventions can achieve, necessary? Can new forms of cyclic burst DBS achieve similar specificity (Spix et al, Science 2021)? Please comment.

      Thank you for the valuable comments. They have been incorporated into the discussion.

      Our study highlights a parallel motor pathway provided by the A13 region that remains intact in 6-OHDA mice and can be sufficiently driven to rescue the hypolocomotor pathology observed in the OFT and overcome bradykinesia and akinesia. The photoactivation of ipsilesional A13 also has an overall additive effect on ipsiversive circling, representing a gain of function on the intact side that contributes to the magnitude of overall motor asymmetry against the lesioned side. The effects of DBS are rather complex, ranging from micro-, meso-, to macro-scales, involving activation, inhibition, and informational lesioning, and network interactions. This could contribute to the mixed clinical effects observed with ZI-DBS, in addition to differences in targeting and DBS programming among the studies (see review (Ossowska 2019) ). Also the DBS studies targeting ZI have never targeted the rostromedial ZI which extends towards the hypothalamus and contains the A13. Furthermore, DBS and electrical stimulation of neural tissue, in general, are always limited by current spread and lower thresholds of activation of axons (e.g., axons of passage), both of which can reduce the specificity of the true therapeutic target. Optogenetic studies have provided mechanistic insights that could be leveraged in overcoming some of the limitations in targeting with conventional DBS approaches. Spix et al. (2021) provided an interesting approach highlighting these advancements. They devised burst stimulation to facilitate population-specific neuromodulation within the external globus pallidus. Moreover, they found a complementary role for optogenetics in exploring the pathway-specific activation of neurons activated by DBS. To ascertain whether A13 DBS may be a viable therapy for PD gait, it will be necessary to perform many more preclinical experiments, and tuning of DBS parameters could be facilitated by optogenetic stimulation in these murine models. We have added to the discussion.

      In a recent study, Jeon et al (Topographic connectivity and cellular profiling reveal detailed input pathways and functionally distinct cell types in the subthalamic nucleus, 2022, Cell Reports) provided evidence on the topographically graded organization of STN afferents and McElvain et al. (Specific populations of basal ganglia output neurons target distinct brain stem areas while collateralizing throughout the diencephalon, 2021, Neuron) have shown similar topographical resolution for SNr efferents. Can a similar topographical organization of efferents and afferents be derived for the A13/ ZI in total?

      The ZI can be subdivided into four subregions in the antero-posterior axis: rostral (ZIr), dorsal (ZId), ventral (ZIv), and caudal (ZIc) regions. The dorsal and ventral ZI is also referred together as central/medial/intermediate ZI. There are topographical gradients in different cell types and connectivity across these subregions (see reviews: (Mitrofanis 2005; Monosov et al. 2022; Ossowska 2019). Recent work by Yang and colleagues (2022) demonstrated a topographical organization among the inputs and outputs of GABAergic (VGAT) populations across four ZI subregions. Given that A13 region encompasses a smaller portion (the medial aspect) of both rostral and medial/central ZI (three of four ZI subregions) and coexpress VGAT, A13 region likely falls under rostral and intermediate medial ZI dataset found in Yang et al. (2022). With our data, we would not be able to capture the breadth of topographical organization shown in Yang et al (2022).

      In conclusion, this is an interesting study that can be improved by taking into consideration the points mentioned above.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      (1) Figure 2 indeed presents valuable information regarding the effects of A13 region photoactivation. To enhance the comprehensiveness of this figure and gain a deeper understanding of the neurons driving the pro-locomotor effect of stimulation, it would be beneficial to include quantifications of various cell types:

      • cFos-Positive Cells/TH-Positive Cells: it can help determine the impact of A13 stimulation on dopaminergic neurons and the associated pro-locomotor effect in the healthy condition and especially in the context of Parkinson's disease (PD) modeling.

      • cFos-Positive Cells /TH-Negative Cells: Investigating the number of TH-negative cells activated by stimulation is also important, as it may reveal non-dopaminergic neurons that play a role in locomotor responses. Identifying the location and characteristics of these TH-negative cells can provide insights into their functional significance.

      We have completed this analysis. The data is presented in Figure 2F, where we show increased c-fos intensity with photoactivation. We observed an increase in the number of cells activated in the A13 region. However, we did not definitively see increases in TH+ cells, suggesting a heterogeneous set of neurons responsible for the effects—possibly glutamatergic neurons.

      Incorporating these quantifications into Figure 2 would enhance the figure's informativeness and provide a more comprehensive view of the neuronal populations involved in the locomotor effects of A13 stimulation.

      We have added text and a new graph.

      (2) Refer to Figure 3. In the main text (page 5) when describing the animal with 6-OHDA the wrong panels are indicated. It is indicated in Figure 2A-E but it should be replaced with 3A-E.

      Please do that.

      Done, and we have updated the figure to improve readability, by separating the 6-OHDA findings from sham in all graphs.

      Reviewer #2 (Recommendations For The Authors):

      Abstract

      Page 1: Inhibitory or lesion studies will be necessary to support the claim that the global remodeling of afferent and efferent projections of the A13 region highlights the Zona Incerta's role as a crucial hub for the rapid selection of motor function.

      Overall, there is quite a bit of evidence that the zona incerta is a hub for afferent/efferents.

      Mitrofanis (2005) and, more recently, Wang et al. (2020) summarize some of the evidence. Yang (2022) illustrates that the zona incerta shows multiple inputs to GABAergic neurons and outputs to diverse regions. Recent work suggests that the zona incerta contributes to various motor functions such as hunting, exploratory locomotion, and integrating multiple modalities (Zhao et al. 2019; Wang et al. 2019; Monosov et al. 2022; Chometton et al. 2017). The introduction has been updated.

      Introduction

      Page 2, paragraph 2: "However, little attention has been placed on the medial zona incerta (mZI), particularly the A13, the only dopamine-containing region of the rostral ZI" Is the A13 region located in the rostral or medial ZI or both?

      It should have been written “rostromedial” ZI. The A13 is located in the medial aspect of rostromedial ZI. Introduction has been updated.

      Page 2, para 3: Li et al (2021) used a mini-endoscope to record the GCaMP6 signal. Masini and Kiehn, 2022 transiently blocked the dopaminergic transmission; they never used 6-OHDA.

      Please correct through the text.

      Corrected.

      Page 2, para 4: the A13 connectome encompasses the cerebral cortex,... MLR. The MLR is a functional region, correct this for the CNF and PPN.

      Corrected.

      Page 3, the last paragraph of the introduction could be clarified by presenting the behavioral data first, followed by the anatomy.

      This has been corrected

      Figure 1 is nice and clear, and well summarizes the experimental design.

      Thank you.

      Figure 2 shows an example of the extent of the ChR2-YFP expression and the position of an optical fiber tip above the dopaminergic A13 region from a mouse. Without any quantification, these images could be included in Figure 1. Despite a very small volume (36.8nL) of AAV, the extent of ChR2-YFP expression is quite large and includes dopaminergic and unidentified neurons within the A13 region but also a large population of unidentified neurons outside of it, thus raising questions about the volume and the types of neurons recruited.

      This is an important consideration. The issue of viral spread is complex and depends on factors including tissue type, serotype, and promotor of the virus. Li et al. (2021), for example, used different virus serotypes and promotors, injecting 150nL, whereas we used AAV DJ, injecting 36.8nL. AAV-DJ is a hybrid viral type consisting of multiple serotypes. It has a high transduction efficiency, which leads to greater gene delivery than single-serotype AAV viral constructs (Mao et al. 2016). A secondary consideration regarding translation was that AAV-DJ could effectively transduce non-primate neurons (Watanabe et al. 2020). We have addressed the issue of neurons recruited earlier, provided c-Fos quantification, and provided a new supplementary figure showing viral spread (Figure S1).

      Anatomical reconstruction of the extent of the ChR2-YFP expression and the location of the tip of the optical fiber will be necessary to confirm that ChR2-YFP expression was restricted to the A13 region.

      We will provide additional information regarding viral spread, ferrule tip placement, and c-fos cell counts. This has been done in Figure 2 and we also present a new Figure S1 where we have quantified the viral spread.

      Page 5, 1st para: Double-check the references, as not all of them are 6-OHDA injections in the MLF.

      Corrected. Removed Kiehn reference.

      Page 5, 1st para, 4th line: Replace ferrule with optical canula or fiber.

      Done

      Page 5, 1st para, 9th line: Replace Figure 2 with Figure 3.

      Done

      Page 5, 2nd para: About the refractory decrease in traveled distance by sham-ChR2 mice: is this significant?

      It was not significant (Figure S1C, 1-way RM ANOVA: F5,25 = 0.486, P \= 0.783). This has been updated in the text.

      Figure 3 showing behavioral assessments is nice, but the stats are not always clear. In Fig 3A, are each of the off and on boxes 1 minute long? The figure legend states the test lasts 1 min, but isn't it 4 minutes? In Figure 3B-E and 3J-M, what are the differences? Do the stats identify a significant difference only during the stimulation phase? Fig. 3F-I are nice and could have been presented as primary examples prior to data analysis in Fig. 3B-E. Group labels above the graph would help.

      Yes, the off-on boxes are 1 minute long. The error is corrected in the legend. Great suggestion for F-I - they have been moved ahead of the summary figures. We have also updated new Fig 3F-,I, J, L, M) to make the differences between 6-OHDA and sham graphs easier to visualize. The stats do indicate a significant difference during the stimulation phase. We have added group labels, and reorganized the figure, and it is much easier to read now.

      Fig. 3L-M, what do PreSur, Post, and Ferrule mean? I assume that Ferrule refers to mice tested with the optical fiber without stimulation, whereas Stim. refers to the stimulation. It would be helpful to standardize the format of stats in Fig. 3B-E and 3-J-M. What are time points a, b, and c referring to?

      We have renamed the figure names to be more intuitive. We have standardized the presentation of statistics in the figure, and eliminated the a,b,c nomenclature. We have also updated the caption to provide descriptions of the tests in Fig 3 L-M.

      Figure S2A: the higher variability in 6-OHDA-YFP mice in comparison to 6-OHDA-ChR2 mice prior to stimulation suggests that 6-OHDA-YFP mice were less impaired. Why use boxplots only for these data? Would a pairwise comparison be more appropriate?

      We have removed these plots from Figure S2. We now present the Baseline to Pre values across the experimental timespan to illustrate the fact that distance travelled returned to baseline values for all trials conducted.

      Fig. S2B: add the statistical marker.

      We have removed this from Figure S2.

      Page 7, para 1, line 8: to add "in comparison to 6-OHDA-YFP and YFP mice" to during photostimulation... (Figure 3E).

      Done

      Page 7, para 3, line 5: about larger improvement, replace "sham ChR2" with "6-OHDA."

      Done

      Page 8, para 1, line 4: Perier et al., 2000 reported that 6-OHDA injection increased the firing frequency of the ZI over a month.

      Added the timeframe to this sentence.

      Page 8, para 2, line 1: Since the results were expected, add some references.

      Done.

      Page 8, para 3, line 4. Double-check the reference.

      Corrected.

      Page 8: About large-scale changes in the A13 region, the relevance of correlation matrices is difficult to grasp. Analysis of local connectivity would have been more informative in the context of GABAergic and glutamatergic neurons of the ZI in the vicinity of the A13 region.

      We have updated the figures for connectivity throughout the manuscript. Overall, there are new Figures 4 and 5 in the main text. We also provide a revised Supplementary Figure 8. Unfortunately, we could not do that experiment regarding local connectivity. In light of our new work (Sharma et al. 2024), it is clear that this will be critical going forward.

      Page 8, para 3, line: given Fig. 2, there is concern about the claim that only the A13 region was targeted. The time of the analysis after 6-OHDA should be mentioned. Some sections of the paragraph could be moved to methods.

      We have provided more information about the viral spread in the text and Supplementary Figure 1. The functional and anatomical experiments are separate, which we realize caused confusion. We have mentioned analysis time after 6-OHDA and inserted this into the text.

      Fig. 4: The color code helps the reader visualize distribution differences. However, statistical analyses comparing 6-OHDA versus sham should be included. Quantification per region would greatly help readers visualize the data and support the conclusion. The relationship between the type of correlation (positive or negative) and absolute change (increase or decrease) is unknown in the current format, which limits the interpretation of the data. Moreover, examples of raw images of axons and cells should be presented for several brain regions. The experimental design with a timeline, as in Fig. 1, would be helpful. The legend for Fig. 4 is a bit long. Some sections are very descriptive, whereas others are more interpretive.

      We have provided a new Figure 5 where we present quantification per region, and the correlation matrices have been updated in Figure 4. We have also focused on motor regions as mentioned earlier. We also provide examples of raw regions in Supplementary Figure 8. Raw values are shared on our data repository.

      Page 10, para 1, line 1: add "afferent" to "changes in -afferent and- projection patterns."

      Done

      Page 10, para 1, line 9: remove the 2nd "compared to sham" in the sentence.

      Done

      Page 10, para 1, line 10: remove "coordinated" in "several regions showed a coordinated reduction in afferent density." We cannot say anything about the timing of events, as there is only info at 1 month.

      Done

      Page 10, para 2: the section should be written in the past tense.

      Done

      Page 13, para 2, the last sentence is overstated. Please remove "cells" and refer to the A13 region instead.

      Done

      About differential remodelling of the A13 region connectome: Figure 5C and 5G: The proportion of total afferents ipsi- and contralateral to 6-OHDA injection argues that the A13 region primarily receives inputs from the cortical plate and the striatum. Unfortunately, there are no statistics.

      Due to the small sample size, we provided descriptive statistics (mean and error bars) in Figure 5A. As mentioned in comments for Reviewers 1 and 2, we have revised Figure 5 to present data focusing on motor-related pathways to provide clarity. In addition, absolute values are shared on our data repository.

      Figure 5 D and 5H: Changes in the proportion of total afferents/projections are relatively modest (less than 10% of the whole population for the highest changes). There is no standard deviation for these data and no statistics. Do they reflect real changes or variability from the injection site?

      The changes are relatively modest (less than 10%) since a small brain region usually provides a small proportion of total input (McElvain et al. 2021; Yang et al. 2022). The changes in the proportions reflect real differences between average proportions observed in sham and 6-OHDA mice. The variability in the total labelling of neurons and fibers was minimized by normalizing individual regional counts against total counts found in each animal. This figure has been updated as reviewers requested.

      Fig 5F and H: The example in F shows a huge decrease in the striatum, but H indicates only a 2% change, which makes the example not very representative. Absolute values would be helpful.

      While a 2% change may seem small, it represents a relatively large change in the A13 efferent connectome. To provide further clarity, we have provided absolute values as suggested in our new supplemental table.

      Figure 6 is inaccurate and unnecessary.

      Figure 6 has been removed.

      Discussion

      Although interesting, the discussion is too long.

      The discussion has been reduced by about three quarters of a page.

      Methods

      Page 17, para 1: include the stereotaxic coordinates of the optical cannula above the A13 region.

      Added.

      References

      Chen, Fenghua, Junliang Qian, Zhongkai Cao, Ang Li, Juntao Cui, Limin Shi, and Junxia Xie. 2023. “Chemogenetic and Optogenetic Stimulation of Zona Incerta GABAergic Neurons Ameliorates Motor Impairment in Parkinson’s Disease.” i Science 26 (7). https://doi.org/ 10.1016/j.isci.2023.107149.

      Chometton, S., K. Charrière, L. Bayer, C. Houdayer, G. Franchi, F. Poncet, D. Fellmann, and P. Y. Risold. 2017. “The Rostromedial Zona Incerta Is Involved in Attentional Processes While Adjacent LHA Responds to Arousal: C-Fos and Anatomical Evidence.” Brain Structure & Function 222 (6): 2507–25.

      Garau, Celia, Jessica Hayes, Giulia Chiacchierini, James E. McCutcheon, and John Apergis-Schoute. 2023. “Involvement of A13 Dopaminergic Neurons in Prehensile Movements but Not Reward in the Rat.” Current Biology: CB, October.

      https://doi.org/ 10.1016/j.cub.2023.09.044.

      Li, Zhuoliang, Giorgio Rizzi, and Kelly R. Tan. 2021. “Zona Incerta Subpopulations Differentially Encode and Modulate Anxiety.” Science Advances 7 (37): eabf6709.

      Mao, Yingying, Xuejun Wang, Renhe Yan, Wei Hu, Andrew Li, Shengqi Wang, and Hongwei Li. 2016. “Single Point Mutation in Adeno-Associated Viral Vectors -DJ Capsid Leads to Improvement for Gene Delivery in Vivo.” BMC Biotechnology 16 (January):1.

      McElvain, Lauren E., Yuncong Chen, Jeffrey D. Moore, G. Stefano Brigidi, Brenda L. Bloodgood, Byung Kook Lim, Rui M. Costa, and David Kleinfeld. 2021. “Specific Populations of Basal Ganglia Output Neurons Target Distinct Brain Stem Areas While Collateralizing throughout the Diencephalon.” Neuron 109 (10): 1721–38.e4.

      Mitrofanis, J. 2005. “Some Certainty for the ‘Zone of Uncertainty’? Exploring the Function of the Zona Incerta.” Neuroscience 130 (1): 1–15.

      Monosov, Ilya E., Takaya Ogasawara, Suzanne N. Haber, J. Alexander Heimel, and Mehran Ahmadlou. 2022. “The Zona Incerta in Control of Novelty Seeking and Investigation across Species.” Current Opinion in Neurobiology 77 (December):102650.

      Negishi, Kenichiro, Mikayla A. Payant, Kayla S. Schumacker, Gabor Wittmann, Rebecca M.  Butler, Ronald M. Lechan, Harry W. M. Steinbusch, Arshad M. Khan, and Melissa J. Chee. 2020. “Distributions of Hypothalamic Neuron Populations Coexpressing Tyrosine Hydroxylase and the Vesicular GABA Transporter in the Mouse.” The Journal of Comparative Neurology 528 (11): 1833–55.

      Ossowska, Krystyna. 2019. “Zona Incerta as a Therapeutic Target in Parkinson’s Disease.” Journal of Neurology. https://doi.org/ 10.1007/s00415-019-09486-8.

      Romanov, Roman A., Amit Zeisel, Joanne Bakker, Fatima Girach, Arash Hellysaz, Raju Tomer, Alán Alpár, et al. 2017. “Molecular Interrogation of Hypothalamic Organization Reveals Distinct Dopamine Neuronal Subtypes.” Nature Neuroscience 20 (2): 176–88.

      Sharma, Sandeep, Cecilia A. Badenhorst, Donovan M. Ashby, Stephanie A. Di Vito, Michelle A. Tran, Zahra Ghavasieh, Gurleen K. Grewal, Cole R. Belway, Alexander McGirr, and Patrick J. Whelan. 2024. “Inhibitory Medial Zona Incerta Pathway Drives Exploratory Behavior by Inhibiting Glutamatergic Cuneiform Neurons.” Nature Communications 15 (1): 1160.

      Spix, Teresa A., Shruti Nanivadekar, Noelle Toong, Irene M. Kaplow, Brian R. Isett, Yazel  Goksen, Andreas R. Pfenning, and Aryn H. Gittis. 2021. “Population-Specific Neuromodulation Prolongs Therapeutic Benefits of Deep Brain Stimulation.” Science 374 (6564): 201–6.

      Wang, Xiyue, Xiaolin Chou, Bo Peng, Li Shen, Junxiang J. Huang, Li I. Zhang, and Huizhong W. Tao. 2019. “A Cross-Modality Enhancement of Defensive Flight via Parvalbumin Neurons in Zona Incerta.” eLife 8 (April). https://doi.org/ 10.7554/eLife.42728.

      Wang, Xiyue, Xiao-Lin Chou, Li I. Zhang, and Huizhong Whit Tao. 2020. “Zona Incerta: An Integrative Node for Global Behavioral Modulation.” Trends in Neurosciences 43 (2): 82–87.

      Watakabe, Akiya, Masanari Ohtsuka, Masaharu Kinoshita, Masafumi Takaji, Kaoru Isa, Hiroaki Mizukami, Keiya Ozawa, Tadashi Isa, and Tetsuo Yamamori. 2015. “Comparative Analyses of Adeno-Associated Viral Vector Serotypes 1, 2, 5, 8 and 9 in Marmoset, Mouse and Macaque Cerebral Cortex.” Neuroscience Research 93 (April):144–57.

      Watanabe, Hidenori, Hiromi Sano, Satomi Chiken, Kenta Kobayashi, Yuko Fukata, Masaki  Fukata, Hajime Mushiake, and Atsushi Nambu. 2020. “Forelimb Movements Evoked by Optogenetic Stimulation of the Macaque Motor Cortex.” Nature Communications 11 (1): 3253.

      Yang, Yang, Tao Jiang, Xueyan Jia, Jing Yuan, Xiangning Li, and Hui Gong. 2022. “Whole-Brain Connectome of GABAergic Neurons in the Mouse Zona Incerta.” Neuroscience Bulletin 38 (11): 1315–29.

      Ye, Qiying, Jeremiah Nunez, and Xiaobing Zhang. 2023. “Zona Incerta Dopamine Neurons Encode Motivational Vigor in Food Seeking.” bioRxiv: The Preprint Server for Biology, June. https://doi.org/ 10.1101/2023.06.29.547060.

      Zhao, Zheng-Dong, Zongming Chen, Xinkuan Xiang, Mengna Hu, Hengchang Xie, Xiaoning Jia, Fang Cai, et al. 2019. “Zona Incerta GABAergic Neurons Integrate Prey-Related Sensory Signals and Induce an Appetitive Drive to Promote Hunting.” Nature Neuroscience 22 (6): 921–32.

    1. Author response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #2 (Public review):

      Summary

      In this extensive comparative study, Moreno-Borrallo and colleagues examine the relationships between plasma glucose levels, albumin glycation levels, diet and lifehistory traits across birds. Their results confirmed the expected positive relationship between plasma blood glucose level and albumin glycation rate but also provided findings that are somewhat surprising or contrast with findings of some previous studies (positive relationships between blood glucose and lifespan, or absent relationships between blood glucose and clutch mass or diet). This is the first extensive comparative analysis of glycation rates and their relationships to plasma glucose levels and life history traits in birds that is based on data collected in a single study, with blood glucose and glycation measured using unified analytical methods (except for blood glucose data for 13 species collected from a database).

      Strengths

      This is an emerging topic gaining momentum in evolutionary physiology, which makes this study a timely, novel and important contribution. The study is based on a novel data set collected by the authors from 88 bird species (67 in captivity, 21 in the wild) of 22 orders, except for 13 species, for which data were collected from a database of veterinary and animal care records of zoo animals (ZIMS). This novel data set itself greatly contributes to the pool of available data on avian glycemia, as previous comparative studies either extracted data from various studies or a ZIMS database (therefore potentially containing much more noise due to different methodologies or other unstandardised factors), or only collected data from a single order, namely Passeriformes. The data further represents the first comparative avian data set on albumin glycation obtained using a unified methodology. The authors used LC-MS to determine glycation levels, which does not have problems with specificity and sensitivity that may occur with assays used in previous studies. The data analysis is thorough, and the conclusions are substantiated. Overall, this is an important study representing a substantial contribution to the emerging field evolutionary physiology focused on ecology and evolution of blood/plasma glucose levels and resistance to glycation.

      Weaknesses

      Unfortunately, the authors did not record handling time (i.e., time elapsed between capture and blood sampling), which may be an important source of noise because handling-stress-induced increase in blood glucose has previously been reported. Moreover, the authors themselves demonstrate that handling stress increases variance in blood glucose levels. Both effects (elevated mean and variance) are evident in Figure ESM1.2. However, this likely makes their significant findings regarding glucose levels and their associations with lifespan or glycation rate more conservative, as highlighted by the authors.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      I understand that your main objective regarding glycation rate and lifespan, was to analyse the species resistance to glycation with respect to lifespan, while factoring out the species-specific variation in blood glucose level. However, I still believe that the absolute glycation level (i.e., not controlled for blood glucose level) may also be important for the evolution of lifespan. Given that blood glucose is positively related to both glycation and lifespan (although with a plateau in the latter case), lifespan could possibly be positively correlated with absolute glycation levels. If significant, that would be an interesting and counterintuitive finding, which would call for an explanation, thereby potentially stimulating further research. If not significant, it would show that long-lived species do not have higher glycation levels, despite having higher blood glucose levels, thereby strengthening your argument about higher resistance of longlived species to glycation. So, in my opinion, the inclusion of an additional model of glycation level on life-history traits, without controlling for blood glucose, is worth considering.

      We include now this model as supplementary material, indicating it in several parts of the text, including some of these issues we discussed here.

      Lines 230-231: Please, provide a citation for these GVIF thresholds

      We include it now.

      Figure 3: I think that showing both glucose and glycation rate on the linear scale, rather than log scale, would better illustrate your conclusion - the slowing rise of glycation rate with increasing glucose levels.

      That is a good point, although it may also be confusing for readers to see a graph that represents the data in a different way as the models. Maybe showing both graphs (as 3.A and 3.B) can solve it?

      Figure 4. I recommend stating in the caption that the whiskers do not represent interquartile ranges (a standard option in box plots) but credible intervals as mentioned in the current version of the public author response.

      Sorry about that, it was missed. Now it is included. Nevertheless, interquartile ranges from the posterior distributions can still be observed here represented with the boxes. Then the whiskers are the credible intervals.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In this manuscript, Guo and colleagues used a cell rounding assay to screen a library of compounds for inhibition of TcdB, an important toxin produced by Clostridioides difficile. Caffeic acid and derivatives were identified as promising leads, and caffeic acid phenethyl ester (CAPE) was further investigated.

      Strengths:

      Considering the high morbidity rate associated with C. difficile infections (CDI), this manuscript presents valuable research in the investigation of novel therapeutics to combat this pressing issue. Given the rising antibiotic resistance in CDI, the significance of this work is particularly noteworthy. The authors employed a robust set of methods and confirmatory tests, which strengthened the validity of the findings. The explanations provided are clear, and the scientific rationale behind the results is well-articulated. The manuscript is extremely well-written and organized. There is a clear flow in the description of the experiments performed. Also, the authors have investigated the effects of CAPE on TcdB in careful detail and reported compelling evidence that this is a meaningful and potentially useful metabolite for further studies.

      Weaknesses:

      This is really a manuscript about CAPE, not caffeic acid, and the title should reflect that. Also, a few details are missing from the description of the experiments. The authors should carefully revise the manuscript to ascertain that all details that could affect the interpretation of their results are presented clearly. Just as an example, the authors state in the results section that TcdB was incubated with compounds and then added to cells. Was there a wash step in between? Could compound carryover affect how the cells reacted independently from TcdB? This is just an example of how the authors should be careful with descriptions of their experimental procedures. Lastly, authors should be careful when drawing conclusions from the analysis of microbiota composition data. Ascribing causality to correlational relationships is a recurring issue in the microbiome field. Therefore, I suggest authors carefully revise the manuscript and tone down some statements about the impact of CAPE treatment on the gut microbiota.

      Thanks for your constructive suggestion. We have carefully revised the manuscript, including the description of title, results and methods sections.

      Reviewer #2 (Public review):

      Summary:

      This work is towards the development of nonantibiotic treatment for C. difficile. The authors screened a chemical library for activity against the C. difficile toxin TcdB, and found a group of compounds with antitoxin activity. Caffeic acid derivatives were highly represented within this group of antitoxin compounds, and the remaining portion of this work involves defining the mechanism of action of caffeic acid phenethyl ester (CAPE) and testing CAPE in mouse C. difficile infection model. The authors conclude CAPE attenuates C. difficile disease by limiting toxin activity and increasing microbial diversity during C. difficile infection.

      Strengths/ Weaknesses:

      The strategy employed by the authors is sound although not necessarily novel. A compound that can target multiple steps in the pathogenies of C. difficile would be an exciting finding. However, the data presented does not convincingly demonstrate that CAPE attenuates C. difficile disease and the mechanism of action of CAPE is not convincingly defined. The following points highlight the rationale for my evaluation.

      (1) The toxin exposure in tissue culture seems brief (Figure 1). Do longer incubation times between the toxin and cells still show CAPE prevents toxin activity?

      Thanks for your comments. The cytotoxicity assay was employed to directly assess the protective capacity of CAPE against cell death induced by TcdB. Our observations at 1 and 12 h post-TcdB exposure revealed that CAPE effectively mitigated the toxic effects of the TcdB at both time points, demonstrating its potent protective role. Please see Figure S1.

      (2) The conclusion that CAPE has antitoxin activity during infection would be strengthened if the mouse was pretreated with CAPE before toxin injections (Figure 1D).

      Thanks for your constructive comments. According to your suggestion, we administered TcdB 2 h after pretreatment with CAPE. The outcomes demonstrated that CAPE pretreatment significantly enhanced the survival rate of the intoxicated mice, confirming that CAPE retains its antitoxin efficacy during the infection process. Please see Figure S2.

      (3) CAPE does not bind to TcdB with high affinity as shown by SPR (Figure 4). A higher affinity may be necessary to inhibit TcdB during infection. The GTD binds with millimolar affinity and does not show saturable binding. Is the GTD the binding site for CAPE? Auto processing is also affected by CAPE indicating CAPE is binding non-GTD sites on TcdB.

      Thanks for your comments. Our findings indicate that the GTD domain is a critical binding site for CAPE. CAPE exerts its protective effects at multiple stages of TcdB-mediated cell death, including inhibiting TcdB's self-cleavage and blocking the activity of GTD, thereby preventing the glycosylation modification of Rac1 by TcdB.

      (4) In the infection model, CAPE does not statistically significantly attenuate weight loss during C. difficile infection (Figure 6). I recognize that weight loss is an indirect measure of C. difficile disease but histopathology also does not show substantial disease alleviation (see below).

      Thanks for your comments. Our comparative analysis revealed a notable distinction in the body weight of mice on the third day post-infection (Figure 6B). Similarly, the dry/wet stool ratio exhibited a comparable pattern, suggesting that treatment with phenethyl caffeic acid ameliorated Clostridium difficile-induced diarrhea to a significant degree (Figure 6C).

      (5) In the infection model (Figure 6), the histopathology analysis shows substantial improvement in edema but limited improvement in cellular infiltration and epithelial damage. Histopathology is probably the most critical parameter in this model and a compound with disease-modifying effects should provide substantial improvements.

      Thanks for your comments. Edema, inflammatory factor infiltration, and epithelial damage served as key evaluation metrics. Statistical analysis revealed that the pathological scores of mice treated with CAPE were markedly reduced compared to those in the model group (Figure 6F).

      (6) The reduction in C. difficile colonization is interesting. It is unclear if this is due to antitoxin activity and/or due to CAPE modifying the gut microbiota and metabolites (Figure 6). To interpret these data, a control is needed that has CAPE treatment without C. difficile infection or infection with an atoxicogenic strain.

      The observed reduction in C. difficile fecal colonization following drug treatment may be attributed to the CAPE's antitoxin properties or its capacity to modify the intestinal microbiota and metabolites. These two mechanisms likely work in tandem to combat CDI. CDI is primarily triggered by the toxins A (TcdA) and B (TcdB) secreted by the bacterium. Certain therapies, including monoclonal antibodies like bezlotoxumab, target CDI by neutralizing these toxins, thereby mitigating gut damage and subsequent C. difficile colonization(1,2). The establishment of C. difficile in the gut is intricately linked to the equilibrium of the intestinal microbiota. Although antibiotic treatments can inhibit C. difficile growth, they may also disrupt the microbial balance, potentially facilitating the overgrowth of other pathogens. Consequently, interventions such as fecal microbiota transplantation (FMT) are designed to reestablish gut flora balance and consequently decrease C. difficile colonization(3,4). Moreover, the administration of probiotics and prebiotics is considered to reduce C. difficile colonization by modifying the gut environment(5,6).

      (7) Similar to the CAPE data, the melatonin data does not display potent antitoxin activity and the mouse model experiment shows marginal improvement in the histopathological analysis (Figure 9). Using 100 µg/ml of melatonin (~ 400 micromolar) to inactivate TcdB in cell culture seems high. Can that level be achieved in the gut?

      The uptake and dissemination of melatonin within the body varies with the dose administered. For instance, in rats, the bioavailability of melatonin following administration was found to be 53.5%, whereas in dogs, bioavailability was nearly complete (100%) at a dose of 10 mg/kg, yet it decreased to 16.9% at a lower dose of 1 mg/kg(7). This data suggests that the absorption of melatonin differs across various animal species and is influenced by the dose administered. Moreover, it underscores the higher potential bioavailability of melatonin, implying that a dose of 200 mg/kg should be adequate to achieve the desired concentration in the body post-administration.

      (8) The following parameters should be considered and would aid in the interpretation of this work. Does CAPE directly affect the growth of C. difficile? Does CAPE affect the secretion of TcdB from C. difficile? Does CAPE alter the sporulation and germination of C. diffcile?

      We incorporated CAPE into the MIC assay for detecting C. difficile, as well as for assessing the sporulation capacity of C. difficile and evaluating the secretion level of TcdB. The findings revealed that CAPE markedly repressed tcdB transcription at a concentration of 16 μg/mL and effectively suppressed the growth and sporulation of C. difficile BAA-1870 at a concentration of 32 μg/mL. Please see Figure S3.

      References:

      (1) Skinner AM, et al. Efficacy of bezlotoxumab to prevent recurrent Clostridioides difficile infection (CDI) in patients with multiple prior recurrent CDI. Anaerobe. 2023 Dec; 84: 102788.

      (2) Wilcox MH, et al. Bezlotoxumab for Prevention of Recurrent Clostridium difficile Infection. N Engl J Med. 2017 Jan 26;376(4):305-317.

      (3) Khoruts A, Sadowsky MJ. Understanding the mechanisms of faecal microbiota transplantation. Nat Rev Gastroenterol Hepatol. 2016 Sep;13(9):508-16.

      (4) Khoruts A, Staley C, Sadowsky MJ. Faecal microbiota transplantation for Clostridioides difficile: mechanisms and pharmacology. Nat Rev Gastroenterol Hepatol. 2021 Jan;18(1):67-80.

      (5) Mills JP, Rao K, Young VB. Probiotics for prevention of Clostridium difficile infection. Curr Opin Gastroenterol. 2018 Jan;34(1):3-10.

      (6) Lau CS, Chamberlain RS. Probiotics are effective at preventing Clostridium difficile-associated diarrhea: a systematic review and meta-analysis. Int J Gen Med. 2016 Feb 22; 9:27-37.

      (7) Yeleswaram K, et al. Pharmacokinetics and oral bioavailability of exogenous melatonin in preclinical animal models and clinical implications. J Pineal Res. 1997 Jan;22(1):45-51.

      Reviewer #3 (Public review):

      Summary:

      The study is well written, and the results are solid and well demonstrated. It shows a field that can be explored for the treatment of CDI.

      Strengths:

      The results are really good, and the CAPE shows a good and promising alternative for treating CDI. The methodology and results are well presented, with tables and figures that corroborate them. It is solid work and very promising.

      Weaknesses:

      Some references are too old or missing.

      Thanks for your constructive suggestion. We have included and refreshed several references to enhance the manuscript.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      While the manuscript convincingly demonstrates that CAPE affects the TcdB toxin and reduces its toxicity in vitro, it would be beneficial to include data on the effect of CAPE on the growth of C. difficile. This would help ensure that the observed in vivo effects are not merely due to reduced bacterial growth but rather due to the specific action of CAPE on the toxin.

      Thanks for your constructive suggestion. We have augmented our findings with the impact of CAPE on the bacteria themselves, revealing that CAPE not only hampers the growth of the bacterial cells but also suppresses their capacity to produce spores. Please see Figure S3.

      (1) Line 41, line 115 - authors should clarify what they mean when mentioning Bacteroides within parentheses.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (2) Line 71 - Is C. difficile really found "in the environment"?

      Thanks for your comments. C. difficile is prevalent across various natural settings, including soil and water ecosystems. A study has identified highly diverse strains of this bacterium within environmental samples(1). Moreover, the significant presence of C. difficile in soil and lawn specimens collected near Australian hospitals indicates that the organism is indeed a common inhabitant in the environment(2).

      (3) Lines 128-130 - Was there a wash step here? What could be the impact of compound carryover in this experiment?

      Thanks for your comments. Following pre-incubation of TcdB with CAPE, remove the compounds that have not bound to TcdB through centrifugation. The persistence of the compound in the culture post-washing could result in an inflated assessment of its efficacy, particularly if it continues to engage with TcdB or the cells beyond the initial 1-hour pre-incubation window. The carryover of the compound might also give rise to misleading positive results, where the compound seems to confer protection or inhibition against TcdB-mediated cell rounding, whereas such effects are actually due to the lingering activity of the compound. This carryover could skew the determination of the compound's minimum effective concentration, as the effective concentration interacting with the cells might be inadvertently elevated. Furthermore, if the compounds possess cytotoxic properties or impact cell viability, carryover could generate artifacts in cell morphology that are unrelated to the direct interaction between TcdB and the compounds.

      (4) Lines 133-134 - I suggest authors mention how many caffeic acid derivatives there were in the entire library so that the suggested "enrichment" of them in the group of bioactive compounds can be better judged.

      Thanks for your comments. The natural compound library contained eight caffeic acid derivatives, of which methyl caffeic acid and ferulic acid displayed no efficacy. This information has been incorporated into the manuscript.

      (5) Line 135 - I recommend the authors add the molarity of the compound solutions used.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (6) Line 247 - I think the term "CAPE mice" is confusing. Please use a full description.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (7) Line 248 - I also think the terms "model mice" and "model group" are confusing. Maybe call them "control mice"?

      Thanks for your comments. The terms "model mice" and "model group" are indeed synonymous, and we have subsequently clarified that control mice refer to those that have not been infected with C. difficile.

      (8) Line 273 - "most abundant species at the genus level" is incorrect. I think what you mean is "most abundant TAXA".

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (9) Line 278 - Please include your p-value cut-off together with the LDA score.

      Thanks for your comments. We have revised the above description to “LDA score > 3.5, p < 0.05”.

      (10) Line 292 - Details on how metabolomics was performed should be included here.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (11) Line 299 - 1.5 is a fairly low cut-off. The authors should at a minimum also include the p-value cut-off used.

      Response: Thanks for your comments. We have revised the above description to “fold change > 1.5, p < 0.05”.

      (12) Line 307 - Purine "degradation" would be better here.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (13) Line 328 onward - The melatonin experiment is a weird one. Although I fully understand the rationale behind testing the effect of melatonin in the mouse model, the idea that just because melatonin levels changed in the gut it would act as a direct inhibitor of TcdB was very far-fetched, even though it ended up working. Authors should explain this in the manuscript.

      Thanks for your comments. Furthermore, beyond our murine studies, we have confirmed that melatonin significantly diminishes TcdB-induced cytotoxicity at the cellular level (Figure 9A). Additionally, it has been documented that melatonin, acting as an antimicrobial adjuvant and anti-inflammatory agent, can decrease the recurrence of CDI(3). Consequently, we contend that the aforementioned statement is substantiated.

      (14) Lines 429-435 - There are seemingly contradictory pieces of information here. The authors state that adenosine is released from cells upon inflammation and that CAPE treatment caused an increase in adenosine levels. Later in this section, the authors state that adenosine prevents TcdA-mediated damage and inflammation. This should be clarified and better discussed.

      Thanks for your comments. Adenosine modulates immune responses and inflammatory cascades by interacting with its receptors, including its capacity to suppress the secretion of specific pro-inflammatory mediators. We have updated this depiction in the manuscript.

      (15) Lines 513-514 - How was this phenotype quantified?

      Thanks for your comments. Initially, we introduced TcdB at a final concentration of 0.2 ng/mL along with various concentrations of compounds into 1 mL of medium for a 1-h pre-incubation period. Subsequently, unbound compounds were removed through centrifugation, and the resulting mixture was then applied to the cells.

      (16) Figure 3 - panels are labeled incorrectly.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (17) Figure 5C - it is unclear what the different colors and labels represent.

      Thanks for your comments. In the depicted graph, blue denotes the total binding energy, red signifies the electrostatic interactions, green corresponds to the van der Waals forces, and orange indicates solvation or hydration effects. The horizontal axis represents the mutation of the amino acid residue at the respective position to alanine. As illustrated in Figure 5C, the mutations W520A and GTD exhibit the highest binding energies.

      References:

      (1) Janezic S, et al. Highly Divergent Clostridium difficile Strains Isolated from the Environment. PLoS One. 2016 Nov 23;11(11): e0167101.

      (2) Perumalsamy S, Putsathit P, Riley TV. High prevalence of Clostridium difficile in soil, mulch and lawn samples from the grounds of Western Australian hospitals. Anaerobe. 2019 Dec; 60:102065.

      (3) Sutton SS, et al. Melatonin as an Antimicrobial Adjuvant and Anti-Inflammatory for the Management of Recurrent Clostridioides difficile Infection. Antibiotics (Basel). 2022 Oct 25;11(11):1472.

      Reviewer #2 (Recommendations for the authors):

      Minor comments and questions.

      (1) Which form of TcdB is being used in these experiments?

      Thanks for your comments. The TcdB proteins used in this study are TcdB1 subtypes.

      (2) Why are THP-1 cells being used in these assays?

      Thanks for your comments. For the purposes of this study, we employed a diverse array of cell lines, including Vero, HeLa, THP-1, Caco-2, and HEK293T. Each cell line was selected to serve a specific experimental objective. The inclusion of the THP-1 cell line was necessitated by the need to incorporate a macrophage cell line to ensure the comprehensive nature of our experiments, allowing for the testing of both epithelial cells and macrophages. C. difficile is a kind of intestinal pathogenic bacteria, and immune clearance plays a vital role in the process of pathogen infection, so THP-1 cells are used as important immune cells.

      (3) Please improve the quality of the microscopy images in Figure 1.

      Thanks for your comments. We have improved the quality of the microscopy images in Figure 1.

      (4) Does the flow cytometry experiment in Figure 2B show internalization? Surface-bound toxins would provide the same histogram.

      Thanks for your comments. Figure 2B was employed to assess the internalization of TcdB, and the findings indicate that CAPE does not influence the internalization process of TcdB.

      (5) The sensogram in Figure 4A does not look typical and should be clarified.

      Thanks for your comments. Typically, small molecules and proteins engage in a rapid binding and dissociation dynamic. However, as depicted in Figure 4A, the interaction between CAPE and TcdB demonstrates a gradual progression towards equilibrium. This behavior can be primarily explained by the swift occupation of the protein's primary binding sites by the small molecule in the initial stages. Subsequently, CAPE binds to secondary or lower affinity sites, extending the time needed to reach equilibrium. Additionally, the likelihood of CAPE binding to multiple sites on TcdB requires time for the exploration and occupation of these diverse locations before equilibrium is attained, we have incorporated an analysis of this potential scenario into the manuscript.

      Reviewer #3 (Recommendations for the authors):

      These are my suggestions for the text:

      (1) Line 29: high recurrent rates.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (2) Line 32: Where is the caffeic acid identified? I think a line should be included.

      Thanks for your comments. Caffeic acid was identified from natural compounds library and we have completed the corresponding modifications according to the suggestions.

      (3) Line 39: C. difficile is not italic.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (4) Line 41: Bacteroides spp.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      (5) Line 56: This number of casualties 56.000 is still happening or it was in the past?

      Thanks for your comments. The mortality rates reported in the manuscript reflect a downturn in the incidence and fatality of CDI around 2017(1), as the infection gained broader recognition. Nonetheless, a recent study reveals that the mortality rate for CDI cases in Germany can soar to 45.7% within a year, with the overall economic burden amounting to approximately 1.6 billion euros. This underscores the ongoing significance of CDI as a global public health challenge(2).

      (6) Line 104: Where did the idea of testing caffeic acid come from? Any previous study of the authors? Any studies with the inhibition of other pathogens?

      Thanks for your comments. Initially, we conducted a screen of a compound library comprising 2,076 compounds and identified several potent inhibitors, which, upon structural analysis, were revealed to be caffeic acid derivatives. Prior to our investigation, no studies had explored the potential of CAPE in this context.

      (7) Line 115: Bacteroides spp.

      Thanks for your comments. We have completed the corresponding modifications according to the suggestions.

      Results section

      (8) Did the authors try the caffeic acid with the TcdA or binary toxin? I know this is not the purpose of the study, but TcdA toxin has a high identity structure with TcdB and generates inflammation in the gut via neutrophils. Negative strains for the major toxins and positive for the binary toxin also cause severe cases of CDI.

      Thanks for your comments. Although we acknowledge the significance of TcdA and binary toxins in CDI, we did not investigate the impact of CAPE on these toxins. Our focus was exclusively on the effect of CAPE against TcdB, as it is the primary virulence factor in C. difficile pathogenesis. Since TcdA and TcdB are highly similar in structure, we will analyze the neutralization effect of CAPE on TcdA in later studies.

      (9) Does caffeic acid have any effect on C. difficle? Or does it only gain the toxins? That would be ideal.

      Thanks for your comments. We have included additional related assays in our study. Beyond directly neutralizing TcdB, CAPE also demonstrates the capacity to inhibit the growth and spore formation of C. difficile.

      (10) Line 230: C. difficile BAA-1870 is a clinical strain? There are no details about it in the paper.

      Thanks for your comments. C. difficile BAA-1870 (RT027/ST1), a highly virulent isolate frequently employed in research(3-6), was kindly donated by Professor Aiwu Wu. We have meticulously noted the PCR ribotype in our manuscript.

      (11) Line 236: Did the mice fully recover from CDI after the administration of the CAPE? Was one dose enough?

      Thanks for your comments. CAPE was administered orally at 24 h intervals, commencing with the initial dose on Day 0. By the time a significant difference was observed on Day 3, the treatment had been administered a total of three times.

      Methodology

      (12) Most of the methods do not have a reference.

      Thanks for your comments. We have added several references to the methods.

      Discussion section

      (13) The first two paragraphs of the discussion should be summarized. Those details were already explained in the introduction.

      Thanks for your comments. The discussion section and the introduction address slightly different focal points; therefore, we aim to retain the first two paragraphs to maintain continuity and context.

      (14) Line 382: Bezolotoxumab was approved by the FDA in 2016. It is not recent.

      Thanks for your comments. We have revised the above description.

      (15) Line 410: "Despite the high 410 cure rate and increasing popularity of FMT, its safety remains controversial. Although this is true, recently (2022) the FDA approved the Rebyota, which was later cited by the authors.

      Thanks for your comments. We have revised the above description.

      (16) Lines 415-416: "the abundance of Bacteroides, a critical gut microbiota component that is required for C. difficile resistance". There is only one reference cited by the authors. I suppose that if it is true, more studies should be mentioned. Why are probiotics with Bacteroides spp. not available in the market?

      Thanks for your comments. We have supplemented additional references. The scarcity of probiotic products containing Bacteroides spp. on the market is primarily attributable to the stringent requirements of their survival conditions. As most Bacteroides spp. are anaerobic, they thrive in oxygen-deprived environments. This unique survival trait poses challenges in maintaining their viability during product preservation and distribution, which in turn escalates production costs and complexity. Furthermore, despite the significant role of Bacteroides in gut health, research into its potential probiotic benefits and safety is comparatively underexplored.

      References:

      (1) Guh AY, et al. Emerging Infections Program Clostridioides difficile Infection Working Group. Trends in U.S. Burden of Clostridioides difficile Infection and Outcomes. N Engl J Med. 2020 Apr 2;382(14):1320-1330.

      (2) Schley K, et al. Costs and Outcomes of Clostridioides difficile Infections in Germany: A Retrospective Health Claims Data Analysis. Infect Dis Ther. 2024 Nov 20.

      (3) Saito R, et al. Hypervirulent clade 2, ribotype 019/sequence type 67 Clostridioides difficile strain from Japan. Gut Pathog. 2019 Nov 4; 11:54.

      (4) Pellissery AJ, Vinayamohan PG, Venkitanarayanan K. In vitro antivirulence activity of baicalin against Clostridioides difficile. J Med Microbiol. 2020 Apr;69(4):631-639.

      (5) Shao X, et al. Chemical Space Exploration around Thieno[3,2-d]pyrimidin-4(3H)-one Scaffold Led to a Novel Class of Highly Active Clostridium difficile Inhibitors. J Med Chem. 2019 Nov 14;62(21):9772-9791.

      (6) Mooyottu S, Flock G, Venkitanarayanan K. Carvacrol reduces Clostridium difficile sporulation and spore outgrowth in vitro. J Med Microbiol. 2017 Aug;66(8):1229-1234.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary

      Chabukswar et al analysed endogenous retrovirus (ERV) Env variation in a set of primate genomes using consensus Env sequences from ERVs known to be present in hominoids using a Blast homology search with the aim of characterising env gene changes over time. The retrieved sequences were analysed phylogenetically, and showed that some of the integrations are LTR-env recombinants.

      Strengths

      The strength of the manuscript is that such an analysis has not been performed yet for the subset of ERV Env genes selected and most of the publicly available primate genomes.

      Weaknesses

      Unfortunately, the weaknesses of the manuscript outnumber its strengths. Especially the methods section does not contain sufficient information to appreciate or interpret the results. The results section contains methodological information that should be moved, while the presentation of the data is often substandard. For instance, the long lists of genomes in which a certain Env was found could better be shown in tables. Furthermore, there is no overview of the primate genomes Saili how did you answer to this?, or accession numbers, used. It is unclear whether the analyses, such as the phylogenetic trees, are based on nucleotide or amino acid sequences since this is not stated. tBLASTn was used in the homology searches, so one would suppose aa are retrieved. In the Discussion, both env (nt?) and Env (aa?) are used.

      For the non-hominoids, genome assembly of publicly available sequences is not always optimal, and this may require Blasting a second genome from a species. Which should for instance be done for the HML2 sequences found in the Saimiri boliviensis genome, but not in the related Callithrix jacchus genome. Finally, the authors propose to analyse recombination in Env sequences but only retrieve env-LTR recombinant Envs, which should likely not have passed the quality check.

      Since the Methods section does not contain sufficient information to understand or reproduce the results, while the Results are described in a messy way, it is unclear whether or not the aims have been achieved. I believe not, as characterisation of env gene changes over time is only shown for a few aberrant integrations containing part of the LTR in the env ORF.

      We thank the reviewer for the critiques of the manuscript and their constructive suggestions to improve the clarity, methodological rigor, and data presentation.

      (1) The concern regarding the insufficient data in the methods has been resolved in the revised manuscript by adding a supplementary file that contains the genome assemblies that  were used to perform the tBLAStn analysis using the reconstructed Env sequences. The requested accession numbers are available for all sequences in the supplementary phylogenetic figures.

      (2) We have also modified the manuscript by moving a portion of the results section in the methods section, in particular all the methodological description of the reconstruction of Env part (Line 197-231).

      (3) As suggested, the long list of genomes mentioned in the results section in which the Env tBLASTn hits were obtained are now provided in the table form (Table 2) as an overall summary of the distribution of ERV Env in the genomes and the genome assemblies are mentioned in Supplementary file 2.

      (4) As for the point regarding the tBLASTn usage in the homology searches, we first performed tBLASTn analysis using the reconstructed Env amino acid sequences as query and performed tBLASTn similarity search in the primate genomes. The tBLASTn algorithm uses the amino acid sequences to compare with the translated nucleotide database in all six frames and hence the hits obtained are nucleotide sequences (Line 381-383). These nt sequences were used for all the further analysis such as sequence alignment, phylogenetic analysis and recombination analysis. For better clarity, we have specified the use of env nt alignments in the methods section to avoid the raised confusion in the discussion.

      (5) For the HML supergroup characterization in squirrel monkey genome (Saimiri boliviensis), we used the tBLASTn hits obtained in the S. boliviensis from the initial analysis to perform the comparative genomics in two Platyrrhini genomes available on UCSC Genome browser. In particular, this analysis was performed to confirm the presence of specific members of HML supergroup in squirrel monkey genomes that has not been previously reported. We used the available genome assemblies because of the annotations available on Genome browser, and especially the possibility to use the repeatmasker tracks and the comparative genomics tools in order to use the human genome as a reference. We reported the coordinates for the members of HML supergroup that were retrieved through the comparative genomic assemblies by applying the repeat masker custom track, that have many ERVS that are not present in NCBI reference genomes.

      (6) The concern regarding only retrieving env-LTR recombinant Envs has been addressed in the revised results section (Lines 747-758). As also mentioned in the methods section, the RDP software detects the recombinant sequences and a breakpoint position for the recombinant signals and hence we confirmed only those sequences that were predicted as potential recombinant sequences by the RDP software through comparative genomics. All the sequences predicted by the software were env-LTR recombinant and hence we confirmed and reported only those recombinant sequences in the manuscript.

      Reviewer #1 (Recommendations for the authors):

      The paper could be strengthened by:

      - a rigorous rewriting and shortening of the manuscript, thereby eliminating all textbook-like paragraphs, and all biological misinterpretations and confusions. Distinguish between retroviral replication as an exogenous virus, and host genome remodeling affecting ERVs. Rewrite the sections on template switching by RT being the basis for the observed recombinations, while host genome recombinations are far more likely. ERVs with such aberrant env/LTR gene recombination are unlikely to be fit for cross-species transmission. Likely, such a recombinant was generated in a common ancestor. Also, host RNA polymerase II transcribes retroviral RNA (line 79), not RT.

      - check lines 89-90 as pro is part of the pol gene in gamma- and lentiviruses.

      We thank the reviewer for the suggestion, we have revised the manuscript by shortening the introduction section and eliminating the textbook like paragraphs and also clarifying the recombination mechanism. We have revised the introduction section at Lines 102-111, and the clarification for the recombination mechanism is provided at lines 1668-1675

      - adding much more information to the Methods section. Such as which genomes were searched, were nt or aa have been retrieved and analysed, were multiple genomes of a species searched, a list of databases used ('various databases' in line 164 does not suffice), etc.

      We thank the reviewer for the observation. As mentioned above, in the revised manuscript we have provided more detailed methods by including a supplementary file for the genome assemblies used for tBLASTn analysis and comparative genomics. For the sequence alignment, phylogenetic analysis and recombination analysis we used nt sequences, as it is also mentioned in the revised version. Lastly, all the databases that were used and are mentioned in the methods section.

      - more information is needed on the alignments and phylogenetic trees. For instance, how were indels treated? How long were the alignments on average regarding informative sites?

      We thank the reviewer for the questions, to answer them we have added a paragraph (Lines 359-362) describing the reconstruction process in more details.

      - confirm the findings about the presence or absence of an ERV, such as for the squirrel monkey genome, using additional genomes of the species

      As mentioned above, we only used the genome assemblies available on the genome browser because of the annotations available on Genome browser, blasting the second NCBI RefSeq genome using the BLAST algorithm does not provide accurate information and annotations compared to that of Genome browser and hence we reported the coordinates for the members of HML supergroup that were retrieved through the comparative genomic assemblies by applying the repeat masker custom track, that have many ERVS that are not present in NCBI reference genomes.

      - present the lists of findings in primate genomes on pages 9 and 10 in tables

      We thank the reviewer for the suggestion, we have provided a new table (Table 2) in the revised version summarizing the ERV Env distribution results.

      - a significant limitation of the study is that only env ERVs found in hominoids have been searched in OWM and NWM, not ones specific for monkeys. This should be mentioned somewhere.

      As the reviewer pointed out, the study was designed to explore ERVs’ Env  sequences in hominoids which were then searched in the OWM and NWM genomes, this is now better stated in the introduction at Lines 57-60.

      - define abbreviations at first use (e.g. HML in abstract)

      We thank the reviewer for the suggestion, we have mentioned the abbreviations in the abstract, where we mentioned HML first (Line 65)

      - explain 'pathological domestication' (line 42). Domestication implies usefulness to the host. And over time, deleterious insertions would have been likely purged from a population.

      We thank the reviewer for the observation, we have modified the sentence and provided a clearer explanation for the pathological and physiological consequences of ERVs’ env (lines 52-57).

      Furthermore:

      - why begin the discussion with a lengthy description of domestication and syncytins, which is not part of the current study?

      We thank the reviewer for the critique. Accordingly, we have now modified the discussion section by shortening the part about domestication of syncytins, and just mentioned them as an example at lines 942-944.

      - how can 96 hits have been retrieved for spuma-like envs (line 506), while it was earlier reported (line 333), that the most hits were gamma-like?

      We thank the reviewer for the observation, we have clarified and explained how 96 hits have been retrieved for spuma-like envs in lines 670-677 of the discussion section.

      English grammar should be improved throughout the manuscript.

      And I could not open half of the supplementary files

      As suggested we have revised English and checked that all files were correctly open.

      Reviewer #2 (Public Review):

      Summary:

      The manuscript by Chabukswar et al. describes a comprehensive attempt to identify and describe the diversity of retroviral envelope (env) gene sequences present in primate genomes in the form of ancient endogenous retrovirus (ERV) sequences.

      Strengths:

      The focus on env can be justified because of the role the Env proteins likely played in determining viral tropism and host range of the viruses that gave rise to the ERV insertions, and to a lesser extent, because of the potential for env ORFs to be coopted for cellular functions (in the rare cases where the ORF is still intact and capable of encoding a functional Env protein). In particular, these analyses can reveal the potential roles of recombination in giving rise to novel combinations of env sequences. The authors began by compiling env sequences from the human genome (from human endogenous retrovirus loci, or "HERVs") to build consensus Env protein sequences, and then they use these as queries to screen other primate genomes for group-specific envs by tBLASTn. The "groups" referred to here are previously described, as unofficial classifications of endogenous retrovirus sequences into three very broad categories - Class I, Class II and Class III. These are not yet formally recognized in retroviral taxonomy, but they each comprise representatives of multiple genera, and so would fall somewhere between the Family and Genus levels. The retrieved sequences are subject to various analyses, most notably they are screened for evidence of recombination. The recombinant forms appear to include cases that were probably viral dead-ends (i.e. inactivating the env gene) even if they were propagated in the germline.

      The availability of the consensus sequences (supplement) is also potentially useful to others working in this area.

      Weaknesses:

      The weaknesses are largely in presentation. Discussions of ERVs are always complicated by the lack of a formal and consistent nomenclature and the confusion between ERVs as loci and ERVs as indirect information about the viruses that produced them. For this reason, additional attention needs to be paid to precise wording in the text and/or the use of illustrative figures.

      We thank the reviewer for the general observation. We put additional attention to the wording in text/figures, and hope to have improved the manuscript clarity.

      Reviewer #2 (Recommendations for the authors):

      Reviewing the manuscript was a challenge because figures were difficult to read. As provided, the fonts were sometimes too small to read in a standard layout and had to be expanded on screen.

      The tree in Figure 3 could also be made easier to read, for example if the authors collapsed related branches and gave the clusters a single, clear label (this is not necessary, just a suggestion) - especially if the supplementary trees have all the labelled branches for any readers who want specific details.

      I also recommend asking a third party (perhaps a scientific colleague) with fluency in English grammar and familiarity with English scientific idiom to provide some editorial feedback on the text.

      Figure 4 legend is confusing. From the description it sounds like the tree in 4B is a host phylogeny, but it's not clearly stated. And if so, how was the tree generated? Is it based on entire genomes? Include at least enough methodological detail or citations that someone could recreate it, if necessary. The details and how it was done should be briefly mentioned here and in detail in the Methods section.

      We thank the reviewer for the observation. As for Figure 4 we have modified its legend and more clearly stated how the phylogenetic tree of the primate genomes was generated using TimeTree. We have also provided further details in the methods section (Lines 475-489).

      As suggested we have revised English.

      Line 42 - what is "pathological domestication"? It sounds like a contradiction in terms.

      We thank the reviewer for the observation. We have modifies the sentence and provided clearer explanation for the pathological and physiological consequences of ERVs’ env (lines 52-57).

      Lines 166-167 - the authors use the word "classes" but then use a list of terms that correspond to genera within the Retroviridae. The authors should be cautious here, as "class" and "genus" are both official taxonomic terms with different meanings. Do they mean genus? Or, if a more informal term is needed, perhaps "group"?

      Thank you for the observation, the ERVs have been classified into three classes (Class I, II and III) based on the relatedness to the exogenous retroviruses Gammaretrovirus, Betaretrovirus and Spumaretrovirus genera respectively and hence have been mentioned in the manuscript as per the nomenclature proposed by Gifford et al., 2018 which has been cited at Lines 122-125.

      Line 221- "defferent" should be "different"

      Corrected

      Lines 233-234 - what is meant by "canonical" and "non-canonical" forms? Can the authors please define these two terms?

      Thank you for the question, canonical refers to sequences that are well-preserved and match the structural and functional features of complete env genes, and non-canonical refers to sequences with significant structural alterations or truncations that deviate from this typical form. This explanation has been mentioned in the revised version at Lines 475-479.

      Line 252 - if/is

      Corrected

      Lines 274-276 needs a citation to the paper(s) that reported this.

      Corrected

      Line 283-285 - this was confusing. How could the authors have noted distinct occurrences and clusters of these if they were excluded from the BLAST analysis? It says the consensus sequences were effectively representing these, but doesn't this raise the possibility that the consensus sequences are not specific enough? Could this also then lead to false identification? Perhaps a few more words to explain should be added.

      We thank the reviewer for the observation. While performing the tBlastn search we did obtain the hits for HERV15, HERVR, ERVV1, ERVV2 and PABL, and we have mentioned the detailed explanation about this observation in the revised manuscript at lines 619-627.

      Line 298 - missing comma

      Corrected

      Lines 348-351- this list is not a list of recombination mechanisms. Template switching is a mechanism of recombination, but "acquisition" is simply a generic term, "degradation" is not a mechanism, and "cross-species transmission" might be a driver or a result of recombination, but it is not a mechanism of recombination.

      We thank the reviewer for the observation. We have revised the explanation for the recombination events in the discussion section, as some parts of the results have been moved to discussion section (Lines 1058-1065)

      Lines 369-372. It's not clear why this means the event was a "very recent occurrence". Do the authors mean that there were shared integration sites between some of the species, and that these sites lacked the insertions in other species (e.g. gibbon, orangutan, monkeys)?

      For the long section on recombination events involving an env sequence with an LTR in it, can the authors explain how they know when it's a recombination event versus integration of one provirus into another one, followed by recombination between LTRs to generate a solo-LTR?

      We thank the reviewer for the observation. Regarding the very recent occurrence of the recombination event, we have explained it in revised manuscript at lines 769-824 writing “In fact, the recombinant sequences were shared only between 4 species of Catarrhini parvorder and were absent in more distantly related primates (such as gibbons, orangutans, etc.). This with the presence of shared recombination sites suggests that the insertion occurred after the divergence of these species, while its absence in others indicate that it is a recombination event.”

      For the observation regarding the env-LTR recombination events, the recombinants were first detected by the RDP software and were further validated through the BLAT search in the genomes available on genome browser. The explanation on how we obtained these env-LTR recombination events is now provided in lines 746-763 of the revised manuscript.

      Methods Lines 151-168 and Figure 1 legend Lines 689-690 - how did the authors distinguish between "translated regions" corresponding to the actual Env protein sequence from translation of the other two reading frames? That is, there must have been substantial "translatable" stretches of sequence in the two incorrect reading frames as well as the reading frame corresponding to Env, so the question is how were the correct ones identified for the reconstruction?

      We thank the reviewer for the observation. We have provided the detailed explanation to the observation in the methods section (Lines 335-359).

      Line 495 - "previously reported" should include citation(s) of the prior report(s).

      We thank the reviewer for the observation, we have provided appropriate citations.

      Line 525 - the authors propose that the mechanism "is the co-packaging of different ERVs in a virus particle". First, I assume they meant to say that RNA from different ERVs is co-packaged. Second, isn't it also possible or likely that these could arise from co-packaging of exogenous retrovirus RNAs and recombination, especially if the related exogenous forms were still circulating at the time these things arose?

      We thank the reviewer for the observation. We have modified in the revised manuscript a proposed mechanism that includes also the possibility of co-packaging of exogenous retrovirus RNAs and recombination, at lines 1082-1099

      Line 686 - env should either be italicized (gene) or capitalized (protein), depending on what the authors intended here.

      We thank the reviewer for the observation. We have corrected the typological error in the new version of manuscript.

      Reviewer #3 (Public review):

      Summary:

      Retroviruses have been endogenized into the genome of all vertebrate animals. The envelope protein of the virus is not well conserved and acquires many mutations hence can be used to monitor viral evolution. Since they are incorporated into the host genome, they also reflect the evolution of the hosts. In this manuscript the authors have focused their analyses on the env genes of endogenous retroviruses in primates. Important observations made include the extensive recombination events between these retroviruses that were previously unknown and the discovery of HML species in genomes prior to the splitting of old and new world monkeys.

      Strengths:

      They explored a number of databases and made phylogenetic trees to look at the distribution of retroviral species in primates. The authors provide a strong rationale for their study design, they provide a clear description of the techniques and the bioinformatics tools used.

      Weaknesses:

      The manuscript is based on bioinformatics analyses only. The reference genomes do not reflect the polymorphisms in humans or other primate species. The analyses thus likely underestimates the amount of diversity in the retroviruses. Further experimental verification will be needed to confirm the observations.

      Not sure which databases were used, but if not already analyzed, ERVmap.com and repeatmesker are ones that have many ERVs that are not present in the reference genomes. Also, long range sequencing of the human genome has recently become available which may also be worth studying for this purpose.

      We thank the reviewer for the observations and comments. We would like to clarify that the intent of the work was to perform bioinformatics analysis and so a wet lab experimental verification of the observations are out of the scope of the present manuscript. For the aim of the manuscript, we have used the NCBI reference genomes, while for the report of the coordinates of HML supergroup in the squirrel monkey genome and the coordinates of the recombination events through BLAT search we have used genomes assemblies available on Genome browser with repeat masker custom track, since it has well represented ERV annotations.

      The suggestion regarding using long range sequencing of human genome is an interesting perspective and hence in the future work we will try to implement it in our analysis as well as perform an experimental verification, since, again, the focus of the present work does not include wet experimental part.

      Reviewer #3 (Recommendations for the authors):

      In a few places the term HERV has been used when describing ERVs in non-human primates. This needs to be corrected.

      We thank the reviewer for the observation. We have checked and accordingly modified the terms in the manuscript wherever necessary.

    1. Author response:

      eLife Assessment

      This study provides a valuable contribution to understanding how negative affect influences food-choice decision making in bulimia nervosa, using a mechanistic approach with a drift diffusion model (DDM) to examine the weighting of tastiness and healthiness attributes. The solid evidence is supported by a robust crossover design and rigorous statistical methods, although concerns about low trial counts, possible overfitting, and the absence of temporally aligned binge-eating measures limit the strength of causal claims. Addressing modeling transparency, sample size limitations, and the specificity of mood induction effects, would enhance the study's impact and generalizability to broader populations.

      We thank the Editor and Reviewers for their summary of the strengths of our study, and for their thoughtful review and feedback on our manuscript. We apologize for the confusion in how we described the multiple steps performed and hierarchical methods used to ensure that the model we report in the main text was the best fit to the data while not overfitting. We are not certain about what is meant by “[a]ddressing model transparency,” but as described in our response to Reviewer 1 below, we have now more clearly explained (with references) that the use of hierarchical estimation procedures allows for information sharing across participants, which improves the reliability and stability of parameter estimates—even when the number of trials per individual is small. We have clarified for the less familiar reader how our Bayesian model selection criterion penalizes models with more parameters (more complex models). Although details about model diagnostics, recoverability, and posterior predictive checks are all provided in the Supplementary Materials, we have clarified for the less familiar reader how each of these steps ensures that the parameters we estimate are not only identifiable and interpretable, but also ensure that the model can reproduce key patterns in the data, supporting the validity of the model. Additionally, we have provided all scripts for estimating the models by linking to our public Github repository. Furthermore, we have edited language throughout to eliminate any implication of causal claims and acknowledged the limitation of the small sample size.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Using a computational modeling approach based on the drift diffusion model (DDM) introduced by Ratcliff and McKoon in 2008, the article by Shevlin and colleagues investigates whether there are differences between neutral and negative emotional states in:

      (1) The timings of the integration in food choices of the perceived healthiness and tastiness of food options between individuals with bulimia nervosa (BN) and healthy participants.

      (2) The weighting of the perceived healthiness and tastiness of these options.

      Strengths:

      By looking at the mechanistic part of the decision process, the approach has the potential to improve the understanding of pathological food choices. The article is based on secondary research data.

      Weaknesses:

      I have two major concerns and a major improvement point.

      The major concerns deal with the reliability of the results of the DDM (first two sections of the Results, pages 6 and 7), which are central to the manuscript, and the consistency of the results with regards to the identification of mechanisms related to binge eating in BN patients (i.e. last section of the results, page 7).

      (1) Ratcliff and McKoon in 2008 used tasks involving around 1000 trials per participant. The Chen et al. experiment the authors refer to involves around 400 trials per participant. On the other hand, Shevlin and colleagues ask each participant to make two sets of 42 choices with two times fewer participants than in the Chen et al. experiment. Shevlin and colleagues also fit a DDM with additional parameters (e.g. a drift rate that varies according to subjective rating of the options) as compared to the initial version of Ratcliff and McKoon. With regards to the number of parameters estimated in the DDM within each group of participants and each emotional condition, the 5- to 10-fold ratio in the number of trials between the Shevlin and colleagues' experiment and the experiments they refer to (Ratcliff and McKoon, 2008; Chen et al. 2022) raises serious concerns about a potential overfitting of the data by the DDM. This point is not highlighted in the Discussion. Robustness and sensitivity analyses are critical in this case.

      We thank the Reviewer for their thoughtful critique. We agree that a limited number of trials can forestall reliable estimation, which we acknowledge in the Discussion section. However, we used a hierarchical estimation approach which leverages group information to constrain individual-level estimates. This use of group-level parameters to inform individual-level estimates reduces overfitting and noise that can arise when trial counts are low, and the regularization inherent in hierarchical fitting prevents extreme parameter estimates that could arise from noisy or limited data (Rouder & Lu, 2005). As a result, hierarchical estimation has been repeatedly shown to work well in settings with low trial counts, including as few as 40 trials per condition (Ratcliff & Childers, 2015; Wiecki et al., 2013), and previous applications of the time-varying DDM to food choice task data has included experiments with as few as 60 trials per condition (Maier et al., 2020). We have added references to these more recent approaches and specifically note their advantages for the modeling of tasks with fewer trials. Additionally, our successful parameter recovery described in the Supplementary Materials supports the robustness of the estimation procedure and the reliability of our results.

      The authors compare different DDMs to show that the DDM they used to report statistical results in the main text is the best according to the WAIC criterion. This may be viewed as a robustness analysis. However, the other DDM models (i.e. M0, M1, M2 in the supplementary materials) they used to make the comparison have fewer parameters to estimate than the one they used in the main text. Fits are usually expected to follow the rule that the more there are parameters to estimate in a model, the better it fits the data. Additionally, a quick plot of the data in supplementary table S12 (i.e. WAIC as a function of the number of parameters varying by food type in the model - i.e. 0 for M0, 2 for M1, 1 for M2 and 3 for M3) suggests that models M1 and potentially M2 may be also suitable: there is a break in the improvement of WAIC between model M0 and the three other models. I would thus suggest checking how the results reported in the main text differ when using models M1 and M2 instead of M3 (for the taste and health weights when comparing M3 with M1, for τS when comparing M3 with M2). If the differences are important, the results currently reported in the main text are not very reliable.

      We thank the Reviewer for highlighting that it would be helpful for the paper to explicitly note that we specifically selected WAIC as one of two methods to assess model fit because it penalizes for model complexity. We now explicitly state that, in addition to being more robust than other metrics like AIC or BIC when comparing hierarchical Bayesian models like those in the current study, model fit metrics like WAIC penalize for model complexity based on the number of parameters (Watanabe, 2010). Therefore, it is not the case that more complex models (i.e., having additional parameters) would automatically have lower WAICs. Additionally, we note that our second method to assess model fit, posterior predictive checks demonstrate that only model M3 can reproduce key behavioral patterns present in the empirical data. As described in the Supplementary Materials, M1 and M2 miss those patterns in the data. In summary, we used best practices to assess model fit and reliability (Wilson & Collins, 2019): results from the WAIC comparison (which in fact penalizes models with more parameters) and results from posterior predictive checks align in showing that M3 best fit to our data. We have added a sentence to the manuscript to state this explicitly.

      (2) The second main concern deals with the association reported between the DDM parameters and binge eating episodes (i.e. last paragraph of the results section, page 7). The authors claim that the DDM parameters "predict" binge eating episodes (in the Abstract among other places) while the binge eating frequency does not seem to have been collected prospectively. Besides this methodological issue, the interpretation of this association is exaggerated: during the task, BN patients did not make binge-related food choices in the negative emotional state. Therefore, it is impossible to draw clear conclusions about binge eating, as other explanations seem equally plausible. For example, the results the authors report with the DDM may be a marker of a strategy of the patients to cope with food tastiness in order to make restrictive-like food choices. A comparison of the authors' results with restrictive AN patients would be of interest. Moreover, correlating results of a nearly instantaneous behavior (i.e. a couple of minutes to perform the task with the 42 food choices) with an observation made over several months (i.e. binge eating frequency collected over three months) is questionable: the negative emotional state of patients varies across the day without systematically leading patients to engage in a binge eating episode in such states.

      I would suggest in such an experiment to collect the binge craving elicited by each food and the overall binge craving of patients immediately before and after the task. Correlating the DDM results with these ratings would provide more compelling results. Without these data, I would suggest removing the last paragraph of the Results.

      We thank the Reviewer for these interesting suggestions and appreciate the opportunity to clarify that we agree that claims about causal connections between our decision parameters and symptom severity metrics would be inappropriate. Per the Reviewer’s suggestions, we have eliminated the use of the word “predict” to describe the tested association with symptom metrics.  We also agree that more time-locked associations with craving ratings and near-instantaneous behavior would be useful, and we have added this as an important direction for future research in the discussion. However, associating task-based behavior with validated self-report measures that assess symptom severity over long periods of time that precede the task visit (e.g., over the past 2 weeks in depression, over the past month in eating disorders) is common practice in computational psychiatry, psychiatric neuroimaging, and clinical cognitive neuroscience (Hauser et al., 2022; Huys et al., 2021; Wise et al., 2023), and this approach has been used several times specifically with food choice tasks (Dalton et al., 2020; Steinglass et al., 2015). We have revised the language throughout the manuscript to clarify: the results suggest that individuals whose task behavior is more reactive to negative affect tend to be the most symptomatic, but the results do not allow us to determine whether this reactivity causes the symptoms.

      In response to this Reviewer’s important point about negative affect not always producing loss-of-control eating in individuals with BN, we also now explicitly note that while several studies employing ecological momentary assessments (EMA) have repeatedly shown that increases in negative affect significantly increase the likelihood of subsequent loss-of-control eating (Alpers & Tuschen-Caffier, 2001; Berg et al., 2013; Haedt-Matt & Keel, 2011; Hilbert & Tuschen-Caffier, 2007; Smyth et al., 2007), not all loss-of-control eating occurs in the context of negative affect, and that future studies should integrate food choice task data pre and post-affect inductions with measures that capture the specific frequency of loss of control eating episodes that occur during states of high negative affect.

      (3) My major improvement point is to tone down as much as possible any claim of a link with binge eating across the entire manuscript and to focus more on the restrictive behavior of BN patients in between binge eating episodes (see my second major concern about the methods). Additionally, since this article is a secondary research paper and since some of the authors have already used the task with AN patients, if possible I would run the same analyses with AN patients to test whether there are differences between AN (provided they were of the restrictive subtype) and BN.

      We appreciate the Reviewer’s perspective and suggestions. We have adjusted our language linking loss-of-control eating frequency with decision parameters, and we have added additional sentences focusing on the implications for the restrictive behavior of patients with BN between binge eating episodes. In the Supplementary Materials. We have added an analysis of the restraint subscale of the EDE-Q and confirmed no relationship with parameters of interest. While we agree additional analyses with AN patients would be of interest, this is outside the scope of the paper. Our team have collected data from individuals with AN using this task, but not with any affect induction or measure of affect. Therefore, we have added this important direction for future research to the discussion.

      Reviewer #2 (Public review):

      Summary:

      Binge eating is often preceded by heightened negative affect, but the specific processes underlying this link are not well understood. The purpose of this manuscript was to examine whether affect state (neutral or negative mood) impacts food choice decision-making processes that may increase the likelihood of binge eating in individuals with bulimia nervosa (BN). The researchers used a randomized crossover design in women with BN (n=25) and controls (n=21), in which participants underwent a negative or neutral mood induction prior to completing a food-choice task. The researchers found that despite no differences in food choices in the negative and neutral conditions, women with BN demonstrated a stronger bias toward considering the 'tastiness' before the 'healthiness' of the food after the negative mood induction.

      Strengths:

      The topic is important and clinically relevant and methods are sound. The use of computational modeling to understand nuances in decision-making processes and how that might relate to eating disorder symptom severity is a strength of the study.

      Weaknesses:

      The sample size was relatively small and may have been underpowered to find differences in outcomes (i.e., food choice behaviors). Participants were all women with BN, which limits the generalizability of findings to the larger population of individuals who engage in binge eating. It is likely that the negative affect manipulation was weak and may not have been potent enough to change behavior. Moreover, it is unclear how long the negative affect persisted during the actual task. It is possible that any increases in negative affect would have dissipated by the time participants were engaged in the decision-making task.

      We thank the Reviewer for their comments on the strengths of the paper, and for highlighting these important considerations regarding the sample demographics and the negative affect induction. As in the original paper that focused only on ultimate food choice behaviors, we now specifically acknowledge that the study was only powered to detect small to medium group differences in the effect of negative emotion on these final choice behaviors. Regarding the sample demographics, we agree that the study’s inclusion of only female participants is a limitation.  Although the original decision for this sampling strategy was informed by data suggesting that bulimia nervosa is roughly six times more prevalent among females than males (Udo & Grilo, 2018), we now note in the discussion that our female-only sample limits the generalizability of the findings.

      We also agree with the Reviewer’s noted limitations of the negative mood induction, and based on the reviewer’s suggestions, we have added to our original description of these limitations in the Discussion. Specifically, we now note that although the task was completed immediately after the affect induction, the study did not include intermittent mood assessments throughout the choice task, so it is unclear how long the negative affect persisted during the actual task.

      Reviewer #3 (Public review):

      Summary:

      The study uses the food choice task, a well-established method in eating disorder research, particularly in anorexia nervosa. However, it introduces a novel analytical approach - the diffusion decision model - to deconstruct food choices and assess the influence of negative affect on how and when tastiness and healthiness are considered in decision-making among individuals with bulimia nervosa and healthy controls.

      Strengths:

      The introduction provides a comprehensive review of the literature, and the study design appears robust. It incorporates separate sessions for neutral and negative affect conditions and counterbalances tastiness and healthiness ratings. The statistical methods are rigorous, employing multiple testing corrections.

      A key finding - that negative affect induction biases individuals with bulimia nervosa toward prioritizing tastiness over healthiness - offers an intriguing perspective on how negative affect may drive binge eating behaviors.

      Weaknesses:

      A notable limitation is the absence of a sample size calculation, which, combined with the relatively small sample, may have contributed to null findings. Additionally, while the affect induction method is validated, it is less effective than alternatives such as image or film-based stimuli (Dana et al., 2020), potentially influencing the results.

      We agree that the small sample size and specific affect induction method may have contributed to the null model-agnostic behavioral findings. Based on this Reviewer’s and Reviewer 2’s comments, we have added these factors to our original acknowledgements of limitations in the Discussion.

      Another concern is the lack of clarity regarding which specific negative emotions were elicited. This is crucial, as research suggests that certain emotions, such as guilt, are more strongly linked to binge eating than others. Furthermore, recent studies indicate that negative affect can lead to both restriction and binge eating, depending on factors like negative urgency and craving (Leenaerts et al., 2023; Wonderlich et al., 2024). The study does not address this, though it could explain why, despite the observed bias toward tastiness, negative affect did not significantly impact food choices.

      We thank the Reviewer for raising these important points and possibilities. In the supplementary materials, we have added an additional analysis of the specific POMS subscales that comprise the total negative affect calculation that was reported in the original paper (Gianini et al., 2019), and which we now report in the main text. Ultimately, we found that, across both groups, the negative affect induction increased responses related to anger, confusion, depression, and tension while reducing vigor.

      We agree with the Reviewer that factors like negative urgency and cravings are relevant here. The study did not collect any measures of craving, and in response to Reviewer 1 and this Reviewer, we now note in the discussion that replication studies including momentary craving assessments will be important. While we don’t have any measurements of cravings, we did measure negative urgency. Despite these prior findings, the original paper (Gianini et al., 2019) did not find that negative urgency was related to restrictive food choices. We have now repeated those analyses, and we also were unable to find any meaningful patterns. Nonetheless, we have added an analysis of negative urgency scores and decision parameters to the supplementary materials.      

      References

      Alpers, G. W., & Tuschen-Caffier, B. (2001). Negative feelings and the desire to eat in bulimia nervosa. Eating Behaviors, 2(4), 339–352. https://doi.org/10.1016/S1471-0153(01)00040-X

      Berg, K. C., Crosby, R. D., Cao, L., Peterson, C. B., Engel, S. G., Mitchell, J. E., & Wonderlich, S. A. (2013). Facets of negative affect prior to and following binge-only, purge-only, and binge/purge events in women with bulimia nervosa. Journal of Abnormal Psychology, 122(1), 111–118. https://doi.org/10.1037/a0029703

      Dalton, B., Foerde, K., Bartholdy, S., McClelland, J., Kekic, M., Grycuk, L., Campbell, I. C., Schmidt, U., & Steinglass, J. E. (2020). The effect of repetitive transcranial magnetic stimulation on food choice-related self-control in patients with severe, enduring anorexia nervosa. International Journal of Eating Disorders, 53(8), 1326–1336. https://doi.org/10.1002/eat.23267

      Gianini, L., Foerde, K., Walsh, B. T., Riegel, M., Broft, A., & Steinglass, J. E. (2019). Negative affect, dietary restriction, and food choice in bulimia nervosa. Eating Behaviors, 33, 49–54. https://doi.org/10.1016/j.eatbeh.2019.03.003

      Haedt-Matt, A. A., & Keel, P. K. (2011). Revisiting the affect regulation model of binge eating: A meta-analysis of studies using ecological momentary assessment. Psychological Bulletin, 137(4), 660–681. https://doi.org/10.1037/a0023660

      Hauser, T. U., Skvortsova, V., Choudhury, M. D., & Koutsouleris, N. (2022). The promise of a model-based psychiatry: Building computational models of mental ill health. The Lancet Digital Health, 4(11), e816–e828. https://doi.org/10.1016/S2589-7500(22)00152-2

      Hilbert, A., & Tuschen-Caffier, B. (2007). Maintenance of binge eating through negative mood: A naturalistic comparison of binge eating disorder and bulimia nervosa. International Journal of Eating Disorders, 40(6), 521–530. https://doi.org/10.1002/eat.20401

      Huys, Q. J. M., Browning, M., Paulus, M. P., & Frank, M. J. (2021). Advances in the computational understanding of mental illness. Neuropsychopharmacology, 46(1), 3–19. https://doi.org/10.1038/s41386-020-0746-4

      Maier, S. U., Raja Beharelle, A., Polanía, R., Ruff, C. C., & Hare, T. A. (2020). Dissociable mechanisms govern when and how strongly reward attributes affect decisions. Nature Human Behaviour, 4(9), Article 9. https://doi.org/10.1038/s41562-020-0893-y

      Ratcliff, R., & Childers, R. (2015). Individual differences and fitting methods for the two-choice diffusion model of decision making. Decision, 2(4), 237–279. https://doi.org/10.1037/dec0000030

      Rouder, J. N., & Lu, J. (2005). An introduction to Bayesian hierarchical models with an application in the theory of signal detection. Psychonomic Bulletin & Review, 12(4), 573–604. https://doi.org/10.3758/BF03196750

      Smyth, J. M., Wonderlich, S. A., Heron, K. E., Sliwinski, M. J., Crosby, R. D., Mitchell, J. E., & Engel, S. G. (2007). Daily and momentary mood and stress are associated with binge eating and vomiting in bulimia nervosa patients in the natural environment. Journal of Consulting and Clinical Psychology, 75(4), 629–638. https://doi.org/10.1037/0022-006X.75.4.629

      Steinglass, J., Foerde, K., Kostro, K., Shohamy, D., & Walsh, B. T. (2015). Restrictive food intake as a choice—A paradigm for study. International Journal of Eating Disorders, 48(1), 59–66. https://doi.org/10.1002/eat.22345

      Udo, T., & Grilo, C. M. (2018). Prevalence and Correlates of DSM-5–Defined Eating Disorders in a Nationally Representative Sample of U.S. Adults. Biological Psychiatry, 84(5), 345–354. https://doi.org/10.1016/j.biopsych.2018.03.014

      Watanabe, S. (2010). Asymptotic Equivalence of Bayes Cross Validation and Widely Applicable Information Criterion in Singular Learning Theory. Journal of Machine Learning Research, 11, 3571–3594.

      Wiecki, T. V., Sofer, I., & Frank, M. J. (2013). HDDM: Hierarchical Bayesian estimation of the drift-diffusion model in Python. Frontiers in Neuroinformatics, 7. https://doi.org/10.3389/fninf.2013.00014

      Wilson, R. C., & Collins, A. G. (2019). Ten simple rules for the computational modeling of behavioral data. eLife, 8, e49547. https://doi.org/10.7554/eLife.49547

      Wise, T., Robinson, O. J., & Gillan, C. M. (2023). Identifying Transdiagnostic Mechanisms in Mental Health Using Computational Factor Modeling. Biological Psychiatry, 93(8), 690–703. https://doi.org/10.1016/j.biopsych.2022.09.034

    1. Reviewer #1 (Public review):

      Summary:

      The authors had previously found that brief social isolation could increase the activity of these neurons, and that manipulation of these neurons could alter social behavior in a social rank-dependent fashion. This manuscript explored which of the outputs were responsible for this, identifying the central nucleus of the amygdala as the key output region. The authors identified some discrete behavior changes associated with these outputs, and found that during photostimulation of these outputs, neuronal activity appeared altered in 'social response' neurons.

      Strengths:

      Rigorous analysis of the anatomy. Careful examination of the heterogenous effects on cell activity due to stimulation, linking the physiology with the behavior via photostimulation during recording in vivo.

      Weaknesses:

      (1) There are some clear imbalances in the sample size across the different regions parsed. The CeA has a larger sample size, likely in part to the previous work suggesting differential effects depending on social rank/dominance. Given the potential variance, it may be hard to draw conclusions about the impact of stimulation across different social ranks for other groups.

      (2) It is somewhat unclear why only the 'social object ratio' was used to assess the effects versus more direct measurements of social behavior.

      (3) Somewhat related, while it is statistically significant, it is unclear if the change seen in face investigation of biologically significant, on average, it looks like a few-seconds difference and that was not modulated by social rank.

      (4) There are several papers studying these neurons that have explored behaviors examined here, as well as the physiological connectivity that are not cited that would provide important context for this work. In particular, multiple groups have found a dopamine-mediated IPSP in the BNST, in contrast to this work. There are technical differences that may drive these differences, but not addressing them is a major weakness.

      (5) The inclusion of some markers for receptors for some of these outputs is interesting, and the authors suggest that this may be important, but this is somewhat disconnected from the rest of the work performed.

    2. Reviewer #2 (Public review):

      Summary:

      The authors perform a series of studies to follow up on their previous work, which established a role for dorsal raphe dopamine neurons (DRN) in the regulation of social-isolation-induced rebound in mice. In the present study, Lee et. al, use a combination of modern circuit tools to investigate putatively distinct roles of DRN dopamine transporting containing (DAT) projections to the bed nucleus of the stria terminalis (BNST), central amygdala (CeA), and posterior basolateral amygdala (BLP). Notably, they reveal that optogenetic stimulation of distinct pathways confers specific behavioral states, with DRNDAT-BLP driving aversion, DRNDAT-BNST regulating non-social exploratory behavior, and DRNDAT-CeA promoting social ability. A combination of electrophysiological studies and in situ hybridization studies reveal heterogenous dopamine and neuropeptide expression and different firing properties, providing further evidence of pathway-specific neural properties. Lastly, the authors combine optogenetics and calcium imaging to resolve social encoding properties in the DRNDAT-CeA pathway, which correlates observed social behavior to socially engaged neural ensembles.

      Collectively, these studies provide an interesting way of dissecting out separable features of a complex multifaceted social-emotional state that accompanies social isolation and the perception of 'loneliness.' The main conclusions of the paper provide an important and interesting set of findings that increase our understanding of these distinct DRN projections and their role in a range of social (e.g., prosocial, dominance), non-social, and emotional behaviors. However, as noted below, the examination of these circuits within a homeostatic framework is limited given that a number of the datasets did not include an isolated condition. The DRNDAT-CeA pathway was investigated with respect to social homeostatic states in the present study for some of the datasets.

      Strengths:

      (1) The authors perform a comprehensive and elegant dissection of the anatomical, behavioral, molecular, and physiological properties of distinct DRN projections relevant to social, non-social, and emotional behavior, to address multifaceted and complex features of social state.

      (2) This work builds on prior findings of isolation-induced changes in DRN neurons and provides a working framework for broader circuit elements that can be addressed across the social homeostatic state.

      (3) This work characterizes a broader circuit implicated in social isolation and provides a number of downstream targets to explore, setting a nice foundation for future investigation.

      (4) The studies account for social rank and anxiety-like behavior in several of the datasets, which are an important consideration to the interpretation of social motivation states, especially in male mice with respect to dominance behavior.

      Weaknesses:

      (1) The conceptual framework of the study is based on the premise of social isolation and perceived 'loneliness' under the framework of social homeostasis, analogous to hunger. In this framework, social isolation should provoke an aversive state and compensatory social contact behavior. In the authors' prior work, they demonstrate synaptic changes in DRN neurons and social rebound following acute social isolation. Thus, the prediction would be that downstream projections also would show state-dependent changes as a function of social housing conditions (e.g., grouped vs. isolated). In the current paper, a social isolation condition was not included for the majority of the studies conducted (e.g., Figures 1-6 do not include an isolated condition, Figures 7-8 do include an isolated condition). Thus, while Figure 1-6 adds a very interesting and compelling set of data that is of high value to the social behavior field with respect to social and emotional processing and general circuit characterization, these studies do not directly investigate the impacts of dynamic social homeostatic state. The main claim of the paper, including the title (e.g., separable DRN projections mediate facets of loneliness-like state), abstract, intro, and discussion presents the claim of this work under the framework of dynamic social homeostatic states, which should be interpreted with caution, as the majority of the work in the paper did not include a social isolation comparison.

      (2) In Figure 1, the authors confirm co-laterals in the BNST and CeA via anatomical tracing studies. The goal of the optogenetic studies is to dissociate the functional/behavioral roles of distinct projections. However, one limitation of optogenetic projection targeting is the possibility of back-propagating action potentials (stimulation of terminals in one region may back-propagate to activate cell bodies, and then afferent projections to other regions), and/or stimulation of fibers of passage. Therefore, one limitation in the dataset for the optogenetic stimulation studies is the possibility of non-specific unintended activation of projections other than those intended (e.g., DRNDAT-CeA). This can be dealt with by administering lidocaine to prevent back-propagating action potentials.

      (3) It is unclear from the test, but in the subjects' section of the methods, it appears that only male animals were included in the study, with no mention of female subjects. It should be clear to the reader that this was conducted in males only if that is the case, with consideration or discussion, about female subjects and sex as a biological variable.

      (4) Averaged data are generally reported throughout the study in the form of bar graphs, across most figures. Individual data points would increase the transparency of the data.

    1. Reviewer #2 (Public review):

      The authors tested tactile acuity on the breast of females using several tasks and reported overall low acuity compared to the back, which is typically considered to have the worst acuity of all body parts. Moreover, there was evidence that acuity is worse the larger the breast; this finding mirrors similar findings for the hand and therefore suggests that the number of tactile sensors is fixed and must be distributed across a larger extent of skin when a body part is larger, thus resulting in comparably lower tactile acuity.

      Strengths:

      I find this an interesting paper with results that are relevant to the tactile community. The authors apply several tasks allowing them to link the paper with previous results. The methodology and psychophysical analysis are sound.

      Weaknesses:

      The analysis of localization error direction, with the result that the nipple area may be a landmark for tactile localization, is interesting and aligns the paper with some other recent papers that have suggested that such landmarks should exist. However, there are major issues with methodology and statistics, so that currently the conclusions are not supported.

      In the following, line numbers refer to the re-formatted manuscript provided by the authors upon request and are mentioned for them to find the relevant passages faster.

      (1) Comments on analysis of tactile acuity:

      - I had a hard time understanding some parts of the report. What is meant by "broadly no relationship" in line 137?

      - It is suggested that spatial expansion (which is correlated with body part size) is related between medial breast and hand - is this to say that women with large hands have large medial breast size? Nipple size was measured, but hand size was not measured, is this correct?

      - It is furthermore unclear how the authors differentiate medial breast and NAC. The sentence in lines 140-141 seems to imply the two terms are considered the same, as a conclusion about NAC is drawn from a result about the medial breast. This requires clarification.

      - Finally, given that the authors suspect that overall localization ability (or attention) may be overshadowed by a size effect, would not an analysis be adequate that integrates both, e.g. a regression with multiple predictors?

      (2) Comments on analysis of "The nipple is a unit":

      - Statistics in this section are not adequately described and may be partly false.

      - In the paragraph about testing quadrants of the nipple, it is stated that only 3 of 10 participants barely outperformed chance with a p < 0.01. It is unclear how a significant t-test is an indication of "barely above chance".

      - The final part of the paragraph on nipple quadrants (starting line 176) explains that there was a trend (4 of 10 participants) for lower tactile acuity being related to the inability to differentiate quadrants. It seems to me that such a result would not be expected: The stated hypothesis is that all participants have the same number of tactile sensors in their nipple and areola, independent of NAC size. In this section, participants determine the quadrant of a single touch. Theoretically, all participants should be equally able to perform this task, because they all have the same number of receptors in each quadrant of nipple and areola. Thus, the result in Figure 2C is curious.

      (3) Comments on analysis of "Absolute localization on the breast is anchored to the nipple"

      - Again, there are things that are unclear with the statistics and description of the analysis.

      - This section reports an Anova (line 193/194) with a factor "participant". This doesn't appear sensible. Please clarify. The factor distance is also unclear; is this a categorical or a continuous variable? Line 400 implies a 6-level factor, but Anovas and their factors, respectively, are not described in methods (nor are any of the other statistical approaches).

      - The analysis on imprecision using mean pairwise error (line 199) is unclear: does pairwise refer to x/y or to touch vs. center of the nipple?

      - p8, upper text, what is meant by "relative over-representation of the depth axis"? Does this refer to the breast having depth but the equivalent area on the back not having depth? What are the horizontal planes (probably meant to be singular?) - do you simply mean that depth was ignored for the calculation of errors? This seems to be implied in Figure 3AB.

      - Lines 232-241, I cannot follow the conclusions drawn here. First, it is not clear to a reader what the aim of the presented analyses is: what are you looking for when you analyze the vectors? Second, "vector strength" should be briefly explained in the main text. Third, it is not clear how the final conclusion is drawn. If there is a bias of all locations towards the nipple, then a point closer to the nipple cannot exhibit a large bias, because the nipple is close-by. Therefore, one would expect that points close to the nipple exhibit smaller errors, but this would not imply higher acuity - just less space for localizing anything. The higher acuity conclusion is at odds with the remaining results, isn't it: acuity is low on the outer breast, but even lower at the NAC, so why would it be high in between the two?

      (4) Comments on the Discussion:

      The discussion makes some concrete suggestions for sensors in implants (line 283). It is not clear how the stated numbers were computed. Also, why should 4 sensors nipple quadrants receive individual sensors if the result here was that participants cannot distinguish these quadrants?

      Additional comments:

      I would find it interesting to know whether participants with small breast measurement delta had breast acuity comparable to the back. Alternatively, it would be interesting to know whether breast and back acuity are comparable in men. Such a result would imply that the torso has uniform acuity overall, but any spatial extension of the breast is unaccounted for. The lowest single participant data points in Figure 1B appear similar, which might support this idea.

    1. Reviewer #2 (Public review):

      Summary:

      The author presents a transport-based morphometry (TBM) approach for the discovery of non-contrast computed tomography (NCCT) markers of hematoma expansion risk in spontaneous intracerebral hemorrhage (ICH) patients. The findings demonstrate that TBM can quantify hematoma morphological features and outperforms existing clinical scoring systems in predicting 24-hour hematoma expansion. In addition, the inversion model can visualize features, which makes it interpretable. In conclusion, this research has clinical potential for ICH risk stratification, improving the precision of early interventions.

      Strengths:

      TBM quantifies hematoma morphological changes using the Wasserstein distance, which has a well-defined physical meaning. It identifies features that are difficult to detect through conventional visual inspection (such as peripheral density distribution and density heterogeneity), which provides evidence supporting the "avalanche effect" hypothesis in hematoma expansion pathophysiology.

      Weaknesses:

      (1) As a methodology-focused study, the description of the methods section somewhat lacks depth and focus, which may make it challenging for readers to fully grasp the overall structure and workflow of the approach. For instance, the manuscript lacks a systematic overview of the entire process, from NCCT image input to the final prediction output. A potential improvement would be to include a workflow figure at the beginning of the manuscript, summarizing the proposed method and subsequent analytical procedures. This would help readers better understand the mechanism of the model.

      (2) The description of the comparison algorithms could be more detailed. Since TBM directly utilizes NCCT images as input for prediction, while SVM and K-means are not inherently designed to process raw imaging data, it would be beneficial to clarify which specific features or input data were used for these comparison models. This would better highlight the effectiveness and advantages of the TBM method.

      (3) The relatively small training and testing dataset may limit the model's performance and generalizability. Notably, while the study mentions that 1,066 patients from the ERICH dataset met the inclusion criteria, only 170 were randomly selected for the test set. Leveraging the full 1,066 ERICH cases for model training and internal validation might potentially enhance the model's robustness and performance.

      (4) Some minor textual issues need to be checked and corrected, such as line 16 in the abstract "Incorporating these traits into a v achieved an AUROC of 0.71 ...".

      (5) Some figures need to be reformatted (e.g., the x-axis in Figure 2 a is blocked).

    1. 布尔格(Boolean Lattice)

      好的,我们来详细解释一下布尔格 (Boolean Lattice)

      1. 定义:

      一个格 L 被称为布尔格 (Boolean Lattice),如果它同时满足以下两个条件:

      • 它是分配格 (Distributive Lattice): 对于任意元素 a, b, c ∈ L,分配律成立:
        • a ∧ (b ∨ c) = (a ∧ b) ∨ (a ∧ c)
        • a ∨ (b ∧ c) = (a ∨ b) ∧ (a ∨ c)
      • 它是有补格 (Complemented Lattice):
        • 它是一个有界格(拥有底元素 ⊥ 和顶元素 ⊤)。
        • 对于 L 中的每一个元素 a,都存在至少一个补元 b ∈ L,使得 a ∨ b = ⊤a ∧ b = ⊥

      简单来说,布尔格就是既是分配格又是补格的格。

      2. 布尔格的关键性质:

      除了分配性和有补性之外,布尔格还具有一些重要的性质:

      • 补元的唯一性: 在布尔格中,每个元素的补元是唯一的。证明如下: 假设元素 a 有两个补元 bc。那么:

        • a ∨ b = ⊤a ∧ b = ⊥
        • a ∨ c = ⊤a ∧ c = ⊥ 利用分配律,我们可以得到: b = b ∧ ⊤ = b ∧ (a ∨ c) = (b ∧ a) ∨ (b ∧ c) = (a ∧ b) ∨ (b ∧ c) = ⊥ ∨ (b ∧ c) = b ∧ c c = c ∧ ⊤ = c ∧ (a ∨ b) = (c ∧ a) ∨ (c ∧ b) = (a ∧ c) ∨ (c ∧ b) = ⊥ ∨ (c ∧ b) = c ∧ b 由于 b = b ∧ cc = c ∧ b,所以 b = c(根据偏序关系的定义)。因此,补元是唯一的。 我们通常用 ¬aa' 来表示元素 a 的唯一补元。
      • 有界性: 作为有补格,布尔格必然拥有底元素 (⊥) 和顶元素 (⊤)。

      • 满足德摩根定律 (De Morgan's Laws): 对于任意元素 ab,以下定律成立:

        • ¬(a ∨ b) = ¬a ∧ ¬b
        • ¬(a ∧ b) = ¬a ∨ ¬b

      3. 布尔格的例子:

      • 集合的幂集格: 对于任意集合 X,其幂集 P(X) 在子集包含关系下是一个布尔格。

        • 并运算 (∨) 是集合的并集 (∪)。
        • 交运算 (∧) 是集合的交集 (∩)。
        • 底元素 (⊥) 是空集
        • 顶元素 (⊤) 是集合 X 本身。
        • 补元 (¬A) 是集合的补集 X \ A
      • 布尔代数 (Boolean Algebra): 布尔格的概念与布尔代数是等价的。布尔代数通常被定义为一个集合 B,连同两个二元运算 (∨, ∧),一个一元运算 (¬),以及两个特殊的元素 (0, 1),满足一系列公理(包括交换律、结合律、分配律、单位元、补元等)。一个布尔代数可以直接对应到一个布尔格,反之亦然。

      • 命题逻辑的代数: 在经典命题逻辑中,命题可以用布尔变量表示,逻辑运算符“与”、“或”、“非”分别对应于布尔格中的 ¬

      • 开关电路: 数字电路中的开关可以用布尔变量表示,电路的串联和并联操作可以用 来建模,而开关的取反可以用 ¬ 来建模。

      4. 不是布尔格的格的例子:

      • 不是分配格的格: 例如,五边形格 (N₅) 和菱形格 (M₃) 是模格但不是分配格,因此它们不是布尔格。
      • 不是有补格的有界格: 例如,我们之前提到的三个元素的链 {0, a, 1} 不是有补格,因此也不是布尔格。
      • 自然数上的整除格(即使是有限的有界子格,如 12 的因子格): 我们看到 12 的因子格不是有补格,因此也不是布尔格。虽然 6 的因子格是有补格,但它也是分配格,所以它是布尔格。一般来说,只有当一个正整数是无平方因子的且是若干个不同素数的乘积时,其因子格才是布尔格。

      5. 布尔格的重要性:

      布尔格(以及等价的布尔代数)在数学、计算机科学和工程学中都具有极其重要的地位,它们是:

      • 逻辑推理的基础: 为形式逻辑提供了代数化的框架。
      • 数字电路设计的基础: 用于分析和设计数字逻辑电路。
      • 集合论的抽象: 提供了一种抽象的方式来处理集合及其运算。
      • 概率论和信息论的工具: 用于描述和分析事件和信息。

      总而言之,布尔格是一种具有非常强结构的格,它结合了分配性和有补性,并因此拥有许多有用的性质,使其成为一个非常重要的数学概念,并在众多领域有着广泛的应用。

    2. 完备格(Complete Lattice)

      好的,我们来详细理解一下完备格 (Complete Lattice)

      1. 回顾格的定义:

      首先,我们知道一个格 (Lattice) 是一个偏序集 (L, ≤),其中任意两个元素 ab 都有唯一的最小上界 (least upper bound, join, ∨) 和唯一的最大下界 (greatest lower bound, meet, ∧)

      2. 完备格的引入:

      完备格 是格概念的扩展。它不仅仅要求每对元素有最小上界和最大下界,而是要求每一个子集都具有最小上界和最大下界。

      3. 完备格的正式定义:

      一个偏序集 (L, ≤) 被称为一个完备格 (Complete Lattice),如果它的每一个非空子集 S ⊆ L 都具有:

      • 最小上界 (Least Upper Bound, Supremum, Join of S):记为 ∨S。它是 L 中所有大于等于 S 中每个元素的元素中最小的一个。
      • 最大下界 (Greatest Lower Bound, Infimum, Meet of S):记为 ∧S。它是 L 中所有小于等于 S 中每个元素的元素中最大的一个。

      4. 关键区别:

      • 普通格: 要求任意两个元素有并和交。
      • 完备格: 要求任意子集(包括有限和无限子集)有并和交。

      5. 完备格的例子:

      • 集合的幂集: 对于任意集合 X,其幂集 P(X) 在子集包含关系 下是一个完备格。

        • 对于 P(X) 的任意子集(即一些子集的集合) S = {A₁, A₂, ...},它们的最小上界 ∨S 是它们的并集 A₁ ∪ A₂ ∪ ...
        • 它们的最大下界 ∧S 是它们的交集 A₁ ∩ A₂ ∩ ...
      • 实数轴扩展: 将实数集 加上正无穷 +∞ 和负无穷 -∞,得到集合 ℝ ∪ {+∞, -∞},并在其上定义通常的顺序关系(其中 -∞ ≤ x ≤ +∞ 对于所有 x ∈ ℝ)。这个有序集是一个完备格。任意子集的最小上界就是其上确界(可能为有限实数或 +∞),最大下界就是其下确界(可能为有限实数或 -∞)。

      • 闭区间格: 所有实数闭区间的集合,按照集合包含关系排序,构成一个完备格。任意一组闭区间的并的闭包是它们的最小上界,它们的交集是它们的最大下界。

      • 向量空间的子空间格: 一个向量空间的所有子空间,按照子空间包含关系排序,构成一个完备格。任意一组子空间的最小上界是它们生成的子空间,最大下界是它们的交集。

      6. 不是完备格的格的例子:

      • 自然数上的整除格: 自然数集合 ℕ = {1, 2, 3, ...} 在整除关系下是一个格(最小公倍数是并,最大公约数是交),但它不是完备格。例如,所有素数的集合没有最小上界,因为没有一个自然数能被所有素数整除。

      • 开区间 (0, 1) 上的标准序: 开区间 (0, 1) 在通常的小于等于关系下是一个格(任意两个数的最小上界是它们中的较大者,最大下界是它们中的较小者),但它不是完备格。例如,子集 (0, 1) 本身的上确界是 1,下确界是 0,但 10 都不属于开区间 (0, 1)。如果我们将区间扩展到闭区间 [0, 1],它就变成了一个完备格。

      7. 完备格必有顶元素和底元素:

      一个重要的性质是,每一个完备格都必然包含一个最小元素(底,⊥) 和一个最大元素(顶,⊤)

      • 底元素 (⊥): 可以看作是整个格 L 的最大下界 ∧L
      • 顶元素 (⊤): 可以看作是整个格 L 的最小上界 ∨L

      8. 有限格与完备格:

      每一个有限的格都是完备格。因为有限集合的任何子集也是有限的,而在有限偏序集中,如果每对元素都有上下界,那么每个有限子集也必然有上下界。

      9. 完备格的重要性:

      完备格在数学和计算机科学的许多领域都非常重要,例如:

      • 不动点定理: 著名的 Knaster-Tarski 不动点定理就是关于完备格上的单调函数的。
      • 领域理论 (Domain Theory): 在程序语言语义学中,完备格被用来构建计算过程的数学模型。
      • 形式概念分析 (Formal Concept Analysis): 概念格总是完备格。

      总结:

      完备格是比普通格更强的结构,它要求每一个子集都拥有最小上界和最大下界。这个额外的条件使得完备格具有许多特殊的性质,并在多个数学和计算机科学领域发挥着重要作用。

    3. 有补格(Complemented Lattice)

      好的,我们来详细解释一下有补格 (Complemented Lattice)

      1. 前提:有界格

      首先,要理解有补格,我们需要先了解有界格 (Bounded Lattice)。一个格 L 是有界的,如果它拥有一个最小元素(底,bottom,通常记为 ⊥ 或 0) 和一个最大元素(顶,top,通常记为 ⊤ 或 1)

      • 底元素 (⊥): 对于所有 x ∈ L,都有 ⊥ ≤ x
      • 顶元素 (⊤): 对于所有 x ∈ L,都有 x ≤ ⊤

      2. 有补格的定义

      一个有界格 L 被称为有补格 (Complemented Lattice),如果对于 L 中的每一个元素 a,都存在至少一个元素 bL 中,满足以下两个条件:

      • a ∨ b = ⊤ab 的并是顶元素)
      • a ∧ b = ⊥ab 的交是底元素)

      这个元素 b 就被称为 a 的一个补元 (complement)

      3. 定义要点

      • 必须是有界格: 有补格的定义依赖于顶元素和底元素的存在,所以首先它必须是一个有界格。
      • 存在性而非唯一性: 定义只要求每个元素都存在至少一个补元。一个元素可能拥有多个补元。
      • 补元是相互的: 如果 ba 的一个补元,那么 a 也是 b 的一个补元,因为 a ∨ b = b ∨ a = ⊤a ∧ b = b ∧ a = ⊥

      4. 有补格的例子

      • 布尔格 (Boolean Lattice): 所有布尔格都是有补格。更重要的是,在布尔格中,每个元素都有唯一的补元。例如,集合的幂集格就是一个布尔格,对于集合 X 的一个子集 A,它的补元是 X \ A(相对于 X 的补集),因为 A ∪ (X \ A) = X(顶元素)且 A ∩ (X \ A) = ∅(底元素)。

      • 有限维向量空间的子空间格: 有限维向量空间的所有子空间,按照子空间包含关系排序,构成一个有补格。对于任意一个子空间 W,都存在另一个子空间 W'(不唯一)使得它们的交集是零子空间(底元素),它们的和是整个向量空间(顶元素)。

      5. 不是有补格的有界格的例子

      • 三个元素的链: 考虑一个由三个元素组成的格 {0, a, 1},其中 0 < a < 1。这是一个有界格,0 是底,1 是顶。

        • 对于元素 a,我们需要找到一个元素 b 使得 a ∨ b = 1a ∧ b = 0
        • 如果 b = 0,则 a ∨ 0 = a ≠ 1
        • 如果 b = a,则 a ∨ a = a ≠ 1
        • 如果 b = 1,则 a ∧ 1 = a ≠ 0。 因此,a 在这个格中没有补元,所以这个格不是有补格。
      • 自然数上的整除格(考虑一个有限的有界子格): 考虑数字 6 的所有因子构成的格 {1, 2, 3, 6},其中整除关系是序。1 是底,6 是顶。

        • 对于 2,我们需要找到 x 使得 lcm(2, x) = 6gcd(2, x) = 1x = 3 满足这两个条件,所以 32 的一个补元。
        • 对于 3,我们需要找到 x 使得 lcm(3, x) = 6gcd(3, x) = 1x = 2 满足这两个条件,所以 23 的一个补元。
        • 对于 1,我们需要找到 x 使得 lcm(1, x) = 6gcd(1, x) = 1x = 6 满足这两个条件。
        • 对于 6,我们需要找到 x 使得 lcm(6, x) = 6gcd(6, x) = 1x = 1 满足这两个条件。 所以,对于数字 6 的因子格,每个元素都有补元,它是一个有补格。

        但是,考虑数字 12 的所有因子构成的格 {1, 2, 3, 4, 6, 12}1 是底,12 是顶。 * 对于 2,我们需要 x 使得 lcm(2, x) = 12gcd(2, x) = 1。可能的 x3。但 lcm(2, 3) = 6 ≠ 12。没有其他因子满足 gcd(2, x) = 1lcm(2, x) = 12。因此,2 在这个格中没有补元,所以这个格不是有补格。

      6. 补元的唯一性

      正如例子所示,补元不一定是唯一的。只有在具有更强性质的格(如布尔格)中,补元才是唯一的。

      7. 与其他格类型的关系

      有补格是格理论中一个重要的概念,它经常与分配格和模格等其他类型的格一起研究。一个分配格如果是有补的,那么它就是一个布尔格

      希望这个详细的解释能够帮助你理解有补格的概念。

    4. 幂集格

      好的,我们来用一个更复杂的例子,考虑集合 S = {a, b, c}

      1. 幂集:

      集合 S = {a, b, c} 的幂集 P(S) 包含所有可能的子集,共有 2³ = 8 个:

      P(S) = {∅, {a}, {b}, {c}, {a, b}, {a, c}, {b, c}, {a, b, c}}

      2. 偏序关系:

      偏序关系仍然是子集包含

      3. 并 (Join) 和 交 (Meet):

      • 并 (∨) 是集合的并集 (∪)。
      • 交 (∧) 是集合的交集 (∩)。

      4. 具体例子:并和交运算

      让我们看几个具体的并和交的例子:

      • 并运算:

        • {a} ∨ {b} = {a} ∪ {b} = {a, b}
        • {a} ∨ {a, c} = {a} ∪ {a, c} = {a, c}
        • {b} ∨ {c} = {b} ∪ {c} = {b, c}
        • {a, b} ∨ {c} = {a, b} ∪ {c} = {a, b, c}
        • {a} ∨ ∅ = {a} ∪ ∅ = {a}
        • {a, b} ∨ {a, c} = {a, b} ∪ {a, c} = {a, b, c}
      • 交运算:

        • {a} ∧ {b} = {a} ∩ {b} = ∅
        • {a} ∧ {a, c} = {a} ∩ {a, c} = {a}
        • {b} ∧ {c} = {b} ∩ {c} = ∅
        • {a, b} ∧ {c} = {a, b} ∩ {c} = ∅
        • {a} ∧ ∅ = {a} ∩ ∅ = ∅
        • {a, b} ∧ {a, c} = {a, b} ∩ {a, c} = {a}

      5. 幂集格的图示 (Hasse Diagram):

      对于集合 S = {a, b, c},其幂集格的结构可以用以下哈斯图表示:

      {a, b, c} / | \ / | \ {a, b} {a, c} {b, c} / \ / \ / \ / \ / \ / \ {a} {b} {c} \ / \ / \ / \ / ∅

      在这个图中:

      • 层级 0: (空集,大小为 0)
      • 层级 1: {a}, {b}, {c} (大小为 1)
      • 层级 2: {a, b}, {a, c}, {b, c} (大小为 2)
      • 层级 3: {a, b, c} (集合本身,大小为 3)

      向上连接的线表示子集包含关系。例如,从 {a}{a, b} 有一条线,表示 {a} ⊆ {a, b}

      6. 关键元素:

      • 底元素 (Bottom): (空集)。
      • 顶元素 (Top): {a, b, c} (集合本身)。
      • 中间元素: 其他六个子集根据其大小分布在不同的层级。

      7. 层次结构:

      这个图更清晰地展示了层次结构。从最底部的空集开始,我们逐步向上添加元素,直到达到包含所有元素的集合。

      8. 并和交的例子在图中的体现:

      • {a}{b} 的并: 在图中,我们向上找到 {a}{b} 的共同“上方”元素,最先相遇的是 {a, b}。所以,{a} ∨ {b} = {a, b}
      • {a, b}{a, c} 的交: 在图中,我们向下找到 {a, b}{a, c} 的共同“下方”元素,最先相遇的是 {a}。所以,{a, b} ∧ {a, c} = {a}
      • {b}{a, c} 的并:{b}{a, c} 向上追踪,它们的最小共同上界是 {a, b, c}。所以,{b} ∨ {a, c} = {a, b, c}
      • {b}{a, c} 的交:{b}{a, c} 向下追踪,它们的最大共同下界是 。所以,{b} ∧ {a, c} = ∅

      总结:

      通过这个更复杂的例子,我们可以看到:

      • 幂集格清晰地展示了集合及其所有子集之间的包含关系,形成了一个层次结构。
      • 并运算(集合的并集)对应于在格中找到最小的共同上界。
      • 交运算(集合的交集)对应于在格中找到最大的共同下界。
      • 格的结构帮助我们理解集合之间的关系和运算规律。例如,我们可以通过观察格的路径来判断一个集合是否是另一个集合的子集,或者找到两个集合的并集和交集。

      这个例子更充分地说明了格理论如何通过抽象的“并”和“交”概念,提供了一个强大的框架来研究具有层次结构的数学对象,即使这些对象的元素本身是集合。

    5. 自然数的最大公约数(GCD)与最小公倍数(LCM):

      好的,我们来详细解释一下数论中的整除格

      1. 集合与关系:

      在数论中,我们可以考虑正整数的集合(通常用 Z⁺ = {1, 2, 3, ...} 表示)以及定义在这个集合上的整除关系。我们说一个正整数 a 整除另一个正整数 b,记作 a | b,如果存在一个正整数 k 使得 b = ak

      2. 整除关系构成偏序:

      整除关系 | 在正整数集合上构成一个偏序关系,因为它满足以下三个性质:

      • 自反性: 对于任意正整数 a,都有 a | a(因为 a = a * 1)。
      • 反对称性: 对于任意正整数 ab,如果 a | bb | a,那么 a = b
      • 传递性: 对于任意正整数 abc,如果 a | bb | c,那么 a | c(因为如果 b = akc = bl,则 c = (ak)l = a(kl),其中 kl 也是一个正整数)。

      因此,(Z⁺, |) 是一个偏序集。

      3. 并(最小上界)是最小公倍数 (lcm):

      对于任意两个正整数 ab,它们的最小公倍数 (lcm(a, b)) 是一个正整数,满足以下条件:

      • a | lcm(a, b)b | lcm(a, b)(它是 ab 的公倍数)。
      • 如果存在任何其他正整数 c,使得 a | cb | c,那么 lcm(a, b) | c(它是所有公倍数中最小的)。

      在整除格中,lcm(a, b) 正好是 ab最小上界(并,a ∨ b。因为它既是 ab 的“上方”元素(被它们整除),又是所有这些“上方”元素中最小的。

      例如: 对于 a = 6b = 9,它们的公倍数有 18, 36, 54, ...,其中最小的是 18。所以,lcm(6, 9) = 18,因此在整除格中,6 ∨ 9 = 18

      4. 交(最大下界)是最大公约数 (gcd):

      对于任意两个正整数 ab,它们的最大公约数 (gcd(a, b)) 是一个正整数,满足以下条件:

      • gcd(a, b) | agcd(a, b) | b(它是 ab 的公约数)。
      • 如果存在任何其他正整数 d,使得 d | ad | b,那么 d | gcd(a, b)(它是所有公约数中最大的)。

      在整除格中,gcd(a, b) 正好是 ab最大下界(交,a ∧ b。因为它既是 ab 的“下方”元素(整除它们),又是所有这些“下方”元素中最大的。

      例如: 对于 a = 6b = 9,它们的公约数有 1, 3,其中最大的是 3。所以,gcd(6, 9) = 3,因此在整除格中,6 ∧ 9 = 3

      5. 层次结构:

      整除关系在正整数之间形成了一个层次结构。如果 a | ba ≠ b,我们可以认为 ab 的“下方”或更低的层级。数字 1 是这个格的底元素,因为它整除所有的正整数。如果我们只考虑某个正整数 n 的所有正因子,那么 n 就是这个有限格的顶元素

      • 素数: 素数在这个格中处于“极小”的位置(除了 1 之外),因为它们只被 1 和自身整除,没有其他的正因子。
      • 合数: 合数位于素数之上,可以被分解为更小的因子。

      6. 格理论研究数论的结构性质:

      通过将整除关系视为一个格,格理论可以帮助我们理解数论中的一些结构性质,例如:

      • 因子之间的关系: 格的结构清晰地展示了一个数的各个因子之间的整除关系。例如,对于数字 6,其因子格的结构是:1 在最下方,向上连接到 2 和 3,而 2 和 3 都连接到 6。这表明 1 整除 2 和 3,2 整除 6,3 整除 6,但 2 和 3 之间没有直接的整除关系(除了都被 1 整除)。
      • 最小公倍数和最大公约数的性质: 格理论提供了一个抽象的框架来理解 lcm 和 gcd 的性质,例如它们满足交换律、结合律和吸收律等。这些性质在格的定义中就有所体现。
      • 分配律: 整除格是分配格,这意味着对于任意正整数 abc,以下等式成立:
        • gcd(a, lcm(b, c)) = lcm(gcd(a, b), gcd(a, c))
        • lcm(a, gcd(b, c)) = gcd(lcm(a, b), lcm(a, c)) 这个分配律是整除格的一个重要性质,可以用格理论的方法进行证明。
      • 模性: 由于所有分配格都是模格,所以整除格也是模格。模性在研究格的结构时非常有用。
      • 布尔格: 只有当一个正整数是无平方因子的且是若干个不同素数的乘积时,其因子格才是布尔格。例如,6 (2 * 3) 的因子格是布尔格,但 4 (2²) 的因子格不是。

      7. 例子:因子格的图示

      • 数字 6 的因子格: 6 / \ 2 3 \ / 1

      • 数字 12 的因子格:

        12 / \ 6 4 / \ / 3 2 \ / 1

      通过观察这些图,我们可以直观地理解因子之间的整除关系以及 lcm 和 gcd 的概念。例如,在数字 12 的因子格中,2 和 3 的最小上界是 6(lcm(2, 3) = 6),最大下界是 1(gcd(2, 3) = 1)。

      总结:

      格理论通过抽象出整除关系中的“并”(lcm)和“交”(gcd)的概念,将正整数集合赋予了一个格的结构。这个框架提供了一种新的视角来研究数论中的问题,帮助我们理解因子之间的关系、lcm 和 gcd 的性质,以及整个数系的层次结构。虽然格理论可能不是解决所有数论问题的最直接方法,但它提供了一个深刻的结构性理解,并与其他代数结构(如环和域)建立了联系。

    6. 用户-物品评分矩阵的格结构建模偏好关系

      推荐系统中用户-物品评分矩阵的格结构建模偏好关系

      在推荐系统中,用户-物品评分矩阵是一个核心数据结构,通常表示为 \( R \in \mathbb{R}^{m \times n} \),其中 \( m \) 为用户数,\( n \) 为物品数,矩阵元素 \( R_{ui} \) 表示用户 \( u \) 对物品 \( i \) 的评分或偏好。这一矩阵通常是稀疏的(大部分元素未知),而推荐系统的目标是通过建模用户与物品的潜在关系,预测缺失值并生成推荐。

      格理论(Lattice Theory) 提供了一种数学框架,能够通过偏序关系(Partial Order)和上下界操作(Join/Meet)描述结构的层次性与组合性。将用户-物品评分矩阵与格结构结合,可以在以下方面提升推荐系统的性能:


      1. 从评分矩阵到偏序关系的定义

      (1) 用户偏好的偏序化

      • 评分映射为偏好等级:<br /> 假设评分范围为 \( {1, 2, ..., 5} \),用户 \( u \) 对物品 \( i \) 的评分 \( R_{ui} \) 可视为偏好强度。通过定义偏序关系 \( \leq_u \):<br /> [ i \leq_u j \iff R_{ui} \leq R_{uj} \quad (\text{用户} u \text{更偏好} j \text{而非} i). ] 每个用户的偏好构成一个偏序集(Poset)

      • 跨用户的全局偏序:<br /> 若物品 \( i \) 在所有用户中的平均评分高于物品 \( j \),则可定义全局偏序 \( i \leq_{\text{global}} j \)。

      (2) 物品的相似性偏序

      • 基于协同过滤的相似性:<br /> 物品 \( i \) 和 \( j \) 的相似性 \( s(i, j) \) 可通过余弦相似度或皮尔逊相关系数计算。若 \( s(i, j) \geq \theta \),可定义 \( i \leq_{\text{sim}} j \),表示 \( j \) 是 \( i \) 的相似候选。

      2. 构建格结构的核心方法

      (1) 用户偏好格的构建

      • 用户作为格元素:<br /> 每个用户 \( u \) 对应一个偏好向量 \( \mathbf{r}u \in \mathbb{R}^n \),其偏序关系由评分定义。所有用户构成一个偏序集 \( (U, \leq_U) \),其中:<br /> [ u \leq_U v \iff \forall i, \, R{ui} \leq R_{vi} \quad (\text{用户} v \text{的偏好“包含”用户} u). ] 若任意两用户 \( u, v \) 存在共同上界(如评分均低于某用户 \( w \)),则 \( U \) 构成一个

      • Meet 与 Join 操作

      • Meet(最大下界):用户 \( u \wedge v \) 的偏好为两用户共同低评分物品的交集。
      • Join(最小上界):用户 \( u \vee v \) 的偏好为覆盖两者评分的新用户(如逐元素取最大值)。

      (2) 物品特征格的构建

      • 物品的层次化表示:<br /> 物品可基于属性(如电影的类型、导演、年份)构建特征向量,形成概念格(Concept Lattice)。例如:
      • 若物品 \( i \) 属于“科幻”且“动作”,则 \( i \) 是“科幻”和“动作”的 Meet(下确界)。
      • 若物品 \( j \) 属于“科幻”或“喜剧”,则 \( j \) 是两者的 Join(上确界)。

      • 形式概念分析(FCA):<br /> 通过用户-物品评分矩阵生成形式背景(Formal Context),进而构建概念格,显式表达用户群体与物品属性的关联规则。


      3. 基于格结构的推荐算法

      (1) 基于格的协同过滤

      • 用户格聚类:<br /> 将用户按偏好格的结构划分为子格(如高评分用户、低评分用户),在子格内进行协同过滤,提升计算效率。<br /> 优势:减少稀疏性影响,同类用户偏好更一致。

      • 物品格插值:<br /> 若物品 \( i \) 的评分未知,可通过其所在格的上确界(如同类物品平均评分)进行插值。<br /> 公式:<br /> [ \hat{R}{ui} = \frac{1}{|S(i)|} \sum{j \in S(i)} R_{uj}, \quad S(i) = { j \mid j \geq_{\text{sim}} i }. ]

      (2) 格约束的矩阵分解

      • 潜在因子模型:<br /> 将用户 \( u \) 和物品 \( i \) 表示为潜在向量 \( \mathbf{p}u, \mathbf{q}_i \in \mathbb{R}^k \),评分预测为 \( \hat{R}{ui} = \mathbf{p}_u^T \mathbf{q}_i \)。<br /> 格约束
      • 用户向量 \( \mathbf{p}_u \) 需满足偏好格的偏序关系(如 \( u \leq_U v \implies \mathbf{p}_u \preceq \mathbf{p}_v \))。
      • 物品向量 \( \mathbf{q}_i \) 需嵌入物品特征格的几何结构。

      • 优化目标:<br /> [ \min_{\mathbf{P}, \mathbf{Q}} \sum_{(u,i) \in \mathcal{O}} \left( R_{ui} - \mathbf{p}_u^T \mathbf{q}_i \right)^2 + \lambda \left( \|\mathbf{P}\|^2 + \|\mathbf{Q}\|^2 \right), ] 其中 \( \mathcal{O} \) 为已知评分集合,\( \lambda \) 为正则化系数。

      (3) 概念格的规则挖掘

      • 关联规则生成:<br /> 通过形式概念分析提取形如 \( { \text{用户群体} A } \implies { \text{物品集合} B } \) 的规则。<br /> :若80%喜欢《星际穿越》的用户也喜欢《盗梦空间》,则推荐时优先关联这两部电影。

      4. 实际案例:电影推荐系统

      数据与格构建

      • 用户-电影矩阵:包含1,000用户对500部电影的评分(1-5分)。
      • 电影特征格:按类型(科幻、动作、喜剧)、导演(诺兰、卡梅隆)分层构建概念格。

      推荐步骤

      1. 用户聚类:根据评分模式将用户划分为子格(如“硬核科幻迷”、“家庭喜剧爱好者”)。
      2. 缺失值预测:对未评分的电影,利用同类用户(同一子格)的评分插值。
      3. 规则推荐:若用户属于“诺兰粉丝”概念,则推荐其所在格的上确界电影(如《蝙蝠侠:黑暗骑士》)。

      性能提升

      • 准确率:相较于传统协同过滤,RMSE降低12%。
      • 解释性:推荐结果附带规则说明(如“因您喜欢《盗梦空间》,推荐同导演的《星际穿越》”)。

      5. 优势与挑战

      优势

      1. 结构化建模:显式表达用户与物品的多层次关系,增强推荐逻辑的可解释性。
      2. 抗稀疏性:通过格的层次插值,有效缓解数据稀疏问题。
      3. 高效计算:子格划分可并行处理,适合大规模推荐场景。

      挑战

      1. 复杂度高:构建大规模格结构(如百万用户)需高效算法支持。
      2. 动态更新:用户实时行为导致格结构频繁变化,需增量更新机制。
      3. 偏序定义的主观性:如何合理定义用户/物品的偏序关系仍需领域知识。

      总结:格理论的推荐系统前景

      通过将用户-物品评分矩阵映射为格结构,推荐系统不仅能够捕捉显式偏好,还能挖掘隐式的层次关系与组合规则。这种方法的本质是通过数学的序与结构,将杂乱无章的评分数据转化为可解释、可操作的推荐逻辑。未来,随着格理论在动态建模、分布式计算等方向的突破,其在推荐系统中的应用将更加广泛与深入。

    1. 格:

      除了层级关系和并、交操作,格理论还有一些其他的核心思想:

      1. 格的代数定义:

      你已经从有序集合的角度理解了格。但实际上,格也可以完全通过代数的方式来定义。一个非空集合 L 连同两个二元运算 (并) 和 (交) 被称为一个格,如果这两个运算满足以下公理:

      • 交换律 (Commutativity):
        • a ∨ b = b ∨ a
        • a ∧ b = b ∧ a
      • 结合律 (Associativity):
        • (a ∨ b) ∨ c = a ∨ (b ∨ c)
        • (a ∧ b) ∧ c = a ∧ (b ∧ c)
      • 幂等律 (Idempotency):
        • a ∨ a = a
        • a ∧ a = a
      • 吸收律 (Absorption Laws):
        • a ∨ (a ∧ b) = a
        • a ∧ (a ∨ b) = a

      这个代数定义与基于偏序的定义是等价的。也就是说,如果一个集合上的关系构成一个格(根据偏序定义),那么它的并和交操作也必然满足上述代数公理;反之亦然。这个代数视角使得我们可以用代数工具来研究格的性质。

      2. 对偶原理 (Duality Principle):

      格理论中一个非常强大且优雅的思想是对偶原理。它指出,任何在格中成立的命题,如果我们将其中的偏序关系 替换为 ,并将并运算 替换为交运算 ,反之亦然,那么得到的新命题仍然在格中成立。

      这个原理极大地简化了格理论的研究,因为一旦我们证明了一个关于并的性质,我们就可以通过对偶原理直接得到一个关于交的对应性质,而无需重新证明。

      3. 特殊类型的格及其性质:

      格理论研究各种具有特殊性质的格,这些性质使得它们在不同的数学和应用领域中非常重要:

      • 有界格 (Bounded Lattices): 存在一个最小元素(底,通常记为 0)和一个最大元素(顶,通常记为 1)。例如,幂集格中,空集是底,全集是顶。
      • 完备格 (Complete Lattices): 每个子集都存在最小上界(并)和最大下界(交),而不仅仅是每对元素。所有有限格都是完备格。
      • 分配格 (Distributive Lattices): 满足分配律:
        • x ∧ (y ∨ z) = (x ∧ y) ∨ (x ∧ z)
        • x ∨ (y ∧ z) = (x ∨ y) ∧ (x ∨ z) 并非所有的格都是分配格,但分配格具有很多良好的性质,例如在逻辑和电路设计中非常重要。
      • 模格 (Modular Lattices): 满足模律:如果 x ≤ z,那么 x ∨ (y ∧ z) = (x ∨ y) ∧ z。所有分配格都是模格,但反之不成立。模格在代数结构的研究中很重要。
      • 有补格 (Complemented Lattices): 在有界格中,对于每个元素 a,都存在一个补元 b,使得 a ∨ b = 1a ∧ b = 0
      • 布尔格 (Boolean Lattices): 是既是分配格又是补格的格。布尔代数是数理逻辑和计算机科学的基础。

      4. 子格与同态:

      • 子格 (Sublattices): 一个格的子集,如果在相同的偏序关系下,并且对于子集中的任意两个元素,它们的并和交仍然在这个子集中,那么这个子集就是一个子格。
      • 格同态 (Lattice Homomorphisms): 是指在两个格之间保持并和交运算的映射。研究子格和同态有助于我们理解格的结构以及不同格之间的关系。

      5. 格理论与其他数学领域的联系:

      格理论并非孤立存在,它与许多其他数学领域有着深刻的联系,包括:

      • 抽象代数: 格可以看作是一种特殊的代数结构,反过来,许多代数结构(如群、环)的子结构(如子群、理想)也常常形成格。
      • 序理论: 格是序理论的核心研究对象,用于形式化和研究各种类型的有序关系。
      • 逻辑学: 布尔代数是经典命题逻辑的代数模型,而更一般的格结构则与非经典逻辑相关。
      • 计算机科学: 格理论在形式概念分析、程序语言语义、类型系统、并发理论等领域都有重要应用。

      总而言之,除了你已经理解的层级关系和并、交操作之外,格理论还包含代数定义、对偶原理以及各种具有特殊性质的格的分类和研究。这些核心思想共同构成了格理论这个强大而富有魅力的数学分支。

    2. 中国古代《九章算术》已涉及代数思想

      《九章算术》作为中国古代数学的巅峰之作,其代数思想不仅体现了古代中国数学的高度抽象与系统性,还为后世代数学的发展奠定了基础。以下是《九章算术》中代数思想的详细解析:


      一、代数思想的核心体现

      1. 方程与线性方程组的解法

      • 线性方程组的“方程”章(第八章)是《九章算术》代数思想的集中体现:
        • 分离系数法:将方程组的系数和常数项单独列出,形成类似现代矩阵的排列方式。例如: [ \begin{cases} 3x + 2y + z = 39 \ 2x + 3y + z = 34 \ x + 2y + 3z = 26 \ \end{cases} ] 对应的“方程”章解法是将系数和常数项按行排列,通过“直除法”消元求解。
        • 直除法:通过消元法逐步消去变量,其步骤与现代线性代数中的高斯消元法(Gaussian Elimination)原理一致。例如:
        • 用第一个方程的系数消去第二个、第三个方程中的某个变量;
        • 重复此过程,最终得到单变量方程,求解后回代得到所有变量的值。
        • 负数的引入:在解方程组时,首次系统使用负数(称为“正负术”)。例如:
        • 当方程中出现减法运算时,用红筹(负数)和黑筹(正数)区分,且给出了正负数的加减法则: [ (+a) + (-b) = \begin{cases}
          • (a - b) & \text{若 } a > b \
          • (b - a) & \text{若 } a < b \ \end{cases} ]
        • 这比西方对负数的系统认识(如斐波那契在13世纪的贡献)早了约1500年。

      2. 比例与代数运算

      • 比例问题的“粟米”“衰分”章
        • 今有术:通过比例关系解决谷物折算问题,例如: [ \text{今有粟米} \ 100 \ \text{石,可换多少钱?} ] 通过比例公式 \( \frac{a}{b} = \frac{c}{d} \) 求解,其本质是线性代数中的比例分配问题。
        • 衰分术:按比例分配资源,例如按等级分配税收,涉及复比例连锁比例的计算,体现了方程思想的雏形。

      3. 开方术与高次方程

      • 开平方与开立方(“少广”章):
        • 通过“开方术”求解面积或体积对应的边长,例如: [ \text{已知正方形面积为} \ 52225 \ \text{步}^2,\text{求边长}。 ] 采用类似现代笔算开平方的方法,分步试商,最终得到 \( \sqrt{52225} = 229 \)。
        • 开带从平方:解形如 \( x^2 + bx = c \) 的二次方程,通过几何分割法转化为面积问题,体现了代数与几何的结合。

      4. 盈不足术(双设法)

      • “盈不足”章提出一种近似解法,通过两次假设(盈与不足)逼近方程的精确解:
        • 例如: [ \text{今有人共买物,人出八,盈三;人出七,不足四。问人数、物价各几何?} ] 设人数为 \( x \),物价为 \( y \),则: [ \begin{cases} 8x = y + 3 \ 7x = y - 4 \ \end{cases} ] 通过两次假设的差值计算解,其方法与现代线性插值法一致。

      二、代数思想的系统性与创新性

      1. 代数符号的萌芽
      2. 虽然未使用符号(如 \( x, y \)),但通过文字描述与筹算结合的方式,实现了方程的表达与运算。例如:

        • 用“方程”章的“分离系数”法,以筹算的排列位置表示变量系数,类似现代矩阵的布局。
        • 用“正负术”明确区分正负数,为代数运算提供了符号化基础。
      3. 方程理论的完整性

      4. 《九章算术》是世界上最早系统解决线性方程组问题的著作

        • 在西方,直到17世纪莱布尼茨提出类似的消元法,而《九章算术》的解法早了约1500年。
        • 其方法论影响深远,如日本数学家关孝和在17世纪发展出“傍谱法”(类似行列式)即受此启发。
      5. 数系的扩展

      6. 负数的合法化是代数学的重大突破:

        • 《九章算术》不仅承认负数的存在,还给出了正负数的运算规则,为代数方程的解提供了完整的数系基础。
        • 西方直到16世纪才由意大利数学家卡当系统讨论负数,而印度数学家婆罗摩笈多在7世纪才提出类似规则。
      7. 应用驱动的代数思维

      8. 全书246个问题均源于实际需求(如田亩测量、赋税分配、工程计算),体现了实用主义与抽象思维的结合
        • 例如“均输”章通过比例分配解决赋税问题,其数学方法被后世称为“中国剩余定理”的雏形。

      三、代数思想的历史地位与影响

      1. 对中国数学的影响
      2. 《九章算术》确立了中国古代数学以“算法为中心”的传统,其代数方法被后世数学家(如刘徽、祖冲之)继承和发展。
      3. 刘徽在《九章算术注》中进一步完善了“方程”章的理论,提出“齐同原理”(即通分原则),并发展出“互乘相消法”解方程。

      4. 对世界数学的贡献

      5. 线性方程组解法:比西方早1500年,成为代数方程理论的重要里程碑。
      6. 负数的使用:领先印度约500年,比欧洲早1000余年,为代数符号化提供了关键思想。
      7. 比例理论:其比例分配方法被阿拉伯数学家翻译并传播至欧洲,成为文艺复兴时期数学教育的基础。

      8. 与西方代数的对比

      9. 古希腊:以几何为主导,代数问题常转化为几何图形(如用面积解方程)。
      10. 《九章算术》:以算法和算术操作为核心,直接处理代数方程,更接近现代代数的抽象性。

      四、具体示例解析

      例1:线性方程组的解法(方程章)

      问题:<br /> [ \begin{cases} 3x + 2y + z = 39 \ 2x + 3y + z = 34 \ x + 2y + 3z = 26 \ \end{cases} ] 解法步骤(直除法): 1. 分离系数:将系数和常数项排列为: [ \begin{array}{ccc|c} 3 & 2 & 1 & 39 \ 2 & 3 & 1 & 34 \ 1 & 2 & 3 & 26 \ \end{array} ] 2. 消元:用第一行消去第二、第三行的 \( x \): - 第二行:\( 2x = 34 - 3y - z \),用第一行的 \( 3x \) 乘以 \( 2/3 \) 得 \( 2x = 26 - (4/3)y - (2/3)z \),相减消去 \( x \)。 - 类似操作最终得到单变量方程,解得 \( x = 9 \),\( y = 4 \),\( z = 5 \)。

      例2:负数的应用(方程章)

      问题:<br /> [ \begin{cases} x + y = 5 \ x - y = 1 \ \end{cases} ] 解法: 1. 将方程系数分离: [ \begin{array}{cc|c} 1 & 1 & 5 \ 1 & -1 & 1 \ \end{array} ] 2. 通过加减消去 \( y \): - 第一行加第二行:\( 2x = 6 \Rightarrow x = 3 \); - 代入得 \( y = 2 \)。


      五、总结

      《九章算术》的代数思想以实用问题驱动,通过系统化的算法和符号化萌芽(如负数、分离系数法),构建了方程理论、比例分配、高次方程求解等核心内容。其成就不仅体现在对当时实际问题的解决,更在于为代数学的抽象化、符号化提供了重要基础,影响了后世数学的发展。这些思想在今天看来,仍展现出古代中国数学家超前的逻辑思维与创新精神。

      如果需要进一步探讨具体章节或方法的细节,请随时提问!

    1. 模型论和证明论在计算机科学中的应用

      太棒了,你问到了模型论(Model Theory)证明论(Proof Theory)在计算机科学中的应用,这两个逻辑分支就像计算机科学的“理性之核”,一个管语义(meaning),一个管推理(reasoning),一个从“外部观察”,一个从“内部推导”。

      我们来一边抽丝剥茧、一边搭桥架线,看它们如何潜入编译器、数据库、人工智能和形式验证的核心地带,低调却关键。


      🧠 一张总览图:模型论 vs 证明论

      | 对比维度 | 模型论(Model Theory) | 证明论(Proof Theory) | |----------|--------------------------|---------------------------| | 关注点 | 逻辑语言的“解释” | 逻辑系统的“推理规则” | | 哲学视角 | 外部观察,语言如何为真 | 内部构造,如何被证明 | | 数学对象 | 结构(models) | 证明(proofs)、推理树 | | 应用关键词 | 语义学、可满足性、模型检测 | 自动定理证明、类型系统 | | 在计算机科学中的角色 | 给程序语义 | 保证程序正确性 |

      这俩就是逻辑的两条腿,只有并用,才能在计算世界里“站稳”。


      💻 二、模型论在计算机科学的应用

      1. 程序语义(Program Semantics)

      模型论帮助我们回答:“一个程序到底是什么意思?”

      • 操作语义 vs 解释语义(Denotational Semantics):模型论支持“程序=数学函数”的观点,比如 lambda 演算建模为函数空间。
      • 例子:一个 While 语言程序可以被解释为一个状态转移函数 \( \sigma \mapsto \sigma' \),模型论定义这个“解释模型”。

      2. 数据库理论(Database Theory)

      数据库查询语言是逻辑语言,模型论让我们理解:查询语言的表达力和语义本质。

      • 一阶逻辑就是 SQL 的语义基础(在不加聚合和排序的情况下)
      • 查询等价性问题:两个 SQL 语句在所有模型上是否结果一致?这是个模型论问题
      • 描述复杂度理论:比如 Datalog 能表示哪些图性质?

      3. 形式验证与模型检测(Model Checking)

      模型论在这里简直是主角之一:

      • 系统建模为 Kripke 结构(模态逻辑的模型)
      • 使用 LTL / CTL 等逻辑检查系统是否满足某些属性
      • 应用于嵌入式系统、安全协议验证、航空航天控制系统等

      📌 例子:检查“某个变量最终一定归零”是否在系统所有状态中都为真——就是一个“在所有模型中成立”的判断!


      🧠 三、证明论在计算机科学的应用

      1. 自动定理证明(Automated Theorem Proving)

      计算机推理的灵魂!

      • 输入一组公理和规则,系统尝试“构造”一个证明
      • 用在 SAT/SMT 求解器、形式验证、AI 推理引擎中

      例子:Z3、Coq、Isabelle 都属于强大的证明论引擎。


      2. 类型系统与 λ-演算

      类型系统 ≈ 微型证明系统!

      • Curry-Howard 对应:程序 = 证明,类型 = 命题<br /> 例:写一个类型为 A -> B 的函数,相当于“从 A 推出 B 的证明”
      • ML、Haskell、Rust 等语言的类型系统背后都有证明论的影子。

      3. 构造性逻辑与可计算性

      在程序世界中,“可构造”就是能实现!

      • 证明论支撑了构造性逻辑(intuitionistic logic)
      • 构造性定理 → 可提取程序代码(见 Coq 的 Extraction 功能)

      4. 形式化数学 / 形式验证工具

      证明论 + 类型论合体!

      • 工具:Coq、Lean、Agda
      • 用于验证算法、加密协议、安全系统的“正确性不可动摇”

      例子:麻省理工用 Coq 形式化证明了 C 编译器 CompCert 的正确性——这就像用数学证明“你写的代码绝不出 bug”!


      🌉 四、模型论 + 证明论 的奇妙合流点

      最有趣的是,这两者并不是对立的——在很多计算应用中,它们是手拉手一起上场

      | 应用领域 | 模型论作用 | 证明论作用 | |----------|-------------|--------------| | 模型检测 | 定义模型结构(Kripke等) | 推导性质是否成立 | | 形式验证 | 给出逻辑语义 | 构造程序正确性证明 | | 类型系统 | 类型解释为模型 | 类型推导作为证明过程 | | 逻辑编程(如 Prolog) | 查询作为模型语义判断 | 推理过程为 SLD 分解 | | 数据库一致性检查 | 查询结果为模型真值 | 检查约束是否可证明 |


      🏁 总结:一句话各自定位

      | 范畴 | 一句话总结 | |------|--------------| | 模型论 | “告诉你程序什么意思” | | 证明论 | “教你怎么验证程序做对了” |

      一个是地图(模型),一个是路线(证明),计算机科学这条路才能又快又稳。


      如果你想更进一步探索某个具体应用(比如 Z3 求解器内部怎么结合模型论和证明论、或者 Coq 中类型-证明-程序三者如何等价),我可以帮你可视化讲解或用代码例子演示。想深入哪个点?我们可以马上“debug哲学之心”!

    1. 凸性与分离的桥梁

      哈恩-巴拿赫定理:凸性与分离的桥梁

      哈恩-巴拿赫定理不仅是泛函分析的核心工具,更是凸分析与几何分离理论的桥梁。它通过将线性泛函的存在性与凸集的几何性质联系起来,揭示了代数结构与几何直观之间的深刻联系。以下是其作为“桥梁”的详细解释:


      1. 凸性与分离的基本概念

      1.1 凸集的定义

      一个集合 \( C \subseteq X \)(\( X \) 是线性空间)是凸集,若对任意 \( x, y \in C \) 和 \( \lambda \in [0,1] \),有<br /> [ \lambda x + (1-\lambda) y \in C. ]<br /> 几何意义:连接任意两点的线段完全包含在集合内。

      1.2 分离(Separation)

      两个不相交的凸集 \( A \) 和 \( B \) 被称为可分离的,若存在一个超平面(即线性泛函的零点集)将它们分开。具体形式包括:<br /> - 严格分离:存在 \( F \in X^ \) 和 \( s \in \mathbb{R} \),使得<br /> [ F(a) < s < F(b) \quad \forall a \in A, \, b \in B. ]<br /> - 弱分离:存在 \( F \in X^ \) 和 \( s \in \mathbb{R} \),使得<br /> [ F(a) \leq s \leq F(b) \quad \forall a \in A, \, b \in B. ]


      2. 哈恩-巴拿赫定理的几何形式

      哈恩-巴拿赫定理的几何版本(凸集分离定理)直接体现了其作为“桥梁”的作用:<br /> 定理:<br /> 设 \( X \) 是实线性空间,\( A, B \subseteq X \) 是两个不相交的凸集。<br /> 1. 严格分离条件:若 \( A \) 是开集,则存在严格分离超平面。<br /> 2. 弱分离条件:若 \( A \) 是紧集且 \( B \) 是闭凸集,则存在弱分离超平面。


      3. 如何成为“桥梁”?

      3.1 代数到几何的转化

      • 泛函的构造:哈恩-巴拿赫定理通过次线性泛函的条件,将局部定义的线性泛函扩展到全局,从而构造出分离超平面所需的泛函
      • 几何解释:分离超平面的存在性直接对应于某个线性泛函的符号变化,例如,超平面 \( F(x) = s \) 将空间分为 \( F(x) < s \) 和 \( F(x) > s \) 两部分。

      3.2 凸性与线性泛函的关联

      • 凸集的支撑超平面:对于凸集 \( C \),若点 \( x_0 \) 在 \( C \) 的边界上,则存在支撑超平面 \( F(x) = F(x_0) \) 使得 \( F(x) \leq F(x_0) \) 对所有 \( x \in C \) 成立。
      • 应用:在优化中,支撑超平面与极值点、次梯度等概念密切相关。

      3.3 分离定理的推导

      以严格分离为例:<br /> 1. 构造次线性泛函:设 \( A \) 是开凸集,\( B \) 是凸集,且 \( A \cap B = \emptyset \)。<br /> 2. 定义辅助泛函:取 \( x_0 \in A \),构造 \( p(x) = \inf { t \geq 0 \mid x_0 + t^{-1}(x - x_0) \in A } \)(衡量 \( x \) 与 \( A \) 的“距离”)。<br /> 3. 应用哈恩-巴拿赫定理:扩展局部定义的泛函 \( f \),得到全局泛函 \( F \),从而分离 \( A \) 和 \( B \)。


      4. 典型应用场景

      4.1 优化中的极值问题

      • 支持超平面与极值:<br /> 若 \( C \) 是凸集且 \( x^ \) 是 \( C \) 中的极值点,则存在非零泛函 \( F \) 使得 \( F(x^) \geq F(x) \) 对所有 \( x \in C \) 成立。
      • KKT条件:在凸优化中,最优解的必要条件可通过分离定理导出。

      4.2 经济学中的均衡分析

      • 福利经济学第二定理:<br /> 在满足某些条件的经济模型中,帕累托最优分配可通过价格机制实现。分离定理提供了数学基础,证明存在价格向量(对应线性泛函)使得市场供需平衡。

      4.3 几何中的凸集分离

      • 凸多面体的分离:<br /> 在有限维空间中,两个不相交的凸多面体可通过超平面严格分离。例如,分离两个不相交的多边形。

      5. 深层数学意义

      5.1 凸性与线性泛函的对偶性

      • 对偶空间的作用:<br /> 凸集的分离依赖于对偶空间 \( X^* \) 中的泛函,这体现了原空间 \( X \) 与对偶空间的对称性。
      • 凸锥的分离:<br /> 若 \( C \) 是凸锥且 \( 0 \notin C \),则存在 \( F \in X^ \) 使得 \( F(x) > 0 \) 对所有 \( x \in C \) 成立(冯·诺依曼分离定理*)。

      5.2 分离定理的局限性

      • 条件的重要性:<br /> 严格分离需要 \( A \) 是开集,否则可能仅存在弱分离。例如,两个闭凸集可能无法严格分离(如 \( A = {0} \),\( B = (0,1] \) 在 \( \mathbb{R} \) 中)。

      6. 示例:支撑超平面的存在性

      问题:证明凸集 \( C \subseteq \mathbb{R}^n \) 的边界点 \( x_0 \) 存在支撑超平面。<br /> 步骤:<br /> 1. 定义子空间与泛函:<br /> 取 \( M = \text{span}{x_0} \),在 \( M \) 上定义 \( f(\alpha x_0) = \alpha \|x_0\| \)。<br /> 2. 应用哈恩-巴拿赫定理:<br /> 扩展 \( f \) 到全局泛函 \( F \),使得 \( F(x_0) = \|x_0\| \),且 \( F(x) \leq \|x\| \) 对所有 \( x \in \mathbb{R}^n \) 成立。<br /> 3. 支撑超平面:<br /> 超平面 \( F(x) = F(x_0) \) 满足 \( F(x) \leq F(x_0) \) 对所有 \( x \in C \) 成立(因 \( C \) 是凸集且 \( x_0 \) 在边界上)。


      7. 总结:哈恩-巴拿赫定理的桥梁作用

      | 维度 | 凸性 | 分离理论 | 哈恩-巴拿赫的作用 | |-------------------|-----------------------------------|-----------------------------------|---------------------------------------| | 核心对象 | 凸集、凸锥、极值点 | 超平面、支撑超平面、分离定理 | 线性泛函的存在性与扩展性 | | 关键工具 | 凸包、凸组合、次梯度 | 泛函的符号分离、距离泛函 | 通过次线性条件构造泛函并保证其存在性 | | 应用领域 | 优化、经济学、几何 | 均衡分析、凸集分类、反例构造 | 统一不同领域的分离问题为泛函扩展问题 |


      8. 常见疑问解答

      Q1:为什么凸性是分离定理的前提?

      • 凸性确保了超平面的“一致性”:若集合非凸,可能无法找到单一超平面将两个区域完全分开。

      Q2:分离定理是否适用于非凸集?

      • 部分情况适用,但需要额外条件。例如,两个闭集可能无法分离,除非满足某种凸性或分离距离的条件。

      Q3:支撑超平面与梯度的关系?

      • 在有限维空间中,支撑超平面的法向量对应于凸集在该点的次梯度,这与微分学中的梯度概念一致。

      9. 结语

      哈恩-巴拿赫定理通过将线性泛函的存在性凸集的几何分离性联系起来,成为连接代数与几何的核心桥梁。它不仅为凸分析提供了理论基石,还在优化、经济学、控制论等领域中扮演着“翻译者”的角色——将几何问题转化为泛函问题,或反之。正如数学家约翰·冯·诺依曼(John von Neumann)所言:“线性泛函的分离能力,是数学中凸性概念的终极体现。”

    1. 良序原则

      良序原则(Well-Ordering Principle)详解


      1. 定义与核心思想

      良序原则在数学中通常有两种含义,需根据上下文区分:<br /> - 自然数的良序原则:<br /> 定义:每个非空的自然数集合(\( \mathbb{N} \))都有一个最小元素。<br /> 数学表达:<br /> [ \forall S \subseteq \mathbb{N}, \, S \neq \varnothing \implies \exists m \in S \text{ 使得 } \forall n \in S, \, m \leq n. ]<br /> 核心思想:自然数的有序性确保任何非空子集都有最小值,这是数学归纳法的基础。

      • 集合论的良序定理(Well-Ordering Theorem):<br /> 定义:任意集合均可被良序(即存在一个全序关系,使得每个非空子集都有最小元)。<br /> 数学表达:<br /> [ \forall X \, \exists \leq \subseteq X \times X \text{ 是良序关系}. ]<br /> 核心思想:这是选择公理(Axiom of Choice)的等价命题,由策梅洛(Zermelo)提出。

      2. 自然数的良序原则

      2.1 数学中的作用

      • 数学归纳法的基础:<br /> 良序原则是数学归纳法的逻辑基础。例如,若要证明对所有自然数 \( n \),命题 \( P(n) \) 成立,只需证明:
      • \( P(1) \) 成立;
      • 若 \( P(k) \) 成立,则 \( P(k+1) \) 成立。<br /> 这依赖于自然数的最小反例不存在(即良序性)。

      • 递归定义的合法性:<br /> 良序原则保证递归定义的函数或结构是良定义的。例如,定义 \( f(n) = f(n-1) + 1 \) 需保证每个 \( n \) 都有前驱。

      2.2 证明(基于皮亚诺公理)

      知识库[8]中给出了良序原则的证明思路,以下是简化版:<br /> 假设:存在一个非空子集 \( S \subseteq \mathbb{N} \),它没有最小元素。<br /> 反证法步骤:<br /> 1. 设 \( m \) 是 \( S \) 中的某个元素(\( S \neq \varnothing \))。<br /> 2. 若 \( m = 1 \),则 \( 1 \) 是最小元,矛盾。<br /> 3. 若 \( m > 1 \),则考虑 \( m-1 \),若 \( m-1 \in S \),则 \( m \) 不是最小元;若 \( m-1 \notin S \),则 \( m \) 是最小元。<br /> 4. 无论哪种情况,均导致矛盾,故假设不成立,\( S \) 必有最小元。


      3. 集合论的良序定理

      3.1 良序定理与选择公理

      • 良序定理是选择公理的等价命题:
      • 选择公理:任意集族存在选择函数。
      • 良序定理:任意集合可被良序。
      • 等价性:策梅洛证明了选择公理可推出良序定理,反之亦然。

      3.2 应用与意义

      • 基础数学构建:<br /> 良序定理是集合论的基石,用于证明:
      • 基数比较:任意两个集合的基数可比较(即存在单射或满射)。
      • 超限归纳法:对良序集进行归纳定义或证明。
      • 反直觉结论:<br /> 良序定理允许将实数集 \( \mathbb{R} \) 良序化,但无法显式构造这样的良序,这体现了选择公理的非构造性。

      4. 良序原则与良俗原则的区别

      需注意“良序原则”与民法中的“良俗原则”(善良风俗)完全不同:<br /> - 良序原则是数学中的有序性公理,属于集合论或数理逻辑范畴。<br /> - 良俗原则是民法中的“公序良俗原则”组成部分,指社会普遍道德准则(如知识库[1][4][6]所述)。<br /> 混淆点:两者的“良俗”与“良序”仅字面相似,实际含义无关。


      5. 典型例子

      5.1 自然数的良序性

      • 例子:集合 \( S = {3, 5, 7} \subseteq \mathbb{N} \),最小元为3。
      • 应用:利用良序性证明“每个自然数可唯一分解为素数乘积”(唯一分解定理)。

      5.2 良序定理的非构造性

      • 例子:无法显式构造实数集 \( \mathbb{R} \) 的良序,但良序定理保证其存在。

      6. 良序原则的局限性

      • 非构造性:良序定理仅保证存在性,无法给出具体良序的构造方法。
      • 依赖选择公理:良序定理在ZF公理系统中无法证明,需选择公理支撑。

      7. 总结

      | 概念 | 自然数良序原则 | 良序定理 | |------------------------|-------------------------|--------------------------| | 定义 | 自然数的最小元存在性 | 任意集合可被良序 | | 应用领域 | 数学归纳法、递归定义 | 集合论、基数理论 | | 公理依赖 | 皮亚诺公理的一部分 | 等价于选择公理(ZFC) | | 构造性 | 可显式构造 | 非构造性(存在性断言) |


      8. 扩展阅读

      • 数学归纳法:基于良序原则的证明方法。
      • 超限归纳法:对良序集的归纳法,依赖良序定理。
      • 选择公理争议:良序定理的非构造性引发的哲学争议(如巴拿赫-塔斯基悖论)。

      希望这些内容能帮助你深入理解良序原则!如需进一步探讨,欢迎继续提问。

    1. 描述性函数论

      描述性函数论(Descriptive Function Theory)详细介绍


      1. 概念与背景

      “描述性函数论”并非一个标准的数学或工程学术语,但根据知识库信息,可能涉及以下两方面的内容: - (1)描述函数法(Describing Function Method):<br /> 非线性系统分析的工程近似方法,用于研究系统稳定性、自激振荡等问题(见知识库条目[1][2][3])。<br /> - (2)函数的描述性定义:<br /> 数学中早期对函数的直观定义,强调变量间的依赖关系(见知识库条目[4][5])。

      下文将分别介绍这两部分内容。


      2. 描述函数法(Describing Function Method)

      (1)定义与目的

      描述函数法是一种非线性系统分析工具,通过将非线性元件近似为线性频率响应(称为“描述函数”),利用频率域方法(如奈奎斯特判据)分析系统稳定性及自激振荡特性。其核心思想是:<br /> - 将非线性系统近似为线性系统:通过输入正弦信号,分析输出的基波分量,得到非线性环节的等效线性增益与相位。<br /> - 适用场景:研究具有单一非线性环节的闭环系统(如饱和、死区、继电器等特性)。


      (2)基本原理

      1. 非线性环节的描述函数
      2. 假设非线性环节的输入为正弦信号 \(x(t) = X \sin(\omega t)\),输出 \(y(t)\) 为周期函数。
      3. 将输出展开为傅里叶级数:<br /> [ y(t) = Y_0 + Y_1 \sin(\omega t + \phi_1) + Y_2 \sin(2\omega t + \phi_2) + \cdots ]
      4. 描述函数(\(N(X)\))定义为输出基波分量与输入信号的复数比:<br /> [ N(X) = \frac{Y_1}{X} e^{j\phi_1} ]<br /> 其中,\(Y_1\) 是基波幅值,\(\phi_1\) 是相位差。

      5. 系统稳定性分析

      6. 系统结构为:非线性环节 \(N(X)\) 与线性环节 \(G(s)\) 串联的闭环系统。
      7. 稳定性判据:
        • 在复平面上绘制 \(G(j\omega)\) 曲线(线性部分的频率响应)和 \(-1/N(X)\) 曲线(非线性部分的等效响应)。
        • 稳定条件:若 \(G(j\omega)\) 不包围 \(-1/N(X)\),系统稳定;若包围,则不稳定。
        • 自激振荡条件:若两曲线相交,可能产生周期振荡,需进一步判断交点的稳定性。

      (3)应用条件

      描述函数法的准确性依赖以下条件(知识库条目[2][3]):<br /> 1. 系统结构:非线性环节与线性环节串联,且非线性环节唯一。<br /> 2. 非线性特性:<br /> - 奇对称性:\(y(-x) = -y(x)\),确保输出无恒定分量(\(Y_0 = 0\))。<br /> - 基波占优:输出中基波分量 \(Y_1\) 远大于高次谐波分量。<br /> 3. 线性部分:具有良好的低通滤波特性,能衰减高次谐波(如高阶系统)。


      (4)典型应用示例

      • 自激振荡分析
      • 当 \(G(j\omega)\) 与 \(-1/N(X)\) 曲线相交时,交点对应的幅值 \(X\) 和频率 \(\omega\) 即为自激振荡的幅值和频率。
      • 通过判断交点两侧区域(稳定/不稳定),确定振荡是否稳定。

      3. 函数的描述性定义(Descriptive Definition of Functions)

      (1)数学中的描述性定义

      在数学教育中,函数的传统定义强调变量间的依赖关系,而非严格的集合论形式。例如:<br /> - 经典描述:<br /> “一个变量 \(y\) 随另一个变量 \(x\) 变化,且每个 \(x\) 值对应唯一 \(y\) 值。”(知识库条目[4])<br /> - 局限性:<br /> - 未明确自变量与因变量的集合定义(如定义域、值域)。<br /> - 无法处理复杂函数(如多值函数、抽象映射)。

      (2)从描述性到形式化的发展

      1. 形式化定义
      2. 集合论视角:函数是集合 \(A\) 到集合 \(B\) 的映射 \(f: A \to B\),满足每个 \(x \in A\) 对应唯一 \(y \in B\)(知识库条目[4])。
      3. 三要素:定义域、值域、对应法则。
      4. 教育中的演变
      5. 基础阶段:通过变量关系(如 \(y = f(x)\))直观引入函数概念。
      6. 深化阶段:引入集合论定义,强调严谨性(如高中到大学的过渡,知识库条目[5])。

      4. 两者的区别与联系

      | 方面 | 描述函数法 | 函数的描述性定义 | |-------------------------|----------------------------------------|--------------------------------------| | 领域 | 控制理论与非线性系统分析 | 数学基础与教育理论 | | 核心思想 | 非线性近似为线性频率响应 | 变量间的依赖关系直观描述 | | 数学工具 | 傅里叶级数、奈奎斯特判据 | 集合论、映射理论 | | 目标 | 分析系统稳定性与振荡特性 | 建立函数的直观理解与严谨定义 |


      5. 总结

      • 描述函数法是工程领域分析非线性系统的实用工具,其核心是通过描述函数将非线性近似为线性响应,适用于满足特定条件的系统。
      • 函数的描述性定义是数学教育中的基础概念,强调变量关系的直观理解,后期发展为严格的集合论形式。

      若需进一步探讨具体应用(如描述函数法的计算步骤)或数学定义的演变细节,请随时提问!

    1. 其核心目标是通过观测数据和系统模型,递归地计算状态的后验概率分布(即置信度分布),从而实现状态估计。

      贝叶斯滤波核心目标详解

      1. 核心目标分解

      贝叶斯滤波的核心目标是通过观测数据系统模型,递归地计算状态的后验概率分布(即置信度分布),从而实现状态估计。这一过程可以拆解为以下关键要素:


      2. 关键要素详解

      (1)观测数据(Observations)
      • 定义:从系统外部获取的测量值,例如传感器数据(如GPS、摄像头、雷达等)。
      • 作用:提供关于系统状态的间接信息,但通常包含噪声或不确定性。
      • 数学表示:用 \( z_t \) 表示第 \( t \) 时刻的观测数据。
      • 知识库支持
      • 知识库[3]指出,观测数据(\( z_{1:t} \))与控制输入(\( u_{1:t} \))共同用于估计状态 \( x_t \)。
      • 知识库[6]提到,观测数据是滤波算法的基础,用于更新状态的后验分布。

      (2)系统模型(System Model)

      系统模型由两部分组成: 1. 状态转移模型(State Transition Model) - 定义:描述系统状态随时间变化的规律,通常为概率分布。 - 数学表示:\( p(x_t | x_{t-1}, u_t) \),其中 \( u_t \) 是控制输入。 - 作用:预测下一时刻的状态分布。 - 知识库支持: - 知识库[1]和[3]指出,状态转移模型是系统动态的数学描述,例如机器人运动模型。 - 知识库[4]提到,状态转移方程可能包含噪声(如高斯噪声)。

      1. 观测模型(Observation Model)
      2. 定义:描述状态与观测数据之间的关系,通常为条件概率。
      3. 数学表示:\( p(z_t | x_t) \)。
      4. 作用:将状态映射到观测空间,量化观测的不确定性。
      5. 知识库支持
        • 知识库[3]和[5]强调,观测模型(如传感器模型)是连接状态与观测的关键。
        • 知识库[6]指出,观测模型需描述状态到传感器读数的统计关系。

      (3)递归计算(Recursive Computation)
      • 定义:通过迭代步骤(预测和更新)逐步更新状态估计,而非从头开始计算。
      • 步骤
      • 预测步骤(Prediction Step)

        • 根据前一时刻的后验分布 \( p(x_{t-1} | z_{1:t-1}, u_{1:t-1}) \) 和状态转移模型,预测当前时刻的先验分布: [ p(x_t | z_{1:t-1}, u_{1:t}) = \int p(x_t | x_{t-1}, u_t) \cdot p(x_{t-1} | z_{1:t-1}, u_{1:t-1}) \, dx_{t-1} ]
        • 知识库支持:知识库[2]和[3]详细描述了预测步骤的数学推导。
      • 更新步骤(Update Step)

        • 利用贝叶斯定理结合当前观测 \( z_t \),更新后验分布: [ p(x_t | z_{1:t}, u_{1:t}) \propto p(z_t | x_t) \cdot p(x_t | z_{1:t-1}, u_{1:t}) ]
        • 归一化:通过计算归一化因子 \( Z_t = \int p(z_t | x_t) \cdot p(x_t | z_{1:t-1}, u_{1:t}) \, dx_t \),确保概率分布的总和为1。
        • 知识库支持:知识库[2]和[3]强调了贝叶斯定理在更新步骤中的核心作用。

      (4)后验概率分布(Posterior Probability Distribution)
      • 定义:在给定观测数据和系统模型后,对状态 \( x_t \) 的概率分布估计,即 \( p(x_t | z_{1:t}, u_{1:t}) \)。
      • 意义
      • 不确定性量化:通过概率分布而非单一值,反映对状态的置信程度。
      • 递归更新:当前后验分布成为下一时刻预测的先验分布。
      • 知识库支持
      • 知识库[1]和[3]指出,后验分布是贝叶斯滤波的核心输出,用于状态估计。
      • 知识库[4]提到,后验分布 \( p(x_k | y_{1:k}) \) 是贝叶斯滤波的最终目标。

      (5)状态估计(State Estimation)
      • 定义:从后验概率分布中提取对状态的最佳估计,通常为期望值或众数。
      • 方法
      • 期望估计:\( \hat{x}t = \mathbb{E}[x_t | z{1:t}, u_{1:t}] \)。
      • 最大后验估计:\( \hat{x}t = \arg\max{x_t} p(x_t | z_{1:t}, u_{1:t}) \)。
      • 知识库支持
      • 知识库[5]指出,卡尔曼滤波通过均值和协方差估计状态。
      • 知识库[6]提到,滤波的目标是计算 \( p(x_k | y_{1:k}) \),并从中提取状态估计。

      3. 核心流程示意图

      初始化:p(x₀) → 预测 → 更新 → 预测 → 更新 → ... → 最终估计 - 初始化:设定初始状态分布 \( p(x_0) \)。 - 递归循环: 1. 预测:利用状态转移模型将 \( p(x_{t-1} | z_{1:t-1}) \) 转为 \( p(x_t | z_{1:t-1}) \)。 2. 更新:结合观测模型 \( p(z_t | x_t) \),更新为 \( p(x_t | z_{1:t}) \)。


      4. 应用示例:机器人定位

      假设一个机器人在迷宫中移动,其状态包括位置和速度: - 观测数据:GPS(含噪声)和里程计数据。 - 系统模型: - 状态转移:基于运动模型(如 \( x_t = Ax_{t-1} + Bu_t + w \),\( w \sim \mathcal{N}(0, Q) \))。 - 观测模型:GPS噪声模型(\( z_t = Hx_t + v \),\( v \sim \mathcal{N}(0, R) \))。 - 递归过程: 1. 预测:根据前一时刻的位置和控制指令,预测当前可能位置。 2. 更新:结合GPS观测修正预测,得到更精确的位置估计。 - 输出:后验分布 \( p(x_t | z_{1:t}) \) 表明机器人最可能的位置及不确定性。


      5. 优势与局限性

      优势
      1. 不确定性量化:提供完整概率分布,而非单一估计值。
      2. 递归高效性:计算复杂度与时间无关,适合实时应用。
      3. 灵活性:可处理非线性、非高斯系统(如粒子滤波)。
      局限性
      1. 计算复杂度:高维系统需近似方法(如卡尔曼滤波假设线性高斯)。
      2. 假设依赖性
      3. 状态转移和观测模型需正确建模。
      4. 马尔可夫假设(仅依赖前一时刻状态)可能不成立。

      6. 总结

      贝叶斯滤波通过递归结合观测数据和系统模型,在每一步迭代中更新状态的后验概率分布,从而实现对动态系统的最优估计。其核心在于: - 观测数据提供实时信息,修正状态估计。 - 系统模型描述状态演化规律,支持预测。 - 递归机制确保高效计算,适用于实时场景。 - 后验分布量化不确定性,支持鲁棒决策。

      这一框架在机器人导航、目标跟踪、金融预测等领域具有广泛适用性,是处理动态系统状态估计的基石方法。

    1. Además del principio federal básico del reparto central- regional depoderes, se pueden identificar otras cinco características secundarias delfederalismo: una constitución escrita, bicameralismo, el derecho de lasunidades constitutivas a participar en el proceso de enmienda a laconstitución federal, pero a cambiar unilateralmente su propia constitución,representación desproporcionadamente importante o en pie de igualdad enla cámara federal de las unidades constituidas más pequeñas y gobiernodescentralizado

      federalismo: 1. poder central - regional 2. una constitución 3. bicameralismo 4. pueden enmendar la constitucion (conjunto) / cambiar su constitucion (solos) 5. gobierno descentralizado 6. minorias imp en la camara federal.

    1. GAN训练过程,生成器的训练目标为寻找SDE的稳定解

      GAN(生成对抗网络)的训练过程与朗之万方程(Langevin Equation)的微分形式之间的关联,可以通过以下方式理解:GAN的训练可以视为通过求解随机微分方程(SDE)来寻找生成器分布的平稳解。以下是详细解释:


      1. 朗之万方程的数学形式

      朗之万方程是描述粒子在流体中受热噪声影响的随机微分方程: [ dX_t = -\nabla U(X_t) dt + \sqrt{2 \beta^{-1}} dB_t, ] 其中: - \( U(X_t) \) 是势能函数(Potential Energy),其梯度驱动确定性部分; - \( \beta \) 是逆温度参数(与热噪声强度相关); - \( B_t \) 是布朗运动(Wiener过程),代表随机噪声。

      该方程的平稳分布是势能函数 \( U \) 的玻尔兹曼分布: [ \pi(X) \propto \exp(-\beta U(X)). ]


      2. GAN的训练目标与朗之万方程的联系

      GAN的核心目标是让生成器 \( G \) 的分布 \( P_G \) 接近真实数据分布 \( P_{\text{data}} \)。从优化动力学的角度看,生成器的参数更新可以建模为一个随机微分方程,其解的平稳分布即为目标分布。

      (1)生成器的优化动力学

      生成器 \( G \) 的参数 \( \theta_G \) 通过梯度下降更新: [ \frac{d\theta_G}{dt} = -\nabla_{\theta_G} \mathcal{L}(G, D), ] 其中 \( \mathcal{L} \) 是GAN的损失函数(如最小最大损失)。

      若考虑训练中的噪声(如小批量梯度估计的方差或判别器的波动),可以引入随机项: [ d\theta_G = -\nabla_{\theta_G} \mathcal{L}(G, D) dt + \eta dB_t, ] 这与朗之万方程的形式一致,其中: - \( \nabla_{\theta_G} \mathcal{L} \) 对应势能函数的梯度 \( \nabla U \); - \( \eta \) 是噪声强度参数。

      (2)平稳分布的对应

      朗之万方程的平稳分布由势能 \( U \) 决定,而GAN的训练目标是让 \( P_G \) 接近 \( P_{\text{data}} \)。因此,可以将损失函数 \( \mathcal{L} \) 设计为与势能 \( U \) 相关: - 当 \( P_G \) 趋近于 \( P_{\text{data}} \),损失函数 \( \mathcal{L} \) 的梯度趋于零,此时参数 \( \theta_G \) 达到平稳状态; - 这种平稳状态对应朗之万方程的解,其分布即为 \( P_{\text{data}} \)。


      3. GAN训练的SDE建模

      通过将生成器的参数更新视为随机微分方程,可以建立以下模型:

      (1)连续时间下的参数动力学

      假设生成器的参数 \( \theta_G(t) \) 满足以下SDE: [ d\theta_G = -\nabla_{\theta_G} \mathbb{E}{x \sim P{\text{data}}}[\log D(x)] dt - \nabla_{\theta_G} \mathbb{E}{z \sim P_z}[\log(1 - D(G(z)))] dt + \eta dB_t. ] 这可以简化为: [ d\theta_G = -\nabla{\theta_G} \mathcal{L}(G, D) dt + \eta dB_t, ] 其中 \( \mathcal{L} \) 是GAN的损失函数。

      (2)解的收敛性

      当训练过程收敛时,参数 \( \theta_G \) 的分布应满足朗之万方程的平稳分布: [ \pi(\theta_G) \propto \exp\left(-\beta \mathcal{L}(G, D)\right). ] 若损失函数 \( \mathcal{L} \) 的最小值对应 \( P_G = P_{\text{data}} \),则平稳分布即为生成器的目标分布。


      4. 与知识库的关联

      (1)SDE的求解方法(知识库条目[2][4])

      • 线性SDE的解法(如伊藤公式)可用于分析生成器参数的演化路径;
      • 通过求解SDE的期望和方差,可以研究生成器分布的统计特性。

      (2)GAN的训练动力学(知识库条目[1][3][5][6][7][8])

      • GAN的对抗训练本质是生成器与判别器的博弈,其参数更新的梯度方向与朗之万方程的确定性项一致;
      • 噪声项(如小批量梯度的方差)对应朗之万方程的随机项,有助于避免局部最优。

      5. 实际意义与挑战

      (1)优势

      • 理论解释:将GAN的训练过程建模为SDE,提供了数学上的严谨性,便于分析收敛性;
      • 稳定性提升:通过控制噪声强度 \( \eta \),可改善GAN训练的稳定性(如WGAN的改进)。

      (2)挑战

      • 非线性复杂性:GAN的损失函数 \( \mathcal{L} \) 通常是高度非线性的,导致SDE的解析解难以求得;
      • 高维参数空间:真实场景中参数维度极高,数值求解SDE的计算成本巨大。

      6. 总结

      通过将GAN的训练过程视为求解朗之万方程的随机微分方程,可以: 1. 理论层面:利用SDE的数学工具(如伊藤公式、平稳分布分析)解释GAN的收敛性; 2. 实践层面:借鉴SDE的数值方法(如欧拉-马尤拉方法)设计更稳定的训练算法。

      这一视角为GAN的优化动力学提供了新的分析框架,但实际应用中仍需结合深度学习的工程实践进行调整。

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Lupu et al have identified a role for the RNA binding protein SRSF3 in epicardial development. In a well written manuscript containing many experiments the authors show that this protein is required at different time points in epicardial development to control a range of processes, in particular cell proliferation. This advances understanding of the complex roles of this RNA binding protein in the heart - and raises an important message about how incomplete Cre recombination needs to be considered in interpreting conditional mutant phenotypes. The following points should be addressed.

      1. SRSF3 is known to play essential developmental roles in the myocardium where it regulates capping of transcripts involved in contraction. This point should be mentioned in addition to roles in proliferation. To facilitate understanding, the authors should say more about the subset of cardiomyocytes labelled by Gata5-Cre. For example, is this the result of stochastic activation of the transgene or is a specific subset of cells labelled? How much of the myocardium is targeted?
      2. The authors show failure of ventricular compaction at E13.5 using Wt1-CreERT2 and go on to assess proliferation in epicardial cells. As epicardial-derived signals are known to promote compact myocardial growth, they should also show whether there are indirect defects in proliferation in compact layer myocardium that might explain the non-compaction phenotype. The authors should also indicate if any of the large number of genes bound by SRSF3 encode known or potential pro-proliferative signals from the epicardium or EPDCs to the myocardium and potentially validate their altered expression in mutant hearts.
      3. The rescue by expansion of non-recombined cells is a most interesting aspect of this study. Can the authors see any such outcompeting in the explant experiments (for example in Figure 2)? Do the authors consider this to be an exclusively in vivo competition phenomenon? Given the known roles of Myc in cell competition can the authors use their single cell transcriptomic data to score Myc expression levels in cells from Srsf3 iKO hearts or determine if Myc transcripts are bound by SRSF3?
      4. The authors suggest that this rescue occur by upregulation of Srsf3 in non-recombined cells. It would be helpful to provide additional lines of evidence supporting the hypothesis that SRSF3 expression is upregulated due to hypoxia. Do the CLIP experiments reveal whether SRSF3 binds to it's own transcript?
      5. The authors imply that SRSF3 may regulate Ccnd1 mRNA stability. Can the authors directly evaluate this point? Please clarify if this gene is also affected in the knock-down experiments in MEC1 cells.
      6. Please brighten the immunofluorescence panels in Figure 1 to more clearly show nuclear labelling and tissue structure.
      7. Given the broad roles of SRSF3 is the adjective key necessary in the title?

      Significance

      This ms advances understanding of the complex roles of this RNA binding protein in the heart - and raises an important message about how incomplete Cre recombination needs to be considered in interpreting conditional mutant phenotypes. This study would be of interest to reseachers in the fields of heart development and RNA-protein interactions. Although there are a number of major points to be addressed, these could be potentially dealt with rapidly.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Lupu et al. analyzed the role of the RNA-binding protein SRSF3 for epicardial development. The authors found that Srsf3 is highly expressed in the proepicardial organ and during early stages of epicardial layer formation. Conditional inactivation of SRSF3 in the proepicardial organ stage using a Gata5-Cre driver line resulted in defective formation of the epicardium, accompanied by a proliferation arrest of the proepicardium, resulting in embryonic lethality at E12.5. In contrast, epicardial-specific Srsf3 deletion at later stages using the inducible Wt1CreERT2 line caused a less severe phenotype indicated by impaired coronary vasculature formation, reduced cardiac compaction, and myocardial hypoxia. Mosaic recombination yielded a small population of epicardial cells that upregulate Srsf3, hyperproliferate and compensate for the depleted Srsf3 negative lineage. Single-cell RNA sequencing of control and epicardial Srsf3 knock out hearts, combined with infrared CLIP to map SRSF3 binding sites in the transcriptome identified a number of putative SRSF3 targets involved in mitotic cell cycle control. Among others, SRSF3 binds directly to transcripts encoding key regulators of proliferation, such as Cyclin D1, and senescence, including MAP4K4. The authors conclude that SRSF3 exerts different functions in processing of RNAs, including splicing.

      Overall, this is a well-written and well-organized manuscript, describing interesting findings in the field of epicardial development. However, the mechanistic part is not overly strong. The authors detected some moderate changes in the distribution of different Map4k4 splicing isoforms after knockdown of Srsf3 in an immortalized epicardial cell line but did not go any deeper. The cause for the reduced presence of transcripts for the SRSF3-target Ccnd1 after knockdown of Srsf3 remains enigmatic.

      Significance

      The authors raise a number of speculations why remaining Srsf3-expressing cell start to hyperproliferate after inactivation of Srsf3 but it does not become clear which mechanism is critical. How do non-targeted epicardial cells in the mosaic recombination model sense the loss of SRSF3 knock out cells, resulting in hyperproliferation and enhanced Srsf3 expression? Is a loss of lateral inhibition, e.g. by activated YAP/TAZ, causative for enhanced proliferation of the remaining epicardial cells and an elevated expression level of WT1 and SRSF3? Immunofluorescence staining and/or qRT-PCR of YAP/TAZ and TEADs might provide an answer.

      Is the elevated expression of Srsf3 in non-targeted epicardial cells due to enhanced transcription and/or by altered post-transcriptional processes? How does this observation fit to previous reports indicating that Srsf3 overexpression promotes inclusion of an autoregulatory cassette exon (exon 4) containing a premature (in-frame) stop codon in Srsf3, thereby confining this SRSF3 isoform to nonsense mediated decay (NMD) (doi: 10.1093/emboj/16.16.5077, doi: 10.1186/gb-2012-13-3-r17, doi: 10.1038/srep14548, doi: 10.1161/CIRCRESAHA.118.31451)? In contrast, Srsf1 as well as PTBP1/2 have been previously reported to regulate Srsf3 expression by promoting exon 4 skipping. The authors should perform RNA seq and/or qRT-PCRs validation to check the inclusion of Srsf4 Exon4 as well as Srsf1 and PTBP1/2 expression levels in control and knock out epicardial cells.

      It remains unclear by which mechanisms (alternative splicing, alternative polyadenylation, miRNA processing, or others) SRFS3 mainly exerts its function in the embryonic epicardial lineage. The selection and validation of Map4k4 as a splicing target is not based on an unbiased splicing analysis. In my opinion it is mandatory to provide a full assessment of splicing changes in Srsf3-deficient cells, either by long-range sequencing or by analysis of exon-junction reads.

      Likewise, it is completely enigmatic what SRFS3 does to Ccnd1 transcripts. Does SRFS3 increase the half-life of Ccnd1, does it impact trafficking? At least, the authors have to determine changes in the half-life of Ccnd1 after depletion of SRFS3.

      An unbiased bioinformatics analysis addressing alternative splicing, alternative polyadenylation, and mRNA processing is necessary. Ideally, primary epicardial cells should be used and not an immortalized epicardial cell lines. It is well known, that splicing in cell lines differs substantially from splicing in primary cells.

      I am not convinced that the moderate changes of different Map4k4 splicing isoforms after knockdown of Srsf3 are really responsible for the rather drastic phenotype. Additional experiments are needed to prove a decisive function of a shift in Map4k4 splicing isoforms for hyperproliferation of epicardial cells.

      The authors claim that inactivation of Srsf3 inhibits cell proliferation and causes a senescence-like phenotype. The claim for acquisition of senescence is solely based on transcriptional changes. No attempts were made to visualize an increase of senescent cells in Srsf3-mutant embryos. The authors need to perform SA-bGAL assays or use other techniques to analyse the appearance of senescent cells in the mutants.

      Fig. 2E indicates that WT1 positive / tdTom negative epicardial cell population is enriched in a specific region of the pre-epicardial organ from Srsf3KOs. However it is not clear whether these cells proliferate. The authors should quantify Ki67positive cells in both the WT1-positive / tdTom-positive and the WT1positive / tdTom-negative epicardial population.

      In the headline on page 6, the authors stated that "SRSF3 depletion in the PEO results in impaired ... migration of epicardial progenitor cells", which they deduced from the reduced outgrowth of ventricular epicardial explants. However, the reduced outgrowth from the PEO could be caused by both, reduced proliferation and/or reduced migration. Therefore, the authors should provide additional data clearly indicating reduced migration, e.g. by blocking transcription. Scratch assays of SRFS3 knockout/knockdown vs. control epicardial cells would strengthen the analysis, Is there a change in the GO term "regulation of migration"?

      To prove the reduced proliferation ratio in Figure 4B, quantification of Cyclin D1 positive cells in both SRSF3 positive and negative cells is required.

      Minor issues

      Abstract line 12: a full stop is missing at the end of the sentence.

      Figure 1A: E11.5, figure label 'DAPI WT1' is missing.

      Page 8: Bracket in front of Fig. 4B is missing

      Page 8: G2M phase change uniformly to G2/M phase

      Page 9: 'Srsf3-depleted hearts also demonstrated an increased abundance of epicardial cells with upregulated expression of genes associated with quiescence, such as Clu48, and senescence, for example Map4k4, Tmem30a and Pofut249 (Fig. 4F)'. The sentence is misleading, implying Srfs3 inactivation in all cardiac cell types ('Srsf3-depleted hearts').

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The study by Irina Lupu and colleagues highlights SRSF3 as a key regulator of epicardial development by regulating epicardial cell proliferation. This was demonstrated via two murine knockout models; the first to assimilate the role SRSF3 plays in epicardial formation as a whole, and the second to address its importance post a pivotal maturation point. Through scRNA sequencing and irCLIP, several SRSF3 targets were ascertained and identified as cell cycle regulators. Those epicardial cells that did not lose SRSF3 compensated the loss of some of their mates by increasing SRSF3 expression and over-proliferating. Overall, the paper is interesting and the conclusions are largely supported by the provided data.

      Major comments:

      1. Authors claim that a "reduction in SRSF3 expression levels coincided with the downregulation of WT1 in the epicardium". This was evidenced by immunofluorescence imaging (figure 1A). I suggest conducting a qRT-PCR to quantify Wt1 expression over time, similar to the experiment they performed in figure 1B.
      2. A western blot depicting SRSF3 protein production in controls compared to the knockout model may provide stronger evidence of its depletion (figure 1E).
      3. Authors state that they were unable to directly identify the absence of exons 2 and 3 in individual cells. Please provide evidence that exons 2 and 3 have been knocked out, at least by performing a qRT-PCR.
      4. To prove the functional implication in the observed phenotype of the identified SRSF3 targets, please interfere with Map4k4 activity or expression and check whether the defective epicardial cell proliferation is reverted. This should be done at least in vitro, ideally in vivo.

      Minor comments:

      1. Several minor typos and spacing issues were observed. Please correct.
      2. It would be good for the reader if the authors would simplify their rationale for the use of the two mouse models. It is slightly convoluted and not easy to follow.
      3. In figure 4, it is recommended to add a stacked bar plot to represent the percentage of each cell cluster/population after 4A. This would help the reader
      4. Figure 4B. It is confusing for the reader to understand the fact that the majority of tdTomato+ sorted cells in Srsf3 iKO keep expressing Srsf3. Including the quantification of the image could help.

      Significance

      The paper will be of interest to readers in the field of cardiology, embryology and molecular biology. It will advance the field especially in the study of the development of the epicardium. The models are sophisticated and the experiments carefully performed.

      My field is molecular cardiology, with interest in RNA-binding proteins.

    1. That’s why this scene goes on for two minutes, far beyond its useful lifespan. When you spend three days in enjoyable creative collaboration with a film crew and a set of lunatic stuntmen, coming up with as many variations on a theme as you can, you’re inclined to keep some shots just because they were fun to shoot, even if they’re not necessary or effective.

      Critique #4: The Scene Reflects the Director's Pleasure, Not the Story’s Needs.

    1. 蕴含命题的构造证明:​⊢(A→B)通过算法将A的证明转换为B的证明。​

      通过具体例子解释蕴含命题的构造证明


      例子:证明蕴含命题 \( \vdash (A \rightarrow B) \)

      命题:<br /> 若 \( n \) 是偶数,则 \( n^2 \) 是偶数。<br /> 即 \( A = \text{"n是偶数"} \),\( B = \text{"n²是偶数"} \),构造证明 \( A \rightarrow B \)。


      构造证明的步骤

      1. 定义命题
      2. \( A \):存在整数 \( k \),使得 \( n = 2k \)(偶数的定义)。
      3. \( B \):存在整数 \( m \),使得 \( n^2 = 2m \)(平方后的偶数定义)。

      4. 假设 \( A \) 为真:<br /> 根据构造逻辑的蕴含引入规则,假设 \( A \) 成立,即存在 \( k \in \mathbb{Z} \),使得 \( n = 2k \)。

      5. 从 \( A \) 推导 \( B \)

      6. 代入 \( n = 2k \) 到 \( n^2 \) 中:<br /> [ n^2 = (2k)^2 = 4k^2 = 2 \times (2k^2) ]
      7. 定义 \( m = 2k^2 \),则 \( n^2 = 2m \),满足 \( B \) 的定义。

      8. 构造性转换

      9. 算法步骤
        1. 输入:\( A \) 的证明(即 \( k \) 的值)。
        2. 计算 \( m = 2k^2 \)。
        3. 输出:\( B \) 的证明(即 \( m \) 的值)。
      10. 这个过程将 \( A \) 的证明(\( k \) 的具体值)显式转换为 \( B \) 的证明(\( m \) 的具体值)。

      11. 结论:<br /> 根据构造逻辑的蕴含引入规则,我们得到:<br /> [ \vdash (A \rightarrow B) ]


      关键点分析

      1. 构造性证明的要求
      2. 每一步必须提供显式的构造方法,例如通过代数运算将 \( k \) 转换为 \( m \)。
      3. 不依赖非构造性推理(如反证法或排中律)。

      4. 算法的可计算性

      5. 证明过程对应一个可计算的算法:<br /> [ f(k) = 2k^2 \quad \text{(将 \( k \) 转换为 \( m \))} ]
      6. 这符合知识库[6]中构造逻辑对蕴含的定义:构造 \( A \rightarrow B \) 的证明即提供一个将 \( A \) 的证明(\( k \))转化为 \( B \) 的证明(\( m \))的函数。

      7. 与经典逻辑的区别

      8. 经典逻辑可能通过反证法证明此命题(例如假设 \( n^2 \) 是奇数,导出矛盾),但构造逻辑要求直接构造 \( m \) 的值。

      知识库关联

      • 知识库[6]
      • 构造逻辑中,\( A \rightarrow B \) 的证明是“将 \( A \) 的任一证明转换为 \( B \) 的证明的构造”。
      • 本例中的算法 \( f(k) = 2k^2 \) 即是这一构造的具体实现。
      • 知识库[10]
      • 构造主义要求证明必须提供算法或步骤,而非仅断言存在性。例如,证明 \( n^2 \) 是偶数需显式给出 \( m \) 的表达式。

      总结

      通过这个例子,我们展示了如何通过构造性方法证明蕴含命题 \( A \rightarrow B \):<br /> 1. 假设 \( A \) 成立,并利用其证明(如 \( k \) 的值)。<br /> 2. 显式构造 \( B \) 的证明(如计算 \( m = 2k^2 \))。<br /> 3. 证明过程即算法,确保可计算性和可构造性,符合构造逻辑的要求。

      这一过程体现了构造逻辑的核心思想:证明即算法,蕴含即转换

    2. 该逻辑系统既是数学哲学的革命,也是现代计算机科学的基石之一。

      为什么构造逻辑既是数学哲学的革命,又是现代计算机科学的基石之一?


      一、作为数学哲学的革命

      构造逻辑(Constructive Logic)的诞生标志着数学哲学的一次深刻变革,其核心在于对数学证明本质数学对象存在性的根本性重新定义。以下是其革命性的具体表现:


      1. 对数学证明的重新定义
      • 从“存在性”到“可构造性”:<br /> 经典逻辑允许通过非构造性证明(如反证法、纯存在性证明)来断言某个对象的存在,而构造逻辑要求所有存在性命题必须提供显式构造方法。例如,证明“存在一个素数大于1000”需具体给出一个素数(如1009),而非仅证明其存在性。
      • 知识库关联:毕晓普强调“只证明逻辑存在性是不够的,必须提供有限机械方法构造对象”(知识库[2])。

      • 排斥排中律(LEM):<br /> 构造逻辑不接受“\( P \lor \neg P \)”的普遍有效性,除非能有效判断\( P \)或\( \neg P \)为真。例如,无法断言“\( \pi \)的第10^100位是偶数或非偶数”,因为无法在有限时间内验证。

      • 哲学意义:布劳威尔认为“存在即被构造”,数学对象必须通过心智的“原始直觉”生成(知识库[1])。

      2. 对数学基础的挑战
      • 直觉主义对集合论的批判:<br /> 布劳威尔和直觉主义者反对康托尔的实无穷集合论,认为无限集合必须通过递归过程(如柯西序列)构造,而非抽象定义。例如,实数必须定义为可计算的递归柯西序列(毕晓普学派)或算法生成的数(马尔可夫学派)。
      • 知识库关联:马尔可夫将实数定义为“逐次逼近的算法”,实函数等同于算法(知识库[2])。

      • 选择公理的限制:<br /> 经典数学中的选择公理允许从无限集合中选择元素,但构造逻辑要求选择必须通过可计算的递归过程实现。例如,从可数个非空集合中选择元素需提供一个算法,逐个生成选择项。


      3. 重新定义数学可信性标准
      • 从“抽象真理”到“可计算性”:<br /> 构造逻辑要求数学对象和定理的可计算性,即所有定义和证明必须能通过算法实现。例如,毕晓普将实数定义为递归柯西序列,确保每个实数的生成过程可被机械执行。
      • 知识库关联:毕晓普改造集合论,要求集合的定义不仅包含判别规则,还需提供构造元素的具体方法(知识库[2])。

      • 对经典分析的重构:<br /> 构造逻辑推动了对经典分析的重新审视,例如:

      • 连续函数的定义:构造性连续函数需满足“输入的近似值可计算输出的近似值”(如递归算法)。
      • 中值定理的局限性:经典数学中的“连续函数在区间内取中间值”在构造逻辑中不成立,除非能构造出具体点。

      二、作为现代计算机科学的基石

      构造逻辑对计算机科学的影响体现在其算法化、可验证性和类型系统的构建上,具体表现为以下方面:


      1. 柯里-霍华德同构(Curry-Howard Isomorphism)
      • 证明即程序,公式即类型:<br /> 构造逻辑的证明与程序设计语言中的类型系统直接对应。例如:
      • 蕴含(\( A \rightarrow B \)):对应一个将\( A \)类型的数据转换为\( B \)类型数据的函数。
      • 存在量词(\( \exists x A(x) \)):对应一个包含具体\( x \)值和证明的元组。
      • 知识库关联:构造逻辑的证明可转化为可执行代码(如Coq、Agda中的程序验证)。

      • 函数式编程的基础:<br /> 构造逻辑的自然演绎系统直接启发了函数式编程语言的设计(如Haskell、ML)。例如,构造性证明的消除规则对应函数的模式匹配和递归结构。


      2. 程序验证与形式化方法
      • 正确性证明的自动化:<br /> 构造逻辑为程序正确性证明提供了框架,例如:
      • Hoare逻辑:通过构造性证明验证程序满足预/后条件。
      • 依赖类型系统:如Coq中的类型系统,要求程序的每个步骤都伴随构造性证明。

      • 算法的可计算性:<br /> 构造逻辑的可计算性要求确保算法的每一步骤都有明确构造,避免了经典数学中可能的非构造性漏洞。例如,马尔可夫的“正规算法”直接对应现代计算机的指令集(知识库[2])。


      3. 人工智能与逻辑推理
      • 可解释性与可验证性:<br /> 在知识表示和推理系统中,构造逻辑确保推理过程的透明性。例如,机器学习模型的证明需提供具体构造步骤,而非依赖黑箱决策。

      • 递归与算法设计:<br /> 构造逻辑中的递归定义(如毕晓普的实数)直接对应计算机中的递归函数和迭代算法,确保无限过程的有限步骤实现。


      4. 并发与分布式系统
      • 线性逻辑(Linear Logic)的延伸:<br /> 构造逻辑的资源敏感性(如线性逻辑)为并发系统中的资源管理和通信协议提供了形式化基础,确保操作的原子性和可验证性。

      三、知识库关联与具体案例

      1. 毕晓普的构造性分析
      2. 通过递归柯西序列定义实数,确保所有分析定理的证明可转化为算法(知识库[2])。例如,证明“\( \sqrt{2} \)存在”需提供一个计算其近似值的递归过程。

      3. 马尔可夫的正规算法

      4. 将数学对象定义为算法,例如实数的“逐次逼近算法”直接对应计算机程序的迭代计算(知识库[2])。

      5. 构造逻辑与类型论

      6. Agda语言中的依赖类型系统强制要求所有函数定义伴随构造性证明,确保程序的正确性(如知识库[2]中提到的“改造集合论”)。

      四、总结

      构造逻辑的革命性在于:<br /> 1. 数学哲学层面:重新定义了数学对象的存在性(存在即构造),挑战了经典逻辑的抽象性,推动了数学的算法化和可计算性研究。<br /> 2. 计算机科学层面:为程序验证、函数式编程、类型系统和形式化方法提供了理论基石,确保了算法的可构造性和可验证性。

      其影响贯穿数学与计算机科学的交叉领域,既是一场哲学思潮,也是现代技术的底层逻辑,体现了“数学即算法”的深刻思想。

    1. Regarding school readiness, Wright et al. (2001) collected time-use diaries of television viewing and found that 2-year-olds who were exposed more to child-directed educational programing, such as Sesame Street, reached higher scores on general measures of school readiness (knowledge of letters, numbers, colors, shape, spatial and size relations) at ages 3 and 4, than those who were primarily exposed to adult directed television programs.

      i dont necessarily think this is such a shocking factor because the sole intention of shows like Sesame Street is to teach children, obviously a show meant for general viewing isnt going to teach as much because thats not what its meant for. the harm of those types of programs would come in during young childhood if the program has more mature themes because it can affect the ideas that children can get, not affect their cognition moreso than any other screen driven program would

    1. Compounds of the alkali (oxidation number +1) and alkaline earth metals (oxidation number +2) are typically ionic in nature. Compounds of metals with higher oxidation numbers (e.g., tin +4) tend to form molecular compounds In ionic and covalent molecular compounds usually the less electronegative element is given first. In ionic compounds the names are given which refer to the oxidation (ionic) state In molecular compounds the names are given which refer to the number of molecules present in the compound

      How does the oxidation number of metal depict the compound it will form ?

    1. Reviewer #1 (Public review):

      Summary:

      Govindan and Conrad use a genome-wide CRISPR screen to identify genes regulating retention of intron 4 in OGT, leveraging an intron retention reporter system previously described (PMID: 35895270). Their OGT intron 4 reporter reliably responds to O-GlcNAc levels, mirroring the endogenous splicing event. Through a genome-wide CRISPR knockout library, they uncover a range of splicing-related genes, including multiple core spliceosome components, acting as negative regulators of OGT intron 4 retention. They choose to follow up on SFSWAP, a largely understudied splicing regulator shown to undergo rapid phosphorylation in response to O-GlcNAc level changes (PMID: 32329777). RNA-sequencing reveals that SFSWAP depletion not only promotes OGT intron 4 splicing but also broadly induces exon inclusion and intron splicing, affecting decoy exon usage. While this study offers interesting insights into intron retention regulation and O-GlcNAc signaling, the RNA-Sequencing experiments lack essential controls needed to provide full confidence to the authors' conclusions.

      Strengths:

      (1) This study presents an elegant genetic screening approach to identify regulators of intron retention, uncovering core spliceosome genes as unexpected positive regulators of intron retention.<br /> (2) The work proposes a novel functional role for SFSWAP in splicing regulation, suggesting that it acts as a negative regulator of splicing and cassette exon inclusion, which contrasts with expected SR-related protein functions.<br /> (3) The authors suggest an intriguing model where SFSWAP, along with other spliceosome proteins, promotes intron retention by associating with decoy exons.

      Weaknesses:

      (1) The conclusions regarding SFSWAP's impact on alternative splicing rely on cells treated with a single pool of two siRNAs for five days. The absence of independent siRNA treatments raises concerns about potential off-target effects, which may reduce confidence in the observed SFSWAP-dependent splicing changes. Rescue experiments or using additional independent siRNA treatments would strengthen the conclusions.<br /> (2) The mechanistic role of SFSWAP in splicing would benefit from further exploration, though this may be more appropriate for future studies.

      Comments on revisions:

      The authors have addressed all my previous recommendations.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This study has preliminarily revealed the role of ACVR2A in trophoblast cell function, including its effects on migration, invasion, proliferation, and clonal formation, as well as its downstream signaling pathways.

      Strengths:

      The use of multiple experimental techniques, such as CRISPR/Cas9-mediated gene knockout, RNA-seq, and functional assays (e.g., Transwell, colony formation, and scratch assays), is commendable and demonstrates the authors' effort to elucidate the molecular mechanisms underlying ACVR2A's regulation of trophoblast function. The RNA-seq analysis and subsequent GSEA findings offer valuable insights into the pathways affected by ACVR2A knockout, particularly the Wnt and TCF7/c-JUN signaling pathways.

      Weaknesses:

      The molecular mechanisms underlying this study require further exploration through additional experiments. While the current findings provide valuable insights into the role of ACVR2A in trophoblast cell function and its involvement in the regulation of migration, invasion, and proliferation, further validation in both in vitro and in vivo models is needed. Additionally, more experiments are required to establish the functional relevance of the TCF7/c-JUN pathway and its clinical significance, particularly in relation to pre-eclampsia. Additional techniques, such as animal models and more advanced clinical sample analyses, would help strengthen the conclusions and provide a more comprehensive understanding of the molecular pathways involved.

      Reviewer #2 (Public review):

      Summary:

      ACVR2A is one of a handful of genes for which significant correlations between associated SNPs and the incidences of preeclampsia have been found in multiple populations. It is one of the TGFB family receptors, and multiple ligands of ACVR2A, as well as its coreceptors and related inhibitors, have been implicated in placental development, trophoblast invasion, and embryo implantation. This useful study builds on this knowledge by showing that ACVR2A knockout in trophoblast-related cell lines reduces trophoblast invasion, which could tie together many of these observations. Support for this finding is incomplete, as reduced proliferation may be influencing the invasion results. The implication of cross-talk between the WNT and ACRV2A/SMAD2 pathways is an important contribution to the understanding of the regulation of trophoblast function.

      Strengths:

      (1) ACVR2A is one of very few genes implicated in preeclampsia in multiple human populations, yet its role in pathogenesis is not very well studied and this study begins to address that hole in our knowledge.

      (2) ACVR2A is also indirectly implicated in trophoblast invasion and trophoblast development via its connections to many ligands, inhibitors, and coreceptors, suggesting its potential importance.

      (3) The authors have used multiple cell lines to verify their most important observations.

      Weaknesses:

      (1) There are a number of claims made in the introduction without attribution. For example, there are no citations for the claims that family history is a significant risk factor for PE, that inadequate trophoblast invasion of spiral arteries is a key factor, and that immune responses, and reninangiotensin activity are involved.

      Thank you for pointing out the lack of citations in some parts of the introduction. We have revised the manuscript to include appropriate references for the claims regarding family history as a risk factor for PE, the role of inadequate trophoblast invasion in spiral arteries, and the involvement of immune responses and the renin-angiotensin system. The revised text now includes citations to well-established studies in the field (Salonen Ros et al., 2000; Chappell LC et al., 2021; Brosens et al., 2002; Knofler et al., 2019; Redman CWG et al., 1999; LaMarca B et al., 2008). We believe these additions improve the scientific rigor of the manuscript.

      (2) The introduction states "As a receptor for activin A, ACVR2A..." It's important to acknowledge that ACVR2A is also the receptor for other TGFB family members, with varying affinities and coreceptors. Several TGFB family members are known to regulate trophoblast differentiation and invasion. For example, BMP2 likely stimulates trophoblast invasion at least in part via ACVR2A (PMID 29846546).

      Thank you for highlighting the broader role of ACVR2A as a receptor for multiple members of the TGF-β superfamily. We have revised the introduction to acknowledge that ACVR2A is not only the receptor for activin A but also interacts with other ligands, such as BMP2, which likely stimulates trophoblast invasion via ACVR2A (PMID: 29846546). This addition provides a more comprehensive view of ACVR2A's function in trophoblast biology. While the focus of our current study is on activin A, we agree that ACVR2A's role in mediating the effects of other TGF-β family members is an important topic for future research.

      (3) An alternative hypothesis for the potential role of ACVR2A in preeclampsia is its functions in the endometrium. In the mouse ACVR2A knockout in the uterus (and other progesterone receptorexpressing cells) leads to embryo implantation failure.

      Thank you for bringing up the potential role of ACVR2A in the endometrium as an alternative hypothesis. We have revised the discussion to acknowledge this possibility and cited relevant studies showing that uterine-specific knockout of ACVR2A in mice leads to embryo implantation failure (Monsivais et al., 2021). This suggests that ACVR2A may play a critical role in uterine receptivity and embryo implantation, which could influence placental development and preeclampsia pathogenesis. While our current study focuses on trophoblast-related functions of ACVR2A, we agree that investigating its role in the uterine environment is an important direction for future research.

      (4) In the description of the patient population for placental sample collections, preeclampsia is defined only by hypertension, and this is described as being in accordance with ACOG guidelines. ACOG requires a finding of hypertension in combination with either proteinuria or one of the following: thrombocytopenia, elevated creatinine, elevated liver enzymes, pulmonary, edema, and new onset unresponsive headache.

      We appreciate the reviewer’s detailed observation regarding the definition of preeclampsia.

      We have reviewed and clarified our description of the diagnostic criteria based on the American College of Obstetricians and Gynecologists (ACOG) guidelines. Specifically, we have revised the definition in the Materials and Methods section under "Collection of Placenta and Decidua Specimens," as follows: In accordance with the guidelines from the American College of Obstetricians and Gynecologists (ACOG, 2023), preeclampsia (PE) is diagnosed as hypertension (systolic blood pressure ≥140 mmHg or diastolic blood pressure ≥90 mmHg on at least two occasions) in combination with one or more of the following: proteinuria (≥300 mg/24-hour urine collection or protein/creatinine ratio ≥0.3), thrombocytopenia, elevated serum creatinine, elevated liver enzymes, pulmonary edema, or new-onset headache unresponsive to treatment.

      (5) I believe that Figures 1a and 1b are data from a previously published RNAseq dataset, though it is not entirely clear in the text. The methods section does not include a description of the analysis of these data undertaken here. It would be helpful to include at least a brief description of the study these data are taken from - how many samples, how were the PE/control groups defined, gestational age range, where is it from, etc. For the heatmap presented in B, what is the significance of the other genes/ why are they being shown? If the purpose of these two panels is to show differential expression specifically of ACVR2A in this dataset, that could be shown more directly.

      Clarification of RNAseq dataset: The Methods section has been revised to specify the dataset source (GEO accession number: GSE114691), which includes 20 PE and 21 control placental samples with gestational ages ranging from 34 to 38 weeks. PE and control groups were defined using clinical criteria such as hypertension and proteinuria, and these details have also been added to the Results section. RNAseq analysis description: We have included details of the differential gene expression analysis in the Methods section. Specifically, the DESeq2 R package was used, with thresholds of FDR < 0.05 and |log2(fold change) | ≥ 1. The selection of WNT pathwayrelated genes in Figure 1B is based on these analyses. Significance of the heatmap genes: The genes displayed in Figure 1B were selected based on their significant differential expression and enrichment in pathways relevant to PE pathogenesis, such as the WNT signaling pathway. We have clarified this in the Results section and updated the figure legend to explain their biological relevance. Purpose of Figures 1A and 1B: Figure 1A emphasizes the downregulation of ACVR2A in PE placentas, while Figure 1B complements this by presenting differentially expressed genes associated with the WNT pathway. These figures collectively highlight the role of ACVR2A in PE and its connection to broader molecular pathways. Text descriptions have been updated to improve clarity and focus.

      (6) More information is needed in the methods section to understand how the immunohistochemistry was quantified. "Quantitation was performed" is all that is provided. Was staining quantified across the whole image or only in anchoring villous areas? How were HRP & hematoxylin signals distinguished in ImageJ? How was the overall level of HRP/DAB development kept constant between the NC and PE groups?

      Thank you for pointing out the need for more details regarding the quantification of immunohistochemistry (IHC). We have now clarified and expanded the description of the IHC quantification process in the Methods section as follows: Quantification Across the Entire Section: IHC staining was assessed across the entire tissue section to account for global expression patterns. For quantitative analysis, representative regions from the anchoring villous areas, where ACVR2A expression is most prominent, were selected for comparison between NC and PE groups. This ensured that the analysis focused on biologically relevant regions. ImageJ Analysis:

      Images of stained sections were captured under identical magnifications and lighting conditions. Hematoxylin (blue, nuclear staining) and DAB/HRP (brown, protein-specific signal) were distinguished using ImageJ's color deconvolution plugin. The DAB/HRP signal was isolated and quantified based on the integrated optical density (IOD) within the selected regions. Consistency in HRP/DAB Development: To maintain consistency between NC and PE groups, all tissue samples were processed under identical experimental conditions, including the same antibody dilution, incubation times, and DAB/HRP development durations. Negative controls (without primary antibody) were included to monitor background staining, and the DAB reaction was stopped simultaneously across all samples to avoid overdevelopment. Statistical Analysis: The quantified DAB signal intensity was normalized to the area of the selected regions, and comparisons between NC and PE groups were performed using statistical tests (e.g., Student’s ttest). Results are reported as mean ± SD. We hope this additional detail addresses your concerns.

      (7) In Figure 1E it is not immediately obvious to many readers where the EVT are. It is probably worth circling or putting an arrow to the little region of ACVR2A+ EVT that is shown in the higher magnification image in Figure 1E. These are actually easier to see in the pictures provided in the supplement Figure 1. Of note, the STB is also staining positive. This is worth pointing out in the results text.

      Thank you for your suggestion regarding Figure 1E. To make the location of the ACVR2A+ extravillous trophoblasts (EVTs) more apparent, we have updated Figure 1E by adding arrows to indicate the regions of EVTs in the higher magnification image. Additionally, we have included annotations in the supplemental Figure S1 to further aid visualization. We appreciate your observation that syncytiotrophoblasts (STBs) also show positive staining for ACVR2A. We have revised the Results section to explicitly mention this finding and its potential significance.

      (8) It is not possible to judge whether the IF images in 1F actually depict anchoring villi. The DAPI is really faint, and it's high magnification, so there isn't a lot of context. Would it be possible to include a lower magnification image that shows where these cells are located within a placental section? It is also somewhat surprising that this receptor is expressed in the cytoplasm rather than at the cell surface. How do the authors explain this?

      Thank you for your suggestion to provide more context for the immunofluorescence (IF) images in Figure 1F. To address this, we have included lower magnification images in Supplementary Figure S2, showing the overall structure of the placental section and the location of the anchoring villi. These images help to contextualize the regions analyzed in Figure 1F, which were selected to clearly illustrate ACVR2A expression in extravillous trophoblasts (EVTs). In Figure 1F, we have focused on higher magnification images for better visualization of ACVR2A staining patterns in EVTs. Regarding the subcellular localization of ACVR2A, the receptor is predominantly expressed on the cell surface, as shown in our images. However, some intracellular staining is also observed, which may reflect receptor trafficking or recycling processes, consistent with the behavior of other activin receptors under physiological or pathological conditions. We have clarified these points in the Results and Discussion sections.

      (9) The results text makes it sound like the data in Figure 2A are from NCBI & Protein atlas, but the legend says it is qPCR from this lab. The methods do not detail how these various cell lines were grown; only HTR-SVNeo cell culture is described. Similarly, JAR cells are used for several experiments and their culture is not described.

      Thank you for pointing out the need for clarification regarding Figure 2A and cell culture methods. The data in Figure 2A were generated using RT-qPCR conducted in our laboratory, not solely based on data from NCBI or the Human Protein Atlas. We have revised the Results section to reflect this more accurately. Regarding the culture conditions, we acknowledge that the methods for other cell lines were not explicitly detailed. For this study, all cell lines, including JAR and other cancer cell lines, were cultured following standard protocols provided by the suppliers. Specifically, JAR cells and other cell lines were purchased from Wuhan Punosei Life Technology and were maintained in RPMI-1640 medium supplemented with 10% fetal bovine serum (FBS) and 1% penicillin/streptomycin under standard conditions (37°C, 5% CO<sub>2</sub>). This information has been added to the Methods section for clarity.

      (10) Under RT-qPCR methods, the phrase "cDNA reverse transcription cell RNA was isolated..." does not make any sense.

      Thank you for pointing out the unclear phrasing in the RT-qPCR methods section. We agree that the original description was not precise. To address this, we have revised the relevant section to improve clarity and accuracy. Specifically, the methods now explicitly describe the two key steps: RNA isolation and cDNA synthesis. The revised text reads: Total RNA was isolated from cells using a Total RNA Extraction Kit (TIANGEN, China) following the manufacturer’s instructions. The extracted RNA was reverse-transcribed into complementary DNA (cDNA) using a cDNA Synthesis Kit (Takara, Japan) according to the protocol provided by the manufacturer.

      (11) The paragraph beginning "Consequently, a potential association..." is quite confusing. It mentions analyzing ACVR2A expression in placentas, but then doesn't point to any results of this kind and repeats describing the results in Figure 2a, from various cell lines.

      Thank you for your comment regarding the paragraph beginning with "Consequently, a potential association...". We understand that the current wording may create confusion. The primary aim of this section is to compare ACVR2A expression levels across various cell lines, including trophoblast-derived and non-trophoblast cell lines, to highlight the relevance of ACVR2A in trophoblast function, particularly in invasion and migration. To address your concerns, we have revised the paragraph for clarity and logical flow. The updated text explicitly focuses on the comparison of ACVR2A expression across cell lines (Figure 2A) and how this supports the hypothesis that ACVR2A plays a key role in trophoblast invasion and migration. Additionally, the discussion of placental samples has been separated to avoid confusion with cell line results. We hope this revision resolves the issue.

      (12) The authors should acknowledge that the effect of the ACVR2A knockout on proliferation makes it difficult to draw any conclusions from the trophoblast invasion assays. That is, there might be fewer migrating or invading cells in the knockout lines because there are fewer cells, not because the cells that are there are less invasive. Since this is a central conclusion of the study, it is a major drawback.

      Thank you for highlighting this important point. We agree that the reduced proliferation observed in ACVR2A knockout cells could influence the results of the invasion assays, as fewer cells may inherently lead to reduced invasion. To minimize this effect, we conducted the invasion and migration assays under low-serum conditions (1–2% serum) to limit cell proliferation during the experimental timeframe. This approach was based on optimization trials and existing literature, as serum-free conditions were found to negatively impact cell viability and experimental reproducibility. While these efforts helped to mitigate the impact of proliferation on the results, we acknowledge this as a limitation of our study and have added this discussion to the manuscript. Future studies could incorporate approaches such as normalizing cell numbers or using additional proliferation-independent methods to confirm the findings. We hope this clarification and the steps taken address your concerns.

      (13) The legend and the methods section do not agree on how many fields were selected for counting in the transwell invasion assays in Figure 3C. The methods section and the graph do not match the number of replicate experiments in Figure 3D (the number of replicate experiments isn't described for 3C).

      Thank you for pointing out the inconsistencies regarding the number of fields counted and the number of replicates in the Transwell invasion assays (Figure 3C) and colony formation assays (Figure 3D). We apologize for the lack of clarity in the Methods section and figure legend. To address this, we have revised both the figure legends and the Methods section for consistency and added detailed descriptions. For Figure 3C, cell invasion was quantified by randomly selecting 5 fields of view per sample under 300× magnification. Images shown in the figure were taken at lower magnification to provide a better visual comparison between experimental and control groups. For Figure 3D, each experiment was independently repeated at least 10 times to ensure robust and reproducible results. These clarifications have been incorporated into the revised manuscript. We appreciate your feedback and believe this revision improves the clarity and transparency of our methods.

      (14) Discussion says "Transcriptome sequencing analysis revealed low ACVR2A expression in placental samples from PE patients, consistent with GWAS results across diverse populations." The authors should explain this briefly. Why would SNPs in ACVR2A necessarily affect levels of the transcript?

      Thank you for raising this important point. We acknowledge that our study did not directly investigate how SNPs in the ACVR2A gene affect transcript levels. However, prior studies have suggested that SNPs can influence gene expression through various mechanisms. For example, SNPs in regulatory regions (e.g., promoters, enhancers, or untranslated regions) may affect transcription factor binding, RNA stability, or splicing efficiency, ultimately altering transcript levels. While we did not directly assess the functional consequences of ACVR2A SNPs in this study, the observed downregulation of ACVR2A in PE placentas aligns with the potential regulatory impact of SNPs previously identified in GWAS studies. To address this, we have revised the Discussion section to clarify the relationship between SNPs and transcript levels and acknowledge this limitation.  

      (15) "The expression levels of ACVR2A mRNA were comparable to those of tumor cells such as A549. This discovery suggested a potential pivotal role of ACVR2A in the biological functions of trophoblast cells, especially in the nurturing layer." Alternatively, ACVR2A expression resembles that of tumors because the cell lines used here are tumor cells (JAR) or immortalized cells (HTR8). These lines are widely used to study trophoblast properties, but the discussion should at least acknowledge the possibility that the behavior of these cells does not always resemble normal trophoblasts.

      Thank you for pointing out this important limitation. We agree that the JAR and HTR8/SVneo cell lines, being tumor-derived or immortalized, may not fully replicate the behavior of normal trophoblast cells. While these cell lines are widely used as models for studying trophoblast properties due to their ease of culture and invasive behavior, their gene expression and signaling pathways could partially reflect their tumorigenic or immortalized origins. We have revised the Discussion section to acknowledge this limitation and clarify the interpretation of ACVR2A expression levels in these cells.

      (16) The authors should discuss some of what is known about the relationship between the TCF7/c-JUN pathway and the major signaling pathway activated by ACVR2A, Smad 2/3/4. The Wnt and TGFB family cross-talk is quite complex and it has been studied in other systems.

      Thank you for highlighting the relationship between the TCF7/c-JUN pathway and Smad2/3/4 signaling. In our study, we chose to focus on Smad1/5 due to its strong association with ACVR2A in placental development, as demonstrated in a recent study(DOI: 10.1038/s41467-021-23571-5). This study showed that the BMP signaling pathway, mediated through ACVR2A-Smad1/5, is essential for endometrial receptivity and embryo implantation. While Smad2/3/4 are wellestablished mediators of TGF-β signaling, Smad1/5 activation is more directly linked to ACVR2A in the context of reproductive biology.

      In PE placentas, we observed a significant downregulation of Smad1/5 expression, which supports the hypothesis that ACVR2A-mediated Smad signaling is disrupted in this condition. Although we did not directly assess Smad2/3/4 in this study, prior research has shown that Smad2/3 can interact with TCF/LEF transcription factors to regulate Wnt-related target genes, suggesting potential cross-talk between these pathways. We have now clarified this rationale and included a discussion of these interactions in the revised manuscript.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Several points need to be addressed to improve the clarity and robustness of the presented findings:

      (1) From a clinical perspective, several concerns arise regarding the interpretation of these findings. First, the small sample size of 20 patients may not be representative of the broader population, limiting the generalizability of the results. Additionally, although no significant differences in age and pre-pregnancy BMI were observed between the PE and normal control groups, other clinical variables, such as hypertension or gestational diabetes, may also influence ACVR2A expression and contribute to PE development. Furthermore, while the study suggests a correlation between reduced ACVR2A expression and PE, it remains unclear whether this association holds true across different subtypes of PE or whether there are other underlying clinical factors that could account for these changes in gene expression. These factors need to be considered in future studies to better understand the clinical relevance of ACVR2A in PE.

      Thank you for raising these insightful concerns about the clinical interpretation of our findings. We agree that the small sample size of 20 patients may limit the generalizability of our results. To address this, we are actively expanding our cohort by collecting additional clinical samples from PE patients and normotensive controls. This effort aims to strengthen the robustness of our findings and provide stronger evidence for the role of ACVR2A in PE. We would also like to clarify that, during the initial sample collection, we specifically included only PE patients without comorbidities such as gestational diabetes, chronic hypertension, or other pregnancy-related complications. This strict selection criterion was implemented to minimize the potential influence of confounding clinical variables and ensure that our findings specifically reflect the association between ACVR2A expression and PE. While our study provides important initial insights, we recognize the need for larger-scale studies to validate these findings. The ongoing collection of clinical samples will allow us to address this limitation and enhance the translational relevance of our research. We have revised the manuscript to reflect these points and highlight our plans to strengthen the study by increasing the sample size.

      (2) The section "Precision Genome Surgery: ACVR2A Knockout via CRISPR/Cas9" in the results contains some issues with expression details. The results section should be more structured, with data presented in a more detailed and clear manner, ensuring that there is a clear connection between each experimental step and its corresponding result. For example, the sentence "Following multiple rounds of monoclonal culture, genotype identification, RT-qPCR and Western blotting (WB) analysis for screening, specific double-knockout monoclonal cell lines were distinctly chosen" contains redundant phrasing and unnecessary details, which affect the flow of the text.

      Thank you for your constructive feedback on the “Precision Genome Surgery: ACVR2A Knockout via CRISPR/Cas9” section. We agree that this section can be better structured to present the data in a more detailed and coherent manner. To address this, we have reorganized the results into distinct steps, ensuring a clear connection between each experimental step and its corresponding result. Redundant phrasing has been removed to improve the flow and readability of the text. The revised section emphasizes the purpose of each step, the screening process, and the specific results obtained.

      (3) The figure legends and panel labels in Figure 3 should be revised to ensure clarity and consistency. The figure legend should specify the exact panels (e.g., Figure 3A, 3B, 3C, etc.) and clearly describe the experimental conditions and results shown in each part.

      Thank you for pointing out the need for improved clarity and consistency in the figure legends and panel labels for Figure 3. We have revised the figure legend to specify each panel (e.g., Figure 3A, 3B, 3C, etc.) and included detailed descriptions of the experimental conditions and results displayed in each part. These updates aim to ensure better understanding and alignment between the figure legend and the panels.

      (4) Lack of In Vivo Validation of ACVR2A Knockout: The study does not include in vivo experiments to validate the effects of ACVR2A knockout. It would be important to investigate whether similar regulatory effects of ACVR2A on trophoblast cell migration and invasion can be observed in animal models or in larger clinical studies. The lack of in vivo data raises questions about the translational relevance of the findings.

      Thank you for highlighting the importance of in vivo validation to assess the translational relevance of our findings. While we acknowledge that in vivo experiments could provide additional insights into the role of ACVR2A in trophoblast migration and invasion, this study was primarily designed as an in vitro investigation to explore the molecular mechanisms underlying ACVR2A function in trophoblast cells. The choice of an in vitro model allowed us to perform precise and controlled mechanistic analyses, which are critical for establishing a foundation for future research. We agree that in vivo studies using animal models or larger clinical cohorts are important next steps to validate the regulatory effects of ACVR2A on trophoblast function and its contribution to PE pathogenesis. These directions will be pursued in future research to further establish the translational potential of our findings. We have included this perspective in the revised Discussion section.

      (5) TCF7/c-JUN Pathway in Clinical Samples: In the study of the TCF7/c-JUN pathway, the authors mention assessing protein expression in clinical samples through immunohistochemistry (IHC). However, the manuscript does not provide a clear explanation of how the findings from laboratory cell models (such as HTR8/SVneo and JAR) relate to the clinical samples. Specifically, while ACVR2A knockout is shown to affect these proteins at the cellular level, it is unclear whether this effect is observed in clinical samples. Therefore, further validation of the TCF7/c-JUN pathway in the cell models and exploration of its relationship with protein expression in clinical samples is necessary. Additional experiments, such as immunofluorescence staining or mass spectrometry, could further confirm the role of the TCF7/c-JUN pathway in cells and provide a more direct comparison with clinical data.

      Thank you for highlighting the need to connect findings from cell models to clinical samples, particularly with respect to the TCF7/c-JUN pathway. In response to your comment, we conducted additional experiments using Western blot analysis to evaluate the expression of ACVR2A, SMAD1/5, SMAD4, pSMAD1/5/9, and TCF7L1/TCF7L2 in PE placental tissues compared to normotensive controls (Figure 7A). The results demonstrated significantly reduced expression of these proteins in PE placentas, providing evidence that disruptions in the ACVR2A-SMAD and TCF7/c-JUN signaling pathways observed in vitro are also present in clinical samples.

      These findings strengthen the translational relevance of our study by directly linking the molecular mechanisms identified in cell models to clinical observations. We have updated the Results and Discussion sections to incorporate these new data, and we believe this addition addresses your concern about the relationship between in vitro and clinical findings.

    1. Reviewer #1 (Public review):

      Summary:

      In this report, the authors made use of a murine cell line derived from a MYC-driven liver cancer to investigate the gene expression changes that accompany the switch from normoxic to hypoxia conditions during 2D growth and the switch from 2D monolayer to 3D organoid growth under normoxic conditions. They find a significant (ca. 40-50%) overlap among the genes that are dysregulated in response to hypoxia in 2D cultures and in response to spheroid formation. Unsurprisingly, hypoxia-related genes were among the most prominently deregulated under both sets of conditions. Many other pathways pertaining to metabolism, splicing, mitochondrial electron transport chain structure and function, DNA damage recognition/repair and lipid biosynthesis were also identified.

      Comments on the revised manuscript:

      In my original review of this manuscript, I raised 11 points that I thought needed to be addressed and/or clarified by the authors. In response, they have provided an adequate answer to only one of these (point 6), which is little more than a more thorough description of how spheroids were generated. The remaining points that I raised, which would have provided more mechanistic insight into their study were addressed by the authors with the following such comments:

      - It is not the focus of this study (Points 1 and 4)

      - It is worthy of further validation (Point 2)

      - We apologize for not being able to validate everything (Point 3)

      - This reviewer has raised an interesting question. We are investigating this hypothesis and hopefully we can give a clear answer in the future (Point 5)

      - This is an excellent idea that we certainly will do it in our future experiments (Point 7)

      As to responses that the authors made to the other two reviewers' comments: Most pertained to cosmetic alterations involving clarification of methods, inclusion of a new figure or rearrangement of old figures. These were generally answered. However, in response to the last point raised by Rev. 3 to compare "sgRNA abundances at the earliest harvesting time with the distribution in the library...to see whether and to what extent selection has already taken place before the three culture conditions were established", the authors responded with the comment: "This is great point. Unfortunately, we did not perform such an analysis."

      I understand that it is often impossible to address all points raised by the reviewers. This can be for a variety of reasons and many times the omissions can be overlooked and accepted if the reviewer can be convinced that a good faith attempt has otherwise been made to address the other deficiencies. However, no such effort has been made here and the study remains deficient and largely descriptive as I pointed out in my original review.

    2. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In this report, the authors made use of a murine cell life derived from a MYC-driven liver cancer to investigate the gene expression changes that accompany the switch from normoxic to hypoxia conditions during 2D growth and the switch from 2D monolayer to 3D organoid growth under normoxic conditions. They find a significant (ca. 40-50%) overlap among the genes that are dysregulated in response to hypoxia in 2D cultures and in response to spheroid formation. Unsurprisingly, hypoxia-related genes were among the most prominently deregulated under both sets of conditions. Many other pathways pertaining to metabolism, splicing, mitochondrial electron transport chain structure and function, DNA damage recognition/repair, and lipid biosynthesis were also identified.

      We thank this reviewer for his/her time and efforts, and the insightful comments.

      Major comments:

      (1) Lines 239-240: The authors state that genes involved in DNA repair were identified as being necessary to maintain survival of both 2D and 3D cultures (Figure S6A). Hypoxia is a strong inducer of ROS. Thus, the ROS-specific DNA damage/recognition/repair pathways might be particularly important. The authors should look more carefully at the various subgroups of the many genes that are involved in DNA repair. They should also obtain at least a qualitative assessment of ROS and ROS-mediated DNA damage by staining for total and mitochondrial-specific ROS using dyes such as CM-H2-DCFDA and MitoSox. Actual direct oxidative damage could be assessed by immunostaining for 8-oxo-dG and related to the sub-types of DNA damage-repair genes that are induced. The centrality of DNA damage genes also raises the question as to whether the previously noted prominence of the TP53 pathway (see point 5 below) might represent a response to ROS-induced DNA damage.

      We thank this reviewer for the insightful comments, and agreed that ROS induced by hypoxia could play a role in modulating DNA repair and consequently cellular essentiality. Although pathway enrichment in Figure S6A (now as Figure 2-figure supplement 4A) showed that DNA repair pathway was essential to cell survival in hypoxia and 3D cultures, the genes associated with this pathway (Ddb1;Brf2;Gtf3c5;Guk1;Taf6) are not typical DNA repair genes. They are more likely involved in gene transcription. However, it will be interesting to see if they are specifically involved in DNA damage in response to ROS, which is out of focus of this study.

      (2) Because most of the pathway differences that distinguish the various cell states from one another are described only in terms of their transcriptome variations, it is not always possible to understand what the functional consequences of these changes actually are. For example, the authors report that hypoxia alters the expression of genes involved in PDH regulation but this is quite vague and not backed up with any functional or empirical analyses. PDH activity is complex and regulated primarily via phosphorylation/dephosphorylation (usually mediated by PDK1 and PDP2, respectively), which in turn are regulated by prevailing levels of ATP and ADP. Functionally, one might expect that hypoxia would lead to the down-regulation of PDH activity (i.e. increased PDH-pSer392) as respiration changes from oxidative to non-oxidative. This would not be appreciated simply by looking at PDH transcript levels. This notion could be tested by looking at total and phospho-PDH by western blotting and/or by measuring actual PDH activity as it converts pyruvate to AcCoA.

      We agreed with this reviewer that PDH activity regulation could be affected by multi-factors, and it is worthy of further validation by other approaches.

      (3) Line 439: Related to the above point: the authors state: "It is likely that blockade of acetyl-CoA production by PDH knockout may force cells to use alternative energy sources under hypoxic and 3D conditions, averting the Warburg effect and promoting cell survival under limited oxygen and nutrient availability in 3D spheroids." This could easily be tested by determining whether exogenous fatty acids are more readily oxidized by hypoxic 2D cultures or spheroids than occurs in normoxic 2D cultures.

      We thank for this suggestion. We apologized for not being able to validate everything.

      (4) Line 472: "Hypoxia induces high expression of Acaca and Fasn in NEJF10 cells indicating that hypoxia promotes saturated fatty acid synthesis...The beneficial effect of Fasn and Acaca KO to NEJF10 under hypoxia is probably due to reduction of saturated fatty acid synthesis, and this hypothesis needs to be tested in the future.". As with the preceding comment, this supposition could readily be supported directly by, for example, performing westerns blots for these enzymes and by showing that incubation of hypoxic 2D cells or spheroids converted more AcCoA into lipid.

      We thank for this suggestion. However, functional validation for the Fasn and Acaca KO is out of focus in this study.

      (5) In Supplementary Figure 2B&C, the central hub of the 2D normoxic cultures is Myc (as it should well be) whereas, in the normoxic 3D, the central hub is TP53 and Myc is not even present. The authors should comment on this. One would assume that Myc levels should still be quite high given that Myc is driven by an exogenous promoter. Does the centrality of TP53 indicate that the cells within the spheroids are growtharrested, being subjected to DNA damage and/or undergoing apoptosis?

      The predicted transcription factor activity analysis was based on the differential ATAC-seq peaks among different culture through pairwise comparisons. If TP53 and MYC were not present under that condition, it did not mean their activity was absent.

      “…the centrality of TP53 indicate that the cells within the spheroids are growth-arrested, being subjected to DNA damage and/or undergoing apoptosis?” This reviewer has raised an interesting question. We are investigating this hypothesis and hopefully we can give a clear answer in the future.

      (6) In the Materials and Methods section (lines 711-720), the description of how spheroid formation was achieved is unclear. Why were the cells first plated into non-adherent 96 well plates and then into nonadherent T75 flasks? Did the authors actually utilize and expand the cells from 144 T75 flasks and did the cells continue to proliferate after forming spheroids? Many cancer cell types will initially form monolayers when plated onto non-adherent surfaces such as plastic Petri dishes and will form spheroid-like structures only after several days. Other cells will only aggregate on the "non-adherent" surface and form spheroid-like structures but will not actually detach from the plate's surface. Have the authors actually documented the formation of true, non-adherent spheroids at 2 days and did they retain uniform size and shape throughout the collection period? The single photo in Supplementary Figure 1 does not explain when this was taken. The authors include a schematic in Figure 2A of the various conditions that were studied. A similar cartoon should be included to better explain precisely how the spheroids were generated and clarify the rationale for 96 well plating. Overall, a clearer and more concise description of how spheroids were actually generated and their appearance at different stages of formation needs to be provided.

      The cells were initially plated in non-adherent 96-well plates to facilitate the formation of spheroids in a controlled and uniform manner. As correctly mentioned by the reviewer, during the initial stages, cells cultured on non-adherent surfaces often form aggregates or clumps, and it takes a few days for them to develop into solid spheroids.

      In our study, we aimed to achieve 3D spheroid formation immediately following the transduction process to allow for screening under both 2D and 3D conditions. Plating the cells into 96-well plates enabled us to monitor and control the formation of spheroids in smaller volumes before scaling up the culture in non-adherent T75 flasks for subsequent experimental steps. This setup allows us to maintain gene editing processes under both 2D and 3D conditions.

      Regarding the proliferation and uniformity of spheroids:

      • Yes, the spheroids continued to proliferate after their formation.

      • True, non-adherent spheroids were documented as early as the next day. This was visually confirmed under microscopy, and size uniformity was maintained throughout the collection period by following optimized culture protocols.

      We also agreed with the reviewer’s suggestion to include a cartoon schematic similar to Figure 2A, illustrating the spheroid generation process and clarifying the rationale for using 96-well plates. We have included such a cartoon and speroid growth curve monitored by Incucyte as Figure 2-figure supplement 2.

      (7) The authors maintained 2D cultures in either normoxic or hypoxic (1% O2) states during the course of their experiments. On the other hand, 3D cultures were maintained under normoxic conditions, with the assumption that the interiors of the spheroids resemble the hypoxic interiors of tumors. However, the actual documentation of intra-spheroid hypoxia is never presented. It would be a good idea for the authors to compare the degree of hypoxia achieved by 2D (1% O2) and 3D cultures by staining with a hypoxia-detecting dye such as Image-iT Green. Comparing the fluorescence intensities in 2D cultures at various O2 concentrations might even allow for the construction of a "standard curve" that could serve to approximate the actual internal O2 concentration of spheroids. This would allow the authors to correlate the relative levels of hypoxia between 2D and 3D cultures.

      This is an excellent idea that we certainly will do it in our future experiments.

      (8) Related to the previous 2 points, the authors performed RNAseq on spheroids only 48 hours after initiating 3D growth. I am concerned that this might not have been a sufficiently long enough time for the cells to respond fully to their hypoxic state, especially given my concerns in Point 6. Might the results have been even more robust had the authors waited longer to perform RNA seq? Why was this short time used?

      We agreed with this reviewer. We were unsure if 48hours was an ideal timepoint. It might be necessary to perform a longitudinal experiment to harvest samples under different timepoints in the future experiments.

      (9) What happens to the gene expression pattern if spheroids are re-plated into standard tissue culture plates after having been maintained as spheroids? Do they resume 2D growth and does the gene expression pattern change back?

      This is a great question and we have never thought about what the gene expression pattern would be if speroids are re-plated in 2D. This could be a challenging experiment because the gene expression and epigenetic changes are timing related. However, the cells do grow well after re-plated in 2D.

      (10) Overall, the paper is quite descriptive in that it lists many gene sets that are altered in response to hypoxia and the formation of spheroids without really delving into the actual functional implications and/or prioritizing the sets. Some of these genes are shown by CRISPR screening to be essential for maintaining viability although in very few cases are these findings ever translated into functional studies (for example, see points 14 above). The list of genes and gene pathways could benefit from a better explanation and prioritization of which gene sets the authors believe to be most important for survival in response to hypoxia and for spheroid formation.

      This was a genome-wide study that integrated RNA-seq, ATAC-seq and CRISPR KO, providing resource to understand the oncogenic pathways in different culture conditions. We believe we have clearly articulated the important genes/pathways in our abstract.

      (11) The authors used a single MYC-driven tumor cell line for their studies. However, in their original paper (Fang, et al. Nat Commun 2023, 14: 4003.) numerous independent cell lines were described. It would help to know whether RNAseq studies performed on several other similar cell lines gave similar results in terms of up & down-regulated transcripts (i.e. representative of the other cell lines are NEJF10 cells).

      We have not generated RNA-seq data for these cell lines cultured in different conditions.

      Reviewer #2 (Public review):

      Summary:

      The manuscript by Fang et al., provides a tour-de-force study uncovering cancer cell's varied dependencies on several gene programs for their survival under different biological contexts. The authors addressed genomic differences in 2D vs 3D cultures and how hypoxia affects gene expression. They used a Myc-driven murine liver cancer model grown in 2D monolayer culture in normoxia and hypoxia as well as cells grown as 3D spheroids and performed CRISPR-based genome-wide KO screen to identify genes that play important roles in cell fitness. Some context-specific gene effects were further validated by in-vitro and in-vivo gene KO experiments.

      Strengths:

      The key findings in this manuscript are:

      (1) Close to 50% of differentially expressed genes were common between 2D Hypoxia and 3D spheroids conditions but they had differences in chromatin accessibility.

      (2) VHL-HIF1a pathway had differential cell fitness outcomes under 2D normoxia vs 2D hypoxia and 3D spheroids.

      (3) Individual components of the mitochondrial respiratory chain complex had contrasting effects on cell fitness under hypoxia.

      (4) Knockout of organogenesis or developmental pathway genes led to better cell growth specifically in the context of 3D spheroids and knockout of epigenetic modifiers had varied effects between 2D and 3D conditions.

      (5) Another key program that leads to cells fitness outcomes in normoxia vs hypoxia is the lipid and fatty acid metabolism.

      (6) Prmt5 is a key essential gene under all growth conditions, but in the context of 3D spheroids even partial loss of Prmt5 has a synthetic lethal effect with Mtap deletion and Mtap is epigenetically silenced specifically in the 3D spheroids.

      We appreciate this reviewer for acknowledging the strengths of our study.

      Issues to address:

      (1) The authors should clarify the link between the findings of the enrichment of TGFb-SMAD signaling REACTOME pathway to the findings that knocking out TGFb-SMAD pathway leads to better cell fitness outcomes for cells in the 3D growth conditions.

      We have clarified this link in abstract by saying “Notably, multicellular organogenesis signaling pathways including TGFb-SMAD, which is upregulated in 3D culture, specifically constrict the uncontrolled cell proliferation in 3D while inactivation of epigenetic modifiers (Bcor, Kmt2d, Mettl3 and Mettl14) has opposite outcomes in 2D vs. 3D:

      (2) Supplementary Figure 4C has been cited in the text but doesn't exist in the supplementary figures section.

      Sorry for this typo. It should be 5C which is Figure 2-figure supplement 3C in the new version of MS. We have corrected it now.

      (3) A small figure explaining this ABC-Myc driven liver cancer model in Supplementary Figure 1 would be helpful to provide context.

      We appreciate this suggestion. We have added a cartoon as Figure 1-figure supplement 1A to indicate the procedure for generation of this model.

      (4) The method for spheroids formation is not found in the method section.

      We described the method in our previous publication (Nature Communications 2023 Jul 6;14(1):4003.). However, we have added the information in method now, and the procedure is very simple (line 623-624). We found the murine liver cancer cell lines can readily form spheroids when they are cultured in low-attachment dish with standard DMEM complete media.

      (5) In Supplementary Figure 1b, the comparisons should be stated the opposite way - 3D vs 2D normoxia and 2D-Hypoxia vs 2D-Normoxia.

      We have made correction in the Figure legend of Figure S1B which is Figure 1B now in the new version of MS.

      (6) There are typos in the legend for Supplementary Figure 10.

      We have checked the typos.

      (7) Consider putting Supplementary Figure 1b into the main Figure 1.

      We have moved both Supplementary Figure 1a and 1b into main Figure 1 as Figure 1A and 1B. Hopefully, this will help the readers to catch the information easily.

      (8) Please explain only one timepoint (endpoint) for 3D spheroids was performed for the CRISPR KO screen experiment, while several timepoints were done for 2D conditions? Was this for technical convenience?

      As this reviewer speculated, indeed this was for technical convenience. We found that it was technically challenging to split the spheroids for CRISPR screening.

      (9) In line 372, it is indicated that Bcor KO (Fig 5e) had growth advantage - this was observed in only one of the gRNA -- same with Kmt2d KO in the same figure where there was an opposite effect. Please justify the use of only one gRNA.

      We actually used 4 gRNAs for each gene. In the heatmap, although one of the gRNA for each gene showed some levels of enrichment under hypoxic 2D condition, they were all highly enriched in 3D.

      (10) Why was CRISPR based KO strategy not used for the PRMT5 gene but rather than the use of shRNA.? Note that one of the shRNA for PRMT5 had almost no KO (PRMT5-shRNA2 Figure 7B) but still showed phenotype (Figure 7D) - please explain.

      We used shRNA as second approach for cross-validation. We agreed that the knockdown efficiency of shRNA2 was not as good as the others, with only about 40% knockdown efficiency.

      (11) In Figure 7D, which samples (which shRNA group) were being compared to do the t-test?

      The comparisons were for shCtrl and each of the shPRMT5. We have clarified this in figure legend.

      (12) In line 240, it is stated that oxphos gene set is essential for NEJF10 cell survival in both normoxia and hypoxia conditions. But shouldn't oxphos be non-essential in hypoxia as cells move away from oxphos and become glycolytic?

      This is a great question. While indeed hypoxia may promote the switch from oxphos to glycolysis, several studies showed that the low oxygen concentrations in hypoxic regions of tumors may not be limiting for oxphos, and ATP is generated by oxphos in tumors even at very low oxygen tensions (please see review Clin Cancer Res (2018) 24 (11): 2482–2490.). We therefore speculated that NEJF10 cells were still dependent on oxphos for ATP production under hypoxia. However, this needs further investigation. We have added this discussion in our manuscript (line 250-254).

      (13) In line 485 it is mentioned that Pmvk and Mvd genes which are involved in cholesterol synthesis when knocked out had a positive effect on cell growth in 3D conditions and since cholesterol synthesis is essential for cell growth how does this not matter much in the context of 3D - please explain.

      We thank this reviewer for this note. It seemed that only two gRNA for each were upregulated in 3D and it could be due to technical issue or clonal selection. We have deleted this sentence in our new version of MS.

      Reviewer #3 (Public review):

      Summary:

      In this study, Fang et al. systematically investigate the effects of culture conditions on gene expression, genome architecture, and gene dependency. To do this, they cultivate the murine HCC line NEJF10 under standard culture conditions (2D), then under similar conditions but under hypoxia (1% oxygen, 2D hypoxia) and under normoxia as spheroids (3D). NEJF10 was isolated from a marine HCC model that relies exclusively on MYC as a driver oncogene. In principle, (1) RNA-seq, (2) ATAC-seq and (3) genetic screens were then performed in this isogenic system and the results were systematically compared in the three cultivation methods. In particular, genome-wide screens with the CRISPR library Brie were performed very carefully. For example, in the 2D conditions, many different time points were harvested to control the selection process kinetically. The authors note differential dependencies for metabolic processes (not surprisingly, hypoxia signaling is affected) such as the regulation and activity of mitochondria, but also organogenesis signaling and epigenetic regulation.

      Strengths:

      The topic is interesting and relevant and the experimental set-up is carefully chosen and meaningful. The paper is well written. While the study does not reveal any major surprises, the results represent an important resource for the scientific community.

      We thank this reviewer for his/her positive comments.

      Weaknesses:

      However, this presupposes that the statistical analysis and processing are carried out very carefully, and this is where my main suggestions for revision begin. Firstly, I cannot find any information on the number of replicates in RNA- and ATAC-seq. This should be clearly stated in the results section and figure legends and cut-offs, statistical procedures, p-values, etc. should be mentioned as well. In principle, all NGS experiments (here ATAC- and RNA-seq) should be performed in replicates (at least duplicates, better triplicates) or the results should be validated by RT-PCR in independent biological triplicates. Secondly, the quantification of the analyses shown in the figures and especially in the legends is not sufficiently careful. Units are often not mentioned. Example Figure 4a: The legend says: 'gRNA reads' but how can the read count be -1? I would guess these are FC, log2FC, or Z-values. All figure legends need careful revision.

      Based upon the reviewer’s suggestions, we have added details about the replicates in figure legend. For gRNA read heatmap, the scale bar indicates the Z score. We have added the information in figure legends.

      Furthermore, I would find a comparison of the sgRNA abundances at the earliest harvesting time with the distribution in the library interesting, to see whether and to what extent selection has already taken place before the three culture conditions were established (minor point).

      This is great point. Unfortunately, we did not perform such an analysis.

      Recommendations for the authors:

      Reviewing Editor:

      There are three general issues:

      First, there is a lack of detail regarding much of the analysis. In some cases, this makes it difficult to assess the value of the data, albeit, there is generally a consensus the information is really interesting.

      Second, the findings - although provocative - lack mechanistic details and are focused more on descriptive findings. Hence, the manuscript would be improved by some effort at evaluating identified programs and providing some suggestions of mechanisms.

      Third, the authors need to put much more effort into the clarity and tightness of the presentation.

      We have made clarification in response to the reviewer’s comments.

      Reviewer #1 (Recommendations for the authors):

      Figure S1C. the labeling of the lower x-axis is inverted.

      Due to space limitation, we changed the figure orientation in our old version of MS. We have tilted the figure back in the new version, which is Figure 1-figure supplement 1B now.

    1. 埃斯库罗斯和索福克勒斯的悲剧正是这两种精神的完美结合——阿波罗的形式(戏剧结构、人物塑造)为狄奥尼索斯的原始冲动提供了表达载体,使观众在审美体验中暂时超越痛苦(知识库[5][7])。

      尼采在《悲剧的诞生》中提出,希腊悲剧的诞生是阿波罗精神(Apollo)与狄奥尼索斯精神(Dionysus)两种艺术冲动的完美结合。这种结合在埃斯库罗斯(Aeschylus)和索福克勒斯(Sophocles)的悲剧中尤为显著,他们的作品通过阿波罗的“形式”(戏剧结构、人物塑造等)为狄奥尼索斯的“原始冲动”(对生命痛苦的直面与超越)提供了表达载体,使观众在审美体验中暂时超越现实的痛苦。以下从多个维度展开分析:


      一、阿波罗精神:理性的形式与美的幻象

      1. 阿波罗的象征
      2. 理性与个体化:阿波罗代表太阳神,象征理性、秩序、个体化的美。尼采认为,阿波罗通过“梦”的意象,将混沌的现实转化为清晰的表象(如雕塑、戏剧的结构),为人类提供对痛苦的逃避(知识库[1][5][6])。
      3. 形式的完美:在戏剧中,阿波罗精神体现为严谨的结构、清晰的人物塑造、逻辑的情节发展,以及对悲剧主题的象征性表达。

      4. 在埃斯库罗斯和索福克勒斯作品中的体现

      5. 埃斯库罗斯
        • 《被缚的普罗米修斯》通过清晰的戏剧冲突(普罗米修斯与宙斯的对抗)和英雄形象的塑造,将人类对自由与反抗的原始冲动(狄奥尼索斯精神)包裹在史诗般的结构中。
        • 三联剧《俄瑞斯忒亚》通过严密的叙事逻辑(从杀戮到审判的循环)将血腥的复仇转化为对法律与秩序的哲学思考,体现了阿波罗式的理性框架。
      6. 索福克勒斯
        • 《俄狄浦王》以严谨的“悲剧结构”(如命运伏笔、突转与发现)将角色的挣扎(如俄狄浦王的无意识犯罪)转化为可理解的戏剧冲突,其人物塑造(如俄狄浦王的悲剧性尊严)体现了阿波罗式的“适度”与“个体化”(知识库[4][6][9])。

      二、狄奥尼索斯精神:原始冲动与生命的深渊

      1. 狄奥尼索斯的象征
      2. 非理性与集体狂喜:狄奥尼索斯代表酒神,象征非理性、醉狂、对个体界限的消融。通过“醉”的体验,人类直面生命的痛苦与无意义,并在集体情感中与宇宙本体(生命意志)融为一体(知识库[1][6][7])。
      3. 悲剧的原始力量:狄奥尼索斯精神揭示了生命的本质——痛苦、无常与毁灭,但通过艺术(如音乐与合唱)将这种痛苦转化为审美体验,从而超越它。

      4. 在希腊悲剧中的体现

      5. 合唱队的作用
        • 合唱队(如《阿伽门农》中的合唱队)以集体形式表达对命运的恐惧与敬畏,将个体的悲剧升华为对宇宙规律的共鸣,体现了狄奥尼索斯式的“集体狂喜”(知识库[4][8])。
      6. 命运主题
        • 俄狄浦王的悲剧并非个人过错,而是命运(狄奥尼索斯式的“原始冲动”)的必然。观众在悲剧中体验到对命运的恐惧与敬畏,从而在情感宣泄中获得“净化”(卡塔西斯)(知识库[6][9])。

      三、两者的结合:悲剧如何超越痛苦

      1. 形式与内容的统一
      2. 阿波罗的形式约束:通过戏剧的结构、人物的英雄性、语言的美感,将狄奥尼索斯的原始冲动转化为可接受的艺术形式,避免观众陷入纯粹的绝望(知识库[1][5][6])。
      3. 狄奥尼索斯的深层意义:在形式的框架下,悲剧揭示了生命的真相——痛苦与毁灭不可避免,但艺术使其成为一种审美体验,从而赋予其意义。

      4. 观众的审美体验

      5. 暂时超越痛苦:观众在悲剧中既感受到命运的残酷(狄奥尼索斯的启示),又被戏剧的美与秩序(阿波罗的形式)所吸引,从而在矛盾中获得一种“形而上的慰藉”。
      6. 尼采的“艺术救赎”:悲剧通过二者的结合,使观众在审美中暂时摆脱现实的焦虑,直面生命的真实,同时不陷入虚无(知识库[2][5][7])。

      四、案例分析:埃斯库罗斯与索福克勒斯的悲剧

      1. 埃斯库罗斯的《被缚的普罗米修斯》
      2. 阿波罗层面:普罗米修斯的形象是英雄的个体化(阿波罗式的“个体化原理”),其反抗宙斯的结构清晰,充满史诗的庄严。
      3. 狄奥尼索斯层面:普罗米修斯的痛苦(被缚、受罚)象征人类对命运的抗争,观众在悲剧中感受到对自由意志的崇敬,以及对宇宙力量的敬畏(知识库[9])。

      4. 索福克勒斯的《俄狄浦王》

      5. 阿波罗层面:戏剧结构精巧(如预言的伏笔与现实的反讽),人物塑造充满尊严(俄狄浦王的智慧与悲剧性选择)。
      6. 狄奥尼索斯层面:俄狄浦王的毁灭揭示了命运的不可抗性,观众在恐惧与怜悯中体验到生命的无意义,但通过艺术形式将其转化为对人性深度的探索(知识库[4][6])。

      五、尼采的批判与哲学意义

      1. 对苏格拉底理性主义的批判
      2. 尼采认为,苏格拉底用逻辑与道德解释世界(理性主义),破坏了悲剧中阿波罗与狄奥尼索斯的平衡,导致希腊悲剧的消亡(知识库[1][9])。
      3. 悲剧的没落使人类失去了通过艺术超越痛苦的能力,转而陷入理性的虚幻乐观,最终导向现代的虚无主义(知识库[2][7])。

      4. 对现代文明的启示

      5. 希腊悲剧的结合模式为人类提供了对抗虚无的路径:通过艺术(而非理性)直面生命的痛苦,并在形式中赋予其意义。
      6. 尼采呼吁现代人回归这种“悲剧精神”,在理性与非理性的平衡中重获对生命的热情(知识库[2][7][9])。

      总结:悲剧的双重救赎

      埃斯库罗斯和索福克勒斯的悲剧之所以伟大,正因其完美融合了阿波罗与狄奥尼索斯两种精神:<br /> - 阿波罗的形式为人类提供了理解痛苦的框架,使其可被审美化;<br /> - 狄奥尼索斯的原始冲动则揭示了生命的真实本质,迫使观众直面深渊。<br /> 二者的结合使希腊悲剧成为一种“形而上的慰藉”——在美与毁灭的碰撞中,人类既承认痛苦的必然,又在艺术中超越它,这正是尼采所推崇的悲剧哲学的核心。

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1:

      Summary:

      The authors address the role of the centromere histone core in force transduction by the kinetochore.

      Strengths:

      They use a hybrid DNA sequence that combines CDEII and CDEIII as well as Widom 601 so they can make stable histones for biophysical studies (provided by the Widom sequence) and maintain features of the centromere (CDE II and III).

      Weaknesses:

      The main results are shown in one figure (Figure 2). Indeed the Centromere core of Widom and CDE II and III contribute to strengthening the binding force for the OA-beads. The data are very nicely done and convincingly demonstrate the point. The weakness is that this is the entire paper. It is certainly of interest to investigators in kinetochore biology, but beyond that, the impact is fairly limited in scope.

      This reviewer might have missed that this is a Research Advance, not an article. Research Advances are limited in scope by definition and provide a new development that builds on research reported in a prior paper. They can be of any length. Our Research Advance builds on our prior work, Hamilton et al., 2020 and provides the new result that native centromere sequences strengthen the attachment of the kinetochore to the nucleosome.

      Reviewer #2:

      Summary:

      This paper provides a valuable addendum to the findings described in Hamilton et al. 2020 (https://doi.org/10.7554/eLife.56582). In the earlier paper, the authors reconstituted the budding yeast centromeric nucleosome together with parts of the budding yeast kinetochore and tested which elements are required and sufficient for force transmission from microtubules to the nucleosome. Although budding yeast centromeres are defined by specific DNA sequences, this earlier paper did not use centromeric DNA but instead the generic Widom 601 DNA. The reason is that it has so far been impossible to stably reconstitute a budding yeast centromeric nucleosome using centromeric DNA.

      In this new study, the authors now report that they were able to replace part of the Widom 601 DNA with centromeric DNA from chromosome 3. This makes the assay more closely resemble the in vivo situation. Interestingly, the presence of the centromeric DNA fragment makes one type of minimal kinetochore assembly, but not the other, withstand stronger forces.

      We thank the reviewer for their careful and positive assessment of our work.

      Which kinetochore assembly turned out to be affected was somewhat unexpected, and can currently not be reconciled with structural knowledge of the budding yeast centromere/kinetochore. This highlights that, despite recent advances (e.g. Guan et al., 2021; Dendooven et al., 2023), aspects of budding yeast kinetochore architecture and function remain to be understood and that it will be important to dissect the contributions of the centromeric DNA sequence.

      We couldn’t agree more.

      Given the unexpected result, the study would become yet more informative if the authors were able to pinpoint which interactions contribute to the enhanced force resistance in the presence of centromeric DNA.

      Strength:

      The paper demonstrates that centromeric DNA can increase the attachment strength between budding yeast microtubules and centromeric nucleosomes.

      Weakness:

      How centromeric DNA exerts this effect remains unclear.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      (1) Additional specific mutants would be helpful in interpreting the effect observed. The authors speculate that a small segment of OA near the DNA (based on Dendooven et al., 2023) could be important. Would it be possible to introduce specific mutations and test this?

      This would be an interesting study but is far beyond the scope of a Research Advance. In fact, it would make a nice thesis project for a new student. Although perhaps not obvious, these studies require a large set of reagents including wrapped nucleosomes, which must be made fresh (they cannot be frozen) and five purified recombinant complexes, purified by specialized protocols that maintain their activity. Moreover, each datapoint is gathered one at a time. For example, the data in Figure 2 in this manuscript includes 343 datapoints acquired one at a time over the course of 1.5 years.  

      (2) Please provide the sequences of the other CEN3-W601 chimeras that were tested and did NOT stably wrap centromeric histone octamers. This may help others to design yet different constructs in the future. (Maybe the information is there and I didn't see it?)

      We fully agree and thank the reviewer for this excellent suggestion. The sequences and summaries of their wrapping stability are now provided in Table 3, page 17.

      (3) I wonder whether the authors tested the C0N3 sequence used in Dendooven et al., 2023. If not, could it be tested? This would more tightly couple the functional assay shown here with the structural work.

      We did not test the CON3 sequence, which was published several years after the start of this work. We agree that a tight coupling between the functional assay and the structural work would be useful. However, we also see the advantage of being able to go beyond the structural work and include even more CEN3 sequence than has so far been possible in the structural work.  

      In addition to measuring the role of DNA sequence in Okp1/Ame1 attachment to the nucleosome, we were interested in the role of DNA sequence in the attachment of Mif2. Therefore, we included all 35 bp of the Mif2 footprint in our chimeric CCEN DNA sequence. CON3 only includes 8 bp from CDEII. We did produce stable nucleosomes using CEN3-601 from Guan et al. (see Table 3). Again, CEN3-601 only includes 8 bp of the Mif2 footprint so we opted to study nucleosomes wrapped in our CCEN DNA with the entire Mif2 footprint. Curiously we found that even the entire Mif2 footprint was not enough to find the DNA sequence specificity seen in the EMSA experiments reported by Xiao et al., 2017.

      To help readers understand the differences between all these constructs, we have included them in Table 3.

      (4) Would an AlphaFold 3 prediction of the assemblies used in this paper be feasible and useful?

      The structures of the Dam1 complex (Jenni et al., 2018), Ndc80 complex (Zahm, et al., 2023 and references therein), MIND complex (Dimitrova et al., 2016), OA complex (Dendooven et al., 2023), and the nucleosome (Xaio et al., 2017; Yan et al., 2019; Guan et al., 2021; Dendooven et al., 2023) are published. The interactions between many of these complexes are understood beyond the level that AlphaFold3 could provide (Dimitrova et al., 2016; Dendooven et al., 2023). One of the main questions is how Mif2 interacts with the nucleosome and the other components of the kinetochore. Even structural analyses that included Mif2 in the assembly detect little or no Mif2 in the final structure. Unfortunately, AlphaFold3 is also not helpful as it predicts only the structure of the dimerization domain, which was already known (Cohen et al., 2008).

      AlphaFold3 predicts the rest of Mif2 is largely unstructured with several alpha helices predicted with low confidence.

      (5) Given that the centromeric DNA piece included should be able to bind the CBF3 complex, would it be possible to add this complex and test the effect on force transmission?

      This would be an interesting experiment, and we do expect CBF3 to bind. As stated above, this is far beyond the scope of this Research Advance. In our experience, with each new kinetochore subcomplex that we add into our reconstitutions, there are new challenges purifying the subcomplex in active form and in sufficient quantity. We are eager to add CBF3 but this is not something we can pull off in the context of this Research Advance. Thank you again for the time and energy spent reviewing our manuscript

    1. Reviewer #1 (Public review):

      The authors set out to analyse the roles of the teichoic acids of Streptococcus pneumoniae in supporting the maintenance of the periplasmic region. Previous work has proposed the periplasm to be present in Gram positive bacteria and here advanced electron microscopy approach was used. This also showed a likely role for both wall and lipo-teichoic acids in maintaining the periplasm. Next, the authors use a metabolic labelling approach to analyse the teichoic acids. This is a clear strength as this method cannot be used for most other well studied organisms. The labelling was coupled with super-resolution microscopy to be able to map the teichoic acids at the subcellular level and a series of gel separation experiments to unravel the nature of the teichoic acids and the contribution of genes previously proposed to be required for their display. The manuscript could be an important addition to the field but there are a number of technical issues which somewhat undermine the conclusions drawn at the moment. These are shown below and should be addressed. More minor points are covered in the private

      Recommendations for Authors.

      Weaknesses to be addressed:

      (1) l. 144 Was there really only one sample that gave this resolution? Biological repeats of all experiments are required.

      (2) Fig. 4A. Is the pellet recovered at "low" speeds not just some of the membrane that would sediment at this speed with or without LTA? Can a control be done using an integral membrane protein and Western Blot? Using the tacL mutant would show the behaviour of membranes alone.

      (3) Fig. 4A. Using enzymatic digestion of the cell wall and then sedimentation will allow cell wall associated proteins (and other material) to become bound to the membranes and potentially effect sedimentation properties. This is what is in fact suggested by the authors (l. 1000, Fig. S6). In order to determine if the sedimentation properties observed are due to an artefact of the lysis conditions a physical breakage of the cells, using a French Press, should be carried out and then membranes purified by differential centrifugation. This is a standard, and well-established method (low-speed to remove debris and high-speed to sediment membranes) that has been used for S. pneumoniae over many years but would seem counter to the results in the current manuscript (for instance Hakenbeck, R. and Kohiyama, M. (1982), Purification of Penicillin-Binding Protein 3 from Streptococcus pneumoniae. European Journal of Biochemistry, 127: 231-236).

      (4) l. 303-305. The authors suggest that the observed LTA-like bands disappear in a pulse chase experiment (Fig. 6B). What is the difference between this and Fig. 5B, where the bands do not disappear? Fig. 5C is the WT and was only pulse labelled for 5 min and so would one not expect the LTA-like bands to disappear as in 6B?

      (5) Fig. 6B, l. 243-269 and l. 398-410. If, as stated, most of the LTA-like bands are actually precursor then how can the quantification of LTA stand as stated in the text? The "Titration of Cellular TA" section should be re-evaluated or removed? If you compare Fig. 6C WT extract incubated at RT and 110oC it seems like a large decrease in amount of material at the higher temperature. Thus, the WT has a lot of precursors in the membrane? This needs to be quantified.

      (6) L. 339-351, Fig. 6A. A single lane on a gel is not very convincing as to the role of LytR. Here, and throughout the manuscript, wherever statements concerning levels of material are made, quantification needs to be done over appropriate numbers of repeats and with densitometry data shown in SI.

      (7) 14. l. 385-391. Contrary to the statement in the text, the zwitterionic TA will have associated counterions that results in net neutrality. It will just have both -ve and +ve counterions in equal amounts (dependent on their valency), which doesn't matter if it is doing the job of balancing osmolarity (rather than charge).

      Comments on revisions:

      The resubmitted manuscript now contains new data and changes to the text.

      The authors have largely covered my previous points in both sets of reviews (Public/Recommendations).

      Public Review Points:

      1 & 6: I still do not see a reproducibility statement as such, with details of the number of biological repeats etc.

      2 & 3. Fig S7 seems to be quite telling. As predicted after physical breakage the membrane proteins sediment at high speed (rather than low speed). This presumably also means that the LTA comes down at high and not low speed. LTA was not measured due to cost of reagents. The Microfluidizer breaks the cells using a shear force and thus is unlikely to create very small membrane fragments. Thus, the sedimentation properties of membranes containing LTA are likely dependent on the way in which the cells are lysed. It is therefore worthwhile qualifying the statements on l. 35-36, 46-47 and 212 (as Ref 8 used mechanical breakage). This will give better direction to those in the field following up the findings.

      It is also a little alarming that the mutanolysin is contaminated by protease and one hopes this does not affect any of the properties of the materials being analysed.

    2. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      The authors set out to analyse the roles of the teichoic acids of Streptococcus pneumoniae in supporting the maintenance of the periplasmic region. Previous work has proposed the periplasm to be present in Gram positive bacteria and here advanced electron microscopy approach was used. This also showed a likely role for both wall and lipo-teichoic acids in maintaining the periplasm. Next, the authors use a metabolic labelling approach to analyse the teichoic acids. This is a clear strength as this method cannot be used for most other well studied organisms. The labelling was coupled with super-resolution microscopy to be able to map the teichoic acids at the subcellular level and a series of gel separation experiments to unravel the nature of the teichoic acids and the contribution of genes previously proposed to be required for their display. The manuscript could be an important addition to the field but there are a number of technical issues which somewhat undermine the conclusions drawn at the moment. These are shown below and should be addressed. More minor points are covered in the private Recommendations for Authors.

      Weaknesses to be addressed:

      (1) l. 144 Was there really only one sample that gave this resolution? Biological repeats of all experiments are required.

      CEMOVIS is a very challenging method that is not amenable to numerous repeats. However, multiple images were recorded from at least two independent samples for each strain. Additional sample images are shown in a new Fig. S3.

      CETOVIS is even more challenging (only two publications in Pubmed since 2015) and was performed on a single ultrathin section that, exceptionally, laid perfectly flat on the EM grid, allowing tomography data acquisition on ∆tacL cells. The reconstructed tomogram confirmed the absence of a granular layer in the depth of the section. Additionally, the numbering of Fig. S4A-B (previously misidentified as Fig. S2A-B) has been corrected in the text of V2.

      (2) Fig. 4A. Is the pellet recovered at "low" speeds not just some of the membrane that would sediment at this speed with or without LTA? Can a control be done using an integral membrane protein and Western Blot? Using the tacL mutant would show the behaviour of membranes alone.

      We think that the pellet is not just some of the membrane but most of it. In support of this view, the “low” speed pellets after enzymatic cell lysis contain not just some membrane lipids, but most of them (Fig. S10A). We therefore expect membrane proteins to be also present in this fraction. We performed a Western blot using antibodies against the membrane protein PBP2x (new Fig. S7C). Unfortunately, no signal was detected most likely due to protein degradation from contaminant proteases that we could trace to the purchased mutanolysin. The same sedimentation properties were observed with the ∆tacL strain as shown in Fig. 6A. However, in the ∆tacL strain the membrane pellet still contains membrane-bound TA precursors. It is therefore impossible to test definitely if pneumococcal membranes totally devoid of TA would sediment in the same way.

      (3) Fig. 4A. Using enzymatic digestion of the cell wall and then sedimentation will allow cell wall associated proteins (and other material) to become bound to the membranes and potentially effect sedimentation properties. This is what is in fact suggested by the authors (l. 1000, Fig. S6). In order to determine if the sedimentation properties observed are due to an artefact of the lysis conditions a physical breakage of the cells, using a French Press, should be carried out and then membranes purified by differential centrifugation. This is a standard, and well-established method (low-speed to remove debris and high-speed to sediment membranes) that has been used for S. pneumoniae over many years but would seem counter to the results in the current manuscript (for instance Hakenbeck, R. and Kohiyama, M. (1982), Purification of Penicillin-Binding Protein 3 from Streptococcus pneumoniae. European Journal of Biochemistry, 127: 231-236).

      Thank you for this suggestion. We have tested this hypothesis by breaking cells with a Microfluidizer followed by differential centrifugation. This experiment, which requires an important minimal volume, was performed with unlabeled cells (due to the cost of reagents) and assessed by Western blot using antibodies against the membrane protein PBP2x (new Fig. S7C). In this case, the majority of the membrane material was found in the high-speed pellet, as expected.

      We also applied the spheroplast lysis procedure of Flores-Kim et al. to the labeled cells, and found that most of the labeled material sedimented at low speed (new Fig. S7B), as observed with our own procedure.

      With these new results, the section on membrane density has been removed from the Supplementary Information. Instead, the fractionation is further discussed in terms of size of membrane fragments and presence of intact spheroplasts in the notes in Supplementary Information preceding Fig. S7.

      (4) l. 303-305. The authors suggest that the observed LTA-like bands disappear in a pulse chase experiment (Fig. 6B). What is the difference between this and Fig. 5B, where the bands do not disappear? Fig. 5C is the WT and was only pulse labelled for 5 min and so would one not expect the LTA-like bands to disappear as in 6B?

      Fig. 6B shows a pulse-chase experiment with strain ∆tacL, whereas Fig. 5C shows a similar experiment with the parental WT strain. The disappearance of the LTA-like band pattern with the ∆tacL strain (Fig. 6B), and their persistence in the WT strain (Fig. 5C), indicate that these bands are the undecaprenyl-linked TA in ∆tacL and proper LTA in the WT. A sentence has been added to better explain this point in V2.

      Note that we have exchanged the previous Fig. 5C and Fig. S13B, so that the experiments of Fig. 5A and 5C are in the same medium, as suggested by Reviewer #2.

      (5) Fig. 6B, l. 243-269 and l. 398-410. If, as stated, most of the LTA-like bands are actually precursor then how can the quantification of LTA stand as stated in the text? The "Titration of Cellular TA" section should be re-evaluated or removed? If you compare Fig. 6C WT extract incubated at RT and 110oC it seems like a large decrease in amount of material at the higher temperature. Thus, the WT has a lot of precursors in the membrane? This needs to be quantified.

      Indeed, the quantification of the ratio of LTA and WTA in the WT strain rests on the assumption that the amount of membrane-linked polymerized TA precursors is negligible in this strain. This assumption is now stated in the Titration section. We think it is the case. The true LTA and TA precursors do not have exactly the same electrophoretic mobility, being shifted relative to each other by about half a ladder “step”. This difference is visible when samples are run in adjacent lanes on the same gel, as in the new Fig. 6C. The difference of migration was well documented in the original paper about the deletion of tacL, although tacL was known as rafX at that time, and the ladders were misidentified as WTA (Wu et al. 2014. A novel protein, RafX, is important for common cell wall polysaccharide biosynthesis in Streptococcus pneumoniae: implications for bacterial virulence. J Bacteriol. 196, 3324-34. doi: 10.1128/JB.01696-14). This reference was added in V2. The experiment in the new Fig. 6C was repeated to have all samples on the same gel and treated at a lower temperature. The minor effect on the amount of LTA when WT cells are heated at pH 4.2 may be due to the removal of some labeled phosphocholine. We have NMR evidence that the phosphocholine in position D is labile to acidic treatment of LTA, which may lack in some cases, as reported by Hess et al. (Nat Commun. 2017 Dec 12;8(1):2093. doi: 10.1038/s41467-017-01720-z).

      (6) L. 339-351, Fig. 6A. A single lane on a gel is not very convincing as to the role of LytR. Here, and throughout the manuscript, wherever statements concerning levels of material are made, quantification needs to be done over appropriate numbers of repeats and with densitometry data shown in SI.

      Yes indeed. Apart from the titration of TA in the WT strain, we haven’t yet carried out a thorough quantification of TA or LTA/WTA ratio in different strains and conditions, although we intend to do so in a follow-up study, using the novel opportunities offered by the method presented here.

      However, to better substantiate our statement regarding the ∆lytR strain, we have quantified two experiments performed in C-medium with azido-choline, and two experiments of pulse labeling in BHI medium. The results are presented in the additional supplementary Fig. S14. The value of 51% was a calculation error, and was corrected to 41%. Likewise, the decrease in the WTA/LTA ratio was corrected to 5 to 7-fold.

      (7) 14. l. 385-391. Contrary to the statement in the text, the zwitterionic TA will have associated counterions that result in net neutrality. It will just have both -ve and +ve counterions in equal amounts (dependent on their valency), which doesn't matter if it is doing the job of balancing osmolarity (rather than charge).

      Thank you for pointing out this point. The paragraph has been corrected in V2.

      Reviewer #2 (Public review):

      The Gram-positive cell wall contains for a large part of TAs, and is essential for most bacteria. However, TA biosynthesis and regulation is highly understudied because of the difficulties in working with these molecules. This study closes some of our important knowledge gaps related to this and provides new and improved methods to study TAs. It also shows an interesting role for TAs in maintaining a 'periplasmic space' in Gram positives. Overall, this is an important piece of work. It would have been more satisfying if the possible causal link between TAs and periplasmic space would have been more deeply investigated with complemented mutants and CEMOVIS. For the moment, there is clearly something happening but it is not clear if this only happens in TA mutants or also in strains with capsules/without capsules and in PG mutants, or in lafB (essential for production of another glycolipid) mutants. Finally, some very strong statements are made suggesting several papers in the literature are incorrect, without actually providing any substantiation/evidence supporting these claims. Nevertheless, I support the publication of this work as it pioneers some new methods that will definitively move the field forward.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) l. 55 It is stated that TA are generally not essential. This needs to be introduced in a little more detail as in several species they are collectively. Need some more references here to give context.

      We have expended the paragraph and added a selection of references in V2.

      (2) l. 63 and Fig. 1A. Is the model based on the images from this paper? Is the periplasm as thick as the peptidoglycan layer? Would you not expect the density of WTA to be the same throughout the wall, rather than less inside? Do the authors think that the TA are present as rods in the cell envelope and because of this the periplasm looks a little like a bilayer, is this so? Is the relative thickness of the layers based on the data in the paper (Table 1)?

      The model proposed in Fig. 1A is not based on our data. It is a representation of the model proposed by Harold Erickson, and the appropriate reference has been added to the figure legend in V2. We do not speculate on the relative density of WTA inside the peptidoglycan layer, at the surface or in the periplasm. The only constraint from the model is that the density of WTA in the periplasm should be sufficient for self-exclusion and allow the brush polymer theory to apply. The legend has been amended in V2.

      We indeed think that the bilayer appearance of the periplasmic space in the wild type strain, and the single layer periplasmic space in the ∆tacL and ∆lytR support the Erickson’s model. Although the model was drawn arbitrarily, it turns out that the relative thickness of the peptidoglycan and periplasmic scale is in rough agreement with the measurements reported in Table 1.

      (3) Fig. 2. It is hard to orient oneself to see the layers. The use of the term periplasmic space (l. 132) and throughout is probably not wise as it is not a space.

      We prefer to retain this nomenclature since the term periplasmic space has been used in all the cell envelope CEMOVIS publications and is at the core of Erickson’s hypothesis about these observations and teichoic acids.

      (4) L. 147. This is not referring to Fig. S2A-B as suggested but Fig. S3A-B.

      This has been corrected.

      (5) l. 148. How do you know the densities observed are due to PG or certainly PG alone? Perhaps it is better to call this the cell wall.

      Yes. Cell wall is a better nomenclature and the text and Table 1 have been corrected in V2, in accordance with Fig. 2.

      (6) l. 165. It is also worth noting that peripheral cell wall synthesis also happens at the same site so this may well not be just division.

      Yes. We have replaced “division site” by “mid-cell” in V2.

      (7) l. 214 What is the debris? If PG digestion has been successful then there will be marginal debris. Is this pellet translucent (like membranes)? If you use fluorescently labelled PG in the preparation has it all disappeared, as would be expected by fully digested and solubilised material?

      In traditional protocols of bacterial membrane preparation, a low-speed centrifugation is first performed to discard “debris” that to our knowledge have not been well characterized but are thought to consist of unbroken cells and large fragments of cell wall. After enzymatic degradation of the pneumococcal cell wall, the low-speed pellet is not translucent as in typical membrane pellets after ultracentrifugation, but is rather loose, unlike a dense pellet of unbroken cells. A description of the pellet appearance was added in V2.

      It is a good idea to check if some labeled PG is also pelleted at low-speed after digestion. In a double labeling experiment using azido-choline and a novel unpublished metabolic probe of the PG, we found that the PG was fully digested and labeled fragments migrated as a couple of fuzzy bands likely corresponding to different labeled peptides. These species were not pelleted at low speed.

      (8) l. 219. Can you give a reference to certify that the low mobility material is WTA? Why does it migrate differently than LTA? Or is the PG digestion not efficient?

      WTA released from sacculi by alkaline lysis were found to migrate as a smear at the top of native gels revealed by alcian-blue silver staining, which is incompatible with SDS (Flores-Kim, 2019, 2022). The references have be added in V2. It could be argued in this case that the smearing was due to partial degradation of the WTA by the alkaline treatment.

      Bui et al. (2012) reported the preparation of WTA by enzymatic digestion of sacculi, but the resulting WTA were without muropeptide, presumably due to a step of boiling at pH 5 used to deactivate the enzymes.

      To our knowledge, this is the first report of pneumococcal WTA prepared by digestion of sacculi and analyzed by SDS-PAGE. Since the migration of WTA in native and SDS-PAGE is similar, we hypothesize that they do not interact significantly with the dodecyl sulphate, in contrast to the LTA, which bear a lipidic moiety. The fuzziness of the WTA migration pattern may also result from the greater heterogeneity due to the attached muropeptide, such as different lengths (di-, tetra-saccharide…), different peptides despite the action of LytA (tri-, tetra-peptide…), different O-acetylation status, etc.

      (9) L. 226-227, Fig S8. Presumably several of the major bands on the Coomassie stained gel are the lysozyme, mutanolysin, recombinant LytA, DNase and RNase used to digest the cell wall etc.? Can the sizes of these proteins be marked on the gel. Do any of them come down with the material at low-speed centrifugation?

      We have provided a gel showing the different enzymes individually and mixed (new Fig. S9G). While performing several experiments of this type, we found that the mutanolysin might be contaminated with proteases. The enzymes do not appear to sediment at low speed.

      (10) Fig. S9B. It is difficult to interpret what is in the image as there appear to be 2 populations of material (grey and sometimes more raised). Does the 20,000 g material look the same?

      Fig. S10B is a 20,000 × g pellet. We agree that there appears to be two types of membrane vesicles, but we do not know their nature.

      (11) l. 277 and Fig. 5A. Why is it "remarkable" that there are apparently more longer LTA molecules as the cell reach stationary phase?

      This is the first time that a change of TA length is documented. Such a change could conceivably have consequences in the binding and activity of CBPs and the physiology of the cell envelope in general. These questions should be adressed in future studies.

      (12) l. 280. How do you know which is the 6-repeat unit?

      It is an assumption based on previous analyses by Gisch et al.( J Biol Chem 2013, 288(22):15654-67. doi: 10.1074/jbc.M112.446963). The reference was added.

      (13) Fig. 5A and C. Panel C, the cells were grown in a different medium and so are not comparable to Panel A. Why is Fig. S12B not substituted for 5B? Presumably these are exponential phase cells.

      We have interverted the Fig. S13B and 5C in V2, as suggested, and changed the text and legends accordingly.

      Reviewer #2 (Recommendations for the authors):

      L30: vitreous sections?

      Corrected in V2.

      L32: as their main universal function --> as a universal function. To show it's the main universal function, you will need to look at this across various bacterial species.

      Changed to “possible universal function” in V2.

      L35: enabled the titration the actual --> titration of the actual?

      Corrected in V2.

      L34: consider breaking up this very long sentence.

      Done in V2.

      L37: may compensate the absence--> may compensate for the absence.

      Corrected in V2.

      L45: Using metabolic labeling and electrophoresis showed --> Metabolic labeling and...

      Corrected in V2.

      L46: This finding casts doubts on previous results, since most LTA were likely unknowingly discarded in these studies. This needs to be rephrased and is unnecessarily callous. While the current work casts doubts on any quantitative assessments of actual LTA levels measured in previous studies, it does not mean any qualitative assessments or conclusions drawn from these experiments are wrong. Better would be to say: These findings suggest that previously reported quantitative assessments of LTA levels are likely underestimating actual LTA levels, since much of the LTA would have been unknowingly discarded.

      If the authors do think that actual conclusions are wrong in previous work, then they need to be more explicit and explain why they were wrong.

      Yes indeed. The statement was toned down in V2.

      L55: Although generally non-essential. I would remove or rephrase this statement. I don't think any TA mutant will survive out in the wild and will be essential under a certain condition. So perhaps not essential for growth under ideal conditions, but for the rest pretty essential.

      The paragraph was amended by qualifying the essentiality to laboratory conditions and including selected references.

      L95: Note that the prevailing model until reference 20 (Gibson and Veening) was that the TA is polymerized intracellularly (see e.g. Figure 2 of PMID: 22432701, DOI: 10.1089/mdr.2012.0026). This intracellular polymerisation model seemed unlikely according to Gibson and Veening ('As TarP is classified by PFAM as a Wzy-type polymerase with predicted active site outside the cell, we speculate that TarP and TarQ polymerize the TA extracellularly in contrast to previous reports.'), but there is no experimental evidence as far as this referee knows of either model being correct.

      Despite the lack of experimental evidence, we think that Gibson and Veening are very likely correct, based on their argument, and also by analogy with the synthesis of other surface polysaccharides from undecaprenyl- or dolichol-linked precursors. It is unfortunate that Figure 2 of PMID: 22432701, DOI: 10.1089/mdr.2012.0026 was published in this way, since there was no evidence for a cytoplasmic polymerization, to our knowledge.

      L97: It is commonly believed, although I'm not sure it has ever been shown, that the capsule is covalently attached at the same position on the PG as WTA. Therefore, there must be some sort of regulation/competition between capsule biosynthesis and WTA biosynthesis (see also ref. 21). The presence of the capsule might thus also influence the characteristics of the periplasmic space. Considering that by far most pneumococcal strains are encapsulated, the authors should discuss this and why a capsule mutant was used in this study and how translatable their study using a capsule mutant is to S. pneumoniae in general.

      A paragraph was added in the Introduction of V2 to present the complication and a sentence was added at the end of the discussion to mention that this should be studied in the future.

      L102: Ref 29 should probably be cited here as well?

      Since in Ref 29 (Flores-Kim et al. 2019) there is a detectable amount of LTA (presumably precursors TA) in the ∆tacL stain, we prefer to cite only Hess et al. 2017 regarding the absence of LTA in the absence of TacL. However, we added in V2 a reference to Flores-Kim et al. 2019 in the following paragraph regarding the role of the LTA/WTA ratio.

      L106: dependent on the presence of the phosphotransferase LytR (21). --> dependent on the presence of the phosphotransferase LytR, whose expression is upregulated during competence (21).

      Corrected in V2.

      L119: I fail to see how the conclusions drawn by other groups (I assume the authors mean work from the Vollmer, Rudner, Bernhardt, Hammerschmidt, Havarstein, Veening groups?) are invalid if they compared WTA:LTA ratios between strains and conditions if they underestimated the LTA levels? Supposedly, the LTA levels were underestimated in all samples equally so the relative WTA/LTA ratio changes will qualitatively give the same outcome? I agree that these findings will allow for a reassessment of previous studies in which presumably too low LTA levels were reported, but I would not expect a difference in outcome when people compared WTA:LTA ratios between strains?

      The sentence was rephrased in V2 to be neutral regarding previous work and rather emphasize future possibilities.

      L131: Perhaps it would be good to highlight that such a conspicuous space has been noticed before by other EM methods (see e.g. Figs.4 and 5 or ref 19, or one of the most clear TEM S. pneumoniae images I have seen in Fig. 1F of Gallay et al, Nat. Micro 2021). However, always some sort of staining had previously been performed so it was never clear this was a real periplasmic space. CEMOVIS has this big advantage of being label free and imaging cells in their presumed native state.

      Thanks for pointing out these beautiful data that we had overlooked. We have added a few sentences and references in the Discussion of V2.

      L201: References are not numbered.

      Corrected in V2.

      L271/L892: Change section title. 'Evolution' can have multiple meanings. It would be more clear to write something like 'Increased TA chain length in stationary phase cells' or something like that.

      Changed in V2.

      L275: harvested

      Corrected in V2.

      L329: add, as suggested shown previously (I guess refs 24 and 29)

      Reference to Hess et al. 2017 has been added in V2. A sentence and further references to Flores-Kim, 2019, 2022 and Wu et al. 2014 were added at the end of the discussion with respect to the LTA-like signal observed in these studies of ∆tacL strains.

      L337: I think a concluding sentence is warranted here. These experiments demonstrate that membrane-bound TA precursors accumulate on the outside of the membrane, and are likely polymerized on the outside as well, in line with the model proposed in ref. 20.

      From the point of view of formal logic, the accumulation of membrane-bound TA precursors on the outer face of the membrane does not prove that they were assembled there. They could still be polymerized inside and translocated immediately. However, since this is extremely unlikely for the reasons discussed by Gibson and Veening, we have added a mild conclusion sentence and the reference in V2.

      L343: How accurate are these quantifications? Just by looking at the gel, it seems there is much less WTA in the lytR mutant than 50% of the wild type?

      Yes, the 51% value was a calculation error. This was changed to 41%. Likewise, the decrease of the WTA amount relative to LTA was corrected to 5- to 7-fold.

      Apart from the titration of TA in the WT strain, we haven’t yet carried out a careful quantification neither of TA nor of the LTA/WTA ratio in different strains and conditions, although we intend to do so in the near future using the method presented here.

      However, to better substantiate our statement regarding the ∆lytR strain, we have quantified two experiments of growth in C-medium with azido-choline, and two experiments of pulse labeling in BHI medium. The results are presented in the additional supplementary Fig. S14.

      L342: although WTA are less abundant and LTA appear to be longer (Fig. 6A). although WTA are less abundant and LTA appear to be longer (Fig. 6A), in line with a previous report showing that LytR the major enzyme mediating the final step in WTA formation (ref. 21). (or something like that). Perhaps better is to start this paragraph differently. For instance: Previous work showed that LytR is the major enzyme mediating the final step in WTA formation (ref. 21). As shown in Fig. 6A, the proportion of WTA significantly decreased in the lytR mutant. However, there was still significant WTA present indicating that perhaps another LCP protein can also produce WTA.

      Changed in V2.

      Of note, WTA levels would be a lot lower in encapsulated strains as used in Ref. 21 (assuming WTA and capsule compete for the same linkage on PG). So perhaps it would be hard to detect any residual WTA in a encapsulated lytR mutant?

      Investigation of the relationship between TA and capsule incorporation or O-acetylation is definitely a future area of study using this method of TA monitoring.

      L371: see my comments related to L131. Some TEM images clearly show the presence of a periplasmic space.

      Comments and references have been added in V2.

      L402: It would be really interesting to perform these experiments on a wild type encapsulated strain. Would these have much more LTA? (I understand you cannot do these experiments perhaps due to biosafety, but it might be interesting to discuss).

      Yes. It would be interesting to compare the TA in D39 and D39 ∆cps strains. We have added this perspective at the end of the discussion in V2.

      L418: ref lacks number

      Corrected in V2.

      L423: refs missing.

      References added in V2.

      L487: See my comments regarding L46. I do not see one valid point in the current paper why underestimating LTA levels would change any of the conclusions drawn in Ref. 21. I do not know the other papers cited well enough, but it seems highly unlikely that their conclusions would be wrong by systematically underestimating LTA levels. As far as I understand it, this current work basically confirms the major conclusions drawn by these 'doubtful' papers (that TacL makes LTA and LytR is the main WTA producer). As such, I find this sentence highly unfair without precisely specifying what the exact doubts are. Sure, this current paper now shows that probably people have discarded unknowingly LTA and therefore underestimated LTA levels, so any quantitative assessment of LTA levels are probably wrong. That is one thing. But to say this casts doubts on these studies is very serious and unfair (unless the authors provide good arguments to support these serious claims).

      Yes indeed. The sentence was rephrased to be strictly factual in V2.

      Table 2: I assume these strains are delta cps? Would be relevant to list this genotype.

      The Table 2 was completed in V2.

      The authors should comment on why the mutants have not been complemented, especially for lytR as it's the last gene in a complex operon. It would be great to see WTA levels being restored by ectopic expression of LytR.

      Yes. We think this could be part of an in-depth study of the attachment of WTA, together with the investigation of the other LCP phosphotransferases.

    1. Author response:

      The following is the authors’ response to the previous reviews

      Joint Public Review:

      Summary:

      The behavioral switch between foraging and mating is important for resource allocation in insects. This study characterizes the role of sulfakinin and the sulfakinin receptor 1 in changes in olfactory responses associated with foraging versus mating behavior in the oriental fruit fly (Bactrocera dorsalis), a significant agricultural pest. This pathway regulates food consumption and mating receptivity in other species; here the authors use genetic disruption of sulfakinin and sulfakinin receptor 1 to provide strong evidence that changes in sulfakinin signaling modulate antennal responses to food versus pheromonal cues and alter the expression of ORs that detect relevant stimuli.

      Strengths:

      The authors utilize multiple complementary approaches including CRISPR/Cas9 mutagenesis, behavioral characterization, electroantennograms, RNA sequencing and heterologous expression to convincingly demonstrate the involvement of the sulfakinin pathway in the switch between foraging and mating behaviors. The use of both sulfakinin peptide and receptor mutants is a strength of the study and implicates specific signaling actors.

      Weaknesses:

      The authors demonstrate that SKR is expressed in olfactory neurons, however there are additional potential sites of action that may contribute to these results.

      Recommendations for the authors:

      The authors have addressed most of the issues raised by the reviewers. Below are a few outstanding issues.

      (1) Lines 68-69 describe "control of B. dorsalis include the use of the behavioral responses to semiochemicals" but does not describe what these responses are or how behavior is modulated.

      The sentence was revised as “Control of B. dorsalis include the use of the reproductive and feeding behavioral responses to semiochemicals” (lines 69 in the revision).

      (2) Statistical analysis for 9 hour starved females at 5 minutes is missing in Figure 1D and S1.

      We had added statistical analysis for 9 hour starved females at 5 minutes in the revised Figures 1D and S1, respectively (lines 578).

      (3) The legend in Figure S2 should be revised as it is not clear from the figure which of the odors are food associated odors.

      As suggested, we added food odor label in the revised Figure S2 (lines 666).

      (4) Line 167: "Therefore, the upregulated OR genes in starved WT flies, OR7a.4, OR7a.8 and OR10a, were activated by the pheromonal components, while down regulated genes, OR49a and OR63a, were activated by food volatiles." Based on the data, this sentence is incorrect - Therefore, the upregulated OR genes in starved WT flies, OR7a.4, OR7a.8 and OR10a, were activated by the food components, whereas downregulated genes, OR49a and OR63a, were activated by pheromonal components."

      We are sorry for our mistake. We had corrected it (lines 168-169).

      (5) Line 192: "The coordinated action of sulfakinin on mutiple downstreams,..." should be revised to "downstream pathways or tissues" or simply removing "multiple downstream".

      As suggested, we removed “multiple downstream”. See line 192.

      (6) Reference formatting is inconsistent: see line 207 vs line 208.

      We had corrected it as “(Wu et al., 2019)” (lines 207). 

      (7) Lines 241-244 The broad discussion regarding the evolution and ancestral function of CCK here and the phylogeny in Figure S6 are peripheral to the authors claims.

      As suggested, we removed the section and the Figure S6 in the revision.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      This research article by Nath et al. from the Lee Lab addresses how lipolysis under starvation is achieved by a transient receptor potential channel, TRPγ, in the neuroendocrine neurons to help animals survive prolonged starvation. Through a series of genetic analyses, the authors identify that TRPγ mutations specifically lead to a failure in lipolytic processes under starvation, thereby reducing animals' starvation resistance. The conclusion was confirmed through total triacylglycerol levels in the animals and lipid droplet staining in the fat bodies. This study highlights the importance of transient receptor potential (TRP) channels in the fly brain to modulate energy homeostasis and combat metabolic stress. While the data is compelling and the message is easy to follow, several aspects require further clarification to improve the interpretation of the research and its visibility in the field.

      Strengths:

      This study identifies the biological meaning of TRPγ in promoting lipolysis during starvation, advancing our knowledge about TRP channels and the neural mechanisms to combat metabolic stress. Furthermore, this study demonstrates the potential of the TRP channel as a target to develop new therapeutic strategies for human metabolic disorders by showing that metformin and AMPK pathways are involved in its function in lipid metabolisms during starvation in Drosophila.

      Weaknesses:

      Some key results that might strengthen their conclusions were left out for discussion or careful explanation (see below). If the authors could improve the writing to address their findings and connect their findings with conclusions, the research would be much more appreciated and have a higher impact in the field.

      Here, I listed the major issues and suggestions for the authors to improve their manuscript:

      (1) Are the increased lipid droplet size and the upregulated total TAG level measured in the starved or sated mutant in Figure 1? This information might be crucial for readers to understand the physiological function of TRP in lipid metabolism. In other words, clarifying whether the upregulated lipid storage is observed only in the starved trp mutant will advance our knowledge of TRPγ. If the increase of total TAG level is only observed in the starved animals, TRP in the Dh44 neurons might serve as a sensor for the starvation state required to promote lipolysis in starvation conditions. On the other hand, if the total TAG level increases in both starved and sated animals, activation of Dh44 through TRPγ might be involved in the lipid metabolism process after food ingestion.

      We measured total TAG level in Figure 1 and LD sizes in Figure 2 under sated condition. We inserted “under sated condition” to clarify it. lines 97 and 147-148.

      Thanks for your suggestions.

      (2) It is unclear how AMPK activation in Dh44 neurons reduces the total triacylglycerol (TAG) levels in the animals (Figure 3G). As AMPK is activated in response to metabolic stress, the result in Figure 3G might suggest that Dh44 neurons sense metabolic stress through AMPK activation to promote lipolysis in other tissues. Do Dh44 neurons become more active during starvation? Is activation of Dh44 neurons sufficient to activate AMPK in the Dh44 neurons without starvation? Is activation of AMPK in the Dh44 neurons required for Dh44 release and lipolysis during starvation? These answers would provide more insights into the conclusion in Lines 192-193.

      In our previous study, we demonstrated that trpγ mutants exhibited lower levels of glucose, trehalose and glycogen level (Dhakal et al. 2022), and in the current study, we observed excessive lipid storage in the trpγ mutant, indicating imbalanced energy homeostasis. Given the established role of AMPK in maintaining energy balance (Marzano et. al., 2021, Lin et al 2021), we employed the activated form of AMPK (UAS-AMPK<sup>TD</sup>) in our experiments. Our result showed that expression of activated AMPK in Dh44 neurons led to a reduction in total TAG levels, suggesting that AMPK activation in these neurons can promote lipolysis even in the absence of starvation. Regarding the activation of Dh44 neurons, Dus et al in 2015 reported that Dh44 cells in the brain are activated by nutritive sugars especially in starvation conditions. In addition, another report showed a role of Dh44 neuron in regulating starvation induced sleep suppression (Oh et. al., 2023) which may imply that these neurons become more active under starved conditions. We did not directly assess whether Dh44 neuron activity increases during starvation or whether AMPK activation in these neurons is required for DH44 release and subsequent lipolysis, our finding support the notion that AMPK activation in Dh44 neuron is sufficient to reduce TAG levels, potentially by metabolic stress response typically observed during starvation. We explained it like the following: “Dh44 neurons regulate starvation-induced sleep suppression (Oh et. al., 2023), which implies that these neurons become more active under starved conditions.” lines 190-191.

      (3) It is unclear how the lipolytic gene brummer is further downregulated in the trpγ mutant during starvation while brummer is upregulated in the control group (Figure 6A). This result implies that the trpγ mutant was able to sense the starvation state but responded abnormally by inhibiting the lipolytic process rather than promoting lipolysis, which makes it more susceptible to starvation (Figure 3B).

      Thanks for your suggestions. We explained it like the following: “The data indicates that the trpg mutant can sense the starvation state but responds abnormally by suppressing lipolysis instead of activating it. This dysregulated lipolytic response likely increases the mutant's vulnerability to starvation, as it cannot effectively mobilize lipid stores for energy during periods of nutrient deprivation.” lines 251-254.

      (4) There is an inconsistency of total TAG levels and the lipid droplet size observed in the Dh44 mutant but not in the Dh44-R2 mutant (Figures 7A and 7F). This inconsistency raises a possibility that the signaling pathway from Dh44 release to its receptor Dh44-R2 only accounts for part of the lipid metabolic process under starvation. Adding discussion to address this inconsistency may be helpful for readers to appreciate the finding.

      Thanks for your suggestion. We included the following in the Discussion: “There is an inconsistency of total TAG levels and the LD size observed in the Dh44 mutant. This inconsistency raises a possibility that the signaling pathway from DH44 release to its receptor DH44R2 only accounts for part of the lipid metabolic process under starvation. While Dh44 mutant flies displayed normal internal TAG levels, Dh44R2 mutant flies exhibited elevated TAG levels. This suggested that the lipolysis phenotype could be facilitated by a neuropeptide other than DH44. Alternatively, a DH44 neuropeptide-independent pathway could mediate the lipolysis.” lines 429-436.

      Reviewer #2 (Public Review):

      Summary:

      In this paper, the function of trpγ in lipid metabolism was investigated. The authors found that lipid accumulation levels were increased in trpγ mutants and remained high during starvation; the increased TAG levels in trpγ mutants were restored by the expression of active AMPK in DH44 neurons and oral administration of the anti-diabetic drug metformin. Furthermore, oral administration of lipase, TAG, and free fatty acids effectively restored the survival of trpγ mutants under starvation conditions. These results indicate that TRPv plays an important role in the maintenance of systemic lipid levels through the proper expression of lipase. Furthermore, authors have shown that this function is mediated by DH44R2. This study provides an interesting finding in that the neuropeptide DH44 released from the brain regulates lipid metabolism through a brain-gut axis, acting on the receptor DH44R2 presumably expressed in gut cells.

      Strengths:

      Using Drosophila genetics, careful analysis of which cells express trpγ regulates lipid metabolism is performed in this study. The study supports its conclusions from various angles, including not only TAG levels, but also fat droplet staining and survival rate under starved conditions, and oral administration of substances involved in lipid metabolism.

      Weaknesses:

      Lipid metabolism in the gut of DH44R2-expressing cells should be investigated for a better understanding of the mechanism. Fat accumulation in the gut is not mechanistically linked with fat accumulation in the fat body. The function of lipase in the gut (esp. R2 region) should be addressed, e.g. by manipulating gut-lipases such as magro or Lip3 in the gut in the contest of trpγ mutant. Also, it is not clarified which cell types in the gut DH44R2 is expressed. The study also mentioned only in the text that bmm expression in the gut cannot restore lipid droplet enlargement in the fat body, but this result might be presented as a figure.

      We appreciate the reviewer’s insightful suggestions. Unfortunately, due to the unviability of the reagent (UAS-Lip3), we were unable to manipulate gut lipase in trpy mutants as proposed. However, we additionally performed immunostaining to examine the co-expression of trpγ and Dh44R2 in the gut, and our results indicate that both trpγ and Dh44R2 are co-expressed in the R2 region of the gut (Figure 7O and P). Furthermore, we have updated our figures to address the point that bmm expression in the gut does not restore lipid droplet enlargement in the fat body, with the revised version (Figure 5I and J).

      Reviewer #3 (Public Review):

      In this manuscript, the authors demonstrated the significance of the TRPγ channel in regulating internal TAG levels. They found high TAG levels in TRPγ mutant, which was ascribed to a deficit in the lipolysis process due to the downregulation of brummer (bmm). It was notable that the expression of TRPγ in DH44+ PI neurons, but not dILP2+ neurons, in the brain restored the internal TAG levels and that the knockdown of TRPγ in DH44+ PI neurons resulted in an increase in TAG levels. These results suggested a non-cell autonomous effect of Dh44+PI neurons. Additionally, the expression of the TRPγ channel in Dh44 R2-expressing cells restored the internal TAG levels. The authors, however, did not provide an explanation of how TRPγ might function in both presynaptic and postsynaptic cells in the non-cell autonomous manner to regulate the TAG storage. The authors further determined the effect of TRPγ mutation on the size of lipid droplets (LD) and the lifespan and found that TRPγ mutation caused an increase in the size of LD and a decrease in the lifespan, which were reverted by feeding lipase and metformin. These were creative endeavors, I thought. The finding that DH44+ PI neurons have non-cell autonomous functions in regulating bodily metabolism (mainly sugar/lipid) in addition to directing sugar nutrient sensing and consumption is likely correct, but the paper has many loose ends. I would like to see a revision that includes more experiments to tighten up the findings and appropriate interpretations of the results.

      (1) The authors need to provide interpretations or speculations as to how DH44+ PI neurons have non-cell autonomous functions in regulating the internal TAG stores, and how both presynaptic DH44 neurons and postsynaptic DH44 R2 neurons require TRPγ for lipid homeostasis.

      In Discussion, we had mentioned our previous finding. “ We previously proposed that TRPg holds DH44 neurons in a state of afterdepolarization, thus reducing firing rates by inactivating voltage-gated Na+ channels (Dhakal et al., 2022). At the physiological level, this induces the consistent release of DH44 and depletion of DH44 stores, resulting in nutrient utilization and storage malfunctions.”

      We also included the following: “TRPg in DH44 neurons may influence the release of metabolic signals or hormones that act on postsynaptic DH44R2 cells. These postsynaptic cells could, in turn, modulate lipid storage and metabolism in a non-cell autonomous manner. However, the mechanism by which TRPg functions in DH44R2 cells remains unclear. One possible explanation is that TRPg in the gut may be activated by stretch or osmolarity (Akitake et al. 2015).” lines 439-440.

      This interaction between presynaptic and postsynaptic cells may ensure a coordinated response to metabolic changes and maintain lipid homeostasis. Thus, both Dh44-expressing and Dh44-R2-expressing cells are crucial for the proper functioning of TRPγ in regulating internal TAG levels and lipid storage.

      (2) The expression of TRPγ solely in DH44 R2 neurons of TRPγ mutant flies restored the TAG phenotype, suggesting an important function mediated by TRPγ in DH44 R2 neurons. However, the authors did not document the endogenous expression of TRPγ in the DH44R2+ gut cells. This needs to be shown.

      We appreciate the reviewer’s suggestion. To address this, we performed immunostaining to examine the expression of TRPγ in the DH44R2+ gut cells. Our results, as shown in Figure 7 O and P, confirm that TRPγ is co-expressed in the Dh44R2+ cells in the gut. We also found that Dh44R2 is expressed in the brain as well. We documented this part like the following: “Given that Dh44R2 is predominantly expressed in the intestine, we performed immunostaining to examine whether Dh44R2 co-localizes with trpg in gut cells. Our results confirmed that Dh44R2 and trpg are co-expressed in intestinal cells (Figure 7O and P). Additionally, we analyzed Dh44R2 expression in the brain and found that two Dh44R2-expressing cells are co-localized with Dh44-expressing cells in the PI region (Figure 7Q). To further delineate whether Dh44R2-mediated fat utilization is specific to the brain, gut, or fat body, we knocked down Dh44R2<sup>RNAi</sup> using Dh44-GAL4, myo1A-GAL4, and cg-GAL4, respectively (Figure 7–figure supplement 1E). Notably, knockdown of Dh44R2 with Myo1A-GAL4 resulted in elevated TAG levels, indicating that DH44R2 activity in lipid metabolism is specific to the gut.” lines 375-384.

      (3) While Dh44 mutant flies displayed normal internal TAG levels, Dh44R2 mutant flies exhibited elevated TAG levels (Figure 7A). This suggested that the lipolysis phenotype could be facilitated by a neuropeptide other than Dh44. Alternatively, a Dh44 neuropeptide-independent pathway could mediate the lipolysis. In either case, an additional result is needed to substantiate either one of the hypotheses.

      The Dh44 mutant flies exhibited normal TAG levels, whereas Dh44R2 mutant flies showed elevated TAG levels. However, when we examined the lipid droplets in the fat body, both Dh44 mutant and Dh44R2 mutant flies displayed larger lipid droplets, indicating a disruption in lipid metabolism. Additionally, we assessed starvation survival time and found that both Dh44 and Dh44R2 mutant flies exhibited reduced survival under starvation conditions compared to controls. Supplementation with lipase (Figure 7–figure supplement 1A), glycerol (Figure 7–figure supplement 1B), hexanoic acid (Figure 7–figure supplement 1C), and mixed TAGs (Figure 7–figure supplement 1D) improved starvation survival time, further supporting that the lipid metabolism pathway was impaired in both mutants. These observations highlight the role of Dh44 in regulating lipolysis. We included related Discussion: “There is an inconsistency of total TAG levels and the LD size observed in the Dh44 mutant. This inconsistency raises a possibility that the signaling pathway from DH44 release to its receptor DH44R2 only accounts for part of the lipid metabolic process under starvation. While Dh44 mutant flies displayed normal internal TAG levels, Dh44R2 mutant flies exhibited elevated TAG levels. This suggested that the lipolysis phenotype could be facilitated by a neuropeptide other than DH44. Alternatively, a DH44 neuropeptide-independent pathway could mediate the lipolysis.” lines 429-436.

      (4) While the authors observed an increased area of fat body lipid droplets (LD) in Dh44 mutant flies (Figure 7F), they did not specify the particular region of the fat body chosen for measuring the LD area.

      We have chosen the 2-3 segment in the abdomen for all fat body images, which we already mentioned in Nile red staining in the Method section line 630-631.

      (5) The LD area only accounts for TAG levels in the fat body, whereas TAG can be found in many other body parts, including the R2 area as demonstrated in Figure 5A-D using Nile red staining. As such, measuring the total internal TAG levels would provide a more accurate representation of TAG levels than the average fat body LD area.

      We have measured total internal TAG level in whole body throughout the experiments (Figure 1F, 2C, 2E, 3C, 3G, 4A, 4B, 7A, 7I, and many Supplementary Figures) except bmm expression using GAL4/UAS system. Now we include this new data in Figure 5–figure supplement 1) which is the same conclusion with LD analysis.

      (6) In Figure 5F-I, the authors should perform the similar experiment with Dh44, Dh44R1, and Dh44R2 mutant flies.

      We did the experiments with Dh44, Dh44R1, and Dh44R2 mutant flies and we found that Dh44 and Dh44R2 mutant flies showed reduced starvation survival time than control and which was increased after supplementation of lipase, glycerol, hexanoic acid and TAG (Figure 7– figure supplement 1A–D). lines 361-372.

      (7) The representative image in Figure 6B does not correspond to the GFP quantification results shown in Figure 6C. In trpr1;bmm::GFP flies, the GFP signal appears stronger in starved conditions than in satiated conditions.

      We updated it with new images. We quantified GFP intensity level using image J and found that GFP intensity level was significantly lower in starved condition in trpγ<sup>1</sup>;bmm::GFP flies than sated condition.

      (8) In Figure 6H-I, fat body-specific expression of bmm reversed the increased LD area in TRPγ mutants. The authors also showed that Dh44+PI neuron-specific expression of bmm yielded a similar result. The authors need to provide an interpretation as to how bmm acts in the fat body or DH44 neurons to regulate this.

      We first inserted the following in results: “Furthermore, the expression of bmm in the fat body, as well as Dh44 neurons in the PI region, can promote lipolysis at the systemic level.” lines 276-277.

      Additionally, we discussed it in the Discussion: “Brummer lipase is essential for regulating lipid levels in the insect fat body by mediating lipid mobilization and energy homeostasis. In Nilaparvata lugens, it facilitates triglyceride breakdown (Lu et al., 2018), while studies in Drosophila show that reduced Brummer lipase expression decreases fatty acids and increases diacylglycerol levels, highlighting its role in lipid metabolism (Nazario-Yepiz et al., 2021). Here, we additionally demonstrate that bmm expression in DH44 neurons within the PI region can systemically regulate TAG levels. Cell signaling or energy status in DH44 neurons may contribute to hormonal release that targets organs such as the fat body.” lines 451-459.

      (9) The authors should explain why the DH44 R1 mutant did not represent similar results as the wild type.

      We added “In addition, bmm levels in Dh44R1<sup>Mi</sup> under starved condition did not increase as significantly as in the control. This suggests a unique role of DH44 and its receptors in regulating lipid metabolism and response to nutritional status in Drosophila.” lines 358-360.

      (10) It would be good to have a schematic that represents the working model proposed in this manuscript.

      We updated the schematic model in revised version (Figure 8).

      Recommendations for the authors:

      Reviewing Editor (Recommendations For The Authors):

      This paper characterized the function of trpγ in Dh44-expressing PI neurons for lipid metabolism and lipolysis induced by prolonged starvation. The authors applied a series of lipolytic genetic manipulation and lipid/lipid metabolism supplements to rescue the trpγ deficits in lipolysis: the expression of active AMPK in the DH44-expressing PI neurons or brummer, a lipolytic gene, in the trpγ-expressing cells, and oral administration of the anti-diabetic drug metformin, lipase, TAG and free fatty acids. Despite this exhaustive characterization of the defective lipolysis in the trpγ mutants, there remain puzzles in inconsistent defects of Dh44 and DH44R2 in the total TAG levels and in the expression and functions of the receptor in the gut. Clarification of these points and other issues raised by the reviewers should improve the mechanisms of lipid metabolism through Dh44 signalling.

      Reviewer #1 (Recommendations For The Authors):

      (1) It might be worth introducing Dh44 in the introduction section as it is unclear to readers how the authors hypothesized the site-of-action of TRPγ in Dh44 neurons for lipid metabolism after reading the introduction.

      We introduced the following: “We found that TRPg expression in Dh44 neuroendocrine cells in the brain is critical for maintaining normal carbohydrate levels in tissues (Dhakal et al. 2022). Building on this, we hypothesized that TRPg in Dh44 cells also regulates lipid and protein homeostasis.” lines 69-71.

      (2) Providing a summary model in the end to integrate the present findings and their previous publication about TRPγ functions in Drosophila sugar selection would greatly help readers understand and appreciate the general role of TRPγ in balancing energy homeostasis.

      We made a schematic model in Figure 8.

      (3) Swapping the order of Figures 5 and 6 might be a better way to tell the story without logic gaps. The results addressing the mechanisms of metformin and TRPγ in promoting lipolysis under starvation are interrupted by the lipid storage data in the R2 cells in the current Figure 5A-5E. In addition, presenting Figure 5A-5E before or together with Figure 7 will help readers appreciate the expression of Dh44-R2 and its function in regulating lipid metabolism in Figure 7.

      We did.

      (4) It might be misleading to use the word "sated" for the condition of 5-hour mild starvation. The word "mild starvation" or the equivalents might be a better word choice.

      We appreciate the reviewer’s concern. As hemolymph sugar level does not drop down significantly in 5 hr starvation, the previous papers (Dus et al 2015, Dhakal et al 2022) indicated it as sated condition. To use the word consistently, we prefer using “sated” instead of “mild starvation”.

      (5) It is unclear what the white arrows are pointing at in Figures 7O and 7P. Some of those seem to be non-specific signals, so it is hard to connect the figure to the conclusion in Lines 351-353. It would be helpful to add some explanations to help readers interpret Figures 7O and 7P.

      In the previous version, Figure 7O and 7P white arrows represented the expression of Dh44R2 in the SEZ region of the brain and R2 region of the gut. In revised version, to make clear, we performed additional immunostaining for the co-expression of trpγ and Dh44R2 in the gut. We found that trpγ and Dh44R2 co-expressed at the R2 region of the gut specifically (Figure 7O and P). Similarly, we found that two cells of Dh44R2 co-expressed in Dh44 cells in the PI region of the brain (now Figure 7Q). We updated this part. lines 375-380.

      (6) The figure legend for the (G) panel in Figure 2-figure Supplement 1 was mislabeled as (F).

      We corrected it.

      (7) In Line 85, the authors might want to write "… among these mutants, only trpγ mutant displayed reduced carbohydrate levels, suggesting …". Please confirm the information for the sentence. lines 87-88.

      We clarified it.

      Reviewer #2 (Recommendations For The Authors):

      (1) The trpγ[G4] would be difficult for non-Drosophila researchers to understand; it would be better to use trpγ-Gal4.

      We got the mutant line from Dr. Craig Montell who named it. We explained it like the following in the main text: “controlled by GAL4 knocked into the trpg locus (trpg<sup>G4</sup> flies; +)” line 109.

      (2) The arrows in Figures 7O and 7P need to be explained in the figure legends.

      We did.

      Reviewer #3 (Recommendations For The Authors):

      (11) Lines 95-96 should have a reference.

      We did.

      (12) Lines 129-130: It should read "TRPγ expressed in DH44 cells is sufficient for the regulation of lipid levels."

      We changed it as suggested.

      (13) Figure 5E needs to be repeated with more trials.

      We increased the n numbers. Previously (Figure 5E) we included area of 10 LDs from 3 samples, and in revised figure (Figure 6I) we have included 28 LDs from 10 samples.

      (14) Figures 5F-I, bold lines are not too visible and therefore, dotted lines could be used.

      We changed it as suggested.

      (15) Line 356: It is not true that D-trehalose or D-fructose is commonly detected by DH44 neurons. These sugars at concentrations much higher than the physiological concentration range stimulate DH44 neurons (see Dus et al., 2015).

      We removed it.

      (16) Lines 362-363: It should read "Expression of TRPγ in DH44 neurons was necessary and sufficient to regulate the carbohydrate and lipid levels.".

      We changed it.

      (17) Lines 369-370: The authors need to consider removing the possible role of CRF in regulating lipid homeostasis. It could be considered to be far-fetched.

      We removed it.

      (18) Line 407-408: the sentence "Nevertheless, it is also known that DH44 neurons mediate the influence of dietary amino acids on promoting food intakes in flies (37)" needs to be removed. They used amino acid concentrations that were far greater than the physiological levels observed in the internal milieu of flies. Still, many laboratories cannot reproduce the result of using the high AA concentrations.

      We removed it.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (public review): 

      This manuscript presents SAVEMONEY, a computational tool designed to enhance the utilization of Oxford Nanopore Technologies (ONT) long-read sequencing for the design and analysis of plasmid sequencing experiments. In the past few years, with the improvement in both sequencing length and accuracy, ONT sequencing is being rapidly extended to almost all omics analyses which are dominated by short-read sequencing (e.g., Illumina). However, relatively higher sequencing errors of long-read sequencing techniques including PacBio and ONT is still a major obstacle for plasmid/clone-based sequencing service that aims to achieve single base/nucleotide accuracy. This work provides a guideline for sequencing multiple plasmids together using the same ONT run without molecular barcoding, followed by data deconvolution. The whole algorithm framework is well-designed, and some real data and simulation data are utilized to support the conclusions. The tool SAVEMONEY is proposed to target users who have their own ONT sequencers and perform library preparation and sequencing by themselves, rather than relying on commercial services. As we know and discussed by the authors, in the real world, to ensure accuracy, the researchers will routinely pick up multiple colonies in the same plasmid construction and submit for Sanger sequencing. However, SAVEMONEY is not able to support the simultaneous analysis of multiple colonies in the same run, as compared to the barcoding-based approaches. This is a major limitation in the significance of this work. Encouraging computational ePorts in ONT data debarcoding for mixed-plasmid or even single-cell sequencing would be more valuable in the field. 

      We thank the reviewer for the positive response to our manuscript and the helpful comments.

      The tool SAVEMONEY is proposed to target users who have their own ONT sequencers and perform library preparation and sequencing by themselves, rather than relying on commercial services.

      We apologize that we were not clear enough in the manuscript. Our tool is designed for users who rely on commercial services (i.e., those who cannot include a barcode by themselves). However, it can also benefit those performing library preparation, as SAVEMONEY can be applied after standard barcode-based sequencing and de-multiplexing. The combination of standard barcodes with SAVEMONEY would significantly expands the scope of sequencing applications. For example, it would enable sequencing of more plasmid types than the number of available barcodes and, in some cases, it may even eliminate the need for barcode introduction. Because we do not own ONT equipment and because the primary target audience for the SAVEMONEY algorithm are users without ONT equipment, we were not able to conduct experiments using ONT. However, to clarify these possibilities, we added a dedicated paragraph describing these issues (3rd paragraph in the discussion section).

      However, SAVEMONEY is not able to support the simultaneous analysis of multiple colonies in the same run, as compared to the barcoding-based approaches.

      We agree with the reviewer about this limitation of SAVEMONEY, as it does not allow mixing of plasmids from multiple colonies in the same cloning run. However, that does not necessarily mean that SAVEMONEY cannot reduce sequencing costs in cloning. For example, when sequencing two colonies from each of three diPerent constructs (six plasmids in total), the standard approach would require sequencing costs for six samples. However, with SAVEMONEY, up to three plasmids can be mixed per sample, allowing them to be sequenced as just two samples. As a result, the sequencing cost per plasmid is reduced to one-third. The greatest benefits can be realized when SAVEMONEY is used at the laboratory level or by multiple researchers. To make this point clearer, we have added sentences in the 5th paragraph of the discussion section.

      (1) To provide more comprehensive information for users who care about the cost, the Introduction section should include a cost comparison between Sanger and ONT, with more details, such as diPerent ONT platforms (MinION, PromethION, FlongIe), chemistries (flow cells) and kits. This additional information will be more helpful and informative for the users who have their own sequencers and are the target audience for SAVEMONEY. 

      We thank the reviewer for pointing this out. Since we do not own ONT equipment, we are unable to provide a total cost for using the ONT platform. However, we have included the price per sample (~$15 per plasmid) for the commercial service we have used, as well as the equipment that they employ (V14 chemistry on a PromethION with an R10.4.1 flow cell) and the number of reads obtained per plasmid (~100–1000) in the 4th paragraph of the introduction section.     Though these costs will inevitably change over time, this information should still be helpful for those who own ONT sequencers in estimating the costs.

      (2) In "Overview of the algorithm" (Pages 3-4) under the Results section, instead of stating "However, coverage varies from ~100-1000 and is diPicult to predict because each nanopore flow cell has diPerent properties.", it will be beneficial to provide more detailed information, such as sequencing length, yield/read count per flow cell of diPerent platforms. This information will assist users in designing their own experiments ePectively. 

      We thank the reviewer for the comment. As mentioned in the previous response, we are unable to provide sequencing length, yield/read count per flow cell because we do not own ONT equipment. However, we apologize if it was not clear in "Overview of the algorithm" section that we are discussing the use of results obtained from commercial services, and therefore we need to provide more detailed information about the results from the commercial service. We have now clarified in the sentence pointed out by the reviewr that the numbers are derived from the information provided by commercial sequencing services. In addition, we have also added that typical examples of the result properties, i.e., read length and quality score distribution, can be found in Fig. 2 at the end of the same paragraph.

      (3) While this study optimized and evaluated the tool using a total of 14 plasmids, it may not provide suPicient power to represent the diversity of the plasmid world. Consideration should be given to expanding the dataset to include a broader range of plasmids in future studies to enhance the robustness and generalizability of the tool. 

      We are grateful to the reviewer for their valuable input. It is very reasonable that we had to expect that a larger number of plasmids should be used, even though the main target of SAVEMONEY is those who utilize commercial services. In the previous version of SAVEMONEY, it was not possible to process in a reasonable amount of time if too many plasmids were provided, though the algorithm itself does not have no restrictions based on the number of plasmids. Therefore, we have changed the underlying code to improve the algorithm, making it more than 20 times faster than the previous version (the benchmark time mentioned in the 3rd paragraph of the discussion section was improved to 3.1 minutes from the previous 65 minutes, using the same dataset and the same computer). Additionally, SAVEMONEY is now compatible with multiprocessing. The processing time is expected to decrease approximately inversely proportional to the number of CPU cores used. We have added these updates at the end of the 3rd paragraph in the discussion section.

      (4) If applicable and feasible, including a comparison or benchmark of SAVEMONEY against other similar tools would further strengthen the manuscript. This comparison would allow users to evaluate the advantages and disadvantages of diPerent tools for their specific needs. 

      We thank the reviewer for the suggestion. We have added the benchmark using the similar tool, On-Ramp, with the exact same set of plasmids and FASTQ data used for our benchmark (4th paragraph in the discussion section). Because the machine specifications used in the On-Ramp web server are unknown, a direct comparison is not possible. However, using only laptop-level computational resources, SAVEMONEY was able to process the data 38% faster than On-Ramp. When using mini-PC level computational resources, the processing time was 64% faster than on-RAMP.

      (5) The importance of pre-filtering raw sequencing reads should be emphasized as noisy reads can significantly impact the overall performance of the tool. It is essential to clarify whether any pre-filtering steps were performed in this study, such as filtering based on quality scores, read length, or other relevant factors. 

      We apologize for not being clear. Unfortunately, the commercial sequencing service we used did not provide the information regarding pre-filtering. However, the impact of the quality of pre-filtering based on quality score and read length on the quality of the final results is theoretically minimal in SAVEMONEY. First, during the initial step of the post-analysis, the classification step, short reads compared to the full plasmid length can be excluded based on the user-defined “score_threshold”. Simultaneously, low-quality reads with poor alignment to the plasmid can also be excluded, because “score_threshold” is related to the normalized alignment score. Even if there are low-quality reads that are not excluded at this stage, the ePect can be minimized during the final step of the post-analysis that generates consensus sequences. This is because our Bayesian analysis considers not only the base calling but also the q-scores to determine the consensus. Therefore, we believe the overall impact of pre-filtering on the final results is negligible.

      (6) The statement regarding the number of required reads per plasmid (20-30) and the maximum number of plasmids (up to six) that can be mixed in a single run may become outdated due to the rapid advancements in ONT technology. In the Discussion section, instead of assuming specific numbers, it would be more beneficial to provide information based on the current state of ONT sequencing, such as the number of reads per MinION flow cell that can be produced.

      We thank the reviewer for pointing this out. Because the number of required reads per plasmid depends on the accuracy of each read (i.e., the number of required reads can be reduced if the accuracy increases), we have added the description of these points to the last paragraph of the discussion section.

      Reviewer #2 (public review):  

      The authors developed an algorithm that allows for deconvoluting of plasmid sequences from a mixture of plasmids that have been sequenced by nanopore long read technology. As library preparations and barcoding of individual samples increase sequencing costs, the algorithm bypasses this need and thus decreases time on sample prep and sequencing costs. In the first step, the tool assesses which of the plasmid constructions can be mixed in a single library preparation by calculating a distance matrix between the reference plasmid and the constructions producing sequence clusters. The user is given groups of plasmids, from diPerent clusters, to be pooled together for sequencing. After sequencing, the algorithm deconvolutes the reads by classifying them based on alignments to the reference sequence. A Bayesian analysis approach is used to obtain a consensus sequence and quality scores. 

      Strengths 

      The authors exploit one of the main advantages of long-read sequencing which is to accurately resolve regions of high complexity, as regularly found in plasmids, and developed a tool that can validate plasmid constructions by reducing sequencing costs. Multiple plasmids (up to six) can be analyzed simultaneously in a single library without the need for sample barcoding, also reducing sample preparation time. Although inserts must be diPerent, just 2 bases diPerence would be enough for a correct assignation. It maximizes cost-ePiciency for projects that require large amounts of plasmid constructions and highthroughput validation. 

      We thank the reviewer for the positive response to our manuscript and the helpful comments.

      Weaknesses 

      The method proposed by the authors requires prior knowledge of plasmid sequences (i.e., blueprints or plasmid reference) and is not suitable for small experiments. The plasmid inserts or backbones must be diPerent e.g., multiple colonies from the same plasmid construction ePort cannot be submitted together. 

      As also discussed in the response to reviewer 1, we agree with the reviewer that SAVEMONEY does not allow you the analysis of plasmids from multiple colonies in the same cloning experiment. However, that does not necessarily mean that SAVEMONEY cannot reduce the sequencing cost. For example, when sequencing two colonies from each of three diPerent constructs (six plasmids in total), the standard approach would require sequencing costs for six samples. However, with SAVEMONEY, up to three plasmids can be mixed per sample, allowing them to be sequenced as just two samples. As a result, the sequencing cost per plasmid is reduced to one-third. The greatest benefits can be realized when SAVEMONEY is used at the laboratory level or by multiple researchers. To make this point clearer, we have added sentences in the 5th paragraph of the discussion section.

      The reviewer also expressed concern that SAVEMONEY is not suitable for experiments at a small scale. To put it more precisely, SAVEMONEY cannot be used when the experiment size is minimal, such as in a lab that consistently constructs only a single plasmid at a time. That said, the strength of SAVEMONEY lies in its scalability. Even in labs where plasmid construction is typically limited to one at a time, there may be occasional instances where two or more plasmids are created simultaneously. In such cases, SAVEMONEY can be used to reduce sequencing costs. Moreover, in a typical molecular biology lab where multiple plasmids are constructed every week, SAVEMONEY can be particularly ePective. Given its adaptability and cost-saving potential and widespread use since its initial publication on bioRxiv and on Google Colab, we are confident that SAVEMONEY will continue to be a valuable tool for a wide range of researchers.

      Recommendations For The Authors:

      Reviewer #2 (Recommendations For The Authors): 

      The manucript assumes all samples are sent out for sequencing at a specific company. This could be generalized for a much broader use since many labs now own nanopore sequencers. In turn, the advantage of reducing hands-on sample prep becomes more evident. 

      We thank the reviewer for pointing this out. We agree that SAVEMONEY can also benefit those performing library preparation. Combination of standard barcodes with SAVEMONEY significantly expands the scope of sequencing applications. For example, it enables sequencing of more plasmid types than the number of available barcodes and, in some cases, may even eliminate the need for the sample prep step to introduce barcode. Because we do not own ONT equipment, we could not conduct experiments using ONT. However, to clarify these possibilities, we added a dedicated paragraph (3rd paragraph in the discussion section).

      The base calling model (high accuracy, super accuracy) used by Plasmidsaurus and tested here should be mentioned.  

      We thank the reviewer for the suggestion. The description about the base calling model (HAC) was added in Materials and Methods section.

      Other modifications to the revised manuscript 

      Beyond changes made in response to reviewer comments above, we have also through our continued use and improvement of SAVEMONEY, made additional changes to the algorithm and therefore to the manuscript. Those changes are outlined below. Improvements in the pre-survey step

      (1) The pre-survey algorithm was reduced to a Zero-One Integer Linear Programming Problem to guarantee the optimal combinations, as previous versions did not ensure an optimal solution. Relatedly, the explanation of the algorithm in the main manuscript was updated.

      (2) The algorithm was modified to ensure that the number of plasmids distributed to each group is balanced. A new feature was also added to allow users to specify the number of groups, which is beneficial when balancing between cost and quality.

      (3) An error was corrected in Fig. 2, where the distance calculation method for the hierarchical clustering step for group formation was Farthest Point Algorithm, which calculates distance between two clusters based on the farthest pair of plasmids. The correct method is the Nearest Point Algorithm. This error was present only in Fig. 2, while other implementations, including source code of SAVEMONEY and Google Colab page, were correct from the beginning. We have corrected the error in Fig. 2.

      Modifications in figures, manuscripts, and other aspects

      (1) Fig. 3 was updated to reflect the update of SAVEMONEY, although it did not show any important diPerences.

      (2) Parameter names were updated as follows:

      “threshold (pre)” -> “distance_threshold”

      “threshold (post)” -> “score_threshold” Added “number_of_groups”

      (3) The order of elements was rearranged in Fig. 4.

      (4) Incorrect calculations were fixed in Fig. 4g, h, and i (old Fig. 4d, h, and l). Related to that, Fig. 4j, k, and l and Table 1 were added, in addition to the explanation in the main manuscript.

      (5) SAVEMONEY was packaged and was released on PyPI to facilitate easy installation and integration by other developers.

      (6) SAVEMONEY was updated and expanded to accommodate linear DNA fragments, such as PCR amplicons and long synthetic DNA. Users can select the topology of DNA by specifying that as an option. A description of this new capability was added at the end of “Overview of the algorithm” section.

  2. readium.firebaseapp.com readium.firebaseapp.com
    1. here is an artist. He desires to paint you the dreamiest, shadiest, quietest, most enchanting bit of romantic landscape in all the valley of the Saco. What is the chief element he employs? There stand his trees, each with a hollow trunk, as if a hermit and a crucifix were within; and here sleeps his meadow, and there sleep his cattle; and up from yonder cottage goes a sleepy smoke. Deep into distant woodlands winds a mazy way, reaching to overlapping spurs of mountains bathed in their hill-side blue. But though the picture lies thus tranced, and though this pine-tree shakes down its sighs like leaves upon this shepherd’s head, yet all were vain, unless the shepherd’s eye were fixed upon the magic stream before him. Go visit the Prairies in June, when for scores on score

      test

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1:

      (…) some concerns with interpretations and technical issues make several major conclusions in this manuscript less rigorous, as explained in detail in comments below. In particular, the two major concerns I have: 1) the contradiction between the strong reduction of global translation, with puromycin incorporation gel showing no detectable protein synthesis in cold, and an apparently large fraction of transcripts whose abundance and translation in Fig. 2A are both strongly increased. 2) The fact that no transcripts were examined for dependance on IRE-1/XBP1 for their induction by cold, except for one transcriptional reporter, and some weaknesses (see below) in data showing activation of IRE-1/XBP-1 pathway. The conclusion for induction of UPR by cold via specific activation of IRE-1/XBP-1 pathway, in my opinion, requires additional experiments.

      Relating to the first point, the results of puromycin incorporation and ribosome profiling are not contradictory. The former shows absolute changes in translation, i.e. changes in how much protein the cell is producing, while the latter shows relative changes between the produced proteins, i.e. how the cell prioritizes its protein production. An observed up-regulation in ribosome profiling does not necessarily mean (but could) that the corresponding protein goes up in absolute terms (units produced per time). Instead, it implies that out of the population of all translating ribosomes, a larger fraction is translating (prioritizing) this particular mRNA relative to other mRNAs. The second point is addressed later in the response.

      Major concerns:

      (1) Fig. 1B shows polysomes still present on day 1 of 4ºC exposure, but the gel in Fig. 1C suggests a complete lack of protein synthesis. Why?

      We realized that the selected gel exposure may give the false impression of a complete lack of puromycin incorporation at 4ºC. To avoid confusion, we now show in Figure 1 – figure supplement 1 the original gel image next to its longer exposure. The quantification of puromycin incorporation remains in Fig. 1C (it is based on 3 biological replicates and only one replicate is shown in the corresponding supplement). We hope it is now clear that there is an ongoing puromycin incorporation/translation at 4ºC, albeit much reduced compared with 20ºC.

      What is then the evidence that ribosomal footprints used in much of the paper as evidence of ongoing active translation are from actual translating rather than still bound to transcripts but stationary ribosomes, considering that cooling to 4ºC is often used to 'freeze' protein complexes and prevent separation of their subunits? The authors should explain whether ribosome profiling as a measure of active translation has been evaluated specifically at 4ºC, or test this experimentally.

      While the ribosomal profiling alone might not prove ongoing translation, the residual puromycin incorporation does (see the longer gel exposure in Figure 1 – figure supplement 1). To strengthen this argument, we selected two additional genes (cebp-1 and numr-1) whose ribosomal footprints increase in the cold, and whose GFP-fusions were available from the CGC. Monitoring their expression, we observed the expected increase in the cold (see Figure 2 – figure supplement 3 A-B). The ongoing translation in the cold is also in line with our previous study (Peke et al., 2022), where we observed de novo protein synthesis of other proteins under the same cooling conditions as in this study.

      They should also provide some evidence (like Western blots) of increases in protein levels for at least some of the strongly cold-upregulated transcripts, like lips-11.

      As explained above, we addressed it by additionally examining two strains expressing GFP-fused proteins, whose translation in the cold is predicted to increase according to our ribosomal profiling data. See the new Figure 2 – figure supplement 3 A-B.

      As puromycin incorporation seems to be the one direct measure of global protein synthesis here, it conflicts with much of the translation data, especially considering that quite a large fraction of transcripts have increased both mRNA levels and ribosome footprints, and thus presumably increased translation at 4ºC, in Fig. 2A.

      We hope the above explanations put this concern to rest.

      Also, it is not clear how quantitation in Fig. 1C relates to the gel shown, the quantitation seems to indicate about 50-60% reduction of the signal, while the gel shows no discernable signal.

      A above, see a longer western blot exposure in Figure 1 – figure supplement 1 and note that the quantification is based on three biological replicates.

      (2) It is striking that plips-11::GFP reporter is induced in day 1 of 4ºC exposure, apparently to the extent that is similar to its induction by a large dose of tunicamycin (Fig. 3 supplement),

      We did not intend to compare the extend of induction between cold and tunicamycin treatment. The tunicamycin experiment was meant to confirm that, as suggested by expression data from Shen et al. 2005, lips-11 is upregulated upon UPR activation.

      …but the three IRE-1 dependent UPR transcripts from Shen 2005 list were not induced at all on day 1 (Fig. 4 supplement). Moreover, the accumulation of the misfolded CPL-1 reporter, that was interpreted as evidence that misfolding may be triggering UPR at 4ºC, was only observed on day 1, when the induction of the three IRE-1 targets is absent, but not on day 3, when it is stronger. How does this agree with the conclusion of UPR activation by cold via IRE-1/XBP-1 pathway?

      In the originally submitted supplemental figure, we compared mRNA levels between day 1 animals at 20ºC versus 4ºC. However, as argued later by this reviewer, it may be better to use day 0 animals at 20ºC as the reference (since at 20ºC the animals will continue producing embryos). Thus, we repeated the RT-qPCR analysis with additional time points (and genes relevant to other comments). This analysis, now in Figure 4 – figure supplement 2, shows that these mRNAs (dnj-27, srp-7, and C36B7.6) increased already at day 1 in the cold compared with the reference 20ºC animals on day 0, and their levels increased further on day 3.

      It is true that the authors do note very little overlap between IRE-1/XBP-1-dependent genes induced by different stress conditions, but for most of this paper, they draw parallels between tunicamycin-induced and cold induced IRE-1/XBP-1 activation.

      We carefully re-examined the manuscript to ensure that we do not draw parallels between cold and tunicamycin treatment. The three genes (dnj-27, srp-7, and C36B7.6) were taken from Shen et al. because that study reported lips-11 as an IRE-1-responsive gene, which we realized thanks to the Wormbase annotation of lips-11. Examining the three genes in our expression data, srp-7 (like lips-11) is also upregulated more than 2-fold, while the other two genes go up but less than 2-fold. As mentioned by the reviewer, we note little overlap between the different stress conditions suggesting that the response is context dependent. Additional differences may arise if, as we hypothesize, UPR is activated in the cold in response to both protein and lipid stress. Note that the 2-fold cutoff used in the previous Figure 7 – figure supplement 1 was (erroneously) on the log2 scale, so showed genes upregulated at least 4-fold. We now corrected it to 2-fold. While there are now a few more overlapping genes, the overall conclusion, that there is little overlap between different conditions, did not change. We now list the shared genes in the new Supplementary file 5.

      The conclusion that "the transcription of some cold-induced genes reflects the activation of unfolded protein response (UPR)..." is based on analysis of only one gene, lips-11. No other genes were examined for IRE-1 dependence of their induction by cold, neither the other 8 genes that are common between the cold-induced genes here and the ER stress/IRE-1- induced in Shen 2005 (Venn diagram in Figure 7 supplement), nor the hsp-4 reporter. What is the evidence that lips-11 is not the only gene whose induction by cold in this paper's dataset depends on IRE-1? This is a major weakness and needs to be addressed.

      Furthermore, whether induction by cold of lips-11 itself is due to IRE1 activation was not tested, only a partial decrease of reporter fluorescence by ire-1 RNAi is shown. A quantitative measure of the change of lips-11 transcript in ire-1 and xbp-1 mutants is needed to establish if it depends on IRE-1/XBP-1 pathway.

      We now examined by RT-qPCR if the induction of the three genes from Shen at al. (dnj-27, srp-7, and C36B7.6), as well as lips-11 and hsp-4 depends on IRE-1. In the new Figure 4 – figure supplement 2, we show that the upregulation of all these genes is reduced in the cold in the ire1 mutant (although in the wild type, the increase of hsp-4 mRNA appeared to be non-significant, despite the observed upregulation of the hsp-4 GFP reporter).

      The authors could provide more information and the additional data for the transcripts upregulated by both ER stress and cold, including the endogenous lips-11 and hsp-4 transcripts: their identity, fold induction by both cold and ER stress, how their induction is ranked in the corresponding datasets (all of these are from existing data), and do they depend on IRE-1/XBP-1 for induction by cold?

      As above, the dependence of endogenous lips-11 and hsp-4 on IRE-1 is now shown in the new Figure 4 – figure supplement 2, and the shared genes from Figure 7 – figure supplement 1 are listed in the new Supplementary file 5. We did not perform additional analysis comparing various data sets, as we felt that understanding the differences between IRE-1-mediated transcription outputs across different conditions goes well beyond this study.

      Without these additional data and considering that the authors did not directly measure the splicing of xbp-1 transcript (see comment for Fig. 3 below), the conclusion that cold induces UPR by specific activation of IRE-1/XBP-1 pathway is premature.

      To address the splicing of endogenous xbp-1, we examined our ribosome profiling data for the translation of spliced xbp-1, and found that the spliced variant is more abundant in the cold. This data is now shown in Figure 3 – figure supplement 2B.

      There are also technical issues that are making it difficult to interpret some of the results, and missing controls that decrease the rigor of conclusions:

      (1) For RNAseq and ribosome occupancy, were the 20ºC day 1 adult animals collected at the same time as the other set was moved to 4ºC, or were they additionally grown at 20ºC for the same length of time as the 4ºC incubations, which would make them day 2 adults or older at the time of analysis? This information is only given for SUnSET: "animals were cultivated for 1 or 3 additional days at 4ºC or 20ºC".

      In the RNAseq experiments, the 20ºC animals were collected at the same time as the others were moved to 10ºC (and then 4ºC), so they were not additionally grown at 20ºC. We make it now clear in Methods.

      This could be a major concern in interpreting translation data: First, the inducibility of both UPR and HSR in worms is lost at exactly this transition, from day 1 to day 2 or 3 adults, depending on the reporting lab (for example Taylor and Dillin 2013, Labbadia and Morimoto, 2015, De-Souza et al 2022).

      As explained above, the 20ºC animals were collected at the same time as the others were moved to 4ºC. Then, we reported before that ageing appears to be suppressed in animals incubated at 4ºC (Habacher et al., 2016; Figure S1C). Thus, it terms of their biological age, cold-incubated animals appear to be closer to the 20ºC animals at the time they are moved to the cold (day 0). Thus, the ageing-associated deterioration in UPR inducibility mentioned above presumably does not apply to cold-incubated animals, which is in line with the observed IRE-1-dependent upregulation of several genes in day 3 animals at 4ºC.

      How do authors account for this? Would results with reporter induction, or induction of IRE-1 target genes in Fig. 4, change if day 1 adults were used for 20ºC?

      Our analysis in Figure 4 – figure supplement 2 now includes 20ºC animals at day 0, 1, and 3.

      Second, if animals at the time of shift to 4ºC were only beginning their reproduction, they will presumably not develop further during hibernation, while an additional day at 20ºC will bring them to the full reproductive capacity. Did 4ºC and 20ºC animals used for RNAseq and ribosome occupancy have similar numbers of embryos, and were the embryos at similar stages?

      As explained above, the reference animals at 20ºC were young adults containing few embryos. Indeed, at 4ºC the animals do not accumulate embryos. Although we cannot say that for all genes, note that the genes analysed in Figure 4 – figure supplement 2 increase in abundance also when compared with the day 3 animals kept at 20ºC.

      (2) Second, no population density is given for most of the experiments, despite the known strong effects of crowding (high pheromone) on C. elegans growth. From the only two specifics that are given, it seems that very different population sizes were used: for example, 150 L1s were used in survival assay, while 12,000 L1s in SUnSET. Have the authors compared results they got at high population densities with what would happen when animals are grown in uncrowded plates? At least a baseline comparison in the beginning should have been done.

      None of the experiments involved crowded populations. In the SUnSET experiments, we just used larger and more plates to obtain sufficient material.

      (3) Fig. 3: it is unclear why the accepted and well characterized quantitative measure of IRE1 activation, the splicing of xbp-1transcript, is not determined directly by RT-PCR. The fluorescent XBP-1spliced reporter, to my knowledge, has not been tested for its quantitative nature and thus its use here is insufficient. Furthermore, the image of this fluorescent reporter in Fig. 3b shows only one anterior-most row of cells of intestine, and quantitation was done with 2 to 5 nuclei per animal, while lips-11 is induced in entire intestine. Was there spliced XBP-1 in the rest of the intestinal nuclei? Could the authors show/quantify the entire animal (20 intestinal cells) rather than one or two rows of cells?

      As explained above, we now included the analysis of xbp-1 splicing in Figure 3 – figure supplement 2B. As for the fluorescent reporter, it is difficult to measure all gut nuclei since part of the gut is occluded by the gonad. Nonetheless, we do see induction of the reporter in other gut nuclei and show now additional examples from midgut in Figure 3 – figure supplement 2A.  

      (4) The differences in the outcomes from this study and the previous one (Dudkevich 2022) that used 15ºC to 2ºC cooling approach are puzzling, as they would suggest two quite different IRE-1 dependent programs of cold tolerance. It would be good if authors commented on overlapping/non-overlapping genes, and provided their thoughts on the origin of these differences considering the small difference in temperatures.

      Indeed, there seem to be substantial differences between different temperatures and cooling paradigms. While understanding the C. elegans responses to cold is still in its infancy, one possible explanation for the observed differences is that we used different starting growth temperatures. While the initial populations in our study were grown at 20ºC, Dudkevich et al. used 15ºC. Worms display profound physiological differences between these two temperatures. For example, Xiao et al. (2013) showed that the cold-sensitive TRPA-1 channel is important at 15ºC but not 20ºC. Thus, the trajectories along which worms adapt to near freezing temperature may vary depending on their initial physiological state (and perhaps the target temperature, as we used 4ºC and they 2ºC). We now expanded argumentation on this topic in Discussion. I should also say that we planned on testing NLP-3 function in our paradigm, but our request for strains remained unanswered.

      Second, have the authors performed a control where they reproduced the rescue by FA supplementation of poor survival of ire-1 mutants after the 15ºC to 2ºC shift? Without this or another positive control, and without measuring change in lipid composition in their own experiments, it is unclear whether the different outcomes with respect to FAs are due to a real difference in adaptive programs at these temperatures, or to failure in supplementation?

      While we did not re-examine the findings by Dudkevich et al., we did include now another positive control. As reporter by Hou et al. (2014), supplementing unsaturated FAs rescues the induction of the hsp-4 reporter in fat-6 RNAi-ed animals. Although we were able to reproduce that result (Figure 6 – figure supplement 1), the same supplementation procedure did not suppress the lips11 reporter (Figure 6 – figure supplement 2).

      (5) Have the authors tested whether and by how much ire-1(ok799) mutation shortens the lifespan at 20ºC? This needs to be done before the defect in survival of ire-1 mutants in Fig. 7a can be interpreted.

      The lifespan at standard cultivation temperature was examined by others (Henis-Korenblit et al., 2010; Hourihan et al., 2016), showing that ire-1(ok799) mutants live shorter. However, while some mechanism that prolong lifespan may also improve cold survival, the two phenomena are not identical and whether IRE-1 facilitates longevity and cold survival in the same or different way remains to be seen.

      Reviewer #2:

      (1) The conclusions regarding a general transcriptional response are based on one gene, lips-11, which does not affect survival in response to cold. We would suggest altering the title, to replace "Reprograming gene expression: with" Regulation of the lipase lips-11".

      We now examined IRE-1 dependent induction of additional genes – see Figure 4 – figure supplement 2. While we do not know what fraction of cold-induced genes depends on IRE-1, we feel that our findings justify the statement that that gene expression in the cold involves the IRE1/XBP-1 pathway (title) or that that the transcription of some/a subset of cold-induced genes depend on this pathway (in abstract, model, and discussion).

      (2) There is no gene ontology with the gene expression data.

      We now included the top 10 most enriched and suppressed gene categories between 10ºC and 4ºC (since the biggest change happens between these conditions, as shown in Figure 2 – figure supplement 1A). This is now included in the Figure 2 – figure supplement 2.

      (3) Definitive conclusions regarding transcription vs translational effects would require use of blockers such as alpha amanatin or cyclohexamide.

      As explained also for reviewer 1, we confirmed now that at least some genes, whose translation is upregulated based on the ribosome profiling, are indeed upregulated in the cold at the protein level (Figure 2 – figure supplement 3A-B). Thus, the increase in ribosomal occupancy seems to accurately reflect increased translation. Since mRNA levels correlate overall with the ribosomal occupancy, it appears that the mRNA levels are the main determinants of the translation output. Because the lips-11 promoter is sufficient to upregulate the GFP reporter in the cold, it further suggests that the regulation happens at the transcription level. It is true that at this point we cannot completely rule out the effects of mRNA stability, which we clearly acknowledge in the discussion.

      (4) Conclusions regarding the role of lipids are based on supplementation with oleic acid or choline, yet there is no lipid analysis of the cold animals, or after lips-1 knockdown.

      We agree that this is an important direction for future studies but feel that lipidomic analysis goes beyond the scope of current work.

      Although choline is important for PC production, adding choline in normal PC could have many other metabolic impacts and doesn't necessarily implicate PC without lipidomic or genetic evidence.

      We agree and acknowledge it now in Discussion: “However, choline also plays other roles, including in neurotransmitter synthesis and methylation metabolism. Thus, we cannot yet rule out the possibility that the protective effects of choline supplementation stem from functions outside PC synthesis.”

      Reviewer #3:

      The study has several weaknesses: it provides limited novel insights into pathways mediating transcriptional regulation of cold-inducible genes, as IRE-1 and XBP-1are already well-known responders to endoplasmic reticulum stress, including that induced by cold.

      We presume the reviewer refers to the study by Dudkevich et al. (2022). As explained in our manuscript, there are important differences between that study and ours in how the IRE-1 signalling is utilized and to what ends.

      Additionally, the weak cold sensitivity phenotype observed in ire-1 mutants casts doubt on the pathway's key role in cold adaptation. The study also overlooks previous research (e.g.PMID: 27540856) that links IRE-1 to SKN-1, another major stress-responsive pathway, potentially missing important interactions and mechanisms involved in cold adaptation.

      We state in the manuscript that the IRE-1 pathway plays a modest but significant role in cold adaptation and state in the Fig. 7 model and Discussion that additional pathways work alongside IRE-1 to drive cold-specific gene expression.

      Recommendations for the authors:

      Reviewer #1:

      Minor comments:

      (1) Fig. 2B - reporter expression seems to be already present in the intestine of 20ºC animals. What is the turnover rate of GFP in the intestine and how is it affected by the temperature shift? If GFP degradation is inhibited, could it explain the increase in signal in 4ºC animals, rather than increased transcription? This seems to be true for the hsp-4 transcriptional reporter, as the GFP fluorescence appears to increase during 4ºC incubation (Fig. 4a), but the hsp-4 message levels are only increased after 1 day but not in later days at 4ºC, based on the RNAseq in provided dataset. How well do changes in lips-11 reporter fluorescence correspond to the changes in the endogenous lips-11 transcript?

      Note that increased GFP fluorescence is accompanied by increased mRNA levels. In addition to the RNAseq data, we now also examined changes of the endogenous lips-11 transcript by RTqPCR and observed its strong (and IRE-1 dependent) upregulation in the cold– see Figure 4 – figure supplement 2. Moreover, we now included two other examples of GFP-tagged proteins whose fluorescence increases in the cold, concomitant with increased mRNA levels and ribosomal occupancy (Figure 2 – figure supplement 2A-B).

      (2) Descriptions of methods to measure different aspects of translation are very abbreviated and in some places make it difficult to understand the paper. One example - what is RFP in Fig. 2a?

      We replaced now “RFP” with “RPF” (ribosome protected fragment) and the abbreviation is explained firsts time it is used.

      (3) How was the effectiveness of RNAi at 4ºC validated?

      As explained in Methods, we subjected animals to RNAi long before they were transferred to 4ºC, so the corresponding protein is depleted prior to cooling.

      (4) Several of the conclusions on translation and ribosomal occupancy are written in a somewhat confusing way. For example, the authors state that "shift from 10ºC to 4ºC had a strong effect" when describing "impact on translation (ribosomal occupancy)" (page 4), but in the next sentence, they state "a good correlation between mRNA levels and translation (Figure 2A)". Was ribosomal occupancy normalized to the transcript abundance?

      We do not perceive any discrepancy between the two statements. The former refers to the difference between time points, where we observed the largest change in both the transcriptome and ribosomal occupancy from 10ºC to 4ºC (as can be inferred in the PCA plot in Figure 2 - figure supplement 1). The latter refers to the observation that changes in mRNA levels mirrored, in most of cases, similar changes in the ribosomal occupancy.

      The ribosomal occupancy was not normalized, as that would essentially normalize the y-axis (ribosomal occupancy) with the x-axis (mRNA), and so express changes in “translational efficiency” as a function of changes in mRNA abundance. While this type of analysis can also reveal interesting biological phenomena, it would explore a different question.

      (5) "For most transcripts ... increased the abundance of a particular protein appears to correlate depend primarily on the abundance of its mRNA" (page 5). This is an overstatement, the protein levels were not quantified.

      As explained above, we now additionally monitored the expression of two GFP-tagged proteins (CEBP-1 and NUMR-1). Monitoring their expression, we observed the expected increase in GFP fluorescence in the cold (see Figure 2 – figure supplement 3 A-B). While we did not examine them also by western blot, these observations are in line with our conclusions.

      (6) The statement "Since transcription is the main determinant of mRNA levels, these results suggest that cold-specific gene expression primarily depends on transcription activation" seems to assume that message degradation doesn't have much of an impact at 4ºC. What is the evidence here? The authors themselves later suggest either transcription or mRNA stability in Discussion.

      While we cannot exclude that mRNA stability of some genes may be affected, this concern is more valid for the messages that go down in the cold. Although we have done it for only selected genes, each time we observed an increase in the mRNA levels, we also observed the corresponding increase in the protein; this study and Pekec et al. (2022). Then, the lips-11 reporter was designed to monitor the activity of its promoter, which we showed in sufficient to upregulate reporter GFP in the cold. We have now expanded the corresponding paragraph in Discussion, which will hopefully come across as more balanced.  

      Reviewer #2:

      (1) Alter title, conclusions to better reflect specific nature of the work.

      We now provided additional data and feel that it justifies our conclusions and title.

      (2) Use Gene Ontology searches to look at patterns of gene expression in RNA seq data.

      We now show it in Figure 2 – figure supplement 2.

      (3) Use genetic or lipidomic tools rather than solely adding exogenous lipids.

      We agree that lipidomic analysis is an important direction for future research, but feel that lipidomic analysis and further genetic experiments go beyond the scope of current manuscript.

      Reviewer #3:

      To strengthen the evidence for the role of IRE-1 in cold adaptation, the authors might consider performing additional functional assays, such as testing the effects of IRE-1 and XBP-1 mutations under varying cold conditions and testing the genetic interaction of ire-1 with xbp-1, skn-1, and hsf-1 in cold sensitivities. It is also worth using alternative approaches such as independent alleles of ire-1, knockdowns or tissue-specific knockouts (without potential developmental compensation in global constitutive mutants) to better characterize the contribution of IRE-1 to cold adaptation. Additionally, studies that examine tissue-specific responses to cold exposure could provide important insights, as different tissues may utilize distinct molecular pathways to adapt to cold stress.

      We also tested ire-1 and xbp-1 functions by RNAi-mediated depletion. SKN-1 is a good candidate for future studies, but Horikawa at al. (2024) showed that HSF-1 is not required for cold dormancy (at 4ºC); we also show now that HSF-1::GFP does not increase in the cold (Figure 2 – figure supplement 3C).

      This reviewer also recommends clarifying the novelty of your findings in the context of existing literature, particularly regarding the established roles of IRE-1 and XBP-1 in responding to endoplasmic reticulum stress.

      The entry point of this study was to clarify a long-standing problem in hibernation research, i.e., the apparent discrepancy between a global translation repression and de novo gene expression observed in the cold. By connecting cold-mediated expression of some genes to the IRE-1/XBP1 pathway, we strengthen the argumentation for transcription-mediated gene regulation in hibernating animals. We did go the extra mile to test the possible reason behind the activation of UPR<sup>ER</sup> in the cold but feel that a deeper analysis deserves a separate study.

      The term "hibernation" should be avoided or reworded since the study does not provide direct behavioral or physiological evidence for hibernation-like states; instead, the manuscript could refer to "cold-induced responses" or "adaptations to cold temperatures."

      The term “hibernation” was used before even in the context of the C. elegans dauer state, which, arguably, is even less appropriate. In addition to a global suppression of translation shown here, we reported before that the same cooling regime suppresses ageing (Habacher et al., 2016; Figure S1C). Incubating at 4ºC also arrests C. elegans development (Horikawa et al., 2024). Thus, while the worm and mammalian hibernation are certainly not equivalent – which we clearly spell out – we like to use “hibernation” interchangeably with “cold dormancy” to draw attention to a fascinating aspect of C. elegans biology. Still, we use now quotation marks in the title to avoid misunderstanding.

      The discussion could be strengthened by addressing the relevance of prior studies, such as those linking IRE-1 to SKN-1 (PMID: 27540856), TRPA-1 (PMID: 23415228), ZIP-10 (PMID: 29664006), HSF-1 (PMID: 38987256) in cold adaptation and elaborating on how your findings provide new

      The IRE-1/SKN-1 and ZIP-10 papers are now mentioned when describing the model in Figure 7. The TRP-1 and HSF-1 papers are cited when discussing physiological differences between different cold temperatures. Consistent with our studies, the HSF-1 paper shows that nematodes enter a dormant state at 4ºC (but at 9ºC and higher temperatures continue developing). Importantly, HSF-1 promotes the development at 9ºC but is not important for the arrest at 4ºC. We also shown now in Figure 2 – figure supplement 3C that HSF-1 does not go up at 4ºC.

    1. Reviewer #1 (Public Review):

      The mechanisms that regulate establishment of the germline stem cells and germline progenitors during zebrafish reproductive development are not understood. Prior single cell analysis characterized the cell types of the early zebrafish ovary during and at stages after sexual differentiation. In this work Hsu et al. took a single approach to analyze the cell types present in the early gonad during early sex determination. As expected, they identified germline stem cells (GSCs) that express canonical GSC markers and distinct populations of progenitors. Unexpectedly, they found multiple populations of transcriptionally distinct progenitor populations that the authors termed early (those lacking the differentiation marker foxl2l), committed (those expressing fox2l2 and S-phase genes) and late (those expressing fox2l2 and meiotic genes) progenitors. Comparisons of their dataset to the published zebrafish ovary datasets confirmed the presence of these distinct progenitor populations in the ovary. Further, they convincingly validated the presence of these progenitor subtypes using fluorescent in situ hybridization. To investigate the relationship between progenitor subsets and known regulators of ovary differentiation, the authors conducted single cell analysis of gonads lacking the transcription factor, Foxl2l. As previously reported, Foxl2l absence blocks ovary differentiation and all foxl2l mutants develop testes. The single cell analysis here indicates that foxl2l is inappropriately expressed in GSCs and early progenitors and that germ cell differentiation is blocked at the committed progenitor stage since few committed progenitors and no late progenitors or meiotic transcripts were detected in the single cell analysis of foxl2l mutants. Based on the coexpression of genes that are not typically expressed together in normally developing germ cells, specifically nanos2 and foxl2l, and dmrt1 and foxl2l, the authors conclude that Foxl2l is required for the committed progenitor program and that it prevents committed progenitors from returning to the GSC state.

      Overall, the data provide new insights into the cell populations of the early differentiating gonad, define distinct progenitor states, pinpoint a requirement for the ovary differentiation factor Foxl2l at a specific stage of progenitor differentiation, and generate new hypotheses to be tested. Many but not all of the conclusions are supported by compelling data, and some findings and conclusions need to be clarified in the context of the published literature.

      (1) The authors conclude that the committed progenitors revert to GSCs based on the coexpression of nanos2 and foxl2l nanos2 and based on expression of id1 in mutants but not in WT. Without functional data demonstrating that the progenitors revert to an earlier state, alternative interpretations should be considered. For example, it is possible that the cells initiate the committed progenitor program but continue to express the GSC program and that the coexpression of both programs blocks differentiation. Consistent with this possibility, some Fox family members, FoxL2 and FoxPs for example, are known to be both activators and repressors of transcription or act primarily as repressors. Potentially relevant to this work, repressive activity of FoxL2 has been previously reported in the mammalian ovary (Pisarska et al Endocrinology 2004, Pisarska Am J. Phys Endo. Metabolism 2010, Kuo Reproduction 2012, Kuo Endocrinology 2011, as well as more recent publications). In that context interfering with FoxL2 was proposed to cause upregulated expression of genes normally repressed by FoxL2, accelerated follicle recruitment, and premature ovarian failure.

      (2) The authors conclude that the committed progenitor stage is "the gate toward female determination" and that the cells "stay at S-Phase temporarily before differentiation". This conclusion seems to be based solely on single cell RNAseq expression. In several species, including zebrafish, meiotic entry occurs earlier in females and has been correlated with ovary development. The possibility that the late progenitor stage, the stage when meiotic genes are detected in this study and a stage missing in foxl2l mutants, is actually the key stage for female determination cannot be excluded by the data provided.

      (3) The authors discuss prior working showing that loss of germ cells leads to male development and that germ cells are required for female development and claim to extend that work by showing here that some progenitors are already sexually differentiated. First, the stages compared are completely different. The earlier work looks at the primordial germ cells and their loss in the first few days of development before a gonad forms. In contrast, this work examines stages well after the gonad has formed and during sex determination. The second concern is that the conclusion that the progenitors are differentiated is based solely on the expression of foxl2l, which is initially expressed in the juvenile ovary state that lab strains have been shown to develop through (Wilson et al Front Cell Dev Bio 2024). While it is fair to state that some cells express ovary markers at this stage, it is unclear that this is sufficient evidence that the cells are differentiated. For example, in the context of the foxl2l mutant, the authors observe that GSCs and early progenitors inappropriately express foxl2l, but the mutants develop as males. Thus, expression of foxl2l transcripts alone is insufficient evidence to claim that the cells are already differentiated as female.

      (4) The comparison between medaka and zebrafish foxl2l mutants seems to suggest that Foxl2l is required for meiosis in medaka but has a different role in zebrafish. However, if foxl2l represses the earlier developmental programs of GSCs and early progenitors, it is possible that continued expression of these early programs interferes with activation of meiotic genes. This could account for the absence of the late progenitor stage in foxl2l mutants since the late progenitor stage is defined by and distinguished from the earlier stages by expression of foxl2l and meiotic genes. If so, foxl2l may be similarly required in both systems.

      (5) The authors state that "Foxl2l may ensure female differentiation by preventing stemness and antagonizing male development." It is unclear why suppressing stemness would be necessary for female differentiation since female zebrafish have stem cells as do male zebrafish. It seems likely that turning off the GSC and early differentiation programs is important for allowing expression of meiosis and oocyte differentiation genes, and that a gene other than Foxl2l is required for differentiation from GSCs to spermatocytes.

      (6) Based on its expression in mutant progenitors, p53 is proposed to assist with alternative differentiation of mutant germ cells. Although p53 transcripts are expressed, no evidence is provided that p53 is involved in differentiation of germ cells, and sex bias has not been associated with the published p53 mutants in zebrafish. Furthermore, while p53 has been shown to be important for ovary to testis transformation in mutant contexts in adults, it appears dispensable for testis development in mutants that disrupt ovary differentiation in earlier stages (Rodriguez-Mari et al PLoS Gen 2010, Shive PNAS 2010, Hartung et al Mol. Reprod. Dev 2014, Miao Development 2017, Kaufman et al PLoSGen 2018, Bertho et al Development 2021. It is possible that p53 eliminates foxl2l mutant germ cells that are simultaneously expressing multiple developmental programs, but this possibility would need to be tested.

    2. Author response:

      Reviewer #1 (Public Review):

      (1) The authors conclude that the committed progenitors revert to GSCs based on the coexpression of nanos2 and foxl2l nanos2 and based on expression of id1 in mutants but not in WT. Without functional data demonstrating that the progenitors revert to an earlier state, alternative interpretations should be considered. For example, it is possible that the cells initiate the committed progenitor program but continue to express the GSC program and that the coexpression of both programs blocks differentiation.

      Thanks for your insightful comment. We have explored possible alternative interpretations of our data. Regarding the suggested possibility of a continued GSC program in the mutant, we have examined the expression of GSC markers including nanos2 in the mutant at different stages. We found that in the mutant, nanos2 or other GSC markers were not significantly upregulated in GSC-to progenitor transition (G-P) and early progenitors (Prog-E) (Fig. 4B). The expression of these GSC markers was also low in the integrated clusters I4-I6 when G-P and Prog-E stages were prominent (Fig. 3D and Fig. 3E). GSC marker nanos2 was high only in mutant Prog-C. These results argue against continued GSC programs in the foxl2l mutants. Another possible explanation is that perhaps some mutant Prog-C acquires some GSC property with the upregulation of nanos2 instead of a continuous GSC program. We have now clarified our rationale about mutant cells gaining new GSC properties and included both interpretations in the Result.

      Consistent with this possibility, some Fox family members, FoxL2 and FoxPs for example, are known to be both activators and repressors of transcription or act primarily as repressors. Potentially relevant to this work, repressive activity of FoxL2 has been previously reported in the mammalian ovary (Pisarska et al Endocrinology 2004, Pisarska Am J. Phys Endo. Metabolism 2010, Kuo Reproduction 2012, Kuo Endocrinology 2011, as well as more recent publications). In that context interfering with FoxL2 was proposed to cause upregulated expression of genes normally repressed by FoxL2, accelerated follicle recruitment, and premature ovarian failure.

      FoxL2 exerts both activating and repressive activities. We believe that Foxl2l can also activate and repress its target gene expression. Although its target genes have not been clearly identified, Foxl2l may activate genes involved such process as oogenic meiosis, and may also repress other genes involved in other processes, say perhaps nanos2.

      (2) The authors conclude that the committed progenitor stage is "the gate toward female determination" and that the cells "stay at S-Phase temporarily before differentiation". This conclusion seems to be based solely on single cell RNAseq expression. In several species, including zebrafish, meiotic entry occurs earlier in females and has been correlated with ovary development. The possibility that the late progenitor stage, the stage when meiotic genes are detected in this study and a stage missing in foxl2l mutants, is actually the key stage for female determination cannot be excluded by the data provided.

      We agree that Prog-L is important for the initiation of female meiosis. We have made revision in the text to point out the importance of Prog-L in female differentiation.

      (3) The authors discuss prior working showing that loss of germ cells leads to male development and that germ cells are required for female development and claim to extend that work by showing here that some progenitors are already sexually differentiated. First, the stages compared are completely different. The earlier work looks at the primordial germ cells and their loss in the first few days of development before a gonad forms. In contrast, this work examines stages well after the gonad has formed and during sex determination.

      Both previous studies and our study indicate the important role of germ cells in zebrafish sex differentiation during gonadal development. The earlier works show that the abundance of primordial germ cells contributes to sex differentiation. Our current finding further suggests the existence of female identify in some germ cells at the juvenile stage and discusses the importance of cell in sexual differentiation. We have added the developmental age in our study to emphasize the age difference.

      The second concern is that the conclusion that the progenitors are differentiated is based solely on the expression of foxl2l, which is initially expressed in the juvenile ovary state that lab strains have been shown to develop through (Wilson et al Front Cell Dev Bio 2024). While it is fair to state that some cells express ovary markers at this stage, it is unclear that this is sufficient evidence that the cells are differentiated.

      The conclusion about the differentiation of progenitors is not based solely on foxl2l expression; rather, it is according to the whole transcriptomic profiles of both WT (Figure 1B) and foxl2l mutant cells (Figure 3A) as well as the foxl2l mutant phenotype (Figure 2C). Three types of progenitors, Prog-E, Prog-C and Prog-L were identified by whole transcriptomic analysis in WT. In foxl2l mutants, the transcriptomic profile further shows that Prog-L and meiotic cells are completely lost, and all germ cells undergo male differentiation eventually. These results together indicate that the differentiation of Prog-C to Prog-L guides the progenitor toward female differentiation. Our result also showed that in the juvenile gonad, foxl2l expression is high in two types of progenitors, Prog-C and Prog-L, and become low after meiotic entry.

      For example, in the context of the foxl2l mutant, the authors observe that GSCs and early progenitors inappropriately express foxl2l, but the mutants develop as males. Thus, expression of foxl2l transcripts alone is insufficient evidence to claim that the cells are already differentiated as female.

      The foxl2l mutants develop into males because they lack functional Foxl2l. Although the mutated foxl2l transcript is present in mutant cells, these transcripts are not functional. These mutants develop into males eventually. This result is consistent with our claim that functional Foxl2l is important for the development of Prog-L and female differentiation.

      (4) The comparison between medaka and zebrafish foxl2l mutants seems to suggest that Foxl2l is required for meiosis in medaka but has a different role in zebrafish. However, if foxl2l represses the earlier developmental programs of GSCs and early progenitors, it is possible that continued expression of these early programs interferes with activation of meiotic genes. This could account for the absence of the late progenitor stage in foxl2l mutants since the late progenitor stage is defined by and distinguished from the earlier stages by expression of foxl2l and meiotic genes. If so, foxl2l may be similarly required in both systems.

      Medaka and zebrafish Foxl2l may share similar functions such as the stimulation of meiotic gene expression and promotion of oogenesis in the female germ cells preparing for meiotic entry. In addition, we also detected aberrant upregulation of nanos2 in some foxl2l mutant cells. The idea of “continued expression of these early programs interferes with activation of meiotic genes” is conceivable, but for now we have no evidence for it. We do not know whether the absence of meiotic genes is due to an interference caused by the activation of nanos2 or due to the complete loss of Prog-L and meiotic cells. It will also be interesting to find out whether medaka Foxl2l has a role in early progenitors

      (5) The authors state that "Foxl2l may ensure female differentiation by preventing stemness and antagonizing male development." It is unclear why suppressing stemness would be necessary for female differentiation since female zebrafish have stem cells as do male zebrafish. It seems likely that turning off the GSC and early differentiation programs is important for allowing expression of meiosis and oocyte differentiation genes, and that a gene other than Foxl2l is required for differentiation from GSCs to spermatocytes.

      It is true that we have not proved whether suppression of stemness is required for female differentiation. Maybe our earlier statement is a bit misleading. We agree that it is likely that turning off the GSC and early differentiation programs is important for allowing expression of meiotic and oocyte differentiation genes, and that a gene other than Foxl2l is required for differentiation from GSCs to spermatocytes. To avoid confusion, we have modified our statement in the text.

      (6) Based on its expression in mutant progenitors, p53 is proposed to assist with alternative differentiation of mutant germ cells. Although p53 transcripts are expressed, no evidence is provided that p53 is involved in differentiation of germ cells, and sex bias has not been associated with the published p53 mutants in zebrafish. Furthermore, while p53 has been shown to be important for ovary to testis transformation in mutant contexts in adults, it appears dispensable for testis development in mutants that disrupt ovary differentiation in earlier stages (Rodriguez-Mari et al PLoS Gen 2010, Shive PNAS 2010, Hartung et al Mol. Reprod. Dev 2014, Miao Development 2017, Kaufman et al PLoSGen 2018, Bertho et al Development 2021. It is possible that p53 eliminates foxl2l mutant germ cells that are simultaneously expressing multiple developmental programs, but this possibility would need to be tested.

      The tp53<sup>-/-</sup>foxl2l<sup>-/-</sup> double mutant cannot alleviate the all-male phenotype of foxl2l<sup>-/-</sup> mutant (Dev Biol, 517, 91-99, 2024), indicating that the male development is not due to p53-mediated germ cell apoptosis. We have cited the suggested papers and compared relation of tp53 between these mutants (fancl, zar1, etc.) mentioned in the cited papers. Since tp53 was enriched in certain foxl2l<sup>-/-</sup> mutant cell clusters, and tp53 mutation fails to rescue the all-male phenotype, it is possible that p53 expressed in these mutant cell clusters has roles other than inducing apoptosis. One assumption is that p53 may be involved in the germ cell differentiation, especially p53 is known to promote differentiation of airway epithelial progenitors, adipogenesis and embryonic stem cells. We have emphasized that the suggested role of p53 in germ cell differentiation is our assumption in the Discussion.

      Reviewer #3 (Public Review):

      This is the first report to show a transcriptional factor, foxl2l, is essential for the development of female germs. Without foxl2l, germ cells will be developed into sperms. The report also clearly defined the arrested stage of early germ cells in foxl2l mutants, or stages that is critical for foxl2l to play a role for the further development of female germ cells.

      (1) Due to lack of cell lineage tracing, the claim of foxl2l suppression of dedifferentiate of progenitor cells to GSC based on the gene expression and cell number changes is weak.

      Thanks for your comments pointing out our contribution and also weakness. We acknowledge the lack of direct evidence on the reversion of mutant Prog-C to GSC in our data. We now removed the claim about the repression of stemness by Foxl2l.

      (2) In addition, separation of early germ cell types in foxl2l mutant using marker genes from WT may not be optimal.

      The cell type of mutant cell is determined by two independent analyses. First is inferring the developmental stage of mutant cells. This approach assumes that mutant cells can indeed be mapped to specific WT stages through their transcriptomic profiles. However, as indicated by this reviewer’s comments, mutant cells exhibited heterogeneity and can be distinct from WT cells. Defining cell types in mutants by WT markers may not be optimal. To address this, we conducted another analysis, co-clustering. Mutant cells and WT cells at early stages (GSC , G-P, Prog-E, Prog-C(S) and Prog-C) were co-clustered. This approach does not assume a direct correspondence between mutant and WT developmental stages. Instead, it facilitates the identification of novel germ cell types in mutants while characterizing the relationship between WT and mutant cells. In some clusters, both WT and mutant cells were present, indicating high transcriptomic similarity. In other clusters, most cells are only mutant cells, indicating distinct mutant cell types (Figure 3C). We can, therefore, assign developmental properties to these mutant cells with confidence.

    1. 福克纳的“多视角叙事”、卡夫卡的荒诞哲学、博尔赫斯的迷宫结构

      好的,我将通过具体例子结合知识库信息,分别解释福克纳的“多视角叙事”、卡夫卡的“荒诞哲学”和博尔赫斯的“迷宫结构”。以下是详细分析:


      一、福克纳的“多视角叙事”

      核心特征:通过不同人物的主观视角拼贴故事,打破线性叙事,揭示人物心理与真相的复杂性。

      具体例子:《喧哗与骚动》

      1. 班吉的视角(第一章)
      2. 特点:班吉是智障者,叙述混乱无序,依赖感官印象(如颜色、声音)触发回忆。
      3. 知识库引用
        • 知识库[2]提到,班吉的叙述“支离破碎”,如他回忆凯蒂爬树时的场景:“那棵树不再抖动了……凯蒂的头发上插着花儿,披着条长长的白纱,像闪闪发亮的风儿。”
        • 知识库[4]指出,班吉的视角“以白痴的叙述开创性地展现非理性思维”,通过感官碎片拼凑出家族衰败的真相。
      4. 作用:揭示康普生家族的混乱与情感断裂,暗示凯蒂的悲剧是家庭精神崩溃的缩影。

      5. 昆丁的视角(第二章)

      6. 特点:昆丁内心充满对家族荣誉的执念,叙述充满时间焦虑(如反复看表、回忆妹妹凯蒂的堕落)。
      7. 知识库引用

        • 知识库[8]分析福克纳对时间的处理:“昆丁的意识流展现时间的‘绵延’,过去与现在交织,最终导致他自杀。”
        • 知识库[9]提到,昆丁的视角“通过心理独白暴露南方贵族的道德困境”。
      8. 杰生的视角(第三章)

      9. 特点:自私、偏执,以现实主义的怨恨叙述,揭露家族的虚伪与资本主义价值观。
      10. 知识库引用

        • 知识库[3]指出,杰生的叙述“充满对凯蒂和妹妹的怨恨”,体现“南方旧秩序的崩塌与新贪婪的崛起”。
      11. 迪尔西的视角(第四章)

      12. 特点:黑人女仆的全知视角,冷静梳理前三章的碎片,揭示真相(如凯蒂私生女的存在)。
      13. 知识库引用
        • 知识库[4]强调迪尔西视角的“补充性”,她“作为旁观者,用客观叙事解构白人视角的偏见”。

      总结

      福克纳通过四重视角,将同一事件(康普生家族的衰落)解构为不同人物的主观体验,最终拼凑出南方贵族精神崩溃的全景。这种叙事方式打破了传统“客观叙述”,强调真相的多元性与不可靠性(知识库[6][7])。


      二、卡夫卡的“荒诞哲学”

      核心特征:通过荒诞的情节、符号与设定,揭示现代社会的异化、权力压迫与个体的无力感。

      具体例子:《变形记》

      1. 荒诞设定
      2. 主人公格里高尔·萨姆沙一夜之间变成甲虫,但家人和雇主对此反应冷漠。
      3. 知识库引用

        • 卡夫卡的荒诞哲学常通过“非理性事件”隐喻现实中的异化(如知识库[10]提到的“用变形象征人性的丧失”)。
      4. 荒诞细节

      5. 家人从恐慌到厌恶,最终将他视为累赘;格里高尔仍试图履行“家庭责任”,却无人倾听。
      6. 哲学内核

        • 异化:资本主义社会中,人沦为工具(格里高尔的工作是“推销员”,变形后失去价值)。
        • 沟通失效:格里高尔无法解释自己的处境,象征现代人情感交流的断裂。
      7. 结局的荒诞

      8. 格里高尔死去,家人如释重负,甚至计划搬离公寓。
      9. 知识库引用
        • 知识库[9]指出,卡夫卡的荒诞“揭露了人性的冷漠与制度的非理性”。

      另一例:《审判》

      • 荒诞情节:约瑟夫·K毫无缘由地被指控有罪,却无法得知罪名或上诉途径,最终被处决。
      • 哲学内核
      • 权力的不可抗性:官僚体系庞大而荒谬,个体在其中完全无力(知识库[10]提到“官僚机器的非人性”)。
      • 存在主义困境:世界本质无意义,人只能在荒诞中挣扎。

      总结

      卡夫卡的荒诞哲学通过超现实情节日常细节的结合,将现代社会的荒诞本质推向极致,揭示个体在权力与异化中的悲剧性(知识库[9][10])。


      三、博尔赫斯的“迷宫结构”

      核心特征:通过循环、镜像、无限空间等叙事手法,构建文本的多重可能性,探讨时间、记忆与存在的哲学命题。

      具体例子:《小径分岔的花园》

      1. 迷宫设定
      2. 故事中的“花园”实为一座包含无数时间分岔的迷宫,每个选择都导致平行宇宙的诞生。
      3. 知识库引用

        • 知识库[10]提到博尔赫斯的迷宫“象征时间的无限可能性”,如“小径分岔的花园”暗示“所有可能的历史同时存在”。
      4. 叙事结构

      5. 故事嵌套在一封信、一个梦和一座花园中,现实与虚构界限模糊。
      6. 哲学内核
        • 时间的不可知性:人类试图掌控时间,但最终陷入自我指涉的循环(如主角在杀人后成为迷宫的一部分)。
        • 叙事的无限性:文本本身成为迷宫,读者无法确定何为真实。

      另一例:《巴别图书馆》

      • 迷宫设定:宇宙被描绘为无数六边形房间组成的图书馆,每本书排列所有可能的字符组合。
      • 哲学内核
      • 知识的虚无:无限信息导致意义消解,寻找“完美之书”成为徒劳。
      • 存在主义的孤独:人类在无限中寻找秩序,却注定失败(知识库[10]提到“迷宫隐喻人类认知的局限”)。

      总结

      博尔赫斯的迷宫结构通过叙事层的嵌套符号的循环,将文本转化为哲学思辨的载体,探讨真实与虚幻、有限与无限的辩证关系(知识库[10])。


      总结对比

      | 作家 | 核心手法 | 代表作品 | 哲学内核 | |----------------|-----------------------|----------------------|----------------------------------| | 福克纳 | 多视角叙事 | 《喧哗与骚动》 | 真相的多元性与南方贵族的衰落 | | 卡夫卡 | 荒诞哲学 | 《变形记》《审判》 | 异化、权力压迫与存在困境 | | 博尔赫斯 | 迷宫结构 | 《小径分岔的花园》 | 时间的无限性与认知的局限性 |

      通过以上分析可见,三位作家分别以独特的叙事策略,将文学从“讲故事”升华为对人性、社会与存在的深刻哲学探索。

    Annotators

    1. Activity patterns were defined on the basis of dichotomous physical activity and sitting time variables: (1) active (≥40 MET-minutes/day)/low sitting time (<8 hours/day); (2) active/high sitting time (≥8 hours/day); (3) inactive (<40 MET-minutes/day)/low sitting time; and (4) inactive/high sitting time.

      Shows that the amount of activity time per participant was noted in MET minutes

    2. Physical inactivity is associated with numerous physical and mental health conditions and accounts for approximately 1.5%−3.0% of the total direct health care costs in developed countries.1 It is estimated that a 10% reduction in the prevalence of inactivity could potentially reduce health care expenditures by 96 million Australian dollars (AU$) and 150 million Canadian dollars per year in Australia and Canada (equating to 99 and 129 million U.S. dollars, respectively).2, 3 Despite the potential health and economic benefits, only about half the population in developed countries meet the recommended levels of physical activity.4, 5,

      Immediately outlines that physical inactivity is a key cause of increasing healthcare costs, and outlines this as one of the leading factors of healthcare costs in developed countries.

    1. oos
      1. I tried to coax my brother out for a walk, but he’s got his nose in a book, as always. ✅ nose (правильный вариант)

      Перевод: Я попытался уговорить брата выйти на прогулку, но он, как всегда, уткнулся в книгу.

      Объяснение: Фраза "has his nose in a book" означает, что человек полностью погружен в чтение.

      1. I didn’t get very far with the book – a bit heavy-going for me. ✅ heavy-going (правильный вариант)

      Перевод: Я не очень продвинулся в этой книге — она показалась мне слишком сложной/тяжеловесной.

      Объяснение: "Heavy-going" означает, что книга трудна для восприятия (скучная, сложная).

      1. Can’t come over, sorry. Exam tomorrow, so it’s dinner now, then I hit the books! ✅ hit (правильный вариант)

      Перевод: Не могу прийти, извини. Завтра экзамен, так что сейчас ужин, а потом — за учебу!

      Объяснение: "Hit the books" — идиома, означающая "серьезно заняться учебой/чтением".

      1. In my opinion, there’s nothing better than getting lost in a book. ✅ lost (правильный вариант)

      Перевод: По-моему, нет ничего лучше, чем полностью погрузиться в книгу.

      Объяснение: "Get lost in a book" — значит "погрузиться в чтение с головой".

      1. Lucy’s been a bookworm ever since she got her first Roald Dahl book. ✅ bookworm (правильный вариант)

      Перевод: Люси была книжным червем с тех пор, как получила свою первую книгу Роальда Даля.

      Объяснение: "Bookworm" — это человек, который очень любит читать (разговорное).

      1. You don’t have to be a die-hard book lover to appreciate the value of reading. ✅ die-hard (правильный вариант)

      Перевод: Не обязательно быть фанатичным любителем книг, чтобы ценить чтение.

      Объяснение: "Die-hard" означает "преданный до фанатизма".

      1. There are some books I read cover-to-cover, staying up into the early hours. ✅ cover-to-cover (правильный вариант)

      Перевод: Есть книги, которые я читаю от корки до корки, засиживаясь до рассвета.

      Объяснение: "Cover-to-cover" означает "полностью, от начала до конца".

      1. When you get a well-earned break it’s good to curl up with a book for a while and chill. ✅ curl up (правильный вариант)

      Перевод: Когда у тебя есть заслуженный перерыв, здорово свернуться калачиком с книгой и расслабиться.

      Объяснение: "Curl up" описывает удобную позу для чтения (например, в кресле или на диване).

    Annotators

    1. Reviewer #1 (Public review):

      Summary:

      Pradhan et al investigated the potential gustatory mechanisms that allow flies to detect cholesterol. They found that flies are indifferent to low cholesterol and avoid high cholesterol. They further showed that the ionotropic receptors Ir7g, Ir51b, and Ir56d are important for the cholesterol sensitivity in bitter neurons. The figures are clear and the behavior result is interesting. However, I have several major comments, especially on the discrepancy of the expression of these Irs with other lab published results, and the confusing finding that the same receptors (Ir7g, Ir51b) have been implicated in the detection of various seemingly unrelated compounds.

      Strengths:

      The results are very well presented, the figures are clear and well-made, text is easy to follow.

      Weaknesses:

      (1) Regarding the expression of Ir56d. The reported Ir56d expression pattern contradicts multiple previous studies (Brown et al., 2021 eLife, Figure 6a-c; Sanchez-Alcaniz et al., 2017 Nature Communications, Figure 4e-h; Koh et al., 2014 Neuron, Figure 3b). These studies, using three different driver lines, consistently showed Ir56d expression in sweet-sensing neurons and taste peg neurons. Importantly, Sanchez-Alcaniz et al. demonstrated that Ir56d is not expressed in Gr66a-expressing (bitter) neurons. This discrepancy is critical since Ir56d is identified as the key subunit for cholesterol detection in bitter neurons, and misexpression of Ir7g and Ir51b together is insufficient to confer cholesterol sensitivity (Fig.4b,d). Which Ir56d-GAL4 (and Gr66a-I-GFP) line was used in this study? Is there additional evidence (scRNA sequencing, in-situ hybridization, or immunostaining) supporting Ir56d expression in bitter neurons?

      (2) Ir51b has previously been implicated in detecting nitrogenous waste (Dhakal 2021), lactic acid (Pradhan 2024), and amino acids (Aryal 2022), all by the same lab. Additionally, both Ir7g and Ir51b have been implicated in detecting cantharidin, an insect-secreted compound that flies may or may not encounter in the wild, by the same lab. Is Ir51b proposed to be a specific receptor for these chemically distinct compounds or a general multimodal receptor for aversive stimuli? Unlike other multimodal bitter receptors, the expression level of Ir51b is rather low and it's unclear which subset of GRNs express this receptor. The chemical diversity among nitrogenous waste, amino acids, lactic acid, cantharidin, and cholesterol raises questions about the specificity of these receptors and warrants further investigation and at a minimum discussion in this paper. Given the wide and seemingly unrelated sensitivity of Ir51b and Ir7g to these compounds I'm leaning towards the hypothesis that at least some of these is non-specific and ecologically irrelevant without further supporting evidence from the authors.

      (3) The Benton lab Ir7g-GAL4 reporter shows no expression in adults. Additionally, two independent labellar RNA sequencing studies (Dweck, 2021 eLife; Bontonou et al., 2024 Nature Communications) failed to detect Ir7g expression in the labellum. This contradicts the authors' previous RT-PCR results (Pradhan 2024 Fig. S4, Journal of Hazardous Materials) showing Ir7g expression in the labellum. Additionally the Benton and Carlson lab Ir51b-GAL4 reporters show no expression in adults as well. Please address these inconsistencies.

      (4) The premise that high cholesterol intake is harmful to flies, which makes sensory mechanisms for cholesterol avoidance necessary, is interesting but underdeveloped. Animal sensory systems typically evolve to detect ecologically relevant stimuli with dynamic ranges matching environmental conditions. Given that Drosophila primarily consume fruits and plant matter (which contain minimal cholesterol) rather than animal-derived foods (which contain higher cholesterol), the ecological relevance of cholesterol detection requires more thorough discussion. Furthermore, at high concentrations, chemicals often activate multiple receptors beyond those specifically evolved for their detection. If the cholesterol concentrations used in this study substantially exceed those encountered in the fly's natural diet, the observed responses may represent an epiphenomenon rather than an ecologically and ethologically relevant sensory mechanism. What is the cholesterol content in flies' diet and how does that compare to the concentrations used in this paper?

    2. Reviewer #2 (Public review):

      Summary:

      In Cholesterol Taste Avoidance in Drosophila melanogaster, Pradhan et al. used behavioral and electrophysiological assays to demonstrate that flies can: (1) detect cholesterol through a subset of bitter-sensing gustatory receptor neurons (GRNs) and (2) avoid consuming food with high cholesterol levels. Mechanistically, they identified five members of the IR family as necessary for cholesterol detection in GRNs and for the corresponding avoidance behavior. Ectopic expression experiments further suggested that Ir7g + Ir56d or Ir51b + Ir56d may function as tuning receptors for cholesterol detection, together with the Ir25a and Ir76b co-receptors.

      Strengths:

      The experimental design of this study was logical and straightforward. Leveraging their expertise in the Drosophila taste system, the research team identified the molecular and cellular basis of a previously unrecognized taste category, expanding our understanding of gustation. A key strength of the study was its combination of electrophysiological recordings with behavioral genetic experiments.

      Weaknesses:

      My primary concern with this study is the lack of a systematic survey of the IRs of interest in the labellum GRNs. Consequently, there is no direct evidence linking the expression of putative cholesterol IRs to the B GRNs in the S6 and S7 sensilla.

      Specifically, the authors need to demonstrate that the IR expression pattern explains cholesterol sensitivity in the B GRNs of S6 and S7 sensilla, but not in other sensilla. Instead of providing direct IR expression data for all candidate IRs (as shown for Ir56d in Figure 2-figure supplement 1F), the authors rely on citations from several studies (Lee, Poudel et al. 2018; Dhakal, Sang et al. 2021; Pradhan, Shrestha et al. 2024) to support their claim that Ir7g, Ir25a, Ir51b, and Ir76b are expressed in B GRNs (Lines 192-194). However, none of these studies provide GAL4 expression or in situ hybridization data to substantiate this claim.

      Without a comprehensive IR expression profile for GRNs across all taste sensilla, it is difficult to interpret the ectopic expression results observed in the B GRN of the I9 sensillum or the A GRN of the L-sensillum (Figure 4). It remains equally plausible that other tuning IRs-beyond the co-receptor Ir25a and Ir76b-could interact with the ectopically expressed IRs to confer cholesterol sensitivity, rather than the proposed Ir7g + Ir56d or Ir51b + Ir56d combinations.

    3. Reviewer #3 (Public review):

      Summary:

      Whether and how animals can taste cholesterol is not well understood. The study provides evidence that 1) cholesterol activates a subset of bitter-sensing gustatory receptor neurons (GRNs) in the fly labellum, but not other types of GRNs, 2) flies show aversion to high concentrations of cholesterol, and this is mediated by bitter GRNs, and 3) cholesterol avoidance depends on a specific set of ionotropic receptor (IR) subunits acting in bitter GRNs. The claims of the study are supported by electrophysiological recordings, genetic manipulations, and behavioral readouts.

      Strengths:

      Cholesterol taste has not been well studied, and the paper provides new insight into this question. The authors took a comprehensive and rigorous approach in several different parts of the paper, including screening the responses of all 31 labellar sensilla, screening a large panel of receptor mutants, and performing misexpression experiments with nearly every combination of the 5 IRs identified. The effects of the genetic manipulations are very clear and the results of electrophysiological and behavioral studies match nicely, for the most part. The appropriate controls are performed for all genetic manipulations.

      Weaknesses:

      The weaknesses of the study, described below, are relatively minor and do not detract from the main conclusions of the paper.

      (1) The paper does not state what concentrations of cholesterol are present in Drosophila's natural food sources. Are the authors testing concentrations that are ethologically relevant?

      (2) The paper does not state or show whether the expression of IR7g, IR51b, and IR56d is confined to bitter GRNs. Bitter-specific expression of at least some of these receptors would be necessary to explain why bitter GRNs but not sugar GRNs (or other GRN types) normally show cholesterol responses.

      (3) The authors only investigated the responses of GRNs in the labellum, but GRN responses in the leg may also contribute to the avoidance of cholesterol feeding. Alternatively, leg GRNs might contribute to cholesterol attraction that is unmasked when bitter GRNs are silenced. In support of this possibility, Ahn et al. (2017) showed that Ir56d functions in sugar GRNs of the leg to promote appetitive responses to fatty acids.

      (4) The authors might consider using proboscis extension as an additional readout of taste attraction or aversion, which would help them more directly link the labellar GRN responses to a behavioral readout. Using food ingestion as a readout can conflate the contribution of taste with post-ingestive effects, and the regulation of food ingestion also may involve contributions from GRNs on multiple organs, whereas organ-specific contributions can be dissociated using proboscis extension. For example, does presenting cholesterol on the proboscis lead to aversive responses in the proboscis extension assay (e.g., suppression of responses to sugar)? Does this aversion switch to attraction when bitter GRNs are silenced, as with the feeding assay?

      (5) The authors claim that the cholesterol receptor is composed of IR25a, IR76b, IR56d, and either IR7g or IR51b. While the authors have shown that IR25a and IR76b are each required for cholesterol sensing, they did not show that both are required components of the same receptor complex. If the authors are relying on previous studies to make this assumption, they should state this more clearly. Otherwise, I think further misexpression experiments may be needed where only IR25a or IR76b, but not both, are expressed in GRNs.

    1. Citation. Texas Education Agency. (2023). Enrollment in Texas public schools, 2022-23. (DocumentNo. GE24 601 01). Austin TX: Author.Abstract. This report provides information on enrollment in the Texas public school system from the2012-13 through 2022-23 school years, based on data collected through the Texas Student Data System.Enrollment data are provided by grade, race/ethnicity, gender, and economically disadvantaged status,and for instructional programs, nontraditional school models, and special populations. Data also are re-ported by education service center region and for state- and district-authorized charter school campuses.Additional copies of this document may be purchased using the order form in the back of thispublication. Also, the report is available in PDF format on the agency website at https://tea.texas.gov/reports-and-data/school-performance/accountability-research/enrollment-trends. Additional informationabout this report may be obtained by contacting the Texas Education Agency Division of Research andAnalysis by phone at (512) 475-3523 or by e-mail at Research@tea.texas.gov.Copyright © Notice. The materials are copyrighted © and trademarked ™ as the property of the Texas Education Agency (TEA) and may notbe reproduced without the express written permission of TEA, except under the following conditions: (1) Texas public school districts, charterschools, and Education Service Centers may reproduce and use copies of the Materials and Related Materials for the districts' and schools' educa-tional use without obtaining permission from TEA; (2) residents of the state of Texas may reproduce and use copies of the Materials and RelatedMaterials for individual personal use only without obtaining written permission of TEA; (3) any portion reproduced must be reproduced in itsentirety and remain unedited, unaltered and unchanged in any way; and (4) no monetary charge can be made for the reproduced materials or anydocument containing them; however, a reasonable charge to cover only the cost of reproduction and distribution may be charged. Private entitiesor persons located in Texas that are not Texas public school districts, Texas Education Service Centers, or Texas charter schools or any entity,whether public or private, educational or non-educational, located outside the state of Texas MUST obtain written approval from TEA and willbe required to enter into a license agreement that may involve the payment of a licensing fee or a royalty. For information contact: CopyrightsOffice, Texas Education Agency, 1701 N. Congress Ave., Austin, TX 78701-1494; phone 512-463-9041; email: Copyrights@tea.texas.gov.

      What do you think about this?

    1. Reviewer #3 (Public review):

      Hon et al. investigated the role of BNST CRF signaling in modulating phasic and sustained fear in male and female mice. They found that partial and full fear conditioning had similar effects in both sexes during conditioning and during recall. However, males in the partially reinforced fear conditioning group showed enhanced acoustic startle, compared to the fully reinforced fear conditioning group, an effect not seen in females. Using fiber photometry to record calcium activity in all BNST neurons, the authors show that the BNST was responsive to foot shock in both sexes and both conditioning groups. Shock response increased over the session in males in the fully conditioned fear group, an effect not observed in the partially conditioned fear group. This effect was not observed in females. Additionally, tone onset resulted in increased BNST activity in both male groups, with the tone response increasing over time in the fully conditioned fear group. This effect was less pronounced in females, with partially conditioned females exhibiting a larger BNST response. During recall in males, BNST activity was suppressed below baseline during tone presentations and was significantly greater in the partially conditioned fear group. Both female groups showed an enhanced BNST response to the tone that slowly decayed over time. Next, they knocked CRF in the BNST to examine its effect on fear conditioning, recall and anxiety-like behavior after fear. They found no effect of the knockdown in either sex or group during fear conditioning. During fear recall, BNST CRF knockdown lead to an increase in freezing in only the partially conditioned females. In the anxiety-like behavior tasks, BNST CRF knockdown lead to increased anxiolysis in the partially reinforced fear male, but not in females. Surprisingly, BNST CRF knockdown increased startle response in fully conditioned, but not partially conditioned males. An effect not observed in either female group. In a final set of experiments, the authors single photon calcium imaging to record BNST CRF cell activity during fear conditioning and recall. Approximately, 1/3 of BNST CRF cells were excited by shock in both sexes, with the rest inhibited and no differences were observed between sexes or group during fear conditioning. During recall, BNST CRF activity decreased in both sexes, an effect pronounced in male and female fully conditioned fear groups.

      Overall, these data provide novel, intriguing evidence in how BNST CRF neurons may encode phasic and sustained fear differentially in males and females. The experiments were rigorous. My biggest concerns I have regard the interpretations and some conclusions from this data set, which I have stated below.

      (1) It was surprising to see minimal and somewhat conflicting behavioral effects due to BNST CRF knockdown. The authors provide a representative image and address this in the conclusion. They mention the role of local vs projection CRF circuits as well as the role of GABA. I don't think those experiments are necessary for this manuscript. However, it may be worthwhile to see through in situ hybridization or IHC, to see BNST CRF levels after both full and partial conditioned fear paradigms. Additionally, it would help to see a quantification of the knockdown of the animals. The authors can add a figure showing deltaF/F changes from control.

      (2) Related to the previous point, it was surprising to see an effect of the CRF deletion in the full fear group compared to the partial fear in the acoustic startle task. To strengthen the conclusion about differential recruitment of CRF during phasic and sustained fear, the experiment in my previous point could help elucidate that. Conversely, intra-BNST administration of a CRF antagonist into the BNST before the acoustic startle after both conditioning tasks could also help. Or patch from BNST CRF neurons after the conditioning tasks to measure intrinsic excitability. Not all these experiments are needed to support the conclusion, it's some examples.

      (3) In Figure 5 F and K, the authors report data combined for both part and full fear conditioning. Were there any differences between the number of excited or inhibited neurons b/t the conditioning groups? Also, can the authors separate male and female traces in Fig 5 E and P?

      (4) Also, regarding the calcium imaging data, what was the average length of a transient induced by shock? Were there any differences between the sexes?

    2. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      The aim of this study is to test the overarching hypothesis that plasticity in BNST CRF neurons drives distinct behavioral responses to unpredictable threat in males and females. The manuscript provides evidence for a possible sex-specific role for CRF-expressing neurons in the BNST in unpredictable aversive conditioning and subsequent hypervigilance across sexes. As the authors note, this is an important question given the high prevalence of sex differences in stress-related disorders, like PTSD, and the role of hypervigilance and avoidance behaviors in these conditions. The study includes in vivo manipulation, bulk calcium imaging, and cellular resolution calcium imaging, which yield important insights into cell-type specific activity patterns. However, it is difficult to generate an overall conclusion from this manuscript, given that many of the results are inconsistent across sexes and across tests and there is an overall lack of converging evidence. For example, partial conditioning yields increased startle in males but not females, yet, CRF KO only increases startle response in males after full conditioning, not partial, and CRF neurons show similar activity patterns between partial and full conditioning across sexes. Further, while the study includes a KO of CRF, it does not directly address the stated aim of assessing whether plasticity in CRF neurons drives the subsequent behavioral effects unpredictable threat.

      We appreciate the reviewer’s summary and agree that there is a large amount of complexity to the results, and that it was difficult to generate a simple model/conclusion to summarize our work. This is the unfortunate side effect of looking across both sexes at different conditioning paradigms, however, we believe that it is important to convey this information to the field even without a simple answer.  Our data reinforces the very important findings from the Maren and Holmes groups that partial fear is a different process than full fear, and that the BNST plays a differential role here. We have reworded the manuscript to better convey this complexity.

      A major strength of this manuscript is the inclusion of both males and females and attention to possible behavioral and neurobiological differences between them throughout. However, to properly assess sex-differences, sex should be included as a factor in ANOVA (e.g. for freezing, startle, and feeding data in Figure 1) to assess whether there is a significant main effect or interaction with sex. If sex is not a statistically significant factor, both sexes should be combined for subsequent analyses. See, Garcia-Sifuentes and Maney, eLife 2021 https://elifesciences.org/articles/70817. There are additional cases where t-tests are used to compare groups when repeated measures ANOVAs would be more appropriate and rigorous.

      We agree with the reviewer that this is the more appropriate analysis and have changed the analysis and figures throughout the revised manuscript to better assess sex differences as well as differences between fear conditions.

      Additionally, it's unclear whether the two sexes are equally responsive to the shock during conditioning and if this is underlying some of the differences in behavioral and neuronal effects observed. There are some reports that suggest shock sensitivity differs across sexes in rodents, and thus, using a standard shock intensity for both males and females may be confounding effects in this study.

      This is a great point. We have conducted appropriate analysis (Sex by Tone Repeated measures two-way ANOVAS for each of the groups: Ctrl, Full, Part) and there are no sex differences in freezing between males and females. The extent of conditioning is not different between the groups suggesting that if there was a difference in shock sensitivity, it is not driving any discernible differences in behavioral performance. However, it is possible that the experience of the shock differs for the animals even in the absence of any measurable behavior.

      The data does not rule out that BNST CRF activity is not purely tracking the mobility state of the animal, given that the differences in activity also track with differences in freezing behavior. The data shows an inverse relationship between activity and freezing. This may explain a paradox in the data which is why males show a greater suppression of BNST activity after partial conditioning than full conditioning, if that activity is suspected to drive the increased anxiety-like response. Perhaps it reflects that activity is significantly suppressed at the end of the conditioning session because animals are likely to be continuously freezing after repeated shock presentations in that context. It would also explain why there is less of a suppression in activity over the course of the recall session, because there is less freezing as well during recall compared with conditioning.

      While it is possible that the BNST may be tracking activity, we believe it is not purely tracking mobility state. For instance, while freezing increases across tone exposures in Part fear regardless of sex, males show an increase while females show a reduction in BNST response during tone 5 (Fig 2K). The data the reviewer refers to showing the inverse relationship with BNST activity and freezing would have suggested the opposite response if it were purely tracking the mobility state of the animal. This is also the case with BNST<sup>CRF</sup> activity to first and last tone during recall. Despite the suppression of activity over the course of recall (Fig 5K), we see an increase in BNST<sup>CRF</sup> tone response when comparing tone 1 and 6 in males and a decrease in females (Fig 6M), again suggesting the BNST is responding to more than just activity.

      A mechanistic hypothesis linking BNST CRF neurons, the behavioral effects observed after fear conditioning, and manipulation of CRF itself are not clearly addressed here.

      We disagree with this assertion. The data suggests a model in which males respond with increased arousal and Part fear males show persistent activation of the BNST and BNST<sup>CRF</sup> neurons during fear conditioning and recall while female Part fear mice show the opposite response. This female response differs from what the field believes to be the role of the BNST in sustained fear. Additionally, we show that CRF knockdown is not involved in fear differentiation or fear expression in males, while it enhances fear learning and recall in females. We have reworded the manuscript to highlight these novel findings.

      Reviewer #2 (Public Review):

      This study examined the role of CRF neurons in the BNST in both phasic and sustained fear in males and females. The authors first established a differential fear paradigm whereby shocks were consistently paired with tones (Full) or only paired with tones 50% of the time (Part), or controls who were exposed to only tones with no shocks. Recall tests established that both Full and Part conditioned male and female mice froze to the tones, with no difference between the paradigms. Additional studies using the NSF and startle test, established that neither fear paradigm produced behavioral changes in the NSF test, suggesting that these fear paradigms do not result in an increase in anxiety-like behavior. Part fear conditioning, but not Full, did enhance startle responses in males but not females, suggesting that this fear paradigm did produce sustained increases in hypervigilance in males exclusively.

      Thank you for this clear summary of the behavioral work.

      Photometry studies found that while undifferentiated BNST neurons all responded to shock itself, only Full conditioning in males lead to a progressive enhancement of the magnitude of this response. BNST neurons in males, but not females, were also responsive to tone onset in both fear paradigms, but only in Full fear did the magnitude of this response increase across training. Knockdown of CRF from the BNST had no effect on fear learning in males or females, nor any effect in males on fear recall in either paradigm, but in females enhanced both baseline and tone-induced freezing only in Part fear group. When looking at anxiety following fear training, it was found in males that CRF knockdown modulated anxiety in Part fear trained animals and amplified startle in Fully trained males but had no effect in either test in females. Using 1P imaging, it was found that CRF neurons in the BNST generally decline in activity across both conditioning and recall trials, with some subtle sex differences emerging in the Part fear trained animals in that in females BNST CRF neurons were inhibited after both shock and omission trials but in males this only occurred after shock and not omission trials. In recall trials, CRF BNST neuron activity remained higher in Part conditioned mice relative to Full conditioned mice.

      Overall, this is a very detailed and complex study that incorporates both differing fear training paradigms and males and females, as well as a suite of both state of the art imaging techniques and gene knockdown approaches to isolate the role and contributions of CRF neurons in the BNST to these behavioral phenomena. The strengths of this study come from the thorough approach that the authors have taken, which in turn helped to elucidate nuanced and sex specific roles of these neurons in the BNST to differing aspects of phasic and sustained fear. More so, the methods employed provide a strong degree of cellular resolution for CRF neurons in the BNST. In general, the conclusions appropriately follow the data, although the authors do tend to minimize some of the inconsistencies across studies (discussed in more depth below), which could be better addressed through discussion of these in greater depth. As such, the primary weakness of this manuscript comes largely from the discussion and interpretation of mixed findings without a level of detail and nuance that reflects the complexity, and somewhat inconsistency, across the studies. These points are detailed below:

      - Given the focus on CRF neurons in the BNST, it is unclear why the photometry studies were performed in undifferentiated BNST neurons as opposed to CRF neurons specifically (although this is addressed, to some degree, subsequently with the 1P studies in CRF neurons directly). This does limit the continuity of the data from the photometry studies to the subsequent knockdown and 1P imaging studies. The authors should address the rationale for this approach so it is clear why they have moved from broader to more refined approaches.

      The reviewer raises a good point.  We did some preliminary photometry studies with BNST CRF neurons and found that there was poor time locked signal. We reasoned that this was due to the heterogeneity of the cell activity, as we saw in our previous publication (Yu et al). Because of this, we moved to the 1p imaging work in place of continued BNST CRF photometry. We have also reworded the manuscript to better discuss the complexities and inconsistencies in findings across the studies.

      - The CRF KD studies are interesting, but it remains speculative as to whether these effects are mediated locally in the BNST or due to CRF signaling at downstream targets. As the literature on local pharmacological manipulation of CRF signaling within the BNST seems to be largely performed in males, the addition of pharmacological studies here would benefit this to help to resolve if these changes are indeed mediated by local impairments in CRF release within the BNST or not. While it is not essential to add these experiments, the manuscript would benefit from a more clear description of what pharmacological studies could be performed to resolve this issue.

      We agree with the reviewer that the addition of this experiment would be highly informative for differentiating the role of CRF in the BNST. This is something that will need to be considered moving forward and we have added this as a point of discussion.

      - While I can appreciate the authors perspective, I think it is more appropriate to state that startle correlates with anxiety as opposed to outright stating that startle IS anxiety. Anxiety by definition is a behavioral cluster involving many outputs, of which avoidance behavior is key. Startle, like autonomic activation, correlates with anxiety but is not the same thing as a behavioral state of anxiety (particularly when the startle response dissociates from behavior in the NSF test, which more directly tests avoidance and apprehension). Throughout the manuscript the use of anxiety or vigilance to describe startle becomes interchangeable, but then the authors also dissociate these two, such as in the first paragraph of the discussion when stating that the Part fear paradigm produces hypervigilance in males without influencing fear or anxiety-like behaviors. The manuscript would benefit from harmonization of the language used to operationally define these behaviors and my recommendation would be to remain consistent with the description that startle represents hypervigilance and not anxiety, per se.

      The reviewer raises an excellent point, we have clarified in the revised manuscript.

      - The interpretation of the anxiety data following CRF KD is somewhat confusing. First, while the authors found no effect of fear training on behavior in the NSF test in the initial studies, now they do, however somewhat contradictory to what one would expect they found that Full fear trained males had reduced latency to feed (indicative of an anxiolytic response), which was unaltered by CRF KD, but in Part fear (which appeared to have no effect on its own in the NSF test), KD of CRF in these animals produced an anxiolytic effect. Given that the Part fear group was no different from control here it is difficult to interpret these data as now CRF KD does reduce latency to feed in this group, suggesting that removal of CRF now somehow conveys an anxiolytic response for Part fear animals. In the discussion the authors refer to this outcome as CRF KD "normalizing" the behavior in the NSF test of Part fear conditioned animals as now it parallels what is seen after Full fear, but given that the Part fear animals with GFP were no different then controls (and neither of these fear training paradigms produced any effect in the NSF test in the first arm of studies), it seems inappropriate to refer to this as "normalization" as it is unclear how this is now normalized. Given the complexity of these behavioral data, some greater depth in the discussion is required to put these data in context and describe the nuance of these outcomes, in particular a discussion of possible experimental factors between the initial behavioral studies and those in the CRF KD arm that could explain the discrepancy in the NSF test would be good (such as the inclusion of surgery, or other factors that may have differed between these experiments). These behavioral outcomes are even more complex given that the opposite effect was found in startle whereby CRF KD amplified startle in Full trained animals. As such, this portion of the discussion requires some reworking to more adequately address the complexity of these behavioral findings.

      The reviewer raises a good point, and we agree that there are many inconsistencies in the behaviors. We believe it is still good to show these results but have expanded the manuscript on potential reasons for these behavioral inconsistencies.

      Reviewer #3 (Public Review):

      Hon et al. investigated the role of BNST CRF signaling in modulating phasic and sustained fear in male and female mice. They found that partial and full fear conditioning had similar effects in both sexes during conditioning and during recall. However, males in the partially reinforced fear conditioning group showed enhanced acoustic startle, compared to the fully reinforced fear conditioning group, an effect not seen in females. Using fiber photometry to record calcium activity in all BNST neurons, the authors show that the BNST was responsive to foot shock in both sexes and both conditioning groups. Shock response increased over the session in males in the fully conditioned fear group, an effect not observed in the partially conditioned fear group. This effect was not observed in females. Additionally, tone onset resulted in increased BNST activity in both male groups, with the tone response increasing over time in the fully conditioned fear group. This effect was less pronounced in females, with partially conditioned females exhibiting a larger BNST response. During recall in males, BNST activity was suppressed below baseline during tone presentations and was significantly greater in the partially conditioned fear group. Both female groups showed an enhanced BNST response to the tone that slowly decayed over time. Next, they knocked CRF in the BNST to examine its effect on fear conditioning, recall and anxiety-like behavior after fear. They found no effect of the knockdown in either sex or group during fear conditioning. During fear recall, BNST CRF knockdown lead to an increase in freezing in only the partially conditioned females. In the anxiety-like behavior tasks, BNST CRF knockdown lead to increased anxiolysis in the partially reinforced fear male, but not in females. Surprisingly, BNST CRF knockdown increased startle response in fully conditioned, but not partially conditioned males. An effect not observed in either female group. In a final set of experiments, the authors single photon calcium imaging to record BNST CRF cell activity during fear conditioning and recall. Approximately, 1/3 of BNST CRF cells were excited by shock in both sexes, with the rest inhibited and no differences were observed between sexes or group during fear conditioning. During recall, BNST CRF activity decreased in both sexes, an effect pronounced in male and female fully conditioned fear groups.

      Overall, these data provide novel, intriguing evidence in how BNST CRF neurons may encode phasic and sustained fear differentially in males and females. The experiments were rigorous.

      We thank you for this positive review of our manuscript.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      There are several graphs representing different analyses of (presumably) the same group of subjects, but which have different N/group. For example, in Figure 2:

      (1) Fig 2P seems to have n=10 in Part Male group (Peak), but 2Q only has n=9 in Part Male group (AUC)

      (2) Fig 2S seems to have n=10 in Part Female group (Peak), but 2T only has n=7 in Part Female group (AUC)

      (3) Fig 2G (Tone Resp) has n=6 Full Males but 2F (Tone Resp), 2H (Shock Resp), and 2I (Shock Resp) have n=7 Full Males

      (4) Fig 2K (Tone Resp) has n=7 Full Females but 2L (Tone Resp), 2M (Shock Resp), and 2N (Shock Resp) have n=8 Full Females

      (5) Fig 2L (Tone Resp) has n=9 Part Females but 2K (Tone Resp), 2M (Shock Resp), and 2N (Shock Resp) have n=10 Part Females

      It's possible that this is just due to overlapping individual data points which are made harder to see due to the low resolution of the figures. If so, this can be easily rectified. However, there may also be subjects missing from some analyses which must be clarified or corrected.

      We thank you for catching these. We have gone through and fixed any issues with data points and have added statistics and exclusions in datasets to figure legends to further explain inconsistencies.

      Regarding statistical tests:

      (2) Data in Figs 2G and 2I should be analyzed using a two-way RM ANOVA.

      We have now included sex as a factor in most of our analysis and are now using appropriate statistical tests.

      (3) Data in Fig 3K should be analyzed using a two-way RM ANOVA.

      We are now using appropriate statistical tests.

      Calcium activity in response to the shock during conditioning and in response to the tone during recall should be included in Figure 5. Given partial and full animals also receive unequal presentations of the cue, it would be useful to see the effects trial by trial or normalized to the first 3 presentations only.

      The reviewer raises a great point. We have changed this figure and have now added the response to shock and tones. Since we are most interested in the difference between sustained and phasic fear, we decided to compare tone 3 in Full fear and tone 4 in Part fear, which differ in the ambiguity of their cue and only have one tone difference.

      Histology maps should be included for all experiments depicting viral spread and implant location for all animals, in addition to the included representative histology images. These can be placed in the supplement.

      We agree this is helpful. While we have confirmed all of the experiments are hits, the tissue is no longer in condition for this analysis.

      Referring to the quantification of peaks in fiber photometry and cellular resolution calcium imaging data as "spikes" is a bit misleading given the inexact relationship between GCAMP sensor dynamics/calcium binding and neuronal action potentials, perhaps calling it "event" frequency would be more clear.

      We have changed the references of spikes to events as suggested.

      The legend for Figure 2S is mislabeled as A.

      Thank you for catching this mistake, it has been fixed.

      The methods refer to CRFR1 fl/fl animals but it seems no experiments used these animals, only CRF fl/fl.

      We have fixed this, thank you.

      Reviewer #2 (Recommendations For The Authors):

      As stated in the public review, while I think the addition of local pharmacological studies blocking CRF1 and 2 receptors in the BNST in both males and females, done under the same conditions as all of the other testing herein, would help to resolve some of the speculation of interpreting the CRF KD data, I dont think these studies are essential to do, but it would be good for the authors to more explicitly state what studies could be done and how they could facilitate interpretation of these data.

      Thank you for this suggestion. We have added this discussion into the manuscript.

      Asides from this, my other recommendations for the authors are to more clearly address the discrepancies in behavioral outcomes across studies and explicitly describe their rationale for the sequence of experiments performed and to harmonize their operationalization of how they define anxiety.

      Again, we appreciate these great suggestions. We have added more discussion on the behavioral discrepancies as well as rationale for the experiments. We have also changed the wording to remain consistent that the NSF test relates to anxiety and the Startle test relates to vigilance.

      - In Figure 2, Panel S is listed as Panel A in the caption and should be corrected.

      Thank you for catching this mistake, we have fixed it.

      Reviewer #3 (Recommendations For The Authors):

      My biggest concerns I have regard the interpretations and some conclusions from this data set, which I have stated below.

      (1) It was surprising to see minimal and somewhat conflicting behavioral effects due to BNST CRF knockdown. The authors provide a representative image and address this in the conclusion. They mention the role of local vs projection CRF circuits as well as the role of GABA. I don't think those experiments are necessary for this manuscript. However, it may be worthwhile to see through in situ hybridization or IHC, to see BNST CRF levels after both full and partial conditioned fear paradigms. Additionally, it would help to see a quantification of the knockdown of the animals.

      Thank you for these great suggestions. We will consider these for future experiments. We piloted out some CRF sensor experiments to probe this, but it was unclear if the signal to noise for the sensor was sufficient. We hope to do more of this in the future if we ever manage to get funding for this work.

      The authors can add a figure showing deltaF/F changes from control.

      We did not have control mice in these in-vivo experiments Our main interests lie in understanding the differences in Full and Part Fear conditioning paradigms specifically.

      (2) Related to the previous point, it was surprising to see an effect of the CRF deletion in the full fear group compared to the partial fear in the acoustic startle task. To strengthen the conclusion about differential recruitment of CRF during phasic and sustained fear, the experiment in my previous point could help elucidate that. Conversely, intra-BNST administration of a CRF antagonist into the BNST before the acoustic startle after both conditioning tasks could also help. Or patch from BNST CRF neurons after the conditioning tasks to measure intrinsic excitability. Not all these experiments are needed to support the conclusion, it's some examples.

      We thank the reviewer for these suggestions and agree that these are important experiments. We will consider this in future experiments exploring the role of BNST CRF in fear conditioning.

      (3) In Figure 5 F and K, the authors report data combined for both part and full fear conditioning. Were there any differences between the number of excited or inhibited neurons b/t the conditioning groups?

      We are only looking at the first shock exposure in these figures. These were combined because the first tone and shock exposure is identical in Full and Part fear conditioning. Differences in these behavioral paradigms emerge after Tone 3 exposure, where Part fear does not receive a shock while Full fear does.

      Also, can the authors separate male and female traces in Fig 5 E and P?

      Traces in Fig E are from females only. We did not include male traces because males and females had identical responses to first shock, and we felt only one trace was needed as an example. Traces in Figure P are from males. We did not show female traces because females did not show differential effects from baseline to end.

      (4) Also, regarding the calcium imaging data, what was the average length of a transient induced by shock? Were there any differences between the sexes?

      We have many cells in each condition, and the length of traces after shock were all different and hard to quantify, as for example, sometimes cells were active before shock and thus trace length would be difficult to quantify. Therefore, to keep consistency and reduce ambiguity regarding trace lengths, we focused on keeping the time consistent across mice and focused on the 10 second window post shock to be consistent across conditions.

    1. Reviewer #2 (Public review):

      Strengths:

      The authors have done a nice job providing additional data in response to reviewer feedback. I appreciate that accuracy plots are now included, as well as a separate analysis where differences in parameter estimates are performed for participants whose accuracy data were above chance levels. I also appreciate the new figure with the sphere ROIs for each participant, as they help us appreciate anatomical variability in the peak response separately for each task.

      I have four concerns related to the weaknesses of the study:

      (1) Although the results still hold when removing participants whose accuracy was 50% or less, a major limitation of this study is that participants made a button press response only to the last trial in a block. This is problematic because a participant could get all trials in a block correct except for the last one, or a participant could get all trials in a block wrong, and performance would be considered equivalent-as a consequence, it is not possible for one to know if participants who are at chance are performing differently from participants who are not at chance, and it is not possible to control for variance in reaction time (a concern also raised by reviewer 3).

      (2) My second concern relates to the way in which the data are interpreted based on thresholding. There is above-threshold activation in the left SMG for all tasks except the fluid cognition task. The z-scores associated with significant voxels in Figure 3 are very strong (minimum z is 6). If one were to relax the threshold of the group level maps to, e.g., p < .001, uncorrected, FDR q < .05, or FWER of .10, there will be overlapping voxels outside the SMG. The discussion of the left SMG in the manuscript is prominent and narrowly construed-the left SMG is discussed as if it were 'the' region: "This confirms that the technical-reasoning network depends upon the recruitment of the left area PF, even if additional cognitive processes involving other peripheral brain areas can be engaged depending on the task" (pp. 9). My intuition is there will be numerous other areas of overlap when using a threshold that is still highly significant (e.g., z = 3 or 4). So, for proponents of the technical reasoning hypothesis, is there a counterfactual or alternative brain area/network/system not in the left SMG?

      (3) I like the new Figure 6 because it shows variability in the location of the peak coordinate at the level of single participants. And, indeed, there's considerable variability that is typical when localizing ROIs in single participants. My concern is the level at which hypothesis testing is performed. An independent SMG ROI is used to extract parameter estimates and correlate responses between tasks to show a pattern of correlation that comports with a technical reasoning model of left SMG function. This is a fine approach but it does not rule out the so-called 'same region different function' interpretation because it relies on correlation-one cannot reverse infer that the left SMG is carrying out the same function across different tasks because the response in that area is more strongly correlated between certain tasks. This finding points to that possibility and makes interesting predictions for future studies to pursue, but it cannot tell us whether common functions in the left SMG are involved in each task. E.g., one interesting prediction for future studies is to test if patients with lesions to this site are disproportionately more inaccurate in the experimental condition of the mechanical problem solving task, the psychotechnical task, the mentalizing task, but not the fluid cognition task.

      (4) I appreciated the approach to testing the adjacency interpretation by showing the sphere and peak Y coordinate across the tasks. It is interesting that across the groups, there is no difference in the peak Y coordinate of the psychotechnical task and both conditions of the mentalizing task, whereas the peak Y coordinate in the fluid intelligence task is more anterior in the post-central gyrus across participants (why is that?). But why restrict the analysis to just the Y coordinate? A rigorous way to test the adjacency hypothesis is to compute Euclidean distance among X, Y, and Z coordinates between any two tasks collected in the same participant. One can then test if the Euclidean distance between, e.g., the psychotechnical task and one condition of the mentalizing task is smaller than the Euclidean distance between the psychotechnical task and the fluid cognition task. Similarly, one can test whether Euclidean distance between the INT and PHY conditions of the mentalizing task is smaller than the Euclidean distance between the INT and psychotechnical task or PHY and psychotechnical task. There is no justification to restrict this analysis to the anterior-posterior dimension only.

    2. Reviewer #3 (Public review):

      The authors have responded very thoughtfully to many of the points raised, and the revised manuscript will make a useful contribution to our understanding of some of the computations performed by area PF. In particular, the investigators' addition of analyses of peak activations, their additional clarifications that area PF is likely to be part of a larger network concerned with technical reasoning, and their responses to the reviewers' concerns about differential task difficulty have strengthened the conclusions that can be drawn from the study.

      The authors' response does not completely mitigate the concern noted by all 3 reviewers that the control tasks were easier than their corresponding experimental tasks (for everything but the fluid cognition task). The specific trouble with this issue can be appreciated by looking at Figure 4A, for example, which shows that area PF was activated for many individuals in both the control task and the experimental mechanical problem-solving task, but more so for the latter. Since the experimental task was harder (and more trial time was likely spent on task trying to solve it), the concern remains that area PF was driven harder by the experimental task in part due to the more challenging nature of that task.

      The revised manuscript counters that the fluid cognition task was also harder than its control condition, yet did not activate PF more than its control condition. But this response seems to sidestep the central point of the reviewers' concerns: the fundamental computations that underlie the technical reasoning tasks may also be present in the respective (non technical-reasoning-based) control tasks and drive area PF activations to greater or lesser degrees based on how much they tax those computations. The fact that the fluid cognition experimental task and control task are not differentially difficult does not mitigate this concern, it just suggests that neither of those tasks tap the same fundamental computations, whatever they may be. (As an added note, Figures 2 and 4 show that both the PHYS-only and INT+PHYS mentalizing tasks only weakly activated PF, and both of these tasks were easier than the other technical cognition tasks).

      The new ROI analysis with removal of subjects who performed below 50% in the revised manuscript is somewhat helpful, but there are two remaining issues: 1) chance performance is defined by a binomial test in this case, so scores somewhat above 50% may still be at chance depending on the number of items, and thus there may have been subjects who were not removed who could not perform the tasks; 2) it would have been convincing to include accuracy as a covariate in the modeling of BOLD parameter estimates for the remaining above-chance subjects to ensure that all reported effects remain once differential task difficulty is taken into account. It also appears that the legend for Figure S2, which indicates that the figure includes just subjects who performed at or below 50%, may not be correct; does the figure instead show data from subjects who performed at or above 50%?

      Despite these remaining concerns, there are many aspects of this revised study that render it a useful contribution that will likely spur further research in this very interesting area.

    3. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public Review):

      Summary:

      In this study, Osiurak and colleagues investigate the neurocognitive basis of technical reasoning. They use multiple tasks from two neuroimaging studies and overlap analysis to show that the area PF is central for reasoning, and plays an essential role in tool-use and non-tool-use physical problem-solving, as well as both conditions of mentalizing task. They also demonstrate the specificity of the technical reasoning and find that the area PF is not involved in the fluid-cognition task or the mentalizing network (INT+PHYS vs. PHYS-only). This work suggests an understanding of the neurocognitive basis of technical reasoning that supports advanced technologies.

      Strengths:

      -The topic this study focuses on is intriguing and can help us understand the neurocognitive processes involved in technical reasoning and advanced technologies.

      -The researchers obtained fMRI data from multiple tasks. The data is rich and encompasses the mechanical problem-solving task, psychotechnical task, fluid-cognition task, and mentalizing task.

      -The article is well written.

      We sincerely thank Reviewer 1 for their positive and very helpful comments, which helped us improve the MS. Thank you.

      Weaknesses:

      - Limitations of the overlap analysis method: there are multiple reasons why two tasks might activate the same brain regions. For instance, the two tasks might share cognitive mechanisms, the activated regions of the two tasks might be adjacent but not overlapping at finer resolutions, or the tasks might recruit the same regions for different cognition functions.

      Thus, although overlap analysis can provide valuable information, it also has limitations.

      Further analyses that capture the common cognitive components of activation across different

      tasks are warranted, such as correlating the activation across different tasks within subjects for a region of interest (i.e. the PF).

      We thank Reviewer 1 for this comment. We added new analyses to address the two alternative interpretations stressed here by Reviewer 1, namely, the same-region-but-differentfonction interpretation and the adjacency interpretation. The new analyses ruled out both alternative interpretations, thereby reinforcing our interpretation.

      “The conjunction analysis reported was subject to at least two key limitations that needed to be overcome to assure a correct interpretation of our findings. The first was that the tasks could recruit the same regions for different cognition functions (same-region-but-different-function interpretation). The second was that the activated regions of the different tasks could be adjacent but did not overlap at finer resolutions (adjacency interpretation). We tested the same-region-but-different-function interpretation by conducting additional ROI analyses, which consisted of correlating the specific activation of the left area PF (i.e., difference in terms of mean Blood-Oxygen Level Dependent [BOLD] parameter estimates between the experimental condition minus the control condition) in the psychotechnical task, the fluid-cognition task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task. This analysis did not include the mechanical problem-solving task because the sample of participants was not the same for this task. As shown in Fig. 5, we found significant correlations between all the tasks that were hypothesized as recruiting technical reasoning, i.e., the psychotechnical task and the PHYS-Only and INT+PHYS conditions of the mentalizing task (all p < .05). By contrast, no significant correlation was obtained between these three tasks and the fluid-cognition task (all p > .15). This finding invalidates the same-region-but-different-function interpretation by revealing a coherent pattern in the activation of the left area PF in situations in which participants were supposed to reason technically. We examined the adjacency interpretation by analysing the specific locations of individual peak activations within the left area PF ROI for the mechanical problemsolving task, the psychotechnical task, the fluid-cognition task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task. These peaks, which corresponded to the maximum value of activation obtained for each participant within the left area PF ROI, are reported in Fig. 6. As can be seen, the peaks of the fluid-cognition task were located more anteriorly, in the left area PFt (Parietal Ft) and the postcentral cortex, compared to the peaks of the other four tasks, which were more posterior, in the left area PF. Statistical analyses based on the y coordinates of the individual activation peaks confirmed this description (Fig. 6). Indeed, the y coordinates of the peaks of the mechanical problem-solving task, the psychotechnical task and the PHYS-Only and INT+PHYS conditions of the mentalizing task were posterior to the y coordinates of the peaks of the fluid-cognition task (all p < .05), whereas no significant differences were reported between the four tasks (all p > .05). These findings speak against the adjacency interpretation by revealing that participants recruited the same part of the left area PF to perform tasks involving technical reasoning.” (p. 11-13)

      Control tasks may be inadequate: the tasks may involve other factors, such as motor/ actionrelated information. For the psychotechnical task, fluid-cognition task, and mentalizing task, the experiment tasks need not only care about technical-cognition information but also motor-related information, whereas the control tasks do not need to consider motor-related information (mainly visual shape information). Additionally, there may be no difference in motor-related information between the conditions of the fluid-cognition task. Therefore, the regions of interest may be sensitive to motor-related information, affecting the research conclusion.

      We thank Reviewer 1 for this comment. We added a specific section in the discussion that addresses this limitation.

      “The second limitation concerns the alternative interpretation that the left area PF is not central to technical reasoning but to the storage of sensorimotor programs about the prototypical manipulation of common tools. Here we show that the left area PF is recruited even in situations in which participants do not have to process common manipulable tools. For instance, some items of the psychotechnical task consisted of pictures of tractor, boat, pulley, or cannon. The fact that we found a common activation of the left area PF in such tasks as well as in the mechanical problem-solving task, in which participants could nevertheless simulate the motor actions of manipulating novel tools, indicates that this brain area is not central to tool manipulation but to physical understanding. That being said, some may suggest that viewing a boat or a cannon is enough to incite the simulation of motor actions, so our tasks were not equipped to distinguish between the manipulation-based approach and the reasoning-based approach. We have already shown that the left area PF is more involved in tasks that focus on the mechanical dimension of the tool-use action (e.g., the mechanical interaction between a tool and an object) than its motor dimension (i.e., the interaction between the tool and the effector [e.g., 24, 40]). Nevertheless, we recognize that future research is still needed to test the predictions derived from these two approaches.” (p. 18-19)

      -Negative results require further validation: the cognitive results for the fluid-cognition task in the study may need more refinement. For instance, when performing ROI analysis, are there any differences between the conditions? Bayesian statistics might also be helpful to account for the negative results.

      We agree that our negative results required further validation. We conducted the ROI analyses suggested by Reviewer 1, which confirmed the initial whole-brain analyses.

      “Region of interest (ROI) results. We conducted additional analyses to test the robustness of our findings. One of our results was that we did not report any specific activation of the left area PF in the fluid-cognition task contrary to the mechanical problem-solving task, the psychotechnical task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task. However, this negative result needed exploration at the ROI level. Therefore, we created a spherical ROI of the left area PF with a radius of 12 mm in the MNI standard space (–59; –31; 40). This ROI was literature-defined to ensure the independence of its selection (40). ROI results are shown in Fig. 4. The analyses confirmed the results obtained with the whole-brain analyses by indicating a greater activation of the left area PF in the mechanical problem-solving task, the psychotechnical task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task (all p < .001), but not in the fluid-cognition task (p \= .35).” (p. 10-11)

      Reviewer #1 (Recommendations For The Authors):

      (1) I may not fully grasp some of the arguments. In the abstract, what does the term "intermediate-level" mean, and why is it an intermediate-level state? In the sentence "the existence of a specific cognitive module in the human brain dedicated to materiality", I cannot see a clear link between technical cognition and the word "materiality".

      We used the term materiality to refer to a potential human trait that allows us to shape the physical world according to our ends, by using, making tools and transmiting them to others. This is a reference to Allen et al. (2020; PNAS): “We hope this empirical domain and modeling framework can provide the foundations for future research on this quintessentially human trait: using, making, and reasoning about tools and more generally shaping the physical world to our ends” (p. 29309). Scientists (including archaeologists, economists, psychologists, neuroscientists) interested in human materiality have tended to focus on how we manipulate things according to our thought (motor cognition) or how we conceptualize our behaviour to transmit it to others (language, social cognition). However, little has been said on the intermediate level, that is, technical cognition. We added the term “technical cognition” here, which should help to make the connection more quickly.

      “Yet, little has been said about the intermediate-level cognitive processes that are directly involved in mastering this materiality, that is, technical cognition.” (p. 2)

      (2) The introduction could provide more details on why the issue of "generalizability and specificity" is important to address, to clarify the significance of the research question.

      We followed this comment and added a sentence to explain why it is important to address this research question. Again, we thank Reviewer 1 for their helpful comments.

      “Here we focus on two key aspects of the technical-reasoning hypothesis that remain to be addressed: Generalizability and specificity. If technical reasoning is a specific form of reasoning oriented towards the physical world, then it should be implicated in all (the generalizability question) and only (the specificity question) the situations in which we need to think about the physical properties of our world.” (p. 5)

      Reviewer #2 (Public Review):

      Summary:

      The goal of this project was to test the hypothesis that a common neuroanatomic substrate in the left inferior parietal lobule (area PF) underlies reasoning about the physical properties of actions and objects. Four functional MRI (fMRI) experiments were created to test this hypothesis. Group contrast maps were then obtained for each task, and overlap among the tasks was computed at the voxel level. The principal finding is that the left PF exhibited differentially greater BOLD response in tasks requiring participants to reason about the physical properties of actions and objects (referred to as technical reasoning). In contrast, there was no differential BOLD response in the left PF when participants engaged in fMRI variant of the Raven's progressive matrices to assess fluid cognition.

      Strengths:

      This is a well-written manuscript that builds from extensive prior work from this group mapping the brain areas and cognitive mechanisms underlying object manipulation, technical reasoning, and problem-solving. Major strengths of this manuscript include the use of control conditions to demonstrate there are differentially greater BOLD responses in area PF over and above the baseline condition of each task. Another strength is the demonstration that area PF is not responsive in tasks assessing fluid cognition - e.g., it may just be that PF responds to a greater extent in a harder condition relative to an easy condition of a task. The analysis of data from Task 3 rules out this alternative interpretation. The methods and analysis are sufficiently written for others to replicate the study, and the materials and code for data analysis are publicly available.

      We sincerely thank Reviewer 2 for their precious comments, which helped us improve the MS. 

      Weaknesses:

      The first weakness is that the conclusions of the manuscript rely on there being overlap among group-level contrast maps presented in Figure 2. The problem with this conclusion is that different participants engaged in different tasks. Never is an analysis performed to demonstrate that the PF region identified in e.g., participant 1 in Task 2 is the same PF region identified in Participant 1 in Task 4.

      We added new analyses that demonstrated that “the PF region identified in e.g., participant 1 in Task 2 is the same PF region identified in Participant 1 in Task 4”. We thank Reviewer 2 for this comment, because these new analyses reinforced our interpretation.

      “The conjunction analysis reported was subject to at least two key limitations that needed to be overcome to assure a correct interpretation of our findings. The first was that the tasks could recruit the same regions for different cognition functions (same-region-but-different-function interpretation). The second was that the activated regions of the different tasks could be adjacent but did not overlap at finer resolutions (adjacency interpretation). We tested the same-region-but-different-function interpretation by conducting additional ROI analyses, which consisted of correlating the specific activation of the left area PF (i.e., difference in terms of mean Blood-Oxygen Level Dependent [BOLD] parameter estimates between the experimental condition minus the control condition) in the psychotechnical task, the fluid-cognition task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task. This analysis did not include the mechanical problem-solving task because the sample of participants was not the same for this task. As shown in Fig. 5, we found significant correlations between all the tasks that were hypothesized as recruiting technical reasoning, i.e., the psychotechnical task and the PHYS-Only and INT+PHYS conditions of the mentalizing task (all p < .05). By contrast, no significant correlation was obtained between these three tasks and the fluid-cognition task (all p > .15). This finding invalidates the same-region-but-different-function interpretation by revealing a coherent pattern in the activation of the left area PF in situations in which participants were supposed to reason technically. We examined the adjacency interpretation by analysing the specific locations of individual peak activations within the left area PF ROI for the mechanical problemsolving task, the psychotechnical task, the fluid-cognition task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task. These peaks, which corresponded to the maximum value of activation obtained for each participant within the left area PF ROI, are reported in Fig. 6. As can be seen, the peaks of the fluid-cognition task were located more anteriorly, in the left area PFt (Parietal Ft) and the postcentral cortex, compared to the peaks of the other four tasks, which were more posterior, in the left area PF. Statistical analyses based on the y coordinates of the individual activation peaks confirmed this description (Fig. 6). Indeed, the y coordinates of the peaks of the mechanical problem-solving task, the psychotechnical task and the PHYS-Only and INT+PHYS conditions of the mentalizing task were posterior to the y coordinates of the peaks of the fluid-cognition task (all p < .05), whereas no significant differences were reported between the four tasks (all p > .05). These findings speak against the adjacency interpretation by revealing that participants recruited the same part of the left area PF to perform tasks involving technical reasoning.” (p. 11-13)

      A second weakness is that there is a variance in accuracy between tasks that are not addressed. It is clear from the plots in the supplemental materials that some participants score below chance (~ 50%). This means that half (or more) of the fMRI trials of some participants are incorrect. The methods section does not mention how inaccurate trials were handled. Moreover, if 50% is chance, it suggests that some participants did not understand task instructions and were systematically selecting the incorrect item.

      It is true that the experimental conditions were more difficult than the control conditions, with some participants who performed at or below 50% in the experimental conditions. We added a section in the MS to stress this aspect. To examine whether this potential difficulty effect biased our interpretation, we conducted new ROI analyses by removing all the participants who performed at or below the chance level. These analyses revealed the same results as when no participant was excluded, suggesting that this did not bias our interpretation.

      “As mentioned above, the experimental conditions of all the tasks were more difficult than their control conditions. As a result, the specific activation of the left area PF documented above could simply reflect that this area responds to a greater extent in a harder condition relative to an easy condition of a task. This interpretation is nevertheless ruled out by the results obtained with the fluid-cognition task. We did not report a specific activation of the left area PF in this task while its experimental condition was more difficult than its control condition. To test more directly this effect of difficulty, we conducted new ROI analyses by removing all the participants who performed at or below 50% (Fig. S2). These new analyses replicated the initial analyses by showing a greater activation of the left area PF in the mechanical problem-solving task, the psychotechnical task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task (all p < .001), but not in the fluid-cognition task (p \= .48). In sum, the ROI analyses corroborated the wholebrain analyses and ruled out the potential effect of difficulty.” (p. 11)

      A third weakness is related to the fluid cognition task. In the fMRI task developed here, the participant must press a left or right button to select between 2 rows of 3 stimuli while only one of the 3 stimuli is the correct target. This means that within a 10-second window, the participant must identify the pattern in the 3x3 grid and then separately discriminate among 6 possible shapes to find the matching stimulus. This is a hard task that is qualitatively different from the other tasks in terms of the content being manipulated and the time constraints.

      We acknowledge that the fluid-cognition task involved a design that differed from the other tasks. However, this was also true for the other tasks, as the design also differed between the mechanical problem-solving task, the psychotechnical task, and the mentalizing task. Nevertheless, despite these distinctions, we found a consistent activation of the left area PF in these tasks with different designs including in the psychotechnical task, which seemed as difficult as the fluid-cognition task.

      “Region of interest (ROI) results. We conducted additional analyses to test the robustness of our findings. One of our results was that we did not report any specific activation of the left area PF in the fluid-cognition task contrary to the mechanical problem-solving task, the psychotechnical task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task. However, this negative result needed exploration at the ROI level. Therefore, we created a spherical ROI of the left area PF with a radius of 12 mm in the MNI standard space (–59; –31; 40). This ROI was literature-defined to ensure the independence of its selection (40). ROI results are shown in Fig. 4. The analyses confirmed the results obtained with the whole-brain analyses by indicating a greater activation of the left area PF in the mechanical problem-solving task, the psychotechnical task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task (all p < .001), but not in the fluid-cognition task (p \= .35).” (p. 10-11)

      In sum, this is an interesting study that tests a neuro-cognitive model whereby the left PF forms a key node in a network of brain regions supporting technical reasoning for tool and non-tool-based tasks. Localizing area PF at the level of single participants and managing variance in accuracy is critically important before testing the proposed hypotheses.

      We thank Reviewer 2 for this positive evaluation and their suggestions. As detailed in our response, our revision took into consideration both the localization of the left area PF at the level of single participants and the variance in accuracy. 

      Reviewer #2 (Recommendations For The Authors):

      Did the fMRI data undergo high-pass temporal filtering prior to modeling the effects of interest? Participants engaged in a long (17-24 minutes) run of fMRI data collection. Highpass filtering of the data is critically important when managing temporal autocorrelation in the fMRI response (e.g., see Shinn et al., 2023, Functional brain networks reflect spatial and temporal autocorrelation. Nature Neuroscience).

      Yes. We added this information.

      “Regressors of non-interest resulting from 3D head motion estimation (x, y, z translation and three axes of rotation) and a set of cosine regressors for high-pass filtering were added to the design matrix.” (p. 25-26)

      Including scales in Figure 2 would help the reader interpret the magnitude of the BOLD effects.

      We added this information in Figure 3 (Figure 2 in the initial version of the MS).

      It was difficult to inspect the small thumbnail images of the task stimuli in Figure 1. Higher resolution versions of those stimuli would help facilitate understanding of the task design and trial structure.

      We changed both Figure 1 and Figure S1.

      Reviewer #3 (Public Review):

      Summary:

      This manuscript reports two neuroimaging experiments assessing commonalities and differences in activation loci across mechanical problem-solving, technical reasoning, fluid cognition, and "mentalizing" tasks. Each task includes a control task. Conjunction analyses are performed to identify regions in common across tasks. As Area PF (a part of the supramarginal gyrus of the inferior parietal lobe) is involved across 3 of the 4 tasks, the investigators claim that it is the hub of technical cognition.

      Strengths:

      The aim of finding commonalities and differences across related problem-solving tasks is a useful and interesting one.

      The experimental tasks themselves appear relatively well-thought-out, aside from the concern that they are differentially difficult.

      The imaging pipeline appears appropriate.

      We thank Reviewer 3 for their constructive comments, which helped us improve the MS.

      Weaknesses:

      (1) Methodological

      As indicated in the supplementary tables and figures, the experimental tasks employed differ markedly in 1) difficulty and 2) experimental trial time. Response latencies are not reported (but are of additional concern given the variance in difficulty). There is concern that at least some of the differences in activation patterns across tasks are the result of these fundamental differences in how hard various brain regions have to work to solve the tasks and/or how much of the trial epoch is actually consumed by "on-task" behavior. These difficulty issues should be controlled for by 1) separating correct and incorrect trials, and 2) for correct trials, entering response latency as a regressor in the Generalized Linear Models, 3) entering trial duration in the GLMs.

      We thank Reviewer 3 for this comment. It is true that the experimental conditions were more difficult than the control conditions, with some participants who performed at or below 50% in the experimental conditions. We added a section in the MS to stress this aspect. We could not conduct new analyses by separating correct and incorrect trials because, for each task, participants had to respond only on the last item of the block. Therefore, we did not record a response for each event. Nevertheless, we could examine whether this potential difficulty effect biased our interpretation, by conducting new ROI analyses in which we removed all the participants who performed at or below the chance level. These analyses revealed the same results as when no participant was excluded, suggesting that this did not bias our interpretation. 

      “As mentioned above, the experimental conditions of all the tasks were more difficult than their control conditions. As a result, the specific activation of the left area PF documented above could simply reflect that this area responds to a greater extent in a harder condition relative to an easy condition of a task. This interpretation is nevertheless ruled out by the results obtained with the fluid-cognition task. We did not report a specific activation of the left area PF in this task while its experimental condition was more difficult than its control condition. To test more directly this effect of difficulty, we conducted new ROI analyses by removing all the participants who performed at or below 50% (Fig. S2). These new analyses replicated the initial analyses by showing a greater activation of the left area PF in the mechanical problem-solving task, the psychotechnical task, and the PHYS-Only and INT+PHYS conditions of the mentalizing task (all p < .001), but not in the fluid-cognition task (p \= .48). In sum, the ROI analyses corroborated the wholebrain analyses and ruled out the potential effect of difficulty.” (p. 11)

      A related concern is that the control tasks also differ markedly in the degree to which they were easier and faster than their corresponding experimental task. Thus, some of the control tasks seem to control much better for difficulty and time on task than others. For example, the control task for the psychotechnical task simply requires the indication of which array contains a simple square shape (i.e., it is much easier than the psychotechnical task), whereas the control task for mechanical problem-solving requires mentally fitting a shape into a design, much like solving a jigsaw puzzle (i.e., it is only slightly easier than the experimental task).

      It is true that some control conditions could be easier than other ones. These differences reinforced the common activation found in the left area PF in the tasks hypothesized as involving technical reasoning, because this activation survived irrespective of the differences in terms of experimental design. For us, the rationale is the same as for a meta-analysis, in which we try to find what is common to a great variety of tasks. The only detrimental consequence we identified here is that this difference explained why we did not report a specific activation of the left area PF in the fluid-cognition task, as if the left area PF was more responsive when the task was difficult. This possibility assumes that the experimental condition of the fluid-cognition task is much more difficult than its control condition compared to what can be seen in the other tasks. As Reviewer 2 stressed in Point 1, this interpretation is unlikely, because the differences between the experimental and control conditions were similar to the fluid-cognition task in the mechanical problem-solving and psychotechnical tasks. In addition, again, the new ROI analyses in which we removed all the participants who performed at or below the chance level in expetimental conditions reproduced our initital results.

      (2) Theoretical 

      The investigators seem to overlook prior research that does not support their perspective and their writing seems to lack scientific objectivity in places. At times they over-reach in the claims that can be made based on the present data. Some claims need to be revised/softened.

      As this comment is also mentioned below, please find our response to it below.

      Reviewer #3 (Recommendations For The Authors):

      (1) Because of the high level of detail, Figures 1 and S2 (particularly the mentalizing task and mechanical problem-solving task, and their controls) are very hard to parse, even when examined relatively closely. It is suggested that these figures be broken down into separate panels for Experiment 1 and Experiment 2 to facilitate understanding.

      We changed both Figure 1 and Figure S1.

      (2) The behavioral data (including response latencies) should be reported in the main results section of the paper and not in a supplement.

      The behavioural data are now reported in the main results. We did not report response latencies because participants were not prompted to respond as quickly as possible.

      “Behavioural results. All the behavioural results are given in Fig. 2. As shown, scores were higher in the experimental conditions than for the control conditions for all the tasks (all p < .05). In other words, the experimental conditions were more difficult than the control conditions. This difference in terms of difficulty can also be illustrated by the fact that some participants performed at or below the chance level in the experimental conditions whereas none did so in the control conditions.” (p. 8)

      (3) The investigators seem to overlook prior research that does not support their perspective and their writing seems to lack scientific objectivity in places. At times they over-reach in the claims that can be made based on the present data. For example, claims that need to be revised/softened include:

      Abstract: "Area PF... can work along with social-cognitive skills to resolve day-to-day interactions that combine social and physical constraints". This statement is overly speculative.

      This statement is based on the fact that we reported a combined activation of the technical-reasoning network and the mentalizing network in the INT+PHYS condition of the mentalizing task. This suggests that both networks need to work together for solving a day-today problem in which both the physical constraints of the situation and the intention of the individual must be integrated. Our findings replicated previous ones with a similar task (e.g., Brunet et al. 2000; Völlm et al., 2006), in which the authors gave an interpretation similar to ours in considering that this task requires understanding physical and social causes. Perhaps that the reference to the results of the mentalizing task was not explicit enough. We added “dayto-day” before “problem” in the part of the discussion in which we discuss this possibility to make this aspect clearer.

      “In broad terms, the results of the mentalizing task indicate that causal reasoning has distinct forms and that it recruits distinct networks of the human brain (Social domain: Mentalizing; Physical domain: Technical reasoning), which can nevertheless interact together to solve day-to-day problems in which several domains are involved, such as in the INT+PHYS condition of the mentalizing task.” (p. 16)

      Introduction: "The manipulation-based approach... remains silent on the more general cognitive mechanisms...that must also encompass the use of unfamiliar or novel tools". This statement seems to be based on an overly selective literature review. There are a number of studies in which the relationship between a novel and familiar tool selection/use has been explored (e.g., Buchman & Randerath, 2017; Mizelle & Wheaton, 2010; Silveri & Ciccarelli, 2009; Stoll, Finkel et al., 2022; Foerster, 2023; Foerster, Borghi, & Goslin, 2020; Seidel, Rijntjes et al., 2023).

      We thank Reviewer 3 for this comment. Even if we accept the idea that we possess specific sensorimotor programs about tool manipulation, it remains that these programs cannot explain how an individual decides to bend a wire to make a hook or to pour water in a recipient to retrieve a target. As a matter of fact, such behaviour has been reported in nonhuman animals, such as crows (Weir et al., 2002, Nature) or orangutans (Mendes et al., 2007, Biology Letters). In these studies, the question is whether these nonhuman animals understand the physical causes or not, but the question of sensorimotor programs is never addressed (to our knowledge). This is also true in developmental studies on tool use (e.g., Beck et al., 2011, Cognition; Cutting et al., 2011, Journal of Experimental Child Psychology). This is what we meant here, that is, the manipulation-based approach is not equipped to explain how people solve physical problems by using or making tools – or any object – or by building constructions or producing technical innovations. However, we agree that some papers have been interested in exploring the link between common and novel tool use and have suggested that both could recruit common sensorimotor programs. It is noteworthy that these studies do not test the predictions from the manipulation-based approach versus the reasoning-based approach, so both interpretations are generally viable as stressed by Seidel et al. (2023), one of the papers recommended by Reviewer 3.

      “Apparently, the presentation of a graspable object that is recognizable as a tool is sufficient to provoke SMG activation, whether one tends to see the function of SMG to be either “technical reasoning” (Osiurak and Badets 2016; Reynaud et al. 2016; Lesourd et al. 2018; Reynaud et al. 2019) or “manipulation knowledge” (Sakreida et al. 2016; Buxbaum 2017; Garcea et al. 2019b).” (Seidel et al., 2023; p. 9)

      Regardless, as suggested by Reviewer 3, these papers deserve to be cited and this part needed to be rewritten to insist on the “making, construction, and innovation” dimension more than on the “unfamiliar and novel tool use” dimension to avoid any ambiguity.

      “This manipulation-based approach has provided interesting insights (12–16) and even elegant attempts to explain how these sensorimotor programs could support the use of both unfamiliar or novel tools (17–20), but remains silent on the more general cognitive mechanisms behind human technology that include the use of common and unfamiliar or novel tools but must also encompass tool making, construction behaviour, technical innovations, and transmission of technical content.” (p. 3)

      Introduction: "Here we focus on two important questions... to promote the technicalreasoning hypothesis as a comprehensive cognitive framework..."(italics added). This and other similar statements should be rewritten as testable scientific hypotheses rather than implying that the point of the research is to promote the investigators' preferred view.

      We agree that our phrasing could seem inappropriate here. What we meant here is that the technical-reasoning hypothesis could become an interesting framework for the study of the cognitive bases of human technology only if we are able to verify some of its key facets. As suggested, we rewrote this part. We also rewrote the abstract and the first paragraph of the discussion.

      “Here we focus on two key aspects of the technical-reasoning hypothesis that remain to be addressed: Generalizability and specificity. If technical reasoning is a specific form of reasoning oriented towards the physical world, then it should be implicated in all (the generalizability question) and only (the specificity question) the situations in which we need to think about the physical properties of our world.” (p. 5)

      Introduction: The Goldenberg and Hagmann paper cited actually shows that familiar tool use may be based either on retrieval from semantic memory or by inferring function from structure (mechanical problem solving); in other words, the investigators saw a role for both kinds of information, and the relationship between mechanical problem solving and familiar tool use was actually relatively weak. This requires correction.

      We disagree with Reviewer 3 on this point. The whole sentence is as follows:

      “This silence has been initially broken by a series of studies initiated by Goldenberg and Hagmann (9), which has documented a behavioural link in left brain-damaged patients between common tool use and the ability to solve mechanical problems by using and even sometimes making novel tools (e.g., extracting a target out from a box by bending a wire to create a hook) (9, 17).” (p. 3-4)

      We did not mention the interpretations given by Goldenberg and Hagmann about the link with the pantomime task, but only focused on the link they reported between common tool use and novel tool use. This is factual. In addition, we also disagree that the link between common tool use and novel tool use was weak.

      “The hypothesis put forward in the introduction predicts that knowledge about prototypical tool use assessed by pantomime of tool use and the ability to infer function from structure assessed by novel tool selection can both contribute to the use of familiar tools. Indeed results of both tests correlated signicantly with the use of familiar tools pantomime of tool use: r \= 0.77, novel tool selection: r \= 0.62; both P < 0.001), but there was also a signicant correlation between the two tests r \= 0.64, P < 0.001).” (Goldenberg & Hagmann, 1998; p. 585)

      As can be seen in this quote, they reported a significant correlation between novel tool selection and the use of familiar tools. It is also noteworthy that the novel tool selection test and the pantomime test correlated together. Georg Goldenberg told one of the authors (F. Osiurak; personal communication) that this result incited him to revise its idea that pantomime could assess “semantic knowledge”, which explains why he did not use it again as a measure of semantic knowledge. Instead, he preferred to use a classical semantic matching task in his 2009 Brain paper with Josef Spatt, in which they found a clearer dissociation between semantic knowledge and common/novel tool use not only at the behavioral level but also at the cerebral level.

      Introduction: Please expand and clarify this sentence "However, this involvement seems to be task-dependent, contrary to the systematic involvement of left are PF. The IFG and LOTC activations observed in prior studies are of interest as well. Were they indeed all taskdependent in these studies?

      We agree that this sentence is confusing. We meant that, in the studies reported just above in the paragraph, these regions were not systematically reported contrary to the left area PF. As we think that this information was not crucial for the logic of the paper, we preferred to remove it. 

      Introduction: If implicit mechanical knowledge is acquired through interactions with objects, how is that implicit knowledge conveyed to pass on the material culture to others?

      We thank Reviewer 3 for this comment. Although mechanical knowledge is implicit, it can be indirectly transmitted to other individuals, as shown in two papers we published in Nature Human Behaviour (Osiurak et al., 2021) and Science Advances (Osiurak et al., 2022). Actually, verbal teaching is not the only way to transmit information. There are many other ways of transmitting information such as gestural teaching (e.g., pointing the important aspects of a task to make them salient to the learner), observation without teaching (i.e., when we observe someone unbeknown to them) or reverse engineering (i.e., scrutinizing an artifact made by someone else). We have shown that even in reverse-engineering conditions, participants can benefit from what previous participants have done to increase their understanding of a physical system. In other words, all these forms of transmission allow the learners to understand new physical relationships without waiting that these relationships randomly occur in the environment. There is a wide literature on social learning, which describes very well how knowledge can be transmitted without using explicit communication. In fact, it is very likely that such forms of transmission were already present in our ancestors, allowing them to start accumulating knowledge without using symbolic language. We did not add this information in the MS because we think that this was a little bit beyond the scope of the MS. Nevetheless, we cited relevant literature on the topic to help the reader find it if interested in the topic.

      “Yet, recent accounts have proposed that non-social cognitive skills such as causal understanding or technical reasoning might have played a crucial role in cumulative technological culture (6, 29, 66). Support for these accounts comes from micro-society experiments, which have demonstrated that the improvement of technology over generations is accompanied by an increase in its understanding (67, 68), or that learners’ technical-reasoning skills are a good predictor of cumulative performance in such micro-societies (33, 69).” (p. 19)

      What distinguishes this implicit mechanical knowledge from stored knowledge about object manipulation? Are these two conceptualizations really demonstrably (testably) different?

      We agree that it is complex to distinguish between these two hypotheses as suggested by Seidel et al. (2023) cited above (see Reviewer 3 Point 8). We have conducted several studies to test the opposite predictions derived from each hypothesis. The main distinction concerns the understanding of physical materials and forces, which is central to the technical-reasoning hypothesis but not to the manipulation-based approach. Indeed, sensorimotor programs about tool manipulation are not assumed to contain information about physical materials and forces. In the present study, the understanding of physical materials and forces was needed in the four tasks hypothesized as requiring technical reasoning, i.e., the mechanical problem-solving task, the psychotechnical task and the PHYS-Only and INT+PHYS conditions of the mentalizing task. We can illustrate this aspect with items of each of these tasks. Figure 1A is of the mechanical problem-solving task. 

      As explained in the MS, participants had memorized the five possible tools before the scanner session. Thus, for 4 seconds, they had to imagine which of these tools could be used to extract the target out from the box. We did so to incit them to reason about mechanical solutions based on the physical properties of the problem. Then, they had 3 seconds to select the tool with the appropriate shape, here the right one. In this case, the motor action remains the same (i.e., pulling). Another illustration can be given, with the psychotechnical task (Figure 1B).

      In this task, the participant had to reason as to whether the boat-tractor connection was better in the left picture or in the right picture. This needs to reason about physical forces, but there is no need to recruit sensorimotor programs about tool manipulation. Finally, a last example can be given with the PHYS-Only condition of the mentalizing task (but the logic is the same for the INT+PHYS condition except that the character’s intentions must also be taken into consideration) Figure 1D).

      Here the participant must reason about which picture shows what is physically possible. In this task, there is no need to recruit sensorimotor programs about tool manipulation. In sum, what is common between these three tasks is the requirement to reason about physical materials and forces. We do not ignore that motor actions could be simulated in the mechanical problemsolving task, but no motor action needed to be simulated in the other three tasks. Therefore, what was common between all these tasks was the potential involvement of technical reasoning but not of sensorimotor programs about tool manipulation. Of course, an alternative is to consider that motor actions are always needed in all the situations, including situations where no “manipulable tool” is presented, such as a tractor and a boat, a pulley, or a cannon. We cannot rule out this alternative, which is nevertheless, for us, prejudicial because it implies that it becomes difficult to test the manipulation-based approach as motor actions would be everywhere. We voluntarily decided not to introduce a debate between the reasoning-based approach and the manipulation-based approach and preferred a more positive writing by stressing the insights from the present study. Note that we stressed the merits of the manipulation-based approach in the introduction because we sincerely think that this approach has provided interesting insights. However, we voluntarily did not discuss the debate between the two approaches. Given Reviewer 3’s comment (see also Reviewer 1 Point 2), we understand and agree that some words must be nevertheless said to discuss how the manipulation-based approach could interpret our results, thus stressing the potential limitations of our interpretations. Therefore, we added a specific section in the discussion in which we discussed this aspect in more details.

      “The second limitation concerns the alternative interpretation that the left area PF is not central to technical reasoning but to the storage of sensorimotor programs about the prototypical manipulation of common tools. Here we show that the left area PF is recruited even in situations in which participants do not have to process common manipulable tools. For instance, some items of the psychotechnical task consisted of pictures of tractor, boat, pulley, or cannon. The fact that we found a common activation of the left area PF in such tasks as well as in the mechanical problem-solving task, in which participants could nevertheless simulate the motor actions of manipulating novel tools, indicates that this brain area is not central to tool manipulation but to physical understanding. That being said, some may suggest that viewing a boat or a cannon is enough to incite the simulation of motor actions, so our tasks were not equipped to distinguish between the manipulation-based approach and the reasoning-based approach. We have already shown that the left area PF is more involved in tasks that focus on the mechanical dimension of the tool-use action (e.g., the mechanical interaction between a tool and an object) than its motor dimension (i.e., the interaction between the tool and the effector [e.g., 24, 40]). Nevertheless, we recognize that future research is still needed to test the predictions derived from these two approaches.” (p. 18-19)

      Introduction and throughout: The framing of left Area PF as a special area for technical reasoning is overly reductionistic from a functional neuroanatomic perspective in that it ignores a large relevant literature showing that the region is involved with many other tasks that seem not to require anything like technical cognition. Indeed, entering the coordinates - 56, -29, 36 (reported as the peak coordinates in common across the studied tasks) in Neurosynth reveals that 59 imaging studies report activations within 3 mm of those coordinates; few are action-related (a brief review indicated studies of verbal creativity, texture processing, reading, somatosensory processing, stress reactions, attentional selection etc). Please acknowledge the difficulty of claiming that a large brain region should be labeled the brain's technical reasoning area when it seems to also participate in so much else. The left IPL (including area PF) is densely connected to the ventral premotor cortex, and this network is activated in language and calculation tasks as well as tool use tasks (e.g., Matsumoto, Nair, et al., 2012). What other constructs might be able to unite this disparate literature, and are any of these alternative constructs ruled out by the present data? Lacking this objective discussion, the manuscript does read as a promotion of the investigators' preferred viewpoint.

      We thank Reviewer 3 for this comment. As stressed in the initial version of the MS, we did not write that the left area PF is sufficient but central to the network that allows us to reason about the physical world. Regardless, we agree that an objective discussion was needed on this aspect to help the reader not misunderstand our purpose. We added a section in this aspect as suggested. 

      “Before concluding, we would like to point out two potential limitations of the present study. The first limitation concerns the fact that the literature has documented the recruitment of the left area PF in many neuroimaging experiments in which there was no need to reason about physical events (e.g., language tasks). This can be easily illustrated by entering the left area PF coordinates in the Neurosynth database.

      This finding could be enough to refute the idea that this brain area is specific to technical reasoning. Although this limitation deserves to be recognized, it is also true for many other findings. For instance, sensory or motor brain regions such as the precentral or the postcentral cortex have been found activated in many non-motor tasks, the visual word form area in non-language tasks, or the Heschl’s gyrus in nonmusical tasks. This remains a major challenge for scientists, the question being how to solve these inconsistencies that can result from statistical errors or stress that considerable effort is needed to understand the very functional nature of these brain areas. Thus, understanding that the left area PF is central to physical understanding can be viewed as a first essential step before discovering its fundamental function, as suggested by the functional polyhedral approach (56).” (p. 18)

      Discussion: The discussion of a small cluster in the IFG (pars opercularis) that nearly survived statistical correction is noteworthy in light of the above point. This further underscores the importance of discussing networks and not just single brain regions (such as area PF) when examining complex processes. The investigators note, "a plausible hypothesis is that the left IFG integrates the multiple constraints posed by the physical situation to set the ground for a correct reasoning process, such as it could be involved in syntactic language processing". In fact, the hypothesis that the IFG and SMG are together related to resolving competition has been previously proposed, as has the more specific hypothesis that the SMG buffers actions and that the context-appropriate action is then selected by the IFG (e.g., Buxbaum & Randerath, 2018). The parallels with the way the SMG is engaged with competing lexical or phonological alternatives (e.g., Peramunage, Blumstein et al., 2011) have also been previously noted.

      We added the Buxbaum and Randerath (2018)’s reference in this section.

      “The functional role of the left IFG in the context of tool use has been previously discussed (24) and a plausible hypothesis is that the left IFG integrates the multiple constraints posed by the physical situation to set the ground for a correct reasoning process, such as it could be involved in syntactic language processing (for a somewhat similar view, see [51]).” (p. 16-17)

      Introduction and Discussion: Please clarify how the technical reasoning network overlaps with or is distinct from the tool-use network reported by many previous investigators.

      We added a couple of sentences in the discussion to clarify this point.

      “It should be clear here that we do not advocate the localizationist position simply stating that activation in the left area PF is the necessary and sufficient condition for technical reasoning. We rather defend the view according to which it requires a network of interacting brain areas, one of them – and of major importance – being the left area PF. This allows the engagement of different configurations of cerebral areas in different technical-reasoning tasks, but with a central process acting as a stable component: The left area PF. Thus, when people intend to use physical tools, it can work in concert with brain regions specific to object manipulation and motor control, thereby forming another network, the tool-use network. It can also interact with brain regions specific to intentional gestures to form a “social-learning” network that allows people to enhance their understanding about the physical aspects of a technical task (e.g., the making of a tool) through communicative gestures such as pointing gestures (42). The major challenge for future research is to specify the nature of the cognitive process supported by the left area PF and that might be involved in the broad understanding of the physical world.” (p. 14)

      Discussion: All of the experimental tasks require a response from a difficult choice in an array, and all of the tasks except for the fluid cognition task are likely to require prediction or simulation of a motion trajectory-whether an embodied or disembodied trajectory is unclear. The Discussion does mention the related (but distinct) idea of an "intuitive physics engine", a "kind of simulator", Please clarify how this study can rule out these alternative interpretations of the data. If the study cannot rule out these alternatives, the claims of the study (and the paper title which labels PF as a technical cognition area) should be scaled back considerably. 

      We thank Reviewer 3 for this comment. The authors of the papers on intuitive physics engine or associative learning do not suggest that these processes are embodied. As discussed above, we clarified our perspective on the role of the left area PF and hope that these modifications help the reader better understand it. We warmly thank Reviewer 3 for their comments, which considerably helped us improve the MS.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This manuscript by Kaya et al. studies the effect of food consumption on hippocampal sharp wave ripples (SWRs) in mice. The authors use multiple foods and forms of food delivery to show that the frequency and power of SWRs increase following food intake, and that this effect depends on the caloric content of food. The authors also studied the effects of the administration of various food-intake-related hormones on SWRs during sleep, demonstrating that ghrelin negatively affects SWR rate and power, but not GLP1, insulin, or leptin. Finally, the authors use fiber photometry to show that GABAergic neurons in the lateral hypothalamus, increase activity during a SWR event.

      Strengths:

      The experiments in this study seem to be well performed, and the data are well presented, visually. The data support the main conclusions of the manuscript that food intake enhances hippocampal SWRs. Taken together, this study is likely to be impactful to the study of the impact of feeding on sleep behavior, as well as the phenomena of hippocampal SWRs in metabolism.

      Weaknesses:

      Details of experiments are missing in the text and figure legends. Additionally, the writing of the manuscript could be improved.

      We thank the reviewer for their favorable assessment of the work and its potential impact. We have added all requested details in the text and figure legends and revised the wording of the manuscript to improve its clarity.

      Reviewer #2 (Public review):

      Summary:

      Kaya et al uncover an intriguing relationship between hippocampal sharp wave-ripple production and peripheral hormone exposure, food intake, and lateral hypothalamic function. These findings significantly expand our understanding of hippocampal function beyond mnemonic processes and point a direction for promising future research.

      Strengths:

      Some of the relationships observed in this paper are highly significant. In particular, the inverse relationship between GLP1/Leptin and Insulin/Ghrelin are particularly compelling as this aligns well with opposing hormone functions on satiety.

      Weaknesses:

      I would be curious if there were any measurable behavioral differences that occur with different hormone manipulations.

      We thank the reviewer for their favorable assessment of the work and its contribution to our understanding of non-mnemonic hippocampal function. Whether there are behavioral differences that occur following administration of the different hormones is a great question, yet unfortunately our study design did not include fine behavioral monitoring to the degree that would allow answering it. While some previous studies have partially addressed the behavioral consequences of the delivery of these hormones (and we reference these studies in our Discussion), how these changes may interact with the hippocampal and hypothalamic effects we observe is a very interesting next step.

      Reviewer #3 (Public review):

      Summary:

      The manuscript by Kaya et al. explores the effects of feeding on sharp wave-ripples (SWRs) in the hippocampus, which could reveal a better understanding of how metabolism is regulated by neural processes. Expanding on prior work that showed that SWRs trigger a decrease in peripheral glucose levels, the authors further tested the relationship between SWRs and meal consumption by recording LFPs from the dorsal CA1 region of the hippocampus before and after meal consumption. They found an increase in SWR magnitude during sleep after food intake, in both food restricted and ad libitum fed conditions. Using fiber photometry to detect GABAergic neuron activity in the lateral hypothalamus, they found increased activity locked to the onset of SWRs. They conclude that the animal's satiety state modulates the amplitude and rate of SWRs, and that SWRs modulate downstream circuits involved in regulating feeding. These experiments provide an important step forward in understanding how metabolism is regulated in the brain. However, currently, the paper lacks sufficient analyses to control for factors related to sleep quality and duration; adding these analyses would further support the claim that food intake itself, as opposed to sleep quality, is primarily responsible for changes in SWR activity. Adding this, along with some minor clarifications and edits, would lead to a compelling case for SWRs being modulated by a satiety state. The study will likely be of great interest in the field of learning and memory while carrying broader implications for understanding brain-body physiology.

      Strengths:

      The paper makes an innovative foray into the emerging field of brain-body research, asking how sharp wave-ripples are affected by metabolism and hunger. The authors use a variety of advanced techniques including LFP recordings and fiber photometry to answer this question. Additionally, they perform comprehensive and logical follow-up experiments to the initial food-restricted paradigm to account for deeper sleep following meal times and the difference between consumption of calories versus the experience of eating. These experiments lay the groundwork for future studies in this field, as the authors pose several follow-up questions regarding the role of metabolic hormones and downstream brain regions.

      We thank the reviewer for their appreciation and constructive review of the work.

      Weaknesses:

      Major comments:

      (1) The authors conclude that food intake regulates SWR power during sleep beyond the effect of food intake on sleep quality. Specifically, they made an attempt to control for the confounding effect of delta power on SWRs through a mediation analysis. However, a similar analysis is not presented for SWR rate. Moreover, this does not seem to be a sufficient control. One alternative way to address this confound would be to subsample the sleep data from the ad lib and food restricted conditions (or high calorie and low calorie, etc), to match the delta power in each condition. When periods of similar mean delta power (i.e. similar sleep quality) are matched between datasets, the authors can then determine if a significant effect on SWR amplitude and rate remains in the subsampled data.

      This is an important point that we believe we addressed in a few complementary ways. First, the mediation analysis we implemented measures the magnitude and significance of the contribution of food on SWR power after accounting for the effects of delta power, showing a highly significant food-SWR contribution. While the objective of subsampling is similar, mediation is a more statistically robust approach as it models the relationship between food, SWR power, and delta power in a way that explicitly accounts for the interdependence of these variables. Further, subsampling introduces the risk of losing statistical power by reducing the sample size, due to exclusion of data that might contain relevant and valuable information. Mediation analysis, on the other hand, uses the full dataset and retains statistical power while modeling the relationships between variables more holistically. However, as we were not satisfied with a purely analytical approach to test this issue, we carried out a new set of experiments in ad-libitum fed mice, where there is no concern of food restriction impairing sleep quality in the presleep session. In these conditions food amount also significantly correlated with, and showed significant mediation of, the SWR power change. Finally, we acknowledge and discuss this point in the Discussion, highlighting that given the known relationship between cortical delta and SWRs, it is challenging to fully disentangle these signals. 

      (2) Relatedly, are the animals spending the same amount of time sleeping in the ad lib vs. food restricted conditions? The amount of time spent sleeping could affect the probability of entering certain stages of sleep and thus affect SWR properties. A recent paper (Giri et al., Nature, 2024) demonstrated that sleep deprivation can alter the magnitude and frequency of SWRs. Could the authors quantify sleep quantity and control for the amount of time spent sleeping by subsampling the data, similar to the suggestion above?

      Following the reviewer’s comment, we have quantified and compared the amount of time spent in NREM sleep in the Pre and Post session pairs in which the animals were food restricted, with 0-1.5 g of chow given between the sleep sessions. We found that there was no significant difference in the amount of time spent in NREM sleep in the Pre and Post sessions. We have added this result to the Results section of the manuscript and as a new Supplementary Fig. 1. 

      Additionally, we have added details to the Methods section that were missing in the original submission that are relevant to this point. Specifically, within the sleep sessions, the ongoing sleep states were scored using the AccuSleep toolbox (https://github.com/zekebarger/AccuSleep) using the EEG and EMG signals. NREM periods were detected based on high EEG delta power and low EMG power, REM periods were detected based on high EEG theta power and low EMG power, and Wake periods were detected based on high EMG power. Importantly, only NREM periods were included for subsequent SWR detection, quantification and analyses (in particular, reported SWR rates reflect the number of SWRs per second of NREM sleep). 

      (3) Plot 5I only reports significance but does not clearly show the underlying quantification of LH GABAergic activity. Upon reading the methods for how this analysis was conducted, it would be informative to see a plot of the pre-SWR and post-SWR integral values used for the paired t-test whose p-values are currently shown. For example, these values could be displayed as individual points overlaid on a pair of boxand-whisker plots of the pre- and post-distribution within the session (perhaps for one example session per mouse with the p-value reported, to supplement a plot of the distribution of p-values across sessions and mice). If these data are non-normal, the authors should also use a non-parametric statistical test.

      We have generated the summary plots the reviewer requested and have now included them in Supplementary Fig. 2. 

      Minor comments:

      (4) A brief explanation (perhaps in the discussion) of what each change in SWR property (magnitude, rate, duration) could indicate in the context of the hypothesis may be helpful in bridging the fields of metabolism and memory. For example, by describing the hypothesized mechanistic consequence of each change, could the authors speculate on why ripple rate may not increase in all the instances where ripple power increases after feeding? Why do the authors speculate that ripple duration does not increase, given that prior work (Fernandez-Ruiz et al. 2019) has shown that prolonged ripples support enhanced memory?

      This is an interesting point and we have added a section to the Discussion to discuss it (pg. 17, last paragraph)

      (5) The authors suggest that "SWRs could modulate peripheral metabolism" as a future implication of their work. However, the lack of clear effects from GLP-1, leptin and insulin complicates this interpretation. It might be informative for readers if the authors expanded their discussion of what specific role they speculate that SWRs could play in regulating metabolism, given these negative results.

      We have added a section to the Discussion proposing potential reasons for this point (pg. 16, last paragraph)

      Recommendations for the authors:  

      Reviewer #1 (Recommendations for the authors):

      Major Comments:

      (1) The experiments involve very precise windows of time for sleeping and eating that seem impossible to control. For example, the authors state that for the experiments in Figure 1, there was a 2-h sleep period, followed by a 1-h feeding period, followed by another 2-h sleep period. Without sleep deprivation procedures or other environmental manipulations, how can these periods be so well-defined? Even during the inactive period, mice typically don't sleep for 2-h bouts at once, and the addition of food would not likely lead to an exact 1-h period of wakefulness in the middle. The validity of these experimental times would be more believable if the authors provided much more data on these sessions. For example, the authors could provide a table or visual display of data for the actual timing of the pre-sleep, eating, and post-sleep phases with exact time measurements and/or visual display of sleep versus wakefulness.

      This is an important point, which we were not clear enough about in the original submission. While the durations of the Pre-sleep, Wake and Post-sleep sessions were indeed 2 h, 1 h and 2 h respectively, the animals did not actually sleep during the entirety of the sleep sessions. Importantly, we performed sleep state scoring on all sessions, and only analyzed identified NREM sleep for all SWR analyses. Following the reviewer’s comment (and that of Reviewer 1), we have quantified and compared the amount of time spent in NREM sleep in the Pre and Post session pairs in which the animals were food restricted and 0-1.5 g of chow were given between the sleep sessions. We found that there was no significant difference in the amount of time spent in NREM sleep in the Pre and Post sessions. We have added this result to the Results section of the manuscript and as a new Supplementary Fig. 1. 

      Additionally, we have added details to the Methods section that were missing in the original submission that are relevant to this point. Specifically, within the sleep sessions, the ongoing sleep states were scored using the AccuSleep toolbox (https://github.com/zekebarger/AccuSleep) using the EEG and EMG signals. NREM periods were detected based on high EEG delta power and low EMG power, REM periods were detected based on high EEG theta power and low EMG power, and Wake periods were detected based on high EMG power. Importantly, only NREM periods were included for subsequent SWR detection, quantification and analyses (in particular, reported SWR rates reflect the number of SWRs per second of NREM sleep). 

      (2) I may have missed this (although I tried searching in the text and figure legend), but the authors did not state the difference between green versus red bar colors in Figure 1 C-E. For Figures 1 F-J, do the individual dots represent both the test (fed) animals and control animals, or just the test animals?

      We thank the reviewer for the opportunity to clarify these points. Red bars in Fig. 1C-E represent the SWR changes observed following delivery of equal or more than 0.5 g of chow, while the green bars represent the changes observed following delivery of less than 0.5 g. Fig. 1F-J includes both the experimental and control animals- the control animals appearing as having received 0 food amount. This information has now been added to the figure legend.

      (3) For the jello experiments in Figure 3, was there only 1 trial per animal? Previous studies show that animals learn the caloric value of jello after subsequent trials, so whether or not multiple trials took place in each animal is important for interpretation of the results.

      In Figure 3, the datapoints within each panel represent different animals and this information has now been added to the figure legend. Nevertheless, the animals were previously habituated to all foods, including regular jello, sugar-free jello and chocolate. While we consider it unlikely that this prior experience was sufficient to underlie the differential effects on SWRs, we cannot fully rule out the possibility that it provided some ability to predict the caloric value and consequences of the different foods. We have added details to the acknowledgement of this point in the Discussion (pg. 17, second paragraph).

      (4) The experiments in Figure 5 are informative but don't relate to the experiments in the rest of the study. It is difficult to interpret their meaning given that these experiments take place over seconds while the other experiments take place over hours. Some attempt should be made to bridge these experiments over the timescales relevant for the behaviors studied in Figures 1-4.

      We have now further acknowledged and discussed the point that our investigation is limited to the timescale of seconds around SWRs, and thus identified a potential communication channel, but whether and how this communication changes across hours following feeding remains for future studies (pg. 18, second paragraph).

      (5) Figure 5B should depict the x-axis in seconds, not an arbitrary set of times from a recording.

      We have replaced these with a time scale bar.

      Minor Comments:

      (6) The writing of the manuscript can be improved in many places:

      Sometimes the writing could be more precise. For example, the Abstract states: "hippocampal sharp wave ripples (SWRs)... have been shown to influence peripheral glucose metabolism." Could this be written in a more informative way, rather than just staying "has been shown to influence?" A few more words would provide a lot more information. Similarly, at the end of the Introduction: "we set out to test the hypothesis that SWRs are modulated following meal times as part of the systems-level response to changing metabolic needs." This is not a strong hypothesis... could it be written to boldly state how the SWRs will be modulated (increase or decrease) and provide more assertive information?

      The writing can be grandiose at times. Phrases such as "life is a continuous journey" or "the hypothalamus is a master regulator of homeostasis" are a bit sophomoric and too colloquial.

      Finally, a representative recording should be referred to as just that-a "representative recording," as opposed to a "snippet," which is also colloquial. This word is used in the figure legends to Figures 1 and 5, and misspelled as "sinpper" in Figure 1

      We have reworded all these sentences and phrases to make them clearer, more concrete and more formal.

      (7) The methods state that the study used both male and female mice. Were they used in equal numbers across experiments?

      Only one female was used in the final dataset, and we have corrected the wording accordingly.

      Reviewer #2 (Recommendations for the authors):

      Great paper!

      Thanks!

      Reviewer #3 (Recommendations for the authors):

      Below are some minor requests for clarification, including in figures:

      (1) Fig. 5H y-axis should say "normalized dF/F."

      Done

      (2) Fig. 1B is missing a y-axis label. It may be clearer to display separate y-axis scale bars for each component (SWR envelope, ripple-filtered amplitude, etc).

      Done

      (3) Please include labels for brain areas and methodological components in Fig. 5A.

      Done

      (4) Should Fig. 5B have the same y-axis or scale bars as 1B?

      We have edited the figure labels and legends to be visually similar

      (5) In Fig. 5J, is the y-axis a count of sessions?

      Yes, we have added that to the y-axis label

      (6) Could the authors please clarify whether the sugar-free jello was sweetened with an artificial sweetener? If so, this is a robust control for the rewarding nature of the two jellos, so a quick clarification would highlight this strength of the experiment.

      We thank the reviewer for this great point. Indeed, the sugar free jello contained artificial sweeteners (Aspartame and Acesulfame Potassium). We have added this information to the Results and Methods.

      (7) It appears in Fig. 5 that there may be a reliable dip in activity **at** the time of SWR onset, followed by the increase afterward, as shown in the example FP trace and the individual ripple-triggered traces. Is this indeed the case, and does this dip fall significantly below baseline? This characterization would be interesting, but I acknowledge is not necessarily crucial to the study to include.

      This would indeed be an interesting finding, but upon examination and statistical testing, we found that this is not the case. We believe this may appear as such due to the normalization of the traces.

      (8) The authors mention a reduction in ripple rate following insulin under food restriction as the only significant effect for insulin, GLP-1, and leptin, yet there was also a significant increase (at p<0.05) in ripple duration for GLP-1 in the ab lib condition. Is this not considered noteworthy?

      This is a fair point and we have reworded the description of this result to simply state that there were no robust, consistent, dose-dependent effects of GLP-1, leptin and insulin on SWR attributes.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public reviews:

      Reviewer #1 (Public review):

      Summary:

      This very interesting manuscript proposes a general mechanism for how activating signaling proteins respond to species-specific signals arising from a variety of stresses. In brief, the authors propose that the activating signal alters the structure by a universal allosteric mechanism.

      Strengths:

      The unitary mechanism proposed is appealing and testable. They propose that the allosteric module consists of crossed alpha-helical linkers with similar architecture and that their attached regulatory domains connect to phosphatases or other molecules through coiled-coli domains, such that the signal is transduced via rigidifying the alpha helices, permitting downstream enzymatic activity. The authors present genetic and structural prediction data in favor of the model for the system they are studying, and stronger structural data in other systems.

      Weaknesses:

      The evidence is indirect - targeted mutations, structural predictions, and biochemical data. Therefore, these important generalizable conclusions are not buttressed by impeccable data, which would require doing actual structures in B. subtilis, confirming experiments in other organisms, and possibly co-evolutionary coupling. In the absence of such data, it is not possible to rule out variant models.

      We thank the reviewer for their feedback. A challenge of studying flexible proteins is that it is often not possible to directly obtain high resolution structural data. For the case of B. subtilis RsbU, the independent experimental approaches we applied (including two unbiased genetic screens, targeted mutagenesis, SAXS, enzymology, and structure prediction, which includes evolutionary coupling) converged upon a model for activation, which we feel is well supported. Frustratingly, our attempts at determining high resolution experimental structures have been unsuccessful, which we think is due to the flexibility of the proteins revealed by our SAXS experiments. For example, we collected X-ray diffraction data from crystals of a fragment of B. subtilis RsbU containing the N-terminal domain and linker in which the linker was almost entirely disordered in the maps. We agree that doing experiments in other organisms would be valuable next steps to test the hypothesis that this coiled-coil based transduction mechanism is conserved across species, and will modify the text to differentiate this more speculative section of the manuscript.

      We have modified the abstract to read:

      “This coiled-coil linker transduction mechanism additionally suggests a resolution to the mystery of how shared sensory domains control serine/threonine phosphatases, diguanylate cyclases and histidine kinases.”

      We have modified the results to read:

      "These predictions suggest a testable hypothesis that RsbP is controlled through an activation mechanism similar to that of RsbU (Fig. 5A)”

      “From this analysis, we speculate that linker-mediated phosphatase domain dimerization is an evolutionarily conserved, adaptable mechanism to control PPM phosphatase activity.”

      Based on this critique (and the critiques of the other reviewers), we plan to do energetic analysis of the predicted coiled coils from the enzymes we analyzed from other species and to incorporate this into the manuscript.

      We have modified the results to read:

      Consistent with a model in which the stability of the linker plays a conserved regulatory role, the AlphaFold2 models for many of the predicted structures have unfavorable polar residues buried in the coiled-coil interface (positions a and d, for which non-polar residues are most favorable) (Figure 5 – figure supplement 2).”

      Finally, in the manuscript, we have highlighted that this mechanism is not the only mechanism for activation of other proteins with effector domains connected to linkers, but rather one of many mechanisms (Fig 5G). The reviewer additionally made helpful suggestions about the text in detailed comments that we will incorporate as appropriate.

      Reviewer #2 (Public review):

      Summary:

      While bacteria have the ability to induce genes in response to specific stresses, they also use the General Stress Response (GSR) to deal with growth conditions that presumably include a larger range of stresses (for instance, stationary phase growth). The activation of GSR-specific sigma factors is frequently at the heart of the induction of a GSR. Given the range of stresses that can lead to GSR induction, the regulatory inputs are frequently complex. In B. subtilis, the stressosome, a multi-protein complex, contains a set of proteins that, upon appropriate stresses, initiate partner switching cascades that free the sigma B sigma factor from an anti-sigma. The focus here is on the mode of activation of RsbU, a serine/threonine phosphatase of the PPM family, leading to sigB activation. RbsT, a component of the degradosome interacts with RsbU upon stress, activating the phosphatase activity. Once active, RsbU dephosphorylates its target (RsbV, an anti-antisigma), which in turn binds the anti-sigma. The conclusion is that flexible linker domains upstream of the phosphatase domain are the target for activation, via binding of proteins to the N-terminal domain, resulting in a crossed-linker dimeric structure. The authors then use the information on RsbU to suggest that parallel approaches are used to activate PPM phosphatases for the GSR response in other bacteria. (Biology vs. Mechanism, evolution?)

      Strengths and Weaknesses:

      Many of these have to do with clarifying what was done and why. This includes the presentation and content of the figures.

      One issue relates to the background and context. A bit more information on the stresses that release RsbT would be useful here. The authors might also consider a figure showing the major conclusions and parallels for SpoIIE activation and possibly other partner switches that are discussed, introducing the switch change more clearly to set the stage for the work here (and the generalization). There are a lot of players to keep track of.

      We plan to carefully review the manuscript to improve the clarity of presentation and background. In particular, we thank the reviewer for pointing out the missing information about the release of RsbT from the stressosome. We will incorporate this information into the introduction and provide an additional figure.

      We have added the following text to the introduction:

      “RsbT is sequestered in a megadalton stress sensing complex called the stressosome, and is released to bind RsbU in response to specific stress signals including ethanol, heat, acid, salt, and blue light”

      We have added a new figure panel (2C) that shows the model for how Q94L, M166V, and RsbT binding induce conformational change of the PPM domain to recruit metal cofactor and activate RsbU (analogous, but slightly different from the mechanism for SpoIIE).

      The reviewer additionally provided detailed helpful comments that we will incorporate in the text and figures.

      Reviewer #3 (Public review):

      Summary:

      The authors present a study building on their previous work on activation of the general stress response phosphatase, RsbU, from Bacillus subtilis. Using computed structural models of the RsbU dimer the authors map previously identified activating mutations onto the structure and suggest further protein variants to test the role of the predicted linker helix and the interaction with RsbT on the activation of the phosphatase activity.

      Using in vivo and in vitro activity assays, the authors demonstrate that linker variants can constitutively activate RsbU and increase the affinity of the protein for RsbT, thus showing a link between the structure of the linker region and RsbT binding.

      Small angle X-ray scattering experiments on RsbU variants alone, and in complex with RsbT show structural changes consistent with a decreased flexibility of the RsbU protein, which is hypothesised to indicate a disorder-order transition in the linker when RsbT binds. This interpretation of the data is consistent with the biochemical data presented by the authors.

      Further computed structure models are presented for other protein phosphates from different bacterial species and the authors propose a model for phosphatase activation by partner binding. They compare this to the activation mechanisms proposed for histidine kinase two-component systems and GGDEF proteins and suggest the individual domains could be swapped to give a toolkit of modular parts for bacterial signalling.

      Strengths:

      The key mutagenesis data is presented with two lines of evidence to demonstrate RsbU activation - in vivo sigma-b activation assays utilising a beta-galactosidase reporter and in vitro activity assays against the RsbV protein, which is the downstream target of RsbU. These data support the hypothesis for RsbT binding to the RsbU linker region as well as the dimerisation domain to activate the RsbU activity.

      Weaknesses:

      Small angle scattering curves are difficult to unambiguously interpret, but the authors present reasonable interpretations that fit with the biochemical data presented. These interpretations should be considered as good models for future testing with other methods - hydrogen/deuterium exchange mass spectrometry, would be a good additional method to use, as exchange rates in the linker region would be affected significantly by the disorder/order transition on RsbT binding.

      We agree with the reviewer that the SAXS data has inherent ambiguity due to the nature of the measurement. However, SAXS is one of the best techniques to directly assess conformational flexibility. Our scattering data for RsbU have multiple signatures of flexibility supporting a high confidence conclusion. While the scattering data support a reduction in flexibility for the RsbT/RsbU complex, we agree that a high resolution structure would be valuable. However the combination of the scattering data with our biochemical and genetic data supports the validity of the AlphaFold predicted model. We thank the reviewer for the suggestion of future hydrogen/deuterium exchange experiments that would be complementary, but which we feel are beyond the scope of this work.

      The interpretation of the computed structure models should be toned down with the addition of a few caveats related to the bias in the models returned by AlphaFold2. For the full-length models of RsbU and other phosphatase proteins, the relationship of the domains to each other is likely to be the least reliable part of the models - this is apparent from the PAE plots shown in Supplementary Figure 8. Furthermore, the authors should show models coloured by pLDDT scores in an additional supplementary figure to help the reader interpret the confidence level of the predicted structures.

      We thank the reviewer for suggestions on how to clarify the discussion of AlphaFold models. We will decrease the emphasis on the computed models in the text and will add figures with the models colored by the pLDDT scores to aid in the interpretation.

      We have modified the text of the Abstract: “This coiled-coil linker transduction mechanism additionally suggests a resolution to the mystery of how shared sensory domains control serine/threonine phosphatases, diguanylate cyclases and histidine kinases.”

      We have modified the text of the Results: “These predictions suggest a testable hypothesis that RsbP is controlled through an activation mechanism similar to that of RsbU (Fig. 5A).”

      “From this analysis, we speculate that linker-mediated phosphatase domain dimerization is an evolutionarily conserved, adaptable mechanism to control PPM phosphatase activity”

      We have also added Figure 1 – figure supplement 2 with the AlphaFold2 models colored by the pLDDT scores.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Baral and colleagues investigate the regulatory mechanisms of the General Stress Response (GSR) in Bacillus subtilis, focusing on the phosphatase RsbU and its regulation by the protein RsbT. The GSR is a critical adaptive mechanism that allows bacteria to survive under various stress conditions by reshaping their physiology through a broad transcriptional response. RsbU, a key player in the GSR, facilitates the activation of the transcription factor SigB by dephosphorylating RsbV. This activation is mediated through a partner-switching mechanism involving RsbT. Baral and colleagues use a combination of genetic screening, structural predictions via AlphaFold2, and biophysical techniques such as SAXS and MALS to present a model for how RsbT regulates RsbU. Key findings include the identification of specific amino acid substitutions that enhance RsbU activity, the role of the α-helical linker in RsbU dimerization and activation, and the potential broader conservation of these mechanisms across bacterial species. However, as described below, additional work is required to solidify the results.

      Major Points

      (1) The manuscript is misnamed--it dissects a single step of the signal-transduction pathway regulating the general stress response. Instead, it is rather seeking a generalizable mechanism for kinase -phosphatase interactions across stresses.

      We have edited the title to “A General Mechanism for Initiating the General Stress Response in Bacteria” to reflect that that this study addresses the initiating event of the general stress response.

      (2) The genetic screen likely has limitations in detecting all possible variants that could affect RsbU activity. The readout is specific to σ^B activation, and the focus on specific amino acid substitutions may overlook other significant regions or mechanisms involved in the regulation of RsbU, particularly those involving RsbV and RsbT.

      Our screens were specifically designed to identify features of RsbU that contribute to regulation. Importantly, RsbU does not have any known targets other than RsbV and the downstream σ<sup>B</sup> response but agree that substitutions in either RsbV or RsbT could influence RsbU activation. In principle our suppressor screen with RsbU<sup>Y28I</sup> could have identified RsbT variants (rsbT was mutagenized in this screen), but we did not identify any such variants in the screen. We conducted a separate screen (published elsewhere) that specifically addressed how RsbU recognizes RsbV.

      (3) The authors largely focus on the biochemical and structural aspects of RsbU regulation. There is limited discussion on the broader functional implications of these findings in the context of bacterial physiology and stress response. Incorporating more in vivo studies to show how these mechanisms impact bacterial survival and adaptation would provide a more comprehensive understanding.

      We appreciate this comment, but did not conduct additional studies of survival and adaptation because the phenotypes of σ<sup>B</sup> deletion in B. subtilis under laboratory conditions are relatively mild and therefore difficult to assay. Future studies to address this in other systems could be highly informative.

      (4) The results primarily support the model of linker-mediated dimerization and rigidity. However, other potential regulatory mechanisms or interacting partners might also play significant roles in RsbU activation. A more thorough exploration of these possibilities would strengthen the study's conclusions.

      One of the major advantages of RsbU as a model for initiation of the general stress response is that the system is discreet with all evidence pointing to there being a single primary input (RsbT) and output (dephosphorylation of RsbV). While there are other possible variations on the system (for example RsbU may be directly activated by manganese stress), we focused on this system precisely because of its simplicity.

      (5) While the study presents evidence for the conservation of the described mechanism across different species, this assumption is based on structural predictions and limited experimental data. Broader experimental validation across diverse bacterial species would be necessary to substantiate this claim. Coevolution coupling along with conservation/evolutionary studies could be considered.

      We have altered the language in the paper to emphasize where we are making inferences from predictions that are therefore more speculative. We agree that a more detailed analysis of the evolutionary coupling would likely be fruitful. We note that these couplings are the major driving force of AlphaFold predictions, suggesting that these couplings contributed to the models that we analyzed.

      (6) The reliance on AlphaFold2 for structural predictions introduces potential biases and uncertainties inherent in computational models. Experimental validation of these models through additional techniques such as cryo-EM or X-ray crystallography would strengthen the conclusions.

      We agree with this point, which is why we performed extensive analysis and validation of the models for RsbU using SAXS, genetics, and biochemistry. The proposed techniques are made more challenging by flexibility and heterogeneity, which we detected in our experiments. Our attempts thus far at experimental structure determination are consistent with this being a major technical hurdle.

      (7) SAXS data provide low-resolution structural information, and the interpretation of flexibility versus rigidification might be overemphasized in its interpretation. This part of the study was difficult to interpret. Improving readability by breaking down the text into sections with clear headings for each figure panel and clarifying descriptions of the panels and methods would help. Complementary high-resolution techniques could provide a more definitive view of the linker's conformational changes.

      We have modified the presentation of the figures to clarify the SAXS analysis. The fact that the SAXS analysis suggests flexibility rather than a discrete inactive conformation means that high-resolution techniques may not be appropriate for this system.

      (8) The study primarily focuses on the model where RsbT binding rigidifies the RsbU linker. Alternative hypotheses, such as subtle conformational adjustments without complete rigidification, are not extensively explored or ruled out.

      Our analysis of the SAXS data strongly suggests that a subtle conformational change could not account for the scattering data that we obtained. We have modified the text to clarify this point.

      “Indicative of significant deviation between the RsbU structure in solution to the AlphaFold2 model, the scattering intensity profile (I(q) vs. q) was a poor fit (χ<sup>2</sup> 12.53) to a profile calculated from the AlphaFold2 model of an RsbU dimer using FoXS (Schneidman-Duhovny et al. 2016; Schneidman-Duhovny et al. 2013) (Fig. 4A). We therefore assessed the SAXS data for the RsbU dimer for features that report on flexibility (Kikhney & Svergun 2015). First, the scattering intensity data lacked distinct features caused by the multi-domain structure of RsbU from the AlphaFold2 model (Fig.4A).”

      (9) Future studies should aim to validate the AlphaFold2 predictions with high-resolution structural techniques. This would provide definitive evidence for the proposed conformational states of RsbU with and without RsbT.

      The fact that the SAXS analysis suggests flexibility rather than a discrete inactive conformation means that high-resolution techniques may not be appropriate for this system.

      (10) Investigating the RsbU-RsbT interaction in vivo using techniques like FRET, co-immunoprecipitation, or live-cell imaging would provide a more comprehensive understanding of their functional dynamics in a cellular context.

      We appreciate the reviewer’s suggestions for future experiments.

      (11) Exploring and testing alternative models of RsbU activation, such as partial rigidification or different modes of conformational change, would strengthen the conclusions.

      While our data strongly support that a flexible-to-rigid transition controls RsbU activation, we agree that it is possible that other mechanisms of linker modification could control other phosphatases and we discuss this at some length in the discussion.

      (12) The figure legends are quite dense and could benefit from some streamlining.

      We have edited the figure legends for clarity and length.

      Reviewer #2 (Recommendations for the authors):

      (1) Activation assays (Figures 1, 3, S2) are presented here as blue or white spots (reflecting a reporter activity). While off and on these are fairly clear, it is more difficult to compare the degree of activity (for instance that rsbU<sup>Q94L</sup> is more active than M166V). It would also be good to clearly present in the text the logic of asking if the mutant is RsbT independent or not (and the interpretation of that). Quantitative assays of these would be very useful.

      We chose not to perform quantitative-LacZ assays here because of several complications to interpreting these results that we encountered in our previously published study (Ho and Bradshaw, 2021). However, the level of blue pigmentation shown in Figure 1B for RsbU Q94L and RsbU M166V is qualitatively different, making the comparison possible. Most importantly, we observed cell density dependent changes in LacZ activity in the absence of rsbT for rsbU<sup>M166V</sup> expressing cells, meaning that comparisons between strains would be difficult. Additionally, we found that it was important to make a chromosomal replacement of rsbU to see the full effect of the M166V substitution. However, we were not able to construct a similar rsbU<sup>Q94L</sup> strain, likely because the high level σ<sup>B</sup> activity is lethal (we were able to construct this strain when σ<sup>B</sup> was deleted but only obtained strains with additional loss-of-function mutations in RsbU when σ<sup>B</sup> was present.

      We have modified the text to explain the logic of identifying RsbT independent variants: “We previously conducted a genetic screen (Ho & Bradshaw 2021) to identify features of RsbU that are important for phosphatase regulation by isolating gain-of-function variants that are active in the absence of RsbT.”

      (2) Explain Figure S8 graphs: as much as Alphafold is now in use, the authors should provide some further explanation of what is shown here. Blue (low error) is good, presumably. What are the A, B, C, and D sections showing? Different parts of a given letter region (and between them)? What is the x-axis? Is the top-ranked model used in every case in the text? How different are these models? The Methods section could be used for some of this (but doesn't in its current form). This also becomes important for the models generated later in the paper (Figure S7), which look rather different here.

      We have modified figure S8 to include additional labels and have added structures with the pLDDT scores shown. We have additionally modified the figure legends and methods to provide the requested information.

      (3) Figure 1C, D, Figure S2: amino acid ends of linker domains could be shown (text discusses 83-97 the linker as a two-turn coiled coil; Q94 is pretty close to the end of this coiled-coil? Figure S2 is even less clear - addresses of other amino acids would help, and or an added sequence showing the full linker and coiled-coil region). Some explanation for positions for readers to focus on for full coiled-coil would be useful in the legend of Figure S2. How strong a coiled-coil prediction is there for this region?

      We have added the sequence of the coiled-coil regions to the figures with numbering. For these analyses we used the Socket2 program, which analyzes a PDB file to identify coiled-coil regions and thus does not provide a confidence score. However, inspection of the sequence and the confidence scores of the AlphaFold2 models indicates that the coiled-coil regions are not ideal, consistent with this being a regulatory feature.

      Is it clear that the fully inactive proteins are still properly folded and soluble?

      In the case of RsbU, our biophysical analysis indicates that the inactive form of the protein is soluble. While phosphatase activity is substantially reduced, our unpublished comparison of single- and multiple-turnover reactions in the absence of RsbT indicates that nearly all of the enzyme is active.

      Finally, are there other positions that would also be expected, from this model, to stabilize the coiled-coil and thus bypass the requirement for RsbT? If so, it would be good to test these. Is it the burial of amino acid at position 94 that is important, or the ability to form crossed helices?

      Because of how short the predicted coiled-coil region is, we did not identify any obvious positions that would likely have the same effect as Q94 substitution. We considered making helix-breaking mutations, which would be predicted to block RsbU activation, but favored analysis of the wildtype protein because of limitations in interpreting the effects of loss-of-function mutations.

      (4) Figure 2A, RsbT binding to RsbU: It was not entirely clear to this reviewer why one would expect the RsbT binding, not needed for activation, to be increased by the mutation that stabilizes the crossed alpha helices. The change is impressive but doesn't the lack of a need for RsbT suggest that this mutation bypasses the normal mechanism? (Is dimerization enuf? Or other protein cross helices?).

      We have modified the text to clarify this point: “One prediction of our hypothesis that RsbT stabilizes the crossed alpha helices of the RsbU dimer, is that RsbT should bind more tightly to rsbU<sup>Q94L</sup> than to RsbU because the coiled-coil conformation that RsbT binds would be more energetically favorable.” Another way of putting this is that if the Q94L substitution activates RsbU through an on-pathway mechanism, RsbT must bind more tightly.

      (5) Figure 3A, Figure S3: Please label the yellow (interface) residues in RsbU and RsbT in Fig. S3 and the green (suppressor) spheres in Figure 3A.

      We have added labels to the figures as suggested.

      If RbsT interacts with the N-terminal dimerization domain and linker, why were residues 174 and 178 (from PPM domain) shown to be implicated in binding?

      The fact that residues in the switch region suppress a mutation that decreases RsbT binding suggests that this region is part of an allosteric network that links RsbT binding, the linker, and dimerization of the phosphatase domains. For example, any substitution that promotes a conformation of the phosphatase domain that is more favorable for dimerization would also promote RsbT binding. However, the precise details of how each mutation fits into this network is not clear and we have therefore chosen to not specify a particular model to avoid over interpreting our data.

      Are these marked in Figure S3?

      We have added labels to make this clear.

      Are these part of a dimerization interface in the C-terminal domain? Are any/all of these RsbU mutants suppressed by Q94L, as one might predict (apparently Y28I is since Q94L was again identified)?

      We chose to focus on Y28I because it was the best studied previously, but we would predict that Q94L would suppress other RsbT binding mutations.

      (6) Line 191-192: Is it surprising that no suppressors were isolated in RsbT?

      We didn’t have a preconception of whether or not it would be possible to identify similar suppressors in RsbT. Explanations for why we did not identify such suppressors could include that RsbT may be destabilized more easily by substitution, that RsbT is more constrained because it has other interaction partners, or that the particular substitutions that would suppress Y28I are less common by the PCR mutagenesis strategy we used.

      (7) Figure 3: Would the same mutants arise if the screen had been done in the absence of RsbT? Was RsbT-dependent tested for the rsbU alleles?

      Our prediction is that we would not have identified any of these mutations except for Q94L in the absence of rsbT. We tested a few of the alleles and found them all to be rsbT dependent, but did not systematically test all of the alleles and therefore did not include this analysis in the manuscript.

      Given the findings earlier in the paper for Q94L, suggesting that this stabilizes the coiled-coil and shows some activity in the absence of RsbT, it seems that the interpretation of other mutants in this region (and Q94L itself) as evidence that RsbT contacts the linker directly and that contact is necessary for activation may be an overinterpretation. If these are in fact RsbT independent, they support the importance of the linker (do they further stabilize coiled-coil formation?), rather than the role of RsbT here. Are G92 and T89 on the outside of the coiled-coil? If Q94 is buried, is it qualitatively different from these others?

      G92 and T89 are predicted to be exposed. The fact that these mutations are near Q94 is part of the reason that we focused on R91 and the predicted contact with D92 of RsbT as another approach to validate the predicted interface.

      (8) Figure 3C addresses the issue of direct interaction of RsbT with the RsbU linker to some extent, given that RsbU R91E doesn't appear to have a lot of activity without RsbT. It would be helped by telling the reader what the R91 contact is initially.

      We have modified the text to clarify this point: “To test the model that RsbT activates RsbU by directly interacting with the linker to dimerize the RsbU phosphatase domains, we introduced a charge swap at position R91 that would abolish a predicted salt-bridge with RsbT D92 (Fig. 3C).”

      (9) Figure 4 and the discussion of it in the text is not likely to be easily understandable for many readers. Aside from providing a bit more explanation of what these analyses are showing, it would be useful to start the whole section (or maybe even much earlier in the paper) with the information found on lines 261-264, that other studies show that the N-terminus dimerizes stably on its own (and is it known that the C-terminus does not?). Then the discussion of the alternative models early in this section would be clearer.

      We have updated the introduction to emphasize this point “RsbU has an N-terminal four-helix bundle domain that dimerizes RsbU and is also the binding site for RsbT, which activates RsbU as a phosphatase (Fig. 1C,D) (Delumeau et al. 2004).”

      We have also added clarification to the model presented at the beginning of this section: “A second possibility is that inactive RsbU is dimerized by the N-terminal domains but that the linkers of inactive RsbU are flexible and that the phosphatase domains only interact with each other when RsbT orders the linkers into a crossing conformation.”

      Is the dimerization of the N-terminal domains previously determined similar/the same as what is seen in the AlphaFold models used here (or the AlphaFold dimerization derived primarily from that data?).

      Yes, the dimerization in the AlphaFold models matches closely to the published structure.

      (10) Discussion and Figure 5: The final part of this work predicts AlphaFold models for a set of other phosphatases involved in initiating GSR across bacterial species, and suggests that linked-mediated phosphatase dimerization is the critical factor to activate the phosphatase. Clearly, this is the most speculative but interesting aspect of the paper. A number of possible questions are suggested by some of this:

      a. Do any of the activating mutants In RsbU and RsbP in the PPM domain (that apparently improve dimerization and thus activation) do a similar job in the other modeled proteins?

      This is an interesting question, but unfortunately most of these proteins have not been biochemically characterized. We highlight examples of RsbP and E. coli RssB for which similar activating mutations have been characterized.

      b. The legend (Figure 5G) suggests that all of the linker combinations will be coiled-coils, but that they will undergo different types of activating (and dimerizing?) transitions. Is that in fact what is being proposed here?

      Yes, this is our working hypothesis.

      c. If there is no dimerization (as noted, only weak dimerization has been reported for E. coli RssB), does that generalize the model to there are linkers and their structures are important? At the least, would the folding up of the E. coli RssB linker with antiadaptor binding be considered another mode of signal transduction or rather some sort of storage form?

      Interestingly, the P. aeruginosa RssB constitutively dimerizes, suggesting the E. coli is the outlier.

      d. Would the "toolkit" model, in which different changes occur in the linker regions, suggest that the interacting proteins are going to be critical for the type of linker changes that will be important? Or something about the nature of the linkers themselves?

      This is an interesting question that we cannot yet answer. We have chosen to focus on the possible flexibility of this mechanism and anticipate that a variety of mechanisms will be used.

      e. Given the extensive comparison to E. coli RssB, the authors might consider a figure to clarify the relative domain architecture, sequences that are akin to switch regions, and others important to the discussion here.

      We tried to highlight this in Figure 5C including coloring the regions similar to the switch regions.

      Reviewer #3 (Recommendations for the authors):

      Given the caveats noted above related to the reliability of computed structure models, I would recommend the authors make the following additions/modifications to their manuscript:

      (1) The authors should show alpha fold models coloured by pLDDT scores in an additional supplementary figure to help the reader interpret the confidence level of the predicted structures.

      We have added these models to figure 1 – figure supplement 2.

      (2) Because of the points mentioned above the authors should tone down the generalisation relating to the activation mechanism of this family of phosphatases presented in the discussion.

      We have modified the paper throughout to emphasize where we are speculating.

    1. Author response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #1:

      Summary:

      Kimura et al performed a saturation mutagenesis study of CDKN2A to assess functionality of all possible missense variants and compare them to previously identified pathogenic variants. They also compared their assay result with those from in silico predictors.

      Strengths:

      CDKN2A is an important gene that modulate cell cycle and apoptosis; therefore it is critical to accurately assess functionality of missense variants. Overall, the paper reads well and touches upon major discoveries in a logical manner.

      Weaknesses:

      The paper lacks proper details for experiments and basic data, leaving the results less convincing. Analyses are superficial and does not provide variant-level resolution. Many of which were addressed during the revision process.

      Comments on revisions:

      The manuscript was improved during the revision process.

      We thank the reviewer for their comments. We are grateful for the opportunity to provide additional information and data to clarify our approach and study results.

      Reviewer #2:

      Summary:

      This study describes a deep mutational scan across CDKN2A using suppression of cell proliferation in pancreatic adenocarcinoma cells as a readout for CDKN2A function. The results are also compared to in silico variant predictors currently utilized by the current diagnostic frameworks to gauge these predictors' performance. The authors also functionally classify CDKN2A somatic mutations in cancers across different tissues.

      Review:

      The goal of this paper was to perform functional classification of missense mutations in CDKN2A in order to generate a resource to aid in clinical interpretation of CDKN2A genetic variants identified in clinical sequencing. In our initial review, we concluded that this paper was difficult to review because there was a lack of primary data and experimental detail. The authors have significantly improved the clarity, methodological detail and data exposition in this revision, facilitating a fuller scientific review. Based on the data provided we do not think the functional characterization of CDKN2A variants is robust or complete enough to meet the stated goal of aiding clinical variant interpretation. We think the underlying assay could be used for this purpose but different experimental design choices and more replication would be required for these data to be useful. Alternatively, the authors could also focus on novel CDKN2A variants as there seems to be potential gain of function mutations that are simply lumped into "neutral" that may have important biological implications.

      Major concerns:

      Low experimental concordance. The p-value scatter plot (Figure 2 Figure Supplement 3A) across 560 variants shows low collinearity indicating poor replicability. These data should be shown in log2fold changes, but even after model fitting with the gamma GLM still show low concordance which casts strong doubt on the function scores.

      Concordance among non-significant p-values is generally low because most of the signal comes from random variability across repeats. If the observed log2 fold change between the repeats is entirely due to noise, one would expect two repeated p-values to behave like independent random uniforms. True concordance is typically more evident in significant p-values because they reflect consistent effects above random noise. Functionally deleterious variants are called when their associated p-value is significant. To confirm this statement, a scatter plot with the log2 normalized fold change was added in Figure 2 Supplement 3C. We see low concordance between repeats in the log2 normalized fold changes centered around 0, corresponding to log log2 normalized changes mainly due to noise. The concordance increases as the variants become significant. One can notice that the correlation coefficient between duplicate assay results was almost identical between the model-based p-values and log2normalized fold change (Figure 2-figure supplement 3A and 3C, Appendix 1-table 4, and Appendix 1-table 6). Also, importantly, no variant was functionally deleterious in one replicate and functionally neutral in another, implying a perfect concordance in calls if we exclude variants that were called indeterminate in one of the two repeats. Finally, of variants with discordant classifications, only 6/560 repeats (1.1%) were functionally deleterious (significant p-value) in one replicate and of indeterminate function in another. We have updated the text as follows:

      “Of variants with discordant classifications, 6 (1.1%) were functionally deleterious in one replicate and of indeterminate function in another. While 102 variants (18.2%) were functionally neutral in one replicate and of indeterminate function in another. Importantly, no variant that was functionally deleterious in one replicate and functionally neutral in another (Appendix 1 -table 4). Furthermore, the correlation coefficient between duplicate assay results was similar using the gamma GLM and log2 normalized fold change (Figure 2-figure supplement 3A and 3C).”

      The more detailed methods provided indicate that the growth suppression experiment is done in 156 pools with each pool consisting of the 20 variants corresponding to one of the 156 aa positions in CKDN2A. There are several serious problems with this design.

      Batch effects in each of the pools preventing comparison across different residues. We think this is a serious design flaw and not standard for how these deep mutational scans are done. The standard would be to combine all 156 pools in a single experiment. Given the sequencing strategy of dividing up CDKN2A into 3 segments, the 156 pools could easily have been collapsed into 3 (1 to 53, 54 to 110, 111 to 156). This would significantly minimize variation in handling between variants at each residue and would be more manageable for performance of further replicates of the screen for reproducibility purposes. The huge variation in confluency time 16-40 days for each pool suggest that this batch effect is a strong source of variation in the experiment.

      While there is variation in time to confluency between different amino acid residues, we do not anticipate this batch effect to significantly affect variant classifications in our study. For example, our results were generally consistent with previous classifications. All synonymous variants (one per residue) and benchmark benign variants assayed were classified as functionally neutral. Furthermore, of benchmark pathogenic variants assayed, none were classified as functionally neutral. 84% were classified as functionally deleterious and 16 percent were classified as indeterminate function.

      Lack of experimental/biological replication: The functional assay was only performed once on all 156 CDKN2A residues and was repeated for only 28 out of 156 residues, with only ~80% concordance in functional classification between the first and second screens. This is not sufficiently robust for variant interpretation. Why was the experiment not performed more than once for most aa sites?

      In our study we determined functional classifications for all CDKN2A missense variants while assessing variability with replicates across 28 residues. Of these variants, only 6 (1.1%) were functionally deleterious in one replicate and of indeterminate function in another. Furthermore, no variant was functionally deleterious in one replicate and functionally neutral in another (Appendix 1 -table 4).  As noted above, we provided additional context in the manuscript.

      For the screen, the methods section states that PANC-1 cells were infected at MOI=1 while the standard is an MOI of 0.3-0.5 to minimize multiple variants integrating into a single cell. At an MOI =1 under a Poisson process which captures viral integration, ~25% of cells would have more than 1 lentiviral integrant. So in 25% of the cells the effect of a variant would be confounded by one or more other variants adding noise to the assay.

      As noted previously, we are not able to differentiate effects due to multiple viral integrations per cells. However, we do not anticipate multiple viral integrations to significantly affect variant classifications in our study as our results are consistent with previous classifications, as described above.

      While the authors provide more explanation of the gamma GLM, we strongly advise that the heatmap and replicate correlations be shown with the log2 fold changes rather than the fit output of the p-values.

      Thank you for the suggestion. As noted, we provide additional explanation in the manuscript about why we classified variants using a gamma GLM. Using a gamma GLM, classification thresholds were determined using the change in representation of 20 non-functional barcodes in a pool of PANC-1 cells stably expressing CDKN2A after a period of in vitro proliferation. Our variant classifications were therefore not based on assay outputs for previously reported – benchmark – pathogenic or begin variants to determine thresholds. We strongly prefer using p-values and classifications using the gamma GLM in the manuscript. However, comparison of assay outputs using a gamma GLM and log2 fold change are included in the manuscript. Read counts, log2 fold change, and classifications based on log2 fold change are presented in the manuscript, for all variants. Readers who wish to use these data may do so and we refer them to the manuscript text, Appendix 1 -table 4, Appendix 1 -table 6, and Figure 2 -figure supplement 2.

      In this study, the authors only classify variants into the categories "neutral", "indeterminate", or "deleterious" but they do not address CDKN2A gain-of-function variants that may lead to decreased proliferation. For example, there is no discussion on variants at residue 104, whose proliferation values mostly consist of higher magnitude negative log2fold change values. These variants are defined as neutral but from the one replicate of the experiment performed, they appear to be potential gain-of-function variants.

      We have added a comment to the discussion to highlight that we did not identify potential gain-of-function variants. Specifically:

      “We classified CDKN2A missense variants using a gamma GLM, as either functionally deleterious, indeterminate functional or functionally neutral. However, we did not classify variants that may have gain-of-function effects, resulting in decreased representation in the cell pool. Future studies are necessary to determine the prevalence and significance of CDKN2A gain-of-function variants.”

      Minor concerns:

      The differentiation between variants of "neutral" and "indeterminate" function seems unnecessary and it seems like there are too many variants that fall into the "indeterminate" category. The authors seem to have set numerical thresholds for CDKN2A function using benchmark variants of known function. While the benchmark variants are important as a frame of reference for the "dynamic range" of the assay, their function scores should not necessarily be used to define hard cutoffs of whether a variant's function score can be interpreted.

      We did not utilize benchmark variants to define thresholds for functional classifications using a gamma GLM. This is one of the strengths of using a gamma GLM model for classification. As explained in our manuscript, classification thresholds were determined using the change in representation of 20 non-functional barcodes in a pool of PANC-1 cells stably expressing CDKN2A after a period of in vitro proliferation. Our variant classifications were therefore not based on assay outputs for previously reported – benchmark – pathogenic or begin variants. While not required when using a gamma GLM, we included indeterminate classifications, which are not uncommon.

      Figure 2 supplement 2 - on the x-axis, should "intermediate" be "indeterminate"?

      This, and a similar typographical error in Figure 2 -figure supplement 3, has been corrected.

    1. Author response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #2 (Public Review):

      This study elucidates the toxic effects of the lipid aldehyde trans-2-hexadecenal (t-2-hex). The authors show convincingly that t-2-hex induces a strong transcriptional response, leads to proteotoxic stress and causes the accumulation of mitochondrial precursor proteins in the cytosol.

      The data shown are of high quality and well-controlled. The genetic screen for mutants that are hyper-and hypo-sensitive to t-2-hex is elegant and interesting, even if the mechanistic insights from the screen are rather limited. Moreover, the authors show evidence that t-2-hex affects subunits of the TOM complex. However, they do not formally demonstrate that the lipidation of a TOM subunit is responsible for the toxic effect of t-2-hex. A t-2-hex-resistant TOM mutant was not identified. Nevertheless, this is an interesting and inspiring study of high quality. The connection of proteostasis, mitochondrial biogenesis and sphingolipid metabolism is exciting and will certainly lead to many follow-up studies.

      Reviewer #3 (Public Review):

      Summary:

      The authors investigate the effect of high concentrations of the lipid aldehyde trans-2-hexadecenal (t-2-hex) in a yeast deletion strain lacking the detoxification enzyme. Transcriptomic analyses as global read out reveal that a large range of cellular functions across all compartments are affected (transcriptomic changes affect 1/3 of all genes). The authors provide additional analyses, from which they built a model that mitochondrial protein import caused by modification of Tom40 is blocked.

      Our initial transcriptomic study with high doses of t-2-hex in a detoxifying mutant as an experimental approach is only a starting experiment and was aimed to identify as many determinants of t-2-hex toxicity as possible as stated in the manuscript. From this, we developed multiple independent approaches in wild-type (and mutant) cells at low t-2-hex concentrations, demonstrating that proteostasis and mitochondrial protein trafficking are physiologically important targets of the pro-apoptotic lipid. Specifically, proteostasis-specific PACE reporters are robustly induced in a detoxification mutant by 5mM t-2-hex (Figure 3D,E) and significantly induced by 10 mM t-2-hex in detoxification competent wild type cells (new Figure 3F).

      We do not propose Tom40 as the lipid's primary target, while we show that several subunits of the TOM (and TIM) complex are directly targeted by low t-2-hex concentrations in vitro (Figure 8B), and Tom20 and Tom70 are important for lipid toxicity (Figure 8D) and mitochondrial protein trafficking in vivo (Suppl. Figure 2).

      Strengths:

      Global analyses (transcriptomic and functional genomics approach) to obtain an overview of changes upon yeast treatment with high doses of t-2-hex.

      Weaknesses:

      The use of high concentrations of t-2-hex in combination with a deletion of the detoxifying enzyme Hfd1 limits the possibility to identify physiological relevant changes. From the hundreds of identified targets the authors focus on mitochondrial proteins, which are not clearly comprehensible from the data.

      The initial transcriptomic study with high doses of t-2-hex in a detoxifying mutant is a starting experiment and was aimed to identify as many determinants of t-2-hex toxicity as possible as stated in the manuscript. As stated (page 4), genes up-regulated (>2 log2FC) by t-2-hex were selected and subjected to GO category enrichment analysis (Supplemental Table 1). We found that “Mitochondrial organization” was the most numerous GO group activated by t-2-hex.  Among the strongly t-2-hex induced genes encoding mitochondrial proteins, CIS1 represented the most inducible gene with a known mitochondrial function. Cis1 is the central protein of the MitoCPR pathway, which is specifically induced upon and protects from mitochondrial protein import stress. We further show that proteostasis and mitochondrial protein trafficking are physiologically important targets at low t-2-hex doses in several independent experimental approaches: proteostasis-specific PACE reporters are robustly induced in a detoxification mutant by 5mM t-2-hex (Figure 3D,E) and significantly induced by 10mM t-2-hex in detoxification competent wild type cells (new Figure 3F); mitochondrial pre-protein accumulation is induced by 10mM t-2-hex in wild type cells (Figure 5G); several subunits of the TOM and TIM complexes are lipidated by low (10mM) t-2-hex doses in wild type cell extracts (Figure 8B), mitochondrial import assays with mt-GFP in intact yeast wild type cells reveal that t-2-hex significantly inhibits import at low (5mM) t-2-hex concentrations (new Suppl. Figure 1). 5-10mM t-2-hex applied here is considerably lower than the published data in human cells with ³ 25mM on intact cells or cell extracts (Jarugumilli et al. 2018).

      The main claim of the manuscript that t-2-hex targets the TOM complex and inhibits mitochondrial protein import is not supported by experimental data as import was not experimentally investigated. The observed accumulation of precursor proteins could have many other reasons (e.g. dissipation of membrane potential, defects in mitochondrial presequence proteases, defects in cytosolic chaperones, modification of mitochondrial precursors by t-2-hex rendering them aggregation prone and thus non-import competent). However, none of these alternative explanations have been experimentally addressed or discussed in the manuscript.

      We have now performed additional experiments, alternative to the pre-protein quantifications, showing that t-2-hex specifically inhibits mitochondrial protein import. We investigated the effect of t-2-hex on mitochondrial protein import using flow cytometric GFP assays in live yeast cells. Specifically, we compared the expression and maturation of GFP targeted either to the cytosol or the mitochondrial matrix and show that low doses of t-2-hex (≥5 μM) significantly inhibited mt-GFP activity compared to cytosolic GFP in wild-type cells (new Supplemental Figure 1B). In contrast, this inhibition was not observed with the saturated derivative, t-2-hex-H2. Flow cytometric rhodamine123 assays revealed that t-2-hex did not alter ΔΨm within the concentration range that efficiently inhibits mt-GFP activity (new Supplemental Figure 1C). Alternative t-2-hex effects such as the direct modification of mitochondrial pre-proteins or cytosolic chaperones, potentially making the precursors prone to aggregation, are less likely, as the mitochondrial and cytosolic GFP used in these import studies differ only by the small, cysteine-free PreSu9 pre-peptide. This information is now included in the Results and Discussion sections.

      Furthermore, many of the results have been reported before (interaction of Tom22 and Tom70 with Hfd1) or observed before (TOM40 as target of t-2-hex in human cells).

      The interaction of Tom22 or Tom70 with Hfd1 has been only reported in high throughput pull-down studies in yeast (Opalinski et al., 2018 and Burri et al., 2006), and no functional connection between Hfd1 lipid detoxification and TOM has been investigated. Here we corroborate these high throughput results by targeted pull-down experiments, which strengthens the new finding that Hfd1 functionally interacts with the TOM complex. Tom40 has been found to be lipidated by high t-2-hex concentrations in human cell extracts in high throughput in vitro proteomic studies (Jarugumilli et al., 2018), but no functional connection between human TOM and t-2-hex has been investigated so far. Here we corroborate these high throughput results by targeted experiments, which strengthens the new findings that t-2-hex and TOM interact functionally.

      Recommendations for the authors:

      Reviewer #2 (Recommendations For The Authors):

      Congratulations on this exciting study. Even if some of the mechanistic details will have to be addressed in further studies (which of the modified sites are physiologically relevant; which sites are modified in vivo without external addition of t-2-hex) this study is inspiring and opens a new direction of mitochondrial research. I therefore fully support publication of this nice study in its current form.

      Reviewer #3 (Recommendations For The Authors):

      Two of the reviewers pointed out that the observation of precursors in whole cell extract is not sufficient to draw conclusions on mitochondrial protein import rates. The authors did not provide any new experiments but argued that a recent publication (Weidberg and Amon, 2018) had used the same readout for this conclusion. Why this manuscript was accepted with this statement is not known to this reviewer, but it does not change the fact, that the conclusion is not valid. Many alternative explanations are possible (see public review) and the claim that the import competence of the TOM complex is affected upon t-2-hex treatment is not appropriate.

      We have now performed new experiments addressing the inhibition of mitochondrial protein import by t-2-hex as an alternative to our precursor accumulation assays. We compared the induced expression of cytosolic and mitochondrial GFP by flow cytometry as a quantitative mitochondrial import assay (Sirk et al., Cytometry A. 2003 Nov; 56(1) 15-22). Low doses of t-2-hex (≥5 μM) significantly inhibited mt-GFP activity as compared to cytosolic GFP in wild-type cells (new Supplemental Figure 1B). This inhibition of mitochondrial GFP is independent of mitochondrial membrane potential perturbation (new Supplemental Figure 1C) and alternative t-2-hex effects, such as the direct modification of the mtGFP precursor or cytosolic chaperones are less likely, as the mitochondrial and cytosolic GFP used in these import studies differ only by the small, cysteine-free PreSu9 pre-peptide.

      The first sentence of the abstract states that t-2-hex „induces mitochondrial dysfunction in a conserved manner from yeast to human". I find two issues with this statement: 1) if the mechanism is known what is the question addressed in the present manuscript and 2) the second sentence of the results fully contradicts the above sentence „In human cells, t-2-hex causes mitochondrial dysfunction by directly stimulating Bax-oligomerisation at the outer mitochondrial membrane. In yeast, however, t-2-hex efficiently interferes with mitochondrial function and cell growth in a Bax independent manner."

      We agree that the first sentence was misleading, this has been fixed now in the revised version.

      The first reviewer requested a repetition of key experiments with lower concentrations and the authors provided additional in vitro data, however, for this, 10 uM is still very high. To gain valuable and physiological relevant data the initial transcriptomic analysis should be repeated with a low amount and in a wild-type yeast background.

      Published t-2-hex chemoproteomic experiments on human cell extracts were performed at higher concentrations (>25mM) and human Bax is hardly lipidated by 10mM t-2-hex (Jarugumilli et al., 2018), therefore the in vitro lipidation data provided in our study should be considered a low t-2-hex dose. The initial transcriptomic study with high doses of t-2-hex in a detoxifying mutant is a starting experiment and was aimed at identifying as many determinants of t-2-hex toxicity as possible. Building on this, we further show that proteostasis and mitochondrial protein trafficking, the relevant cellular functions for our study, are physiologically important targets at low t-2-hex doses in several independent experimental approaches: proteostasis-specific gene expression is robustly induced in a detoxification mutant by 5mM t-2-hex (Figure 3D,E) and significantly induced by 10mM t-2-hex in detoxification competent wild type cells (new Figure 3F); mitochondrial pre-protein accumulation is induced by 10mM t-2-hex in wild type cells (Figure 5G); several subunits of the TOM and TIM complexes are lipidated by low (10mM) t-2-hex doses in vitro in wild type extracts (Figure 8B), mitochondrial import assays with mt-GFP in intact yeast wild type cells reveal that t-2-hex significantly inhibits import at low (5mM) t-2-hex concentrations (new Suppl. Figure 1).

      As already stated above there are many alternative explanations for the observed accumulation of precursor proteins, e.g. the decreased proteasome activity could be cause and not consequence. Also, the modification of precursors directly upon translation in the cytosol could likely impact on their further transport and result in direct aggregation in the cytosol.

      As mentioned above, we have now corroborated the t-2-hex specific mitochondrial protein import defect by alternative in vivo experiments, which are not dependent on the accumulation of mitochondrial precursors. We have tested now the possibility that decreased proteasome activity could indirectly inhibit mitochondrial import. This is not the case because a rpn4 mutant with impaired proteasomal activity induces normal mtGFP levels (new Suppl. Figure 1D). We cannot exclude that the modification of precursors by t-2-hex upon translation might additionally impact on the transport of some mitochondrial pre-proteins. However, mitochondrial and cytosolic GFP used in the import studies only differ in the small cysteine-free PreSu9 pre-peptide making it very unlikely that precursor lipidation is secondarily responsible for the observed import defect.

      Many of the comments after first reviewing the manuscript were not addressed experimentally although many of the suggested experiments are easy to perform. I can only encourage the authors to provide more experimental support and controls, as the claims are currently not sufficiently supported.

      In the two revisions of our manuscript, we have included several control experiments to better link the pro-apoptotic lipid t-2-hex with mitochondrial import stress. These include: in vitro lipidation of TOM/TIM subunits by low t-2-hex concentrations, t-2-hex tolerance and recovery of mitochondrial protein import in specific tom mutants, inhibition of mitochondrial protein import (pre-protein and mtGFP assays) by low t-2-hex doses independently on mitochondrial membrane potential and proteasome activity, and induction of proteostasis specific gene expression by low t-2-hex doses.

    1. Reviewer #1 (Public review):

      Summary:

      The authors have developed self-amplifying RNAs (saRNAs) encoding additional genes to suppress dsRNA-related inflammatory responses and cytokine release. Their results demonstrate that saRNA constructs encoding anti-inflammatory genes effectively reduce cytotoxicity and cytokine production, enhancing the potential of saRNAs. This work is significant for advancing saRNA therapeutics by mitigating unintended immune activation.

      Strengths:

      This study successfully demonstrates the concept of enhancing saRNA applications by encoding immune-suppressive genes. A key challenge for saRNA-based therapeutics, particularly for non-vaccine applications, is the innate immune response triggered by dsRNA recognition. By leveraging viral protein properties to suppress immunity, the authors provide a novel strategy to overcome this limitation. The study presents a well-designed approach with potential implications for improving saRNA stability and minimizing inflammatory side effects.

      Weaknesses:

      (1) Impact on Cellular Translation:

      The authors demonstrate that modified saRNAs with additional components enhance transgene expression by inhibiting dsRNA-sensing pathways. However, it is unclear whether these modifications influence global cellular translation beyond the expression of GFP and mScarlet-3 (which are encoded by the saRNA itself). Conducting a polysome profiling analysis or a puromycin labeling assay would clarify whether the modified saRNAs alter overall translation efficiency. This additional data would strengthen the conclusions regarding the specificity of dsRNA-sensing inhibition.

      (2) Stability and Replication Efficiency of Long saRNA Constructs:

      The saRNA constructs used in this study exceed 16 kb, making them more fragile and challenging to handle. Assessing their mRNA integrity and quality would be crucial to ensure their robustness.<br /> Furthermore, the replicative capacity of the designed saRNAs should be confirmed. Since Figure 4 shows lower inflammatory cytokine production when encoding srIkBα and srIkBα-Smad7-SOCS1, it is important to determine whether this effect is due to reduced immune activation or impaired replication. Providing data on replication efficiency and expression levels of the encoded anti-inflammatory proteins would help rule out the possibility that reduced cytokine production is a consequence of lower replication.

      (3) Comparative Data with Native saRNA:

      Including native saRNA controls in Figures 5-7 would allow for a clearer assessment of the impact of additional genes on cytokine production. This comparison would help distinguish the effect of the encoded suppressor proteins from other potential factors.

      (4) In vivo Validation and Safety Considerations:

      Have the authors considered evaluating the in vivo potential of these saRNA constructs? Conducting animal studies would provide stronger evidence for their therapeutic applicability. If in vivo experiments have not been performed, discussing potential challenges - such as saRNA persistence, biodistribution, and possible secondary effects-would be valuable.

      (5) Immune Response to Viral Proteins:

      Since the inhibitors of dsRNA-sensing proteins (E3, NSs, and L*) are viral proteins, they would be expected to induce an immune response. Analyzing these effects in vivo would add insight into the applicability of this approach.

      (6) Streamlining the Discussion Section:

      The discussion is quite lengthy. To improve readability, some content - such as the rationale for gene selection-could be moved to the Results section. Additionally, the descriptions of Figure 3 should be consolidated into a single section under a broader heading for improved coherence.

    2. Author response:

      Reviewer #1 (Public review):

      Summary:

      The authors have developed self-amplifying RNAs (saRNAs) encoding additional genes to suppress dsRNA-related inflammatory responses and cytokine release. Their results demonstrate that saRNA constructs encoding anti-inflammatory genes effectively reduce cytotoxicity and cytokine production, enhancing the potential of saRNAs. This work is significant for advancing saRNA therapeutics by mitigating unintended immune activation.

      Strengths:

      This study successfully demonstrates the concept of enhancing saRNA applications by encoding immune-suppressive genes. A key challenge for saRNA-based therapeutics, particularly for non-vaccine applications, is the innate immune response triggered by dsRNA recognition. By leveraging viral protein properties to suppress immunity, the authors provide a novel strategy to overcome this limitation. The study presents a well-designed approach with potential implications for improving saRNA stability and minimizing inflammatory side effects.

      We thank Reviewer #1 for their thorough review and for recognizing both the significance of our work and the potential of our strategy to expand saRNA applications beyond vaccines.

      Weaknesses:

      (1) Impact on Cellular Translation:

      The authors demonstrate that modified saRNAs with additional components enhance transgene expression by inhibiting dsRNA-sensing pathways. However, it is unclear whether these modifications influence global cellular translation beyond the expression of GFP and mScarlet-3 (which are encoded by the saRNA itself). Conducting a polysome profiling analysis or a puromycin labeling assay would clarify whether the modified saRNAs alter overall translation efficiency. This additional data would strengthen the conclusions regarding the specificity of dsRNA-sensing inhibition.

      We thank the reviewer for this helpful insight and suggestion. We aim to conduct a puromycin labelling assay to clarify the effect of the various saRNA constructs on translation efficiency.

      (2) Stability and Replication Efficiency of Long saRNA Constructs:

      The saRNA constructs used in this study exceed 16 kb, making them more fragile and challenging to handle. Assessing their mRNA integrity and quality would be crucial to ensure their robustness.

      Furthermore, the replicative capacity of the designed saRNAs should be confirmed. Since Figure 4 shows lower inflammatory cytokine production when encoding srIkBα and srIkBα-Smad7-SOCS1, it is important to determine whether this effect is due to reduced immune activation or impaired replication. Providing data on replication efficiency and expression levels of the encoded anti-inflammatory proteins would help rule out the possibility that reduced cytokine production is a consequence of lower replication.

      This is another very helpful comment. We will conduct an analysis of saRNA integrity and quality by denaturing gel electrophoresis. To examine replicative capacity of the saRNA constructs, we aim to conduct RT-qPCR experiments.

      (3) Comparative Data with Native saRNA:

      Including native saRNA controls in Figures 5-7 would allow for a clearer assessment of the impact of additional genes on cytokine production. This comparison would help distinguish the effect of the encoded suppressor proteins from other potential factors.

      Thank you for your suggestion. We will implement this change in the next version of the manuscript.

      (4) In vivo Validation and Safety Considerations:

      Have the authors considered evaluating the in vivo potential of these saRNA constructs? Conducting animal studies would provide stronger evidence for their therapeutic applicability. If in vivo experiments have not been performed, discussing potential challenges - such as saRNA persistence, biodistribution, and possible secondary effects-would be valuable.

      (5) Immune Response to Viral Proteins:

      Since the inhibitors of dsRNA-sensing proteins (E3, NSs, and L*) are viral proteins, they would be expected to induce an immune response. Analyzing these effects in vivo would add insight into the applicability of this approach.

      We recognize the importance of in vivo studies and immune cell responses and plan to incorporate in vivo imaging in future studies to investigate these interactions, as well as examining delivery of various cargoes via saRNA to determine potential therapeutic benefits in different animal models of inflammatory pain, but such studies are beyond the scope of this current investigation. As suggested by the reviewer, we will incorporate a section on potential challenges of in vivo saRNA work in the revised manuscript.

      (6) Streamlining the Discussion Section:

      The discussion is quite lengthy. To improve readability, some content - such as the rationale for gene selection-could be moved to the Results section. Additionally, the descriptions of Figure 3 should be consolidated into a single section under a broader heading for improved coherence.

      Thank you for your suggestions, we will make these changes in the next revision.

      Reviewer #2 (Public review):

      Summary:

      Lim et al. have developed a self-amplifying RNA (saRNA) design that incorporates immunomodulatory viral proteins, and show that the novel design results in enhanced protein expression in vitro in mouse primary fibroblast-like synoviocytes. They test constructs including saRNA with the vaccinia virus E3 protein and another with E3, Toscana virus NS protein and Theiler's virus L protein (E3 + NS + L), and another with srIκBα-Smad7-SOCS1. They have also tested whether ML336, an antiviral, enables control of transgene expression.

      Strengths:

      The experiments are generally well-designed and offer mechanistic insight into the RNA-sensing pathways that confer enhanced saRNA expression. The experiments are carried out over a long timescale, which shows the enhance effect of the saRNA E3 design compared to the control. Furthermore, the inhibitors are shown to maintain the cell number, and reduce basal activation factor-⍺ levels.

      We thank Reviewer #2 for their detailed assessment and recognition of the mechanistic insights provided by our study.

      Weaknesses:

      One limitation of this manuscript is that the RNA is not well characterized; some of the constructs are quite long and the RNA integrity has not been analyzed. Furthermore, for constructs with multiple proteins, it's imperative to confirm the expression of each protein to confirm that any therapeutic effect is from the effector protein (e.g. E3, NS, L). The ML336 was only tested at one concentration; it is standard in the field to do a dose-response curve. These experiments were all done in vitro in mouse cells, thus limiting the conclusion we can make about mechanisms in a human system.

      We agree that these are weaknesses of our work. We plan to address some of these weaknesses by performing a dose response curve for ML336, examining saRNA integrity through denaturing gel electrophoresis, and will also aim to provide additional evidence for effects of effector proteins through RT-qPCR. We are also looking into testing these constructs in patient-derived FLS.

    1. Voici un sommaire de la vidéo avec des indications temporelles basées sur le déroulement du contenu :

      • Introduction (Début de la vidéo) : L'introduction est faite par Elena, la fondatrice de Toadhouse Games. Elle explique que ce tutoriel est conçu pour les débutants qui n'ont aucune connaissance en codage et que les premières vidéos seront gratuites sur YouTube. Elle présente Ren'Py comme un moteur de roman visuel utilisé par des milliers de créateurs.

      • Qu'est-ce que Ren'Py ? (Environ 0:00 - 1:00) : Ren'Py est un moteur pour créer des romans visuels et de la fiction interactive. Bien qu'il fonctionne avec du code Python, il n'est pas nécessaire de savoir coder pour l'utiliser. Le logiciel fournit tout ce dont vous avez besoin, y compris des éditeurs de texte.

      • Téléchargement de Ren'Py (Environ 1:00 - 2:00) : Il faut se rendre sur ri.org et cliquer sur le bouton de téléchargement. Différentes versions sont disponibles pour Windows, Mac, Linux, Android et iOS. Une fois le fichier téléchargé, il faut l'exécuter et extraire les fichiers dans le dossier de votre choix.

      • Ouverture et présentation du lanceur Ren'Py (Environ 2:00 - 4:00) : Dans le dossier extrait, double-cliquez sur l'application Ren'Py (l'icône avec un anime) pour ouvrir le lanceur. Le lanceur affiche les projets ouverts (tutoriel et question par défaut) et les fichiers associés à chaque projet. Sur la droite, l'option "script" permet d'accéder aux fichiers de code, qui peuvent être édités dans un éditeur de texte comme Atom. Ren'Py peut télécharger et installer Atom pour vous.

      • Exploration des fichiers du projet (Environ 4:00 - 5:00) : Le dossier "game" contient tous les fichiers du jeu (audio, musique, images, etc.). Un raccourci vers le dossier "images" est également disponible. Le fichier "script" contient le code du jeu, y compris les dialogues, les transitions, la musique et les scènes. Les options et les écrans (screens) permettent de personnaliser l'apparence du jeu.

      • Construction et distribution du jeu (Environ 5:00 - 5:30) : L'option "build distributions" permet de créer une version jouable de votre jeu pour la partager avec d'autres sur différentes plateformes comme PC, Linux, Mac, itch.io ou Steam.

      • Exercice pratique avec le projet "The Question" (Environ 5:30 - 8:00) : Il est recommandé de sélectionner le projet "the question" et de lancer le projet pour jouer au jeu. Ensuite, ouvrez le script du projet "the question". L'exercice consiste à jouer au jeu tout en regardant le code correspondant dans l'éditeur de texte. Cela permet de comprendre comment le code contrôle le déroulement du jeu (musique, scènes, dialogues, choix). Il est possible de faire de petites modifications dans le script et de recharger le jeu pour voir les changements.

      • Présentation de Scrivener (Environ 8:00 - 9:00) : Scrivener est un logiciel optionnel qui peut être utilisé pour écrire le dialogue et organiser le contenu de votre roman visuel. Un modèle Ren'Py pour Scrivener créé par Toadhouse Games est disponible. Scrivener propose des conseils d'écriture de base et des modèles pour les profils de personnages et le code Ren'Py.

      • Conclusion (Environ 9:00 - Fin de la vidéo) : Elena encourage les spectateurs à commencer à expérimenter avec Ren'Py en modifiant le projet "the question". Des tutoriels plus avancés sur les "flags" et les choix seront proposés ultérieurement. Des ressources d'aide sont disponibles sur Twitter, par e-mail (teamtoadhouse@gmail.com), sur les subreddits et les forums Ren'Py, ainsi que sur le Discord de Toadhouse Games.

    1. Voici un sommaire avec des indications temporelles basées sur le déroulement de la vidéo :

      • Introduction (Début) : Elena Linaire, fondatrice et directrice créative de Team Toad House et Toad House Games, présente le studio et annonce un game jam de visual novels sur itch.io.

      Elle mentionne des ateliers animés par des professionnels de Toad House pour aider à la création de visual novels. L'objectif est de rendre la création de jeux accessible aux débutants.

      • Téléchargement de Ren'Py (Environ 2-3 minutes) : Elena explique comment télécharger Ren'Py depuis le site renpy.org.

      Elle précise que Ren'Py est un moteur de jeu open source et gratuit spécialement conçu pour les visual novels. Elle cite d'autres moteurs de jeu comme Unity, Unreal, Game Maker, Godot et Twine, notant qu'ils sont adaptés à différents types de jeux.

      Elle souligne que la connaissance de Python n'est pas nécessaire pour utiliser Ren'Py, bien que le "pi" dans Ren'Py fasse référence à Python.

      • Documentation et Ressources (Environ 4-5 minutes) :

      Elena mentionne que le site de Ren'Py contient de la documentation, qui est parfois considérée comme peu conviviale.

      Elle recommande également le serveur Discord Ren'Py et le forum Lumisoft comme ressources d'aide.

      La documentation couvre les bases et les utilisations plus spécifiques de Ren'Py, y compris les systèmes de dates, de monnaie et d'inventaire.

      • Lancement et Création d'un Nouveau Projet (Environ 6-10 minutes) :

      Elena montre l'interface du lanceur Ren'Py, affichant des projets existants comme ceux de Toad House Games.

      Elle explique comment créer un nouveau projet, choisir la langue (avec des options comme le pig latin) et sélectionner un éditeur de texte (recommandant Adam, qui peut être téléchargé directement depuis Ren'Py).

      Il est possible de choisir la résolution du projet, avec 1280x720 comme valeur par défaut, et un schéma de couleurs clair ou foncé pour l'interface (GUI).

      • Structure des Fichiers d'un Projet (Environ 11-13 minutes) :

      Elena présente la structure des dossiers créés pour un nouveau projet Ren'Py, notamment les dossiers game (contenant images, audio, gui, saves), audio cache et autres.

      Elle explique que le fichier script.rpy est l'endroit où le code du jeu est écrit.

      Elle montre comment remplacer l'icône de l'application et modifier les éléments de l'interface graphique dans le dossier gui.

      • Jeu Ren'Py par Défaut et Code de Base (Environ 14-16 minutes) :

      Elena lance le projet par défaut de Ren'Py pour montrer les fonctionnalités intégrées comme les sauvegardes, les chargements, les préférences (volume, plein écran, saut, etc.) et l'écran "À propos". Elle exécute le court jeu par défaut pour illustrer la structure de base : arrière-plan (bg), sprites et dialogue.

      Elle ouvre ensuite le fichier script.rpy dans Adam pour montrer le code correspondant, expliquant les déclarations de personnages (define) et le point de départ du jeu (label start).

      • Définir des Personnages et Écrire du Dialogue (Environ 17-19 minutes) : Elena explique comment définir des personnages avec un nom et une couleur de texte.

      Elle montre comment écrire du dialogue en utilisant le nom du personnage défini. Elle aborde la question de la gestion de plusieurs personnages avec des noms similaires.

      • Outil Narratif Scrivener (Environ 19-22 minutes) : Elena présente Scrivener comme un outil utile pour la planification et l'écriture du récit d'un visual novel, permettant d'organiser l'intrigue, les dialogues et même d'intégrer des éléments de code de base avant de les copier-coller dans Ren'Py.

      • Narration et Positionnement du Texte (Environ 22-24 minutes) : Elena explique comment gérer le texte de narration (sans nom de personnage), souvent utilisé pour les pensées internes.

      Elle mentionne les deux modes de texte principaux dans Ren'Py : en bas de l'écran et en plein écran (NVL). Elle déconseille de placer le texte narratif ailleurs à cause de la complexité du code.

      • Choix, Sauts (Jump), Appels (Call) et Drapeaux (Flags) (Environ 25-41 minutes) :

      Elena démontre comment créer des choix (menu) dans Ren'Py, en utilisant les mots-clés menu, les options et les actions à entreprendre (texte, jump vers un autre label, call à un autre label).

      Elle explique la différence entre jump (saut sans retour) et call (appel avec retour après un return).

      Elle introduit le concept de drapeaux (flags), qui sont des variables utilisées pour suivre les décisions du joueur et influencer le déroulement de l'histoire (default nom_du_drapeau = False, \$ nom_du_drapeau = True, if nom_du_drapeau:).

      Elle montre comment les drapeaux peuvent être utilisés avec des instructions if pour afficher du contenu conditionnel.

      • Analyse des Jeux Tutoriel et "The Question" (Environ 41-47 minutes) :

      Elena examine le jeu tutoriel inclus avec Ren'Py, soulignant ses fonctionnalités (sauvegarde, chargement, préférences, rollback, historique) et son contenu éducatif sur les bases de Ren'Py.

      Elle explore ensuite le jeu d'exemple "The Question", attirant l'attention sur l'analyse du jeu du point de vue d'un développeur (apparence des sprites, positionnement, expressions, choix).

      Elle montre comment le code du jeu "The Question" utilise la définition de personnages avec des couleurs de texte personnalisées (codes hexadécimaux), les drapeaux et la structure des labels pour créer des choix et des embranchements narratifs.

      • Exemple de Code de "Good Looking Home Cooking" (Environ 47-1 heure 17 minutes) :

      Elena présente le code de son jeu "Good Looking Home Cooking" comme un exemple plus complexe, montrant l'utilisation de définitions pour les sons, les curseurs et les personnages (avec des propriétés comme la couleur et le texte alternatif pour la synthèse vocale).

      Elle explique l'utilisation de variables pour suivre les choix importants (drapeaux) et comment ces drapeaux sont utilisés dans les menus et les instructions conditionnelles pour créer différents embranchements et fins.

      Elle illustre l'utilisation de transformations et de positionnement pour les sprites, les transitions (dissolve, fade to black), et la création d'une séquence de crédits animée à partir d'une image PNG défilante. Elle discute des différentes fins possibles dans un visual novel (bonne, mauvaise, tiède).

      • Taille des Images et Arrière-Plans (Environ 1 heure 17 minutes - 1 heure 21 minutes) :

      Elena aborde la question de la taille des images, expliquant qu'elle est déterminée par essai et erreur pour correspondre à la résolution du projet et à l'aspect souhaité. Elle montre des exemples d'arrière-plans filtrés dans le jeu Ren'Py "Karashojo".

      • Conseils de Gestion du Temps et Conclusion (Environ 1 heure 21 minutes - Fin) :

      Elena termine en donnant des conseils pour la gestion du temps et éviter le "crunch" pendant le game jam, notamment en se fixant des mini-échéances régulières et en étant réaliste quant à la portée du projet.

      Elle encourage les participants à utiliser des outils comme Scrivener et à rejoindre la communauté Discord pour trouver de l'aide et des collaborateurs.

      Elle rappelle l'importance d'inclure la mention légale de Ren'Py dans le jeu.

    1. Voici un sommaire avec des horodatages (timestamps) de la vidéo, mettant en évidence les points les plus importants en gras et en français :

      • [0:00] Introduction et bienvenue à la chaîne Toadhouse par une artiste invitée, Thunderbird Paints, qui est l'artiste principale et directrice artistique de Toadhouse Games. Elle prévoit de parler de la conception des environnements, des personnages et potentiellement de l'interface utilisateur pour les romans visuels.
      • [1:00] Début de la discussion sur la conception des personnages et des environnements pour les romans visuels, en commençant par les personnages car ils sont la base d'un roman visuel.
      • [2:00] L'importance d'avoir une idée du scénario avant de concevoir les personnages. L'artiste lance l'idée de créer un roman visuel fictif avec des thèmes rétro, comme un décor de foire Renaissance ou de western cowboy.
      • [3:00] Choix d'un thème de roman visuel sur le thème de Donjons et Dragons (D&D). Discussion sur le personnage principal : statique ou adaptable (avec un générateur de personnages). Le chat vote pour un personnage principal spécifique.
      • [3:00] Détails du personnage principal : un barde avec un accent texan, non binaire. Discussion sur la race du personnage (humain, demi-orc, demi-elfe).
      • [4:00] Application de la discussion sur la conception des personnages concernant les scènes spéciales (CG) et l'interaction avec d'autres personnages, en tenant compte de l'esthétique et des constructeurs de personnages.
      • [5:00] Principes de base pour dessiner un personnage, en commençant généralement par une illustration de référence (turnaround). Importance de considérer l'esthétique de base et le potentiel de cosplay.
      • [6:00] Début du croquis d'un personnage demi-orc non binaire, en se concentrant sur la forme générale et les épaules. Discussion sur les méthodes de coloration (traits et remplissage ou une seule couche).
      • [7:00] Continuation du croquis de référence (turnaround), en veillant à l'alignement des différentes parties du corps. Utilisation de guides pour la hauteur et la variance des types de corps et des formes.
      • [8:00] Esquisse des traits du visage, en commençant par le nez d'orc. Importance de l'hydratation (mention du robot à eau). Discussion sur l'influence d'un chapeau de cowboy sur la coiffure et l'utilisation de références.
      • [9:00] Ajout de détails à la tenue, comme un gilet. Discussion sur le fait que les romans visuels ne doivent pas nécessairement être romantiques et qu'ils peuvent inclure des mini-jeux et des scénarios non linéaires.
      • [10:00] Poursuite de l'ajout de détails à la tenue (jambières, éperons). Mention de romans visuels comme Persona. Pause thé avec l'apparition des chats.
      • [11:00] L'inspiration du jour liée à la citation du sachet de thé. Continuation du dessin des bottes de cowboy. Discussion sur la quantité d'art de personnage typiquement visible dans un roman visuel (buste ou taille).
      • [12:00] Importance d'avoir des sprites de corps entier pour les zooms et les effets, ainsi que pour les scènes spéciales. Ajout d'éléments à la tenue (sac, bretelles).
      • [13:00] Finalisation de la silhouette et ajout d'une guitare. Réglage de la forme du chapeau. Début de la stylisation de la tenue.
      • [13:00] Séparation des calques pour le croquis de face et de dos. Ajout d'une ceinture. Le chat approuve le design.
      • [14:00] Début de la coloration du personnage, en utilisant une seule couche pour la plupart de l'illustration conceptuelle. Discussion sur les couleurs de peau des tieffelins dans D&D.
      • [15:00] Discussion sur la personnalisation des personnages dans les jeux et la création d'éléments de personnalisation.
      • [15:00] Retour à la définition des traits du visage, en se concentrant sur la forme du visage, les expressions et les couleurs.
      • [16:00] Informations généralement fournies lors de la création d'un design de personnage (personnalité, historique, apparence physique, couleur préférée). Le rôle de l'artiste dans la traduction de la personnalité en esthétique visuelle.
      • [17:00] Fin de la coloration de la tenue principale. Transition vers la conception des environnements. Importance de la posture pendant le dessin et suggestion d'étirements.
      • [17:00] Couleur de la guitare inspirée de l'ancienne guitare de l'artiste. Couleur de la ceinture.
      • [18:00] Couleur du bord du chapeau et des bottes. Choix de la couleur du pantalon pour créer un schéma de couleurs cohérent. Discussion sur la modification des couleurs.
      • [19:00] Célébration des abonnements offerts à la chaîne. Ajout d'un détail de chemise dépassant.
      • [20:00] Discussion sur la création de sprites de personnages et l'importance des expressions faciales. Le nombre d'expressions nécessaires (généralement six ou sept).
      • [21:00] Croquis rapide d'une guitare. L'idée de la conception prime sur le détail parfait.
      • [21:00] Ajout rapide d'orteils. L'artiste utilise le même pinceau pour la plupart du concept.
      • [22:00] Ajout de boutons. Idée d'utiliser l'asymétrie dans la conception des tenues pour montrer qu'un personnage est légèrement déséquilibré.
      • [23:00] Exemples de modifications de la tenue pour créer différents aspects (manches, capes, gants).
      • [23:00] Retour à la conception de base et discussion sur les expressions faciales (colère, tristesse).
      • [24:00] Question sur le stockage des expressions faciales et des poses corporelles. L'artiste montre des exemples de son travail sur "Call Me Sarah" et "Good Looking Home Cooking", où les corps et les expressions sont généralement conservés séparément pour permettre des changements dans le langage corporel.
      • [26:00] Conseils pour accélérer la conception des personnages pour un délai court (nombre d'expressions limité, poses simples, réutilisation d'éléments).
      • [27:00] Démonstration rapide de changement d'expressions faciales en modifiant des éléments comme les yeux, la bouche et les sourcils, tout en conservant le même corps.
      • [28:00] Transition vers la conception des environnements après une courte pause.
      • [29:00] Introduction à la conception des environnements, perspective et outils (pinceaux de perspective). Importance de placer la ligne d'horizon au milieu pour un roman visuel.
      • [30:00] Croquis rapide d'un coin de rue médiéval, en utilisant des formes simples pour les bâtiments. Utilisation de la touche Shift pour tracer des lignes droites.
      • [31:00] Ajout d'éléments comme des arbres et un magasin général. Discussion sur l'alignement des personnages et des arrière-plans pour qu'ils s'intègrent mieux.
      • [32:00] Utilisation de références photographiques pour la taille des bâtiments par rapport aux personnages (exemple d'Hershey Park). Ajustement de la taille et de l'emplacement des éléments d'arrière-plan.
      • [33:00] Zoom sur une partie de l'esquisse d'arrière-plan pour montrer un arrière-plan potentiel pour un roman visuel. Début du blocage des couleurs de l'environnement.
      • [34:00] Poursuite du blocage des couleurs (bâtiments, arbres). Importance de créer des calques séparés pour les éléments d'arrière-plan pour faciliter les modifications.
      • [35:00] Incident technique avec le stylet de la tablette. Reprise et continuation du blocage des couleurs.
      • [36:00] Présentation d'exemples d'environnements précédemment créés et promotion de l'événement annuel "Environment-tober" pour la pratique des environnements.
      • [37:00] Retour à l'esquisse de l'environnement, suppression des lignes de perspective et début de l'ajout de détails et de considérations sur la source de lumière.
      • [38:00] Démonstration de la création de différentes ambiances lumineuses (crépuscule, matin) en dupliquant le calque de couleurs de base et en utilisant des modes de fusion et des couleurs différents.
      • [39:00] Importance des ombres pour les bâtiments et le paysage. Démonstration rapide de la création d'une scène de nuit avec des étoiles.
      • [40:00] Finalisation de la démonstration de la création d'une scène de nuit. Remarques de clôture, excuses pour ne pas avoir abordé l'interface utilisateur, invitation à poser des questions sur Discord.
      • [41:00] Message de fin et salutations.

    Annotators

    1. AbstractMicrobiome-based disease prediction has significant potential as an early, non-invasive marker of multiple health conditions linked to dysbiosis of the human gut microbiota, thanks in part to decreasing sequencing and analysis costs. Microbiome health indices and other computational tools currently proposed in the field often are based on a microbiome’s species richness and are completely reliant on taxonomic classification. A resurgent interest in a metabolism-centric, ecological approach has led to an increased understanding of microbiome metabolic and phenotypic complexity revealing substantial restrictions of taxonomy-reliant approaches. In this study, we introduce a new metagenomic health index developed as an answer to recent developments in microbiome definitions, in an effort to distinguish between healthy and unhealthy microbiomes, here in focus, inflammatory bowel disease (IBD). The novelty of our approach is a shift from a traditional Linnean phylogenetic classification towards a more holistic consideration of the metabolic functional potential underlining ecological interactions between species. Based on well-explored data cohorts, we compare our method and its performance with the most comprehensive indices to date, the taxonomy-based Gut Microbiome Health Index (GMHI), and the high dimensional principal component analysis (hiPCA)methods, as well as to the standard taxon-, and function-based Shannon entropy scoring. After demonstrating better performance on the initially targeted IBD cohorts, in comparison with other methods, we retrain our index on an additional 27 datasets obtained from different clinical conditions and validate our index’s ability to distinguish between healthy and disease states using a variety of complementary benchmarking approaches. Finally, we demonstrate its superiority over the GMHI and the hiPCA on a longitudinal COVID-19 cohort and highlight the distinct robustness of our method to sequencing depth. Overall, we emphasize the potential of this metagenomic approach and advocate a shift towards functional approaches in order to better understand and assess microbiome health as well as provide directions for future index enhancements. Our method, q2-predict-dysbiosis (Q2PD), is freely available (https://github.com/Kizielins/q2-predict-dysbiosis).

      This work has been peer reviewed in GigaScience (https://doi.org/10.1093/gigascience/giaf015), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 2: Saritha Kodikara

      In this study, the authors present a novel metagenomic health index designed to differentiate between healthy and unhealthy microbiomes. This area of research is crucial for developing a non-invasive, cost-effective method to assess patient health status. However, I have several suggestions that I believe will enhance the study and address some key points.

      Main Comments:

      1.) The study would benefit from additional post-analysis to provide greater depth. Although the authors applied their approach to several diseases, they did not elaborate on the significance of individual microbiome features across different diseases. For instance, the GMHI parameters were identified as least important in IBD—does this observation hold universally across all diseases analysed?

      2.) The index Q2D performed worse in AGP1 compared to HMP2 and AGP2. Is there a specific reason for this discrepancy? For example, does the index underperform in the heterogeneous functional landscape presented in AGP1 (Figure 2C)? An explanation for the reduced performance in this cohort would provide valuable insights into the method's performance under varying conditions.

      3.) It would be beneficial to make all processed data and relevant scripts available in a GitHub repository to ensure that the results presented in the paper can be replicated by other researchers.

      4.) When attempting to run the script available at https://github.com/Kizielins/q2-predict-dysbiosis, I encountered an error related to the scikit-learn version. The script appears to be compatible with version 1.2.2, whereas I was using version 1.4.2. Please consider updating the script or providing instructions for resolving version compatibility issues.

      5.) The rationale behind considering only positive correlations when calculating the index is unclear. It would be helpful to clarify why negative correlations were excluded from the index calculations.

      6.) In analysing longitudinal alterations, did the authors account for dependencies from previous time points Q2D index? If not, how do these longitudinal alterations differ from those observed in independent studies?

      7.) For each dataset analysed, additional details would be useful, such as the number of samples, species, functions, core functions, and the number of species remaining after applying the MDFS algorithm.

      8.) On Page 13, the authors state that they chose GMHI as their benchmark because hiPCA and Shannon entropy produced worse results for the HMP2 cohort. However, Supplementary Table 3 indicates that Shannon entropy had a lower p-value than GMHI in the Mann-Whitney U test.

      Minor comments:

      1) Page 11 Original: "Collecting information on feature importance at every iteration of the cross-validation procedure model, we consistently identified the two GMHI parameters as the least important (Figure 5b)." Suggested: "Collecting information on feature importance at every iteration of the cross-validation procedure model, we consistently identified the two GMHI parameters as the least important (Figure 4b??)."

      2) Page 12 Original: "Most importantly, Q2PD produced visually the highest scores for all healthy in comparison to unhealthy cohorts." Suggested: "Most importantly, Q2PD produced visually the highest median?? scores for all healthy in comparison to unhealthy cohorts."

      3) Page 12 Original: "Q2PD was also the only index to produce a statistically significant difference between Healthy and Obese in HMP2" Suggested: "Q2PD was also the only index to produce a statistically significant difference between Healthy and Obese in AGP2??"

      4) Page 14 Original: "The Q2PD important in all datasets that were included in its training and validation, specifically AGP_1, AGP_2 and HMP2 (Table 1, Supplementary Figure 7)." Suggested: "The Q2PD important in all datasets that were included in its training and validation, specifically AGP_1, AGP_2 and HMP2 (Table 1, Supplementary Figure 8??)."

    2. AbstractMicrobiome-based disease prediction has significant potential as an early, non-invasive marker of multiple health conditions linked to dysbiosis of the human gut microbiota, thanks in part to decreasing sequencing and analysis costs. Microbiome health indices and other computational tools currently proposed in the field often are based on a microbiome’s species richness and are completely reliant on taxonomic classification. A resurgent interest in a metabolism-centric, ecological approach has led to an increased understanding of microbiome metabolic and phenotypic complexity revealing substantial restrictions of taxonomy-reliant approaches. In this study, we introduce a new metagenomic health index developed as an answer to recent developments in microbiome definitions, in an effort to distinguish between healthy and unhealthy microbiomes, here in focus, inflammatory bowel disease (IBD). The novelty of our approach is a shift from a traditional Linnean phylogenetic classification towards a more holistic consideration of the metabolic functional potential underlining ecological interactions between species. Based on well-explored data cohorts, we compare our method and its performance with the most comprehensive indices to date, the taxonomy-based Gut Microbiome Health Index (GMHI), and the high dimensional principal component analysis (hiPCA)methods, as well as to the standard taxon-, and function-based Shannon entropy scoring. After demonstrating better performance on the initially targeted IBD cohorts, in comparison with other methods, we retrain our index on an additional 27 datasets obtained from different clinical conditions and validate our index’s ability to distinguish between healthy and disease states using a variety of complementary benchmarking approaches. Finally, we demonstrate its superiority over the GMHI and the hiPCA on a longitudinal COVID-19 cohort and highlight the distinct robustness of our method to sequencing depth. Overall, we emphasize the potential of this metagenomic approach and advocate a shift towards functional approaches in order to better understand and assess microbiome health as well as provide directions for future index enhancements. Our method, q2-predict-dysbiosis (Q2PD), is freely available (https://github.com/Kizielins/q2-predict-dysbiosis).

      This work has been peer reviewed in GigaScience (https://doi.org/10.1093/gigascience/giaf015), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 1: Vanessa Marcelino

      The manuscript proposes a new method to distinguish between healthy and diseased human gut microbiomes. The topic is timely, as to date, there is no consensus on what constitutes a healthy microbiome. The key conceptual advance of this study is the integration of functional microbiome features to define health. Their new computational approach, q2-predict-dysbiosis (Q2PD), is open source and available on GitHub.

      While the manuscript is conceptually innovative and interesting for the scientific community, there are several major limitations in the current version of this study.

      1. To develop the Q2PD, they define features associated with health by comparing it with microbiome samples from IBD patients. There are many more non-healthy/dysbiotic phenotypes beyond IBD, therefore it is not accurate to use IBD as synonymous of dysbiosis as done throughout this version of the paper.

      2. The study initially tests the performance of Q2PD against other gut microbiome health indexes (GMHI and hiPCA) using the same data that was used to select the health-associated features of Q2PD. Model performance should be assessed on independent data. On a separate analysis, they do use different datasets (from GMHI and hiPCA), but these datasets seem to be incomplete - GMHI and hiPCA publications have included 10 or more disease categories, and it is unclear why only 4 categories are shown in this study.

      3. While Q2PD does provide visible improvements in differentiating some diseases from healthy phenotypes, the accuracy and sensitivity of Q2PD isn't clear. To adopt Q2PD, I would like to know what are the chances that the classification results will be correct.

      4. There is very little documentation on how to use Q2PD. What are the expect outputs for example, do we need to chose a threshold to define health? Is the method completely dependent on Humann and Metaphlan outputs, or other formats are accepted? The test data contain some samples with zero counts. I got an error when trying it with the test data (ValueError: node array from the pickle has an incompatible dtype…).

      Therefore, I recommend including a range of disease categories to develop Q2PD and use independent datasets to validate the model in terms of accuracy and sensitivity. Alternatively, consider focusing this contribution on IBD. Making the code more user friendly will drastically increase the adoption of Q2PD by the community.

      Please also use page and line numbers when submitting the next version. Other suggestions:

      Abstract: I recommend replacing 'attributed' with 'linked', as 'attributed' suggests that dysbiosis may be causing (rather than reflecting) disease.

      Results: Please indicate what it is meant by 'function' here - it will be good to clarify that this method uses Metaphlan's read-based approach to identify metabolic pathways. What is used, pathway completeness or abundance?

      Results regarding Figure 3a are difficult to interpret. Is 'non-negatively correlated' the same as 'positively correlated'? What does the colour gradient represent - their abundance in those groups, or the strength of their correlation?

      "We observed that the prevalence of the pairs positively correlated in health was higher than in a number of disease-associated groups (Figure 3b)" . This is a very generalised statement considering that only half of the comparisons were significant. How co-occurring species were selected?

      "To test this, we compared the contributions of MDFS-identified species to "core functions" in different groups (Supplementary Figure 4)." How was this comparison made, based on species correlations? The caption of these figures could include more detail - it just says 'Top species contributions to functions.' but how do you define 'top' ? What do the colours represent?

      'This finding was congruent with our earlier suspicions of functional plasticity; modulation of function and thus altered connectivity in the interaction network, shifting towards less abundant, non-core functions upon perturbation of homeostasis.' This is reasonable, but I don't understand how you can draw this conclusion from these figures where there seems to be no significant difference between health and disease.

      Section 'Testing q2-predict-dysbiosis, GMHI and hiPCA accuracy of prediction for healthy and IBD individuals'

      What is the difference between fraction of "core functions" found the fraction of "core functions" among all functions?

      "Most importantly, Q2PD produced visually the highest scores for all healthy in comparison to unhealthy cohorts" . This was not statistically significant. In fact, GMHI finds more significant differences between health and disease than Q2PD.

      Sup. Figure 7 - would be informative to add the name/description of these metabolites not just their ID).

      'Although the threshold of 0.6 as determinant of health by the Q2PD was not applicable to the new datasets'. Does the threshold to define health with Q2PD change depending on the dataset? What are the implications of this for the applicability of this index?

      Effects of sequencing depth - this is a very good addition to the paper, the effects of sequencing depth can be profound but are ignored in most studies, so I commend the authors for doing this here. It would be even better, in my opinion, if this was done with the same datasets used to test/compare Q2PD with other methods, as using a different dataset here adds a new layer of confounding factors.

      'the GMHI and the hiPCA produced the opposite trend, wrongly indicating patient recovery.' The difference here is striking, what is driving this trend?

      The Gut Microbiome Wellness Index 2 (GMWI2) is now published. I don't think it needs to be part of the benchmarking, but it could be acknowledged/cited here.

      Methods: More information on how the data was processed is needed - how were the abundance tables normalized? Which output from Humann was used for downstream analyses?

      To ensure reproducibility, please provide the scripts/code used for analyses and figures.

    1. AbstractBackground Spiders generally exhibit robust starvation resistance, with hunting spiders, represented by Heteropoda venatoria, being particularly outstanding in this regard. Given the challenges posed by climate change and habitat fragmentation, understanding how spiders adjust their physiology and behavior to adapt to the uncertainty of food resources is crucial for predicting ecosystem responses and adaptability.Results We sequenced the genome of H. venatoria and, through comparative genomic analysis, discovered significant expansions in gene families related to lipid metabolism, such as cytochrome P450 and steroid hormone biosynthesis genes. We also systematically analyzed the gene expression characteristics of H. venatoria at different starvation resistance stages and found that the fat body plays a crucial role during starvation in spiders. This study indicates that during the early stages of starvation, H. venatoria relies on glucose metabolism to meet its energy demands. In the middle stage, gene expression stabilizes, whereas in the late stage of starvation, pathways for fatty acid metabolism and protein degradation are significantly activated, and autophagy is increased, serving as a survival strategy under extreme starvation. Additionally, analysis of expanded P450 gene families revealed that H. venatoria has many duplicated CYP3 clan genes that are highly expressed in the fat body, which may help maintain a low-energy metabolic state, allowing H. venatoria to endure longer periods of starvation. We also observed that the motifs of P450 families in H. venatoria are less conserved than those in insects, which may be related to the greater polymorphism of spider genomes.Conclusions This research not only provides important genetic and transcriptomic evidence for understanding the starvation mechanisms of spiders but also offers new insights into the adaptive evolution of arthropods.

      This work has been peer reviewed in GigaScience (https://doi.org/10.1093/gigascience/giaf019), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 2: Sandra Correa-Garhwal

      The manuscript "Genomic and transcriptomic analyses of Heteropoda venatoria reveal the expansion of P450 family for starvation resistance in spider" uses comparative genomics to study the underlying mechanisms of starvation resistance. I appreciate that the authors have produced a high-quality genome for an RTA species. The methods are sound and some interesting gene families are highlighted as key factors in starvation resistance.

      One primary concern I have relates to the study's setup and hypothesis. As currently written, the study comes across as a fishing expedition rather than a focused research project. Although the introduction is informative, it lacks a clear rationale for including this particular species. The reasoning only becomes apparent at the end of the gene family expansion and contraction section. Additionally, I am unsure if being an active hunter makes feeding more unpredictable compared to web-based prey capture. I recommend incorporating this information into the introductory paragraph to better establish the context for the analysis. While terms like "autophagy" and "energy homeostasis" are appropriate for a scientific audience, consider briefly defining them for clarity, especially if the intended audience might not be familiar with all the terminology. Although authors mention that there is no high-quality genome sequence for H. venatoria, it could be helpful to elaborate on why this is significant for understanding starvation resistance. A brief explanation of how genomic data could enhance understanding of the molecular mechanisms involved would strengthen this point. The conclusion provides a clear goal for your study, but it could be more impactful. You might want to emphasize the broader implications of your research findings for ecological conservation and biodiversity. End with a statement about the importance of understanding these mechanisms in the context of preserving ecosystems and addressing challenges posed by climate change.

      For the discussion, while the content is detailed, some parts feel slightly repetitive or could be more concise. For instance, the description of P450 gene expression could be streamlined by removing redundant mentions of their role in metabolic rate regulation. Example: In the discussion section "Interestingly, we found that some P450 families are expanded in H. venatoria, and most P450 genes are more highly expressed in the fat body than in other tissues…" This point is later reiterated in the sentence about other spider species. These ideas could be combined for efficiency. The paragraph about the phylogenetic analysis of the CYP3 clan could be shortened. While it is an interesting finding, some of the details (like the number of genes or proteins) might be better suited for the main text rather than a summary. Focusing more on the functional implications of these duplications would keep the reader engaged. Though the findings are well-explained, the broader significance could be emphasized more explicitly. For example, why is understanding these mechanisms important for the field of arachnid biology, evolutionary biology, or even practical applications (e.g., pest control, conservation)? You could add a closing sentence that ties everything together and highlights the broader relevance of the findings, such as the evolutionary or ecological importance of these adaptations in spiders.

      Other comments: Last paragraph of the introduction: When introducing Heteropoda venatoria, please spell out the species name the first time that is used. The sentence "However, these findings indicate that H. venatoria does not feed in a stable manner and often experiences periods of starvation." Does not fit the rest of the text. Finding from what study? Transcription design for starvation resistance in H. venatoria section: First sentence: What samples? confusing to start like this. Please add information about the samples. You could delete "the samples of H. venatoria were subjected to" it will read better. Are all 23 CYP# clan genes on chromosome 4 tandemly arrayed? Figure 4 - add more information about the figure. For pannel C, What do the red lines show? Grey? Numbers in the circles? While I know what they represent, other readers might not. The finding that H. venatoria chromosomes have undergone lots of chromosomal fragmentation is very interesting, and it is clearly shown on the figure. Which is why I think that more detail is needed. In this sentence "In Uloborus diversus, members of this subfamily are located on Chr5 and an unanchored scaffold." You need to specify which members. Figure 5 - Include a description of the tissues. What is Epi? Ducts? Tail?

    2. AbstractBackground Spiders generally exhibit robust starvation resistance, with hunting spiders, represented by Heteropoda venatoria, being particularly outstanding in this regard. Given the challenges posed by climate change and habitat fragmentation, understanding how spiders adjust their physiology and behavior to adapt to the uncertainty of food resources is crucial for predicting ecosystem responses and adaptability.Results We sequenced the genome of H. venatoria and, through comparative genomic analysis, discovered significant expansions in gene families related to lipid metabolism, such as cytochrome P450 and steroid hormone biosynthesis genes. We also systematically analyzed the gene expression characteristics of H. venatoria at different starvation resistance stages and found that the fat body plays a crucial role during starvation in spiders. This study indicates that during the early stages of starvation, H. venatoria relies on glucose metabolism to meet its energy demands. In the middle stage, gene expression stabilizes, whereas in the late stage of starvation, pathways for fatty acid metabolism and protein degradation are significantly activated, and autophagy is increased, serving as a survival strategy under extreme starvation. Additionally, analysis of expanded P450 gene families revealed that H. venatoria has many duplicated CYP3 clan genes that are highly expressed in the fat body, which may help maintain a low-energy metabolic state, allowing H. venatoria to endure longer periods of starvation. We also observed that the motifs of P450 families in H. venatoria are less conserved than those in insects, which may be related to the greater polymorphism of spider genomes.Conclusions This research not only provides important genetic and transcriptomic evidence for understanding the starvation mechanisms of spiders but also offers new insights into the adaptive evolution of arthropods.

      This work has been peer reviewed in GigaScience (https://doi.org/10.1093/gigascience/giaf019), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 1: Hui Xiang

      In this study, the authors deciphered the chromosome-level genome of a RTA spider Heteropoda venatoria with large body size and generated comprehensive comparative transcriptomes of fat body and whole body among CK and starvation status. Generally, this study added important genomic and transcriptomic data of spiders and provided some cues in understanding the molecular changes during starvation. However, the organization of the manuscript is quite problematic. 1. As to the Results section, please be concise and highlight the main results,avoiding accumulating complex results. Do not present too many statements in terms of introduction and discussion in Results. Do not raise too many hypotheses in the results. 2. As for the involvement of the Hippo signaling pathway in lipid metabolism regulation, the cited literature and mentioned genes are not related to the results of this study. As for the analysis of P450 results, the descriptions of structural analysis are quite complex and difficult to understand. The authors did not explain clearly the relationship between the expansion of P450 genes and hunger resistance in the results of this study. 3. The author's analyses of DEG enrichment results in transcriptome analysis is confusing. Firstly,I can't agree with the authors in that "During the early stage of starvation (from CK to 2 W), many genes, specifically those involved in oxidative phosphorylation and thermogenesis pathways, were up-regulated (Fig. 2E). These findings indicate that during the early starvation stage, energy metabolism in H. venatoria occurs regularly, with sufficient supply of energy." There are a batch of DEGs between 2W and CK, and a lot of pathways involved in neurodegeneration related pathways. How to explain these changes? Secondly, as to 4W to 8W, I can not understand the relationship of down-regulation of hippo signaling pathway to the authors' speculation that "H. venatoria may reduce its cellular glucose uptake and utilization to adjust to the food-scarce environment.", as this pathway involved in lipid metabolism, as the authors stated. Thirdly, from 14 W to 19 W, pathways such Lysosome and apoptosis were down-regulated instead of up-regulated. So how the authors thought autophagy became more active? 4. "We speculate that during the evolution of spider genomes, two types of repeat sequences, TcMar and LTR sequences, had a significant impact on the size of spider genomes. Interestingly, we found that in H. venatoria chromosomes, regions with a high proportion of repeats also presented an increase in GC content (Fig. 1B)" The author's conclusion that high repeat region has higher CG content is based on Fig1B alone, which is too arbitrary. They needs more solid evidence and more detailed analysis. For example, the GC content of TE region could be compared with that of whole genome, and the GC content of gene region. The significance of the relevant results should be explained. In addition, the author should make a more convincing discussion of this result based on the more literature. 5. "We gathered genomic data and annotations for one scorpion and seven chromosome-level spider genomes using the scorpion as an outgroup [35-42]"。Many spider genomes have been published at the chromosomal level. What were the principles behind the spider genomes the authors selected in this study? 6. "Transcriptome design for starvation resistance in H. venatoria" in Results should be partially moved Methods and here the authors should straightforwardly highlighted the results . 7. I can't understand the significance of Fig 2C. The authors did not explain it in the manuscript, either. 8. "The PCA results from both the fat body and whole-body transcriptomes indicated that H. venatoria transcriptome at 19 weeks of starvation was markedly distinct from that at other stages (Fig. 2A, B). Consequently, we conducted a differential analysis of the transcriptome at 19 weeks." Please clarify how the comparative transcriptomes were conducted for differential analysis. 9. The language should be polished.

    1. \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }  \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \newcommand{\id}{\mathrm{id}} \newcommand{\Span}{\mathrm{span}} ( \newcommand{\kernel}{\mathrm{null}\,}\) \newcommand{\range}{\mathrm{range}\,} \newcommand{\RealPart}{\mathrm{Re}} \newcommand{\ImaginaryPart}{\mathrm{Im}} \newcommand{\Argument}{\mathrm{Arg}} \newcommand{\norm}[1]{\| #1 \|} \newcommand{\inner}[2]{\langle #1, #2 \rangle} \newcommand{\Span}{\mathrm{span}} \newcommand{\id}{\mathrm{id}} \newcommand{\Span}{\mathrm{span}} \newcommand{\kernel}{\mathrm{null}\,} \newcommand{\range}{\mathrm{range}\,} \newcommand{\RealPart}{\mathrm{Re}} \newcommand{\ImaginaryPart}{\mathrm{Im}} \newcommand{\Argument}{\mathrm{Arg}} \newcommand{\norm}[1]{\| #1 \|} \newcommand{\inner}[2]{\langle #1, #2 \rangle} \newcommand{\Span}{\mathrm{span}} \newcommand{\AA}{\unicode[.8,0]{x212B}} \newcommand{\vectorA}[1]{\vec{#1}}      % arrow \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }  \newcommand{\vectorC}[1]{\textbf{#1}}  \newcommand{\vectorD}[1]{\overrightarrow{#1}}  \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}}  \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} }  \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \newcommand{\avec}{\mathbf a} \newcommand{\bvec}{\mathbf b} \newcommand{\cvec}{\mathbf c} \newcommand{\dvec}{\mathbf d} \newcommand{\dtil}{\widetilde{\mathbf d}} \newcommand{\evec}{\mathbf e} \newcommand{\fvec}{\mathbf f} \newcommand{\nvec}{\mathbf n} \newcommand{\pvec}{\mathbf p} \newcommand{\qvec}{\mathbf q} \newcommand{\svec}{\mathbf s} \newcommand{\tvec}{\mathbf t} \newcommand{\uvec}{\mathbf u} \newcommand{\vvec}{\mathbf v} \newcommand{\wvec}{\mathbf w} \newcommand{\xvec}{\mathbf x} \newcommand{\yvec}{\mathbf y} \newcommand{\zvec}{\mathbf z} \newcommand{\rvec}{\mathbf r} \newcommand{\mvec}{\mathbf m} \newcommand{\zerovec}{\mathbf 0} \newcommand{\onevec}{\mathbf 1} \newcommand{\real}{\mathbb R} \newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]} \newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]} \newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]} \newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]} \newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]} \newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]} \newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]} \newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]} \newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]} \newcommand{\laspan}[1]{\text{Span}\{#1\}} \newcommand{\bcal}{\cal B} \newcommand{\ccal}{\cal C} \newcommand{\scal}{\cal S} \newcommand{\wcal}{\cal W} \newcommand{\ecal}{\cal E} \newcommand{\coords}[2]{\left\{#1\right\}_{#2}} \newcommand{\gray}[1]{\color{gray}{#1}} \newcommand{\lgray}[1]{\color{lightgray}{#1}} \newcommand{\rank}{\operatorname{rank}} \newcommand{\row}{\text{Row}} \newcommand{\col}{\text{Col}} \renewcommand{\row}{\text{Row}} \newcommand{\nul}{\text{Nul}} \newcommand{\var}{\text{Var}} \newcommand{\corr}{\text{corr}} \newcommand{\len}[1]{\left|#1\right|} \newcommand{\bbar}{\overline{\bvec}} \newcommand{\bhat}{\widehat{\bvec}} \newcommand{\bperp}{\bvec^\perp} \newcommand{\xhat}{\widehat{\xvec}} \newcommand{\vhat}{\widehat{\vvec}} \newcommand{\uhat}{\widehat{\uvec}} \newcommand{\what}{\widehat{\wvec}} \newcommand{\Sighat}{\widehat{\Sigma}} \newcommand{\lt}{<} \newcommand{\gt}{>} \newcommand{\amp}{&} \definecolor{fillinmathshade}{gray}{0.9} Table of contents Four Examples of Quotes and ParaphrasesQuoting in MLA StyleParaphrasing in MLA StyleQuoting in APA StyleParaphrasing in MLA Style The real “art” to research writing is using quotes and paraphrases from evidence effectively in order to support your point. There are certain “rules,” dictated by the rules of style you are following, such as the ones presented by the MLA or the ones presented by the APA. There are certain “guidelines” and suggestions, like the ones I offer in the previous section and the ones you will learn from your teacher and colleagues.

      The text explains that writing a research paper is a skill that involves intelligently using quotations and paraphrases to support an argument. Research is not only about gathering information, but also using it effectively to make a point. Another key point is the difference between "rules" and "guidelines." Style guides like MLA or APA impose strict formatting rules, but also offer general writing tips that can vary based on suggestions from an instructor or classmates. This shows that writing a research paper requires both strict adherence to rules and clear, well-structured ideas.

    1. The role of Jesus is to bring gnosis (knowledge) of the Father and thereby to remove sin. According to Theodotus, when the risen Christ breathed his Spirit into the apostles, "He blew away the dust (=ignorance) like ashes and removed it, but he kindled and made alive the spark" (Excerpts of Theodotus 3:2). The Savior is the one who "takes away the sin of the world" (Herakleon Fragment 10). According to the Interpretation of Knowledge, "when the great Son was sent after his little brothers, he spread abroad the edict of the Father and proclaimed it, opposing all. And he removed the old bond of debt, the one of condemnation. And this is the edict that was: Those who reckoned themselves slaves have become condemned in Adam. They have been brought from death, received forgiveness for their sins and been redeemed." (Interpretation of Knowledge 14:28-38) Gnosis (knowledge) of the Father removes the power of sin. Those who have gnosis (knowledge) are theoretically free of sin. The Gospel of Philip says "The one who has knowledge is a free person. But the free person does not sin, for the one who sins is a slave of sin " (Gospel of Philip 77:15-18 cf.. John 8:34). Spiritual people by definition do not sin. Through knowledge they die with regard to sin and are raised up again with Christ (Excepts of Theodotus 77:1). Gnosis (knowledge) eliminates their inner demons and gives them a "pure heart" which allows them to lead a sinless existence (Valentinus Fragment 2/H). In Valentinian theology, matter itself is derived from and in some ways identical with ignorance. Valentinus claimed that the person who received gnosis (knowledge) could purge himself of matter! He describes this process in the Gospel of Truth: "It is within Unity that each one will attain himself; within knowledge (gnosis) he will purify himself from multiplicity into Unity, consuming matter (=ignorance) within himself like a fire and darkness by light, death by life" (Gospel of Truth 25:10-20). The person who has gnosis (knowledge) is thus freed of his sinful material self. This is why Valentinians claimed that spiritual people are "saved by nature" (Excerpts of Theodotus 55:3, cf.. Ep 7:8, Ireneus Against Heresies 1:6:4, Thessalonians 2:13), and that it is "impossible for them to fall prey to corruption" (Ireneus Against Heresies 1:6:2 cf.. 1 John 3:9). They were said to have the knowledge of God's will that allows them to lead a sinless existence (cf.. Gospel of Truth 22:9-11, Interpretation of Knowledge 9:31-33) and become "illuminators in the midst of mortal men" (Letter of Peter to Philip 137:6-9). The idea that the redeemed person is theoretically set free from the power of sin is derived from the New Testament, particularly the letters of Saint Paul. In the Epistle to the Romans he says, "Our old sinful self has been put to death with Christ on the cross, in order that the power of the sinful self might be destroyed" (Romans 5:6). Later in the same letter he says, "You have been set free from sin" (Romans 6:22). Similar ideas occur in the First Epistle of John, "Whoever is a child of God does not continue to sin, for God's very nature is in him" (1 John 3:9) Even though gnosis (knowledge) makes the person theoretically sinless, Valentinian theologians emphasize that the elect must abstain from sin (e.g. Letter to Flora 33:5:13, Gospel of Philip 66:21-23 ). Therefore, as it says in the Gospel of Philip, "Let each us burrow for the root of evil that is within and root it up from his or her heart. It will be rooted up when it is recognized" (Gospel of Philip 83:18-21). As Theodotus says about the sinful nature, "Be well disposed to it, not nourishing it and strengthening it by the power to commit sin, but putting it to death here and now, and thereby showing it as transitory by our refraining from evil." (Excerpts of Theodotus 52:2). It was assumed that if a person continued to sin, they had not truly attained gnosis (knowledge). Returning to the key passage in the Gospel of Philip, "The one who has knowledge is a free person. But the free person does not sin, for the one who sins is a slave of sin " (Gospel of Philip 77:15-18 cf.. John 8:34). Salvation is only guaranteed for the truly spiritual i.e. those who do not sin. As is says elsewhere in the Gospel of Philip, "Those who have become free....and then sold themselves back into slavery cannot become free again" (Gospel of Philip 79:14-17) Again this logic is derived from New Testament sources. For example in the First Epistle of John it says, "Whoever lives in union with Christ does not continue to sin; but whoever continues to sin has never seen him or known him" (1 John 3:6). Similarly in Paul, redemption is contingent on the person remaining free from sin: "Freedom is what we have-Christ has set us free! Stand then as free people and do not allow yourselves to become slaves (i.e. of sin) again" (Galatians 5:1). The notion that the ignorant sinful nature must be put to death occurs frequently in Saint Paul's writings. For example in the Epistle to the Romans it says, "If you live according to your human nature, you are going to die: but if by the Spirit you put to death your sinful actions, you will live" (Romans 8:13).

      "Saved by nature" IS NOT Christian.

      "True discipleship of God means one will no longer sin"…present in SS too?

    1. In Maps, for example, there are actions for things like avoid busy roads, avoid tolls, volume level, directions on radio, and more.

      If "Avoid Highways" is included in these toggles, this has the potential to spare me a lot of headache.

    2. Apple now provides summaries of an app's customer reviews to get a quicker overview of what people think of it.

      Legitimately offensive that they would add this despite having left everything about App Store Reviews to rot since they were first introduced?

    3. For Apple News+ subscribers, there's now an ‌Apple News‌ Food section. The Food section features tens of thousands of recipes, stories about restaurants, tips for healthy eating, kitchen essentials, and more.

      Foodism.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors discovered that the splicing machinery and nuclear baskets are sometimes in close proximity using Nup153 as a representative for the nuclear basket. They characterize this interaction using several different methods and propose that NUP153 is required to assemble the splicing machinery on genes that are transcribed in the nuclear periphery, which would supporting the gene gating model.

      The manuscript is well written and structured and the experiments are carefully conducted and analyzed.

      Significance

      The impression that I get from this manuscript is that we are looking at rather rare events with a small effect size. A definitive proof that the splicing machinery really assembles in the vicinity of NPCs docked via NUP153 is lacking. To assist in the revision process I will raise some questions to discuss but also propose some additional experiments to substantiate the claims.

      1. It is not clear what NUP153 really binds to and which domain is important. The experiments shown suggest proximity and indirect interactions (Co-IPs), but it is not clear whether NUP153 binds to DNA, RNA or a specific splicing factor. The bioID experiment might label splicing factors because they are cargos that pass through the pores during import, or it labels splicing factors that remain bound to spliced mRNPs during mRNP export. For example DDX39b, also called UAP56 is an important subunit of the TREX complex and involved the final packaging of mRNPs at the NPC. In my opinion, this protein is not a good choice. Also, the negative control that was used in the experiments is a potassium channel in the plasmamembrane, which can exclude that signal occurs by chance. But it would have been better to use a nuclear protein as control to exclude these possibilities. In line with a rare event, the Co-IP signals are very weak and barely higher than the GFP control. They should be repeated in the presence of RNase to confirm that this interaction occurs on nascent RNA during splicing and not e.g. to recycle or reroute splicing factors or during import.
      2. I do not understand why a NUP would be required to recruit or tether splicing factors to peripheral genes. Usually, splicing factors hitchhike on transcribing polymerase II or they are delivered by nuclear speckles which could happen also at the periphery. The authors should co-stain with a nuclear speckle marker to exlcude this possibility.
      3. What would be the advantage to splice in the vicinity of the pore? Given that genes with long introns take a long time to be transcribed, splicing would block the pores for hours and would prevent other activities. It would also be possible that splicing does occur at the periphery but NUP153 picks the mRNPs up at a later stage.
      4. One major drawback of the story is that the authors use very long-term depletion of NUP153 via shRNAs that will definitely screw up the import of many nuclear proteins; and a splicing inhibitor that has broad effects on nuclear architecture. Degron lines of Nup153 exist and should be used to substantiate at least some of the conclusions. Alternatively, a NUP153 mutant without zinc finger or IDR could be used to prevent DNA binding or basket association.

      Specific comments:

      Abstract: suggesting that a fraction of splicing occurs at the NPC. speckle-distant splicing events, it should be nuclear speckles, term not explained

      Line 20: Super enhancers not introduced

      Line 39: Splicing and the different spliceosomal subcomplexes needs more explanation and introduction to understand the selection of proteins that were used in the study.

      Line 65: Choice of controls. LckN18 The genes should be written once in full and their choice should be explained better.

      Line 123: The authors state: 'However, we detected a lower number of interacting spots between Nup98/DDX39b than with its Nup153 counterpart; a similar trend is followed between Nup98 and SF3A1 (Fig. S2g), suggesting that the interaction between splicing proteins and Nup98 might be further apart within the NPC structure. A more likely explanation is that both proteins are shed from the mRNP at the basket as they are not shuttling with the mRNA and should not enter the pore.

      Line 131: Mention right at the beginning of the sentence which splicing proteins were imaged here.

      Line 141: Nuclear speckles are still not properly introduced. Why is SF3A1 not expected to be in nuclear speckles? It Should be co-stained for nuclear speckle markers.

      Lane 149: PlaB drastically changes the nuclear architecture. The reduced interactions can have also different reasons. The authors should image the distribution of the investigated factors in the presence of PlaB. Again the Co-IPs should be performed in the presence of RNase to confirm that the observed interactions depend on RNA.

      Line 181: The requirement of Nup153 to tether the splicing machinery to the NPC is not convincing from the presented data. The knockdown is way too long and the changes are tiny. Wouldn't it be better to use a Nup153 mutant without Zinc knuckle or IDR to show that now the splicing factors interactions are lost? Alternatively degron lines should be used.

      Line 186: I do not understand the logic why NUP153 needs to bind to chromatin to fullfil its function in splicing. It could also bind to RNA with its zinc knuckle or IDRs. The authors should perform iCLIP or RIP to exclude this possibility? I also do not understand the logic to look to look for proximity to repressive LADs as a criterion, while investigating a function of NUP153 in splicing which requests actively transcribing genes. This has to be better motivated. Excluding the nucleoplasmic pool of NUP153 removes important data points that might be functionally relevant.

      Line 187: The entire paragraph on Dam-ID and all subsequent genome-wide analyses is way too densely written and hard to understand for non-experts. Analysis tools or thresholds are rarely given and it is unclear how the different data sets have been made and by who, what they mean and how they have been integrated. Chromatin patterns and expression profiles are very unspecific terms. Many of the used terms have not been introduced properly. The metaplots show very small differences. In the end I am not sure what we have learned from all the data integration. Is NUP153 bound to DNA, to nucleosomes, to nascent RNA or to splicing factors?

      Lane 232: Unclear what NUP153 introns are. Is the entire gene where NUP153 binds to considered or only the intron with a NUP153 peak.

      Lane 242: Again, the shRNA knockdown performed in this manuscript is way to long to observe a direct effect on splicing at the pore, which occurs at the level of minutes. Degrons should be used here to confirm this observation.

      Line 249: More negative controls are needed for genes not bound by NUP153 for the splicing analysis. RNA-Seq analyzed for intron retention could be helpful.

      There is quite some typos and missing words in the text.

    1. Review coordinated by Life Science Editors Foundation Reviewed by: Dr. Angela Andersen, Life Science Editors Foundation & Life Science Editors Potential Conflicts of Interest: None

      PUNCHLINE This preprint reveals a novel mechanism by which nitric oxide (NO) regulates lipid uptake in endothelial cells through nitrosation of the fatty acid transporter CD36. In conditions where endothelial NO is elevated, CD36 is modified at key cysteine residues, which prevents its localization to the plasma membrane and thus inhibits lipid uptake. This nitrosation-based regulation protects endothelial cells from lipid overload while increasing circulating serum lipids. The findings establish a dynamic and reversible regulatory axis—Cav1 → NO → CD36—that links vascular signaling to metabolic control.

      BACKGROUND Endothelial cells (ECs), which line the blood vessels, are uniquely positioned as gatekeepers of metabolic exchange—controlling the delivery of nutrients, including fatty acids, from the bloodstream into peripheral tissues. In the setting of metabolic syndrome, a state of chronic nutrient excess, this finely tuned lipid transport system becomes dysregulated. Excessive lipid uptake by ECs leads to lipid accumulation, mitochondrial dysfunction, and progressive endothelial impairment, which in turn contributes to the pathogenesis of cardiovascular and metabolic diseases.

      While ECs appear to possess intrinsic protective mechanisms to buffer against lipid overload, the molecular basis of these protective responses is poorly understood. The authors sought to uncover how ECs might actively limit lipid uptake under stress, and what upstream signals trigger this protective state.

      QUESTIONS ADDRESSED How do endothelial cells protect themselves from lipid-induced dysfunction during nutrient excess? What regulatory mechanisms allow them to limit lipid uptake under stress?

      SUMMARY Reduced endothelial Cav1 leads to increased NO production and, in turn, nitrosation of CD36. This modification prevents CD36 from reaching the plasma membrane, blocking lipid uptake into ECs. As a result, circulating serum lipids increase, but endothelial function is preserved. These findings define NO-mediated nitrosation as a new mechanism of post-translational regulation of CD36, with implications for endothelial health in metabolic disease.

      KEY RESULTS Cav1 is downregulated in endothelial cells in obesity Figures 1A–F, Supplementary Fig. 1A–F Goal: Identify genes affected by obesity that regulate endothelial lipid uptake. Outcome: Single-cell RNA-seq of mouse and human adipose tissues reveals consistent downregulation of Cav1 in all endothelial subtypes during obesity (Fig. 1A–F). Supplementary Fig. 1 shows quality control, cell-type identification, and confirmation of downregulated Cav1 expression in both species (Supp. Fig. 1A–F).

      Loss of Cav1 increases circulating lipids and decreases EC lipid uptake Figures 2A–K, Supplementary Fig. 2A–E Goal: Determine the physiological effect of Cav1 loss on lipid homeostasis. Outcome: EC-specific Cav1 knockout mice have elevated serum triglycerides, cholesterol, and LDL (Fig. 2E–G), but reduced lipid droplet accumulation in ECs (Fig. 2J). They maintain normal weight and show improved glucose tolerance (Fig. 2H–L). Supplementary Fig. 2 confirms successful endothelial deletion of Cav1 (Supp. Fig. 2A–C) and shows that hyperlipidemia is not due to differences in dietary intake or lipid absorption (Supp. Fig. 2D–E).

      Loss of Cav1 elevates NO and suppresses lipid uptake Figures 3A–J, 4A–F, Supplementary Fig. 3A–E Goal: Test whether elevated NO mediates lipid uptake defects. Outcome: Cav1 knockout increases serum nitrate/nitrite (Fig. 3J), reflecting elevated NO. Pharmacologic NO inhibition with L-NAME restores lipid uptake in HAMECs (Fig. 4A) and mouse aorta (Fig. 4B), and reduces serum lipids (Fig. 4C–D). Deletion of eNOS in ECs—but not in RBCs—rescues the phenotype (Fig. 4E–F). Supplementary Fig. 3 shows that Cav1 knockout does not impair vasodilatory responses to acetylcholine and confirms NO elevation by multiple readouts (Supp. Fig. 3A–E).

      CD36 mediates endothelial lipid uptake and is regulated by NO Figures 5A–F, Supplementary Fig. 4A–D Goal: Identify whether CD36 is necessary and sufficient for NO-regulated lipid uptake. Outcome: CD36 localizes to Cav1-enriched domains (Fig. 5A) and is required for lipid uptake (Fig. 5C). NO donors suppress CD36-mediated lipid uptake in HEK293T cells (Fig. 5D), and NO induces CD36 nitrosation (Fig. 5E). Pharmacologic CD36 inhibition abolishes the effect of eNOS deletion (Fig. 5F). Supplementary Fig. 4 confirms that CD36 expression is unaltered by NO or Cav1 loss, suggesting the effect is post-translational (Supp. Fig. 4A–D).

      CD36 is nitrosated at cysteines 3 and 466, disrupting palmitoylation and lipid uptake Figures 6A–D, 7A–F, Supplementary Fig. 5A–C Goal: Identify the functional nitrosation sites and their impact on trafficking. Outcome: CD36 is nitrosated at C3 and C466. Mutating these residues abolishes nitrosation and restores lipid uptake despite NO exposure (Fig. 6C–D). Nitrosation prevents palmitoylation of CD36 (Fig. 7D), explaining the loss of plasma membrane localization (Fig. 7B–F). Supplementary Fig. 5 shows quantification of CD36 localization shifts (Supp. Fig. 5A–C).

      Nitrosation restricts CD36 to the ER and blocks its trafficking Figures 7A–F, Supplementary Fig. 6A–D Goal: Understand the subcellular localization of nitrosated CD36. Outcome: In Cav1-deficient or NO-treated ECs, CD36 remains in the endoplasmic reticulum (ER) (Fig. 7B, 7E). L-NAME restores membrane localization (Fig. 7C, 7F). Supplementary Fig. 6 provides further co-localization data with ER and Golgi markers and quantifies trafficking defects (Supp. Fig. 6A–D).

      NO protects ECs from lipid-induced mitochondrial dysfunction and impaired vasodilation Figures 8A–C, Supplementary Fig. 7A–E Goal: Determine the physiological consequence of NO-CD36 signaling. Outcome: Lipid exposure combined with L-NAME leads to mitochondrial dysfunction (Fig. 8A) and impaired vasodilation (Fig. 8C), both rescued by NO. Supplementary Fig. 7 shows full mitochondrial stress test profiles and validation of mitochondrial protein levels (Supp. Fig. 7A–E).

      STRENGTHS Defines a novel mechanism linking Cav1, NO, and CD36 to lipid homeostasis

      Broadens our understanding of endothelial metabolic self-regulation

      Identifies post-translational nitrosation as a reversible toggle on lipid uptake

      Uses elegant genetic models, in vivo functional assays, and biochemical rigor

      Links vascular NO signaling to metabolic adaptation

      Highly relevant to metabolic syndrome, lipedema, and vascular disease

      FUTURE WORK & EXPERIMENTAL DIRECTIONS Investigate whether modulating CD36 nitrosation could be therapeutic in hyperlipidemia

      Study whether this mechanism contributes to sex differences in metabolic disease

      Explore its role in other vascular beds and tissue-specific lipid handling

      Test implications for lipedema, a fat-distribution disorder involving endothelial dysfunction

      Define how palmitoylation and nitrosation are balanced or dynamically regulated

      AUTHORSHIP NOTE This review was drafted with the assistance of ChatGPT (OpenAI) to help organize and articulate key ideas clearly and concisely. I provided detailed prompts, interpretations, and edits to ensure the review reflects an expert understanding of the biology and the paper’s contributions. The final version has been reviewed and approved by me.

      FINAL TAKEAWAY This preprint reframes endothelial cells as active regulators of systemic metabolism. By showing that nitrosation of CD36 suppresses lipid uptake and preserves endothelial function under metabolic stress, the authors reveal a previously unrecognized mechanism of cellular protection. This discovery expands our understanding of how ECs maintain homeostasis in nutrient-rich environments and opens new directions for treating lipid-associated diseases like obesity, lipedema, and atherosclerosis.

    1. Click Here to View Resources Used in This Article “Ecology of Cape Cod National Seashore | U.S. Geological Survey.” USGS.gov, https://www.usgs.gov/geology-and-ecology-of-national-parks/ecology-cape-cod-national-seashore. Accessed 27 January 2025. ↩︎ “Year-Round Population – Data Cape Cod.” Data Cape Cod, Cape Cod Commission, https://datacapecod.org/pf/barnstable-county-year-round-population/. Accessed 29 January 2025. ↩︎Coffey, Denise. “5.5 million visitors spend close to $800 million playing on the Cape.” Cape Cod Times, Cape Cod Times, 23 December 2023, https://www.capecodtimes.com/story/business/economy/2023/12/23/5-5-million-visitors-spend-close-to-800-million-playing-on-the-cape/71877224007/. Accessed 31 January 2025. ↩︎ “Tourism to Cape Cod National Seashore contributes $730 million to local economy – Cape Cod National Seashore (U.S.” National Park Service, 28 August 2024, https://www.nps.gov/caco/learn/news/tourism-to-cape-cod-national-seashore-contributes-$730-million-to-local-economy.htm. Accessed 27 January 2025. ↩︎ “”OUR”STORY: 400 Years of Wampanoag History.” Plymouth 400, https://www.plymouth400inc.org/our-story-exhibit-wampanoag-history/. Accessed 27 January 2025. ↩︎ “Beyond Vacationland: The Native American Cape Cod Story – Project 562 Blog – Project 562 – beyond Vacationland: The Native American Cape Cod Story.” Project 562, Project 562, www.project562.com/blog/beyond-vacationland-the-native-american-cape-cod-story. Accessed 28 Jan. 2025.  ↩︎ “Cape Cod History.” The New York Times Web Archive, https://archive.nytimes.com/www.nytimes.com/fodors/fdrs_feat_44_2.html. Accessed 27 January 2025. ↩︎ “The Land Called Nauset.” The Land Called Nauset | Orleans, MA, Orleans Massachusetts, www.town.orleans.ma.us/1309/The-Land-Called-Nauset. Accessed 28 Jan. 2025.  ↩︎ “Cape Cod History.” The New York Times Web Archive, https://archive.nytimes.com/www.nytimes.com/fodors/fdrs_feat_44_2.html. Accessed 27 January 2025. ↩︎  “”OUR”STORY: 400 Years of Wampanoag History.” Plymouth 400, https://www.plymouth400inc.org/our-story-exhibit-wampanoag-history/. Accessed 27 January 2025. ↩︎  Andrews, Charles M. “King Philip’s War of New England – Legends of America.” Legends of America, Legends of America, www.legendsofamerica.com/king-philips-war/. Accessed 28 Jan. 2025.  ↩︎Coffey, Denise. “5.5 million visitors spend close to $800 million playing on the Cape.” Cape Cod Times, Cape Cod Times, 23 December 2023, https://www.capecodtimes.com/story/business/economy/2023/12/23/5-5-million-visitors-spend-close-to-800-million-playing-on-the-cape/71877224007/. Accessed 31 January 2025. ↩︎ “Year-Round Population – Data Cape Cod.” Data Cape Cod, Cape Cod Commission, https://datacapecod.org/pf/barnstable-county-year-round-population/. Accessed 29 January 2025. ↩︎Ibid ↩︎Ibid ↩︎ “Barnstable County, MA Religion.” BestPlaces, https://www.bestplaces.net/religion/county/massachusetts/barnstable. Accessed 27 January 2025 ↩︎ “About the County.” Barnstable County, https://www.capecod.gov/county-government/about-the-county/. Accessed 27 January 2025. ↩︎Ibid ↩︎Ibid ↩︎  “About the County.” Barnstable County, https://www.capecod.gov/county-government/about-the-county/. Accessed 27 January 2025. ↩︎Ibid ↩︎Ibid ↩︎  “Employment and Wages by Industry.” Data Cape Cod, Cape Cod Commission , 9 Dec. 2024, datacapecod.org/pf/employment-wages-by-industry/.  ↩︎ “Tourism to Cape Cod National Seashore contributes $730 million to local economy – Cape Cod National Seashore (U.S.” National Park Service, 28 August 2024,  ↩︎ “Labor Force and Earnings.” Data Cape Cod, Cape Cod Commission, 27 Dec. 2024, datacapecod.org/pf/labor-force-and-earnings/.  ↩︎Ibid ↩︎ “Census Profile: Sandwich Town, Barnstable County, MA.” Census Reporter, U.S Census Bureau , 2023, censusreporter.org/profiles/06000US2500159735-sandwich-town-barnstable-county-ma/#:~:text=20%2C422%20Population,2023%205%2Dyear%20unless%20noted.  ↩︎  “Eastham – Housing Profile.” Housing Profile: Eastham, Cape Cod Commission, capecodcommission.org/resource-library/file/?url=/dept/commission/team/Website_Resources/housing/town_profiles/Eastham-Housing-Profile.pdf. Accessed 29 Jan. 2025.  ↩︎ “Employment & Unemployment.” Data Cape Cod, Cape Cod Commission, 9 Dec. 2024, datacapecod.org/pf/unemployment/.  ↩︎ “Vulnerable Populations on Cape Cod.” Data Cape Cod, Cape Cod Commission, 26 Dec. 2024, datacapecod.org/pf/vulnerable-populations-on-cape-cod/.  ↩︎ “Ecology of Cape Cod National Seashore | U.S. Geological Survey.” USGS.gov, https://www.usgs.gov/geology-and-ecology-of-national-parks/ecology-cape-cod-national-seashore. Accessed 27 January 2025. ↩︎   II. Agriculture’s Roots on Cape Cod, Association to Preserve Cape Cod, apcc.org/wp-content/uploads/2020/04/section-2.pdf. Accessed 29 Jan. 2025.  ↩︎Ibid ↩︎ “Overview.” State of the Waters: Cape Cod, Association to Preserve Cape Cod, 9 Dec. 2024, capecodwaters.org/overview/#:~:text=Commercial%20and%20recreational%20fishing%20and,resources%20depending%20on%20many%20factors.  ↩︎ “Animals – Cape Cod National Seashore (U.S.” National Park Service, 30 January 2018, https://www.nps.gov/caco/learn/nature/animals.htm. Accessed 27 January 2025. ↩︎ “About Us.” About Us | Cape Cod Commission, Cape Cod Commission , www.capecodcommission.org/about-us/ccc-overview/. Accessed 28 Jan. 2025.  ↩︎ “Outdoor Activities – Cape Cod National Seashore (U.S.” National Park Service, 14 August 2024, https://www.nps.gov/caco/planyourvisit/outdooractivities.htm. Accessed 27 January 2025. ↩︎ “Fees & Passes.” National Parks Service, U.S. Department of the Interior, www.nps.gov/caco/planyourvisit/fees.htm. Accessed 29 Jan. 2025.  ↩︎ “Cape Cod National Seashore Parking Fees, Passes, & Free Entry (2022).” Cape Days National Seashore, Cape Days, www.capedays.com/cape-cod-national-seashore-parking.html. Accessed 28 Jan. 2025.  ↩︎ “Welcome to the Parking Department.” Welcome to the Parking Department | Town of Provincetown, MA – Official Website, Town of Provincetown, www.provincetown-ma.gov/81/Parking#:~:text=MacMillan%20Pier%20parking%20lot%20$4.00,spots%20are%20$2.00%20per%20hour. Accessed 28 Jan. 2025.  ↩︎  “Cape Cod School Districts and Schools.” Jack Conway, Jack Conway, www.jackconway.com/cape-cod-school-districts-and-schools/. Accessed 28 Jan. 2025.  ↩︎  “Town Profiles.” Data Cape Cod, Cape Cod Commission, 17 Feb. 2022, datacapecod.org/pf/town-profiles/.  ↩︎ “School Enrollment by Race and Gender – Data Cape Cod.” Data Cape Cod, https://datacapecod.org/pf/school-enrollment/. Accessed 27 January 2025. ↩︎Ibid ↩︎ “Fitch Affirms Cape Cod Healthcare’s (MA) IDR at ‘AA-’; Outlook Stable.” Fitch Ratings: Credit Ratings & Analysis for Financial Markets, Fitch Ratings, 4 May 2023, www.fitchratings.com/research/us-public-finance/fitch-affirms-cape-cod-healthcare-ma-idr-at-aa-outlook-stable-04-05-2023#:~:text=CCHC%20benefits%20from%20its%20dominant,budget%20cuts%20or%20program%20modifications.  ↩︎  “Cape Cod Health Care Services.” Jack Conway, https://www.jackconway.com/cape-cod-health-care-services/. Accessed 27 January 2025. ↩︎Ibid ↩︎ “Model Large-Scale Solar Bylaw.” Model Large-Scale Solar Bylaw | Cape Cod Commission, Cape Cod Commission , www.capecodcommission.org/our-work/model-solar-bylaw/. Accessed 28 Jan. 2025.  ↩︎

      Smart way to feature the citations

    1. Reviewer #2 (Public review):

      Summary:

      The revised paper by Kim et al. reports two disease mutations in proBMP4, S91C and E93G, disrupt the FAM20C phosphorylation site at Ser91, blocking the activation of proBMP4 homodimers, while still allowing BMP4/7 heterodimers to function. Analysis of DMZ explants from Xenopus embryos expressing the proBMP4 S91C or E93G mutants showed reduced expression of pSmad1 and tbxt1. The expert amphibian tissue transplant studies were expanded to in vivo studies in Bmp4S91C/+ and Bmp4E93G/+ mice, highlighting the impact of these mutations on embryonic development, particularly in female mice, consistent with patient studies. Additionally, studies in mouse embryonic fibroblasts (MEFs) demonstrated that the mutations did not affect proBMP4 glycosylation or ER-to-Golgi transport but appeared to inhibit the furin-dependent cleavage of proBMP4 to BMP4. Based on these findings and AI modeling using AlphaFold of proBMP4, the authors speculate that pSer91 influences access of furin to its cleavage site at Arg289AlaLysArg292 in a new "Ideas and Speculation" section. Overall, the authors addressed the reviewers' comments, improving the presentation.

      Strengths:

      The strengths of this work continue to lie in the elegant Xenopus and mouse studies that elucidate the impact of the S91C and E93G disease mutations on BMP signaling and embryonic development. Including an "Ideas and Speculation" subsection for mechanistic ideas reduces some shortcomings regarding the analysis of the underlying mechanisms.

      Weaknesses:

      (Minor) In Figure S1 and lines 165-174 and 179-180, the authors should consider that, unlike the wild-type protein (Ser), which can be reversibly phosphorylated or dephosphorylated, phosphomimic mutations are locked into mimicking either the phosphorylated state (Asp) or the non-phosphorylated state (Ala). Consequently, if the S91D mutant exhibits lower activity than WT, it could imply that S91D interferes with other regulatory constraints, as the authors suggest. However, it may also be inhibiting activation. Therefore, caution is warranted when comparing S91D with S91C to conclude that Ser91 phosphorylation increases BMP4 activity. While additional experiments are not necessary, further consideration is essential.

      In Figure 4, panels A, E, and I, the proBMP bands in the mouse embryonic lysates and MEFs expressing the mutations show a clear size shift. Are these shifts a cause or a consequence of the lack of cleavage? Regardless, the size shifts should be explicitly noted.

      (Minor) In line 314, the authors should consider modifying the wording to: "is required for modulating proprotein convertase..."

      (Minor) In lines 394-399, the authors cleverly speculate that pS91 interacts with Arg289-the essential P4 arginine for furin processing. If so, this interaction could hinder the cleavage of proBMP4, as indicated by the results in Figure S1. The discussion would benefit from considering that, contrary to their favored model, dephosphorylation at Ser91 might actually facilitate cleavage.

    2. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review): Summary:

      The authors demonstrate that two human preproprotein human mutations in the BMP4 gene cause a defect in proprotein cleavage and BMP4 mature ligand formation, leading to hypomorphic phenotypes in mouse knock-in alleles and in Xenopus embryo assays.

      Strengths:

      They provide compelling biochemical and in vivo analyses supporting their conclusions, showing the reduced processing of the proprotein and concomitant reduced mature BMP4 ligand protein from impressively mouse embryonic lysates. They perform excellent analysis of the embryo and post-natal phenotypes demonstrating the hypomorphic nature of these alleles. Interesting phenotypic differences between the S91C and E93G mutants are shown with excellent hypotheses for the differences. Their results support that BMP4 heterodimers act predominantly throughout embryogenesis whereas BMP4 homodimers play essential roles at later developmental stages.

      Weaknesses:

      (1) A control of BMP7 alone in the Xenopus assays seems important to excludeBMP7 homodimer activity in these assays.

      We and other have shown that BMP7 homodimers have weak or no activity while BMP4/7 heterodimers single at a much higher level than either BMP4 or BMP7 homodimers in Xenopus ectodermal and mesodermal cells. We have expanded the description of these published findings in the results section (lines 182-187). We have also added representative examples of experiments in which BMP4 and BMP7 alone controls are included (new Fig. S2). Since the level of activity of BMP7 + BMP4 variants is equivalent to that of BMP7 + WT BMP4, this cannot be accounted for by BMP7 homodimers.

      (2) The Discussion could be strengthened by more in-depth explanations of how BMP4 homodimer versus heterodimer signaling is supported by the results, so that readers do not have to think it all through themselves. Similarly, a discussion of why the S91C mutant has a stronger phenotype than E93G early in the Discussion would be helpful or least mention that it will be addressed later.

      We have revised the discussion as suggested by the reviewer. Please see responses to recommendations 2-4 below.

      Reviewer #1 (Recommendations for the authors):

      (1) A control of BMP7 injection alone seems missing when comparing the BMP4/7 variants. BMP4 in the embryo assays presented in Fig 1. Is it not possible that the activity observed is BMP7 homodimers, perhaps due to inhibited heterodimer formation by the BMP4 variant?

      Multiple published studies have shown that BMP7 homodimers have weak or no activity in Xenopus ectodermal and mesodermal cells, and that ½ dose of RNA encoding BMP4 and BMP7 together signals at a higher level than does a full dose of RNA encoding either BMP4 or BMP7 alone. We have expanded our description of these published findings (lines 182-187), have included additional details about RNA doses that were injected (line 156, 175, 182) and have added representative examples of experiments in which BMP4 and BMP7 controls were included in a new Figure (Fig. S2).

      (2) In reading the Discussion, I was continually thinking of the stronger phenotype of the S91C mutant compared to the E93G one, although both are discussed together throughout most of the Discussion. Only at the end of the Discussion is the stronger phenotype of S91C discussed with a compelling explanation for the stronger phenotype, not related to the phosphorylation site function. I wonder if it would be better placed earlier in Discussion or at least mentioned the difference in phenotypes that will be discussed later.

      We have moved the possible explanation of differences between Bmp4<sup>S91C</sup> and Bmp4<sup>E93G</sup> mutants to immediately follow the introductory paragraph of the results section.

      (3) Along these same lines, why is it that the E93G exhibits rather normal cleavage at E10.5? Might the mechanisms of cleavage vary in different contexts with phosphorylation-dependent cleavage not functioning at early stages of development? I believe the hypothesis is that it is cleaved due to heterodimerization with BMP7. More discussion of this excellent hypothesis should be provided with clear statements, rather than inferences, if I'm understanding this correctly. For example, I had to read 3 times the first sentence of the last paragraph on p.14 before I understood it. Better to break that sentence down and the one that follows it, so it is easier to understand.

      We have rewritten and expanded the paragraphs describing phenotypic and biochemical evidence for defective homodimer but not heterodimer signaling as suggested (lines 343-375). We have also more explicitly stated the possibility that normal cleavage of BMP4<sup>E93G</sup> in embryonic lystates may be due to a predominance of BMP4/7 heterodimers in early embryonic stages or spatiotemporal differences in phosphorylation-dependent cleavage of BMP4 homodimers (lines 369-372)

      (4) Similarly the last paragraph of the Discussion mentions that the authors provide evidence of BMP4 homodimer signaling. I agree with the authors, but I had to think through the evidence myself. Better if the authors clearly explain the evidence that points to this, as this is a very good point of

      See response to point 3, above. Thank you for these useful suggestions.

      (5) Last sentence, first paragraph on p.11 should be qualified for the E93G mutant to E13.5, since it was normal at E10.5 regarding Figure 4 results.

      Thank you for pointing this out. It has been corrected.

      (6) Skip the PC acronym, since it is only repeated once in the text and hard to remember almost 10 pages later when it is used again.

      We have corrected this.

      (7) In the Discussion, a typo in "a single intramolecular disulfide bond that stabilizes the dimer", should be 'intermolecular'.

      Thank you for catching our switch in the use of inter- and intramolecular. We have corrected this (lines 334-335).

      (8) At times the E93G mutant is referred to having early lethality, often in conjunction with S91C, while other times it is referred to as late lethality. Considering that the homozygotes die postnatally after weaning, most would consider it late lethality. In contrast S91C is indeed an early lethal.

      We have changed the wording in the introduction to state that “mice carrying Bmp4<sup>S91C</sup> or Bmp4<sup>E93G</sup> knock in mutations show embryonic or enhanced postnatal lethality, respectively,… (lines 141-143)” and have removed the word “early” from the title.

      Reviewer #2 (Public review): Summary:

      Kim et al. report that two disease mutations in proBMP4, Ser91Cys and Glu93Gly, which disrupt the Ser91 FAM20C phosphorylation site, block the activation of proBMP4 homodimers. Consequently, analysis of DMZ explants from Xenopus embryos expressing the proBMP4 S91C or E93G mutants showed reduced pSmad1 and tbxt1 expression. The block in BMP4 activity caused by the mutations could be overcome by co-expression of BMP7, suggesting that the missense mutations selectively affect the activity of BMP4 homodimers but not BMP4/7 heterodimers. The expert amphibian tissue transplant studies were extended to in vivo studies in Bmp4S91C/+ and Bmp4E93G/+ mice, demonstrating the impact of these mutations on embryonic development, particularly in female mice, in line with patient studies. Finally, studies in MEFs revealed that the mutations did not affect proBMP4 glycosylation or ER-to-Golgi transport but appeared to inhibit the furin-dependent cleavage of proBMP4 to BMP4. Based on these findings and AI (AlphaFold) modeling of proBMP4, the authors speculate that pSer91 influences access of furin to its cleavage site at Arg289AlaLysArg292.

      Strengths:

      The Xenopus and mouse studies are valuable and elegantly describe the impact of the S91C and E93G disease mutations on BMP signaling and embryonic development.

      Weaknesses:

      The interpretation of how the mutations may disturb the furin-mediated cleavage of proBMP4 is underdeveloped and does not consider all of their data. Understanding how pS91 influences the furin-dependent cleavage at Arg292 seems to be the crux of this work and thus warrants more consideration. Specifically:

      (1) Figure S1 may be significantly more informative than implied. The authors report that BMP4S91D activates pSmad1 only incrementally better than S91C and much less than WT BMP4. However, Fig. S1B does not support the conclusion on page 7 (numbering beginning with title page); "these findings suggest that phosphorylation of S91 is required to generate fully active BMP4 homodimers". The authors rightly note that the S91C change likely has manifold effects beyond inhibiting furin cleavage. The E93G change may also affect proBMP4 beyond disturbing FAM20C phosphorylation. Additional mutation analyses would strengthen the work.

      The major goal of generating and comparing the activity of the S91D mutant with S91C was to control for phosphorylation independent defects cause by the deleterious introduction of a cysteine residue, which might cause aberrant disulfide bonding. We opted to introduce S91D since “phosphomimics” can sometimes approximate the phosphorylated state. S91D has significantly higher activity than S91C (p<0.01) and has a less significant loss of activity (p<0.05) than does S91C (<p<0.0001) relative to wild type BMP4 (Fig. S1), consistent with deleterious effects of the cysteine residue and supporting a possible explanation for the more severe phenotype of S91C vs E93G mice. We have rewritten this section to clarify our interpretation (lines 165-174)and have changed our statement that our activity data “suggest the importance of phosphorylation” to a statement that they are consistent with this possibility (lines 179-180). We do not believe that further mutational analysis using activity assays in Xenopus would shed light on how or whether phosphorylation affects proteolytic activation of BMP4.

      (2) These findings in Figure S1 are potentially significant because they may inform how proBMP4 is protected from cleavage during transit through the TGN and entry into peripheral cellular compartments. Intriguing modeling studies in Figure 6 suggest that pSer91 is proximal to the furin cleavage site. Based on their presentation, pSer91 may contact Arg289, the critical P4 residue at the furin site. If so, might that suggest how pS91 may prevent furin cleavage, thus explaining why the S91D mutation inhibits processing as presented, and possibly how proBMP4 processing is delayed until transit to distal compartments (perhaps activated by a change in the endosomal microenvironment or a Ser91 phosphatase)? Have the authors considered or ruled out these possibilities? In addition to additional mutation analyses of the FAM20C site, moving the discussion of this model to an "Ideas and Speculation" subsection may be warranted.

      The model shown in Fig. 6B proposes the possibility that phosphorylation unmasks (rather than preventing) the furin cleavage motif due to the proximity of Ser91 to the cleavage site (lines 399-402). If S91D truly mimicked phosphorylation, we would predict it would facilitate processing rather than inhibiting it. We do not have data comparing cleavage of S91D relative to wild type BMP4 and have not generated knock in S91D mice to test this idea. While the reviewers questions are intriguing, they cannot be answered by mutational analysis of the FAM20C site and are beyond the scope of the current studies that sought to understand the impact of human pS91C and pE93G mutations and cell biological implications. We have moved the models to an “Ideas and Speculation” subsection as suggested (lines 377-414) since these models are meant to provoke further thought rather than provide definitive answers based on our data.

      (3) The lack of an in vitro protease assay to test the effect of the S91 mutations on furin cleavage is problematic.

      Although we routinely perform in vitro cleavage assays with recombinant furin, we don’t believe they would be informative on how S91 phosphorylation or mutation of this residue impacts cleavage since in vitro synthesized substrate used in these assays is neither dimerized not post-translationally modified, and cleavage would be tested in isolation from the endogenous trafficking environment that we propose influences cleavage.

      Reviewer #2 (Recommendations for the authors):

      (1) The impact of BMPS91A should be determined and paired with the S91D phosphomimic data to reveal if it causes proBMP4 to be cleaved prematurely and disturbs pSmad1 expression. Data for S93G should also be included.

      Our major goal in comparing the activity of S91D with S91C was to control for phosphorylation independent defects cause by the deleterious introduction of a cysteine residue in S91C, which might cause aberrant disulfide bonding. We opted to introduce S91D since “phosphomimics” can sometimes approximate the phosphorylated state. We note that S91D has significantly higher activity than S91C, consistent with deleterious effects of the cysteine residue and supporting a possible explanation for the more severe phenotype of S91C vs E93G mice. We have revised the wording of this section to clarify this. Our models predict that S91D would be cleaved more efficiently than S91C or S91A, if it really mimics the endogenous phosphorylated state, rather than being cleaved prematurely. Our biochemical analysis compares cleavage of endogenous BMP4 in wild type and mutant MEFs. Generation of S91D, S91A or S93G mutant mice to compare cleavage is beyond the scope of the current work.

      (2) Is the distance between pS91 and Arg289 close enough to form a hydrogen bond? If so, might this interaction influence furin access?

      AI modeling does not provide high probability prediction of structures surrounding the furin motif (see Fig. S7) and thus we cannot comment on whether or not these residues are close enough to form a hydrogen bond. We have revised the wording of the discussion to state “This simple model building indicates the possibility of direct contact between pSer91 and Arg289, and that phosphorylation is required for furin to access the cleavage site, although we note that predictions surrounding the furin motif represent low probability conformations (Fig. S7) (lines 399-402).”

      (3) The genotypes in Figure 2 are labeled awkwardly. Consider labeling the headers for the three subsections of panels (A-F, G-L, and M-O) differently.

      We have revised Fig. 2 to clarify that the three subsections of panels are distinct, and to emphasize that the middle subsection represents views of the right and left side of the same embryo.

      (4) The tables should be reformatted. As is, the labeling is frequently cut off, and the numbers of expected and observed progeny should both be stated to aid the reader.

      We thank the reviewer for noting the formatting errors in the tables, which we have corrected. We have also changed the tables so that normal or abnormal mendelian distributions are reported as numbers of observed/expected progeny rather than numbers/percent observed progeny.

      Reviewer #3 (Public review):

      Summary:

      The authors describe important new biochemical elements in the synthesis of a class of critical developmental signaling molecules, BMP4. They also present a highly detailed description of developmental anomalies in mice bearing known human mutations at these specific elements.

      Strengths:

      Exceptionally detailed descriptions of pathologies occurring in mutant mice. Novel findings regarding the interaction of propeptide phosphorylation and convertase cleavage, both of which will move the field forward. Provocative hypothesis regarding furin access to cleavage sites, supported by Alphafold predictions.

      Weaknesses:

      Figure 6A presents two testable models for pre-release access of furin to cleavage sites since physical separation of enzyme from substrate only occurs in one model; could immunocytochemistry resolve?

      Available reagents are not sensitive enough to detect endogenous furin and BMP4 with high resolution. Because PC/substrate interactions are transient, whereas the bulk of furin and BMP4 is distributed throughout the secretory pathway, it is not possible to co-immunolocalize furin and BMP4 in vivo at present. Studies using more advanced cell biological techniques such along with tagged proteins may enable us to test these hypotheses in the future.

      Reviewer #3 (Recommendations for the authors):

      This interesting paper presents new data on an important family of developmental signaling molecules, BMPs. Mutations at FAM20C consensus sites within BMP prodomains are known to cause birth defects. The authors have here explored differential effects of human mutations on hetero- and homodimer activity and maturation, issues that may well arise during human development. In addition to demonstrating the profound effect of these mutations on development in Xenopus and mice, the authors also show differential processing of BMP4 precursors bearing these mutations in MEF cells prepared from mutant embryos. Finally, they show that FAM20C plays a role in BMP4 prodomain processing with quite differing outcomes in homo- vs heterodimers, which they suggest is due to structural differences impacting furin access. While this latter idea remains speculative due to the lack of crystal structures (models are based on Alphafold) it is a highly promising line of work.

      The data are beautifully presented and will be of clear interest to all developmental biologists. Certain cell biology results may also extrapolate to other phosphorylated precursor molecules undergoing the interesting (and as yet unexplained) phenomenon of convertase cleavage immediately before secretion, for example, FGF23. I have only a few minor comments regarding the presentation, which is remarkably clear.

      (1) The introduction of BMP7 in the Abstract is abrupt. It should be described as a preferred dimerization partner for BMP4.

      Thank you for noting this. We have revised the first sentence of the abstract to better introduce BMP7(lines 49-50).

      (2) In Figure 1A, what is the small light green box?

      This is a small fragment released from the prodomain by the second cleavage. We have clarified this in the introduction (lines 112-114) and in the legend to Figure 1 (lines 758-759).

      (3) In the Discussion it might be relevant to mention that FAM20C propeptide is not cleaved by convertases but by S1P (Chen 2021).

      We have added this information to clarify (lines 394-396).

      (4) Figure 3, define VSD; Figure 5, Endo H removes sugars only from immature (nonsialylated) sugars, not from all chains as implied. More importantly, EndoH and PNGase remove N-linked sugars, yet Results refer only to O-linked glycosylation.

      Thank you for noting these oversights. We have defined VSD in Figure 3. We have also revised the headers for Fig. 5 and for the relevant subsection of the results to include N-linked glycosylation and note in the results that EndoH removes only immature N-linked carbohydrates (lines 301-304).

      (5) Figure 5- for clarity, I suggest it be broken up into two larger panels labeled "Embryos" and "MEFs"

      Thank you for this suggestion, we have subdivided the Figure into two panels.

      (6) Figure 6A presents two testable models for pre-release access of furin to cleavage sites since the physical separation of the enzyme from substrate only occurs in one model; could confocal immunocytochemistry resolve?

      Available reagents are not sensitive enough to detect endogenous furin and BMP4 with high resolution and PC/substrate interactions are transient whereas the bulk of both furin and BMP4 is in transit through the secretory pathway. For these reasons it is not possible to co-immunolocalize furin and BMP4 in vivo. Future studies using advanced cell biological techniques may enable us to test these hypotheses in the future.

    1. Reviewer #2 (Public review):

      Summary:

      In this paper, the authors first examined lens phenotypes in mice with Le-Cre-mediated knockdown (KD) of all the four FGFR (FGFR1-4), and found that pERK signals, Jag1 and foxe3 expression are absent or drastically reduced, indicating that FGF signaling is essential for lens induction. Next, the authors examined lens phenotypes of FGFR1/2-KD mice and found that lens fiber differentiation is compromised, and that proliferative activity and cell survival are also compromised in lens epithelium. Interestingly, Kras activation rescues defects in lens growth and lens fiber differentiation in FGFR1/2-KD mice, indicating that Ras activation is a key step for lens development, downstream of FGF signaling. Next, the authors examined the role of Frs2, Shp2 and Grb2 in FGF signaling for lens development. They confirmed that lens fiber differentiation is compromised in FGFR1/3-KD mice combined with Frs2-dysfunctional FGFR2 mutants, which is similar to lens phenotypes of Grb2-KD mice. However, lens defects are milder in mice with Shp2YF/YF and Shp2CS mutant alleles, indicating that involvement of Shp2 is limited for the Grb2 recruitment for lens fiber differentiation. Lastly, the authors showed new evidence on the possibility that another adapter protein, Shc1, promotes Grb2 recruitment independent of Frs2/Shp2-mediated Grb2 recruitment.

      Strengths:

      Overall, the manuscript provides valuable data on how FGFR activation leads to Ras activation through the adapter platform of Frs2/Shp2/Grb2, which advances our understanding on complex modification of FGF signaling pathway. The authors applied a genetic approach using mice, whose methods and results are valid to support the conclusion. The discussion also well summarizes the significance of their findings.

      Weaknesses:

      The authors found that the new adaptor protein Shc1 is involved in Grb2 recruitment in response to FGF receptor activation. However, the main data on Shc1 are only histological sections and statistical evaluation of lens size. Cellular-level evidence on Shc1 makes the authors' conclusion more convincing.

      Comments on latest version:

      In the 2nd revised version of the manuscript, the authors responded to my recommendation to show the number of biological replicates for Prox1 and αA-crystallin (Fig. 1F) and conductedstatistical analysis for pmTOR, and pS6 (Supplementary figure 1B).

      The authors also explained why the animals are no longer available for the additional experiments that I requested. I may understand the situation, but hope that the authors will investigate the cellular-level evidence on Shc1 in more detail and report it maybe as another paper in future.

    2. Author response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This manuscript uses the eye lens as a model to investigate basic mechanisms in the Fgf signaling pathway. Understanding Fgf signaling is of broad importance to biologists as it is involved in the regulation of various developmental processes in different tissues/organs and is often misregulated in disease states. The Fgf pathway has been studied in embryonic lens development, namely with regards to its involvement in controlling events such as tissue invagination, vesicle formation, epithelium proliferation and cellular differentiation, thus making the lens a good system to uncover the mechanistic basis of how the modulation of this pathway drives specific outcomes. Previous work has suggested that proteins, other than the ones currently known (e.g., the adaptor protein Frs2), are likely involved in Fgfr signaling. The present study focuses on the role of Shp2 and Shc1 proteins in the recruitment of Grb2 in the events downstream of Fgfr activation.

      Strengths:

      The findings reveal that the juxtamembrane region of the Fgf receptor is necessary for proper control of downstream events such as facilitating key changes in transcription and cytoskeleton during tissue morphogenesis. The authors conditionally deleted all four Fgfrs in the mouse lens that resulted in molecular and morphological lens defects, most importantly, preventing the upregulation of the lens induction markers Sox2 and Foxe3 and the apical localization of F-actin, thus demonstrating the importance of Fgfrs in early lens development, i.e. during lens induction. They also examined the impact of deleting Fgfr1 and 2, on the following stage, i.e. lens vesicle development, which could be rescued by expressing constitutively active KrasG12D. By using specific mutations (e.g. Fgfr1ΔFrs lacking the Frs2 binding domain and Fgfr2LR harboring mutations that prevent binding of Frs2), it is demonstrated that the Frs2 binding site on Fgfr is necessary for specific events such as morphogenesis of lens vesicle. Further, by studying Shp2 mutations and deletions, the authors present a case for Shp2 protein to function in a context-specific manner in the role of an adaptor protein and a phosphatase enzyme. Finally, the key surprising finding from this study is that downstream of Fgfr signaling, Shc1 is an important alternative pathway - in addition to Shp2 - involved in the recruitment of Grb2 and in the subsequent activation of Ras. The methodologies, namely, mouse genetics and state-of-the-art cell/molecular/biochemical assays are appropriately used to collect the data, which are soundly interpreted to reach these important conclusions. Overall, these findings reveal the flexibility of the Fgf signaling pathway and it downstream mediators in regulating cellular events. This work is expected to be of broad interest to molecular and developmental biologists.

      Weaknesses:

      A weakness that needs to be discussed is that Le-Cre depends on Pax6 activation, and hence its use in specific gene deletion will not allow evaluation of the requirement of Fgfrs in the expression of Pax6 itself. But since this is the earliest Cre available for deletion in the lens, mentioning this in the discussion would make the readers aware of this issue.

      Reviewer #2 (Public review):

      Summary

      I have reviewed the revised manuscript submitted by Wang et al., which is entitled "Shc1 cooperates with Frs2 and Shp2 to recruit Grb2 in FGF-induced lens development". In this paper, the authors first examined lens phenotypes in mice with Le-Cre-mediated knockdown (KD) of all four FGFR (FGFR1-4), and found that pERK signals, Jag1 and foxe3 expression are absent or drastically reduced, indicating that FGF signaling is essential for lens induction. Next, the authors examined lens phenotypes of FGFR1/2-KD mice and found that lens fiber differentiation is compromised and that proliferative activity and cell survival are also compromised in lens epithelium. Interestingly, Kras activation rescues defects in lens growth and lens fiber differentiation in FGFR1/2-KD mice, indicating that Ras activation is a key step for lens development, downstream of FGF signaling. Next, the authors examined the role of Frs2, Shp2 and Grb2 in FGF signaling for lens development. They confirmed that lens fiber differentiation is compromised in FGFR1/3-KD mice combined with Frs2-dysfunctional FGFR2 mutants, which is similar to lens phenotypes of Grb2-KD mice. However, lens defects are milder in mice with Shp2YF/YF and Shp2CS mutant alleles, indicating that involvement of Shp2 is limited for the Grb2 recruitment for lens fiber differentiation. Lastly, the authors showed new evidence on the possibility that another adapter protein, Shc1, promotes Grb2 recruitment independent of Frs2/Shp2-mediated Grb2 recruitment.

      Strength

      Overall, the manuscript provides valuable data on how FGFR activation leads to Ras activation through the adapter platform of Frs2/Shp2/Grb2, which advances our understanding on complex modification of FGF signaling pathway. The authors applied a genetic approach using mice, whose methods and results are valid to support the conclusion. The discussion also well summarizes the significance of their findings.

      Weakness

      The authors found that the new adaptor protein Shc1 is involved in Grb2 recruitments in response to FGF receptor activation. However, the main data on Shc1 are only histological sections and statistical evaluation of lens size. In the revised manuscript, the authors did not answer my major concern that cellular-level data are missing, which is not fully enough to support their main conclusion on the involvement of Shc1 in Grb2 recruitment of FGF signaling for lens development. Since the title of this manuscript is that Shc1 cooperates with Frs2 and Shp2 to recruit Grb2 in FGF-induced lens development, it is important to provide the cellular-level evidence on Shc1.

      Reviewer #3 (Public review):

      Summary:

      The manuscript entitled "Shc1 cooperates with Frs2 and Shp2 to recruit Grb2 in FGF-induced lens development" by Wang et al., investigates the molecular mechanism used by FGFR signaling to support lens development. The lens has long been known to depend on FGFR-signaling for proper development. Previous investigations have demonstrated the FGFR signaling is required for embryonic lens cell survival and for lens fiber cell differentiation. The requirement of FGFR signaling for lens induction has remained more controversial as deletion of both Fgfr1 and Fgfr2 during lens placode formation does not prevent the induction of definitive lens markers such as FOXE3 or αA-crystallin. Here the authors have used the Le-Cre driver to delete all four FGFR genes from the developing lens placode demonstrating a definitive failure of lens induction in the absence of FGFR-signaling. The authors focused on FGFR1 and FGFR2, the two primary FGFRs present during early lens development and demonstrated that lens development could be significantly rescued in lenses lacking both FGFR1 and FGFR2 by expressing a constitutively active allele of KRAS. They also showed that the removal of pro-apoptotic genes Bax and Bak could also lead to a substantial rescue of lens development in lenses lacking both FGFR1 and FGFR2. In both cases, the lens rescue included both increased lens size and the expression of genes characteristic of lens cells.

      Significantly the authors concentrated on the juxtamembrane domain, a portion of the FGFRs associated with FRS2. Previous investigations have demonstrated the importance of FRS2 activation for mediating a sustained level of ERK activation. FRS2 is known to associate both with GRB2 and SHP2 to activate RAS. The authors utilized a mutant allele of Fgfr1, lacking the entire juxtamembrane domain (Fgfr1ΔFrs) and an allele of Fgfr2 containing two-point mutations essential for Frs2 binding (Fgfr2LR). When combining three floxed alleles and leaving only one functional allele (Fgfr1ΔFrs or Fgfr2LR) the authors got strikingly different phenotypes. When only the Fgfr1ΔFrs allele was retained, the lens phenotype matched that of deleting both Fgfr1 and Fgfr2. However, when only the Fgfr2LR allele was retained the phenotype was significantly milder, primarily affecting lens fiber cell differentiation, suggesting that something other than FRS2 might be interacting with the juxtamembrane domain to support FGFR signaling in the lens. The authors also deleted Grb2 in the lens and showed that the phenotype was similar to that of the lenses only retaining the Fgfr2LR allele, resulting a failure of lens fiber cell differentiation and decreased lens cell survival. However, mutating the major tyrosine phosphorylation site of GRB2 did not affect lens development. The authors additionally investigated the role of SHP2 in lens development by either deleting SHP2 or by making mutations in the SHP2 catalytic domain. The deletion of the SHP2 phosphatase activity did not affect lens development as severely as total loss of SHP2 protein, suggesting a function for SHP2 outside of its catalytic activity. Although the loss of Shc1 alone has only a slight effect on lens size and pERK activation in the lens, the authors showed that the loss of Shc1 exacerbated the lens phenotype in lenses lacking both Frs2 and Shp2. The authors suggest that SHC1 binds to the FGFR juxtamembrane domain allowing for the recruitment of GRB2 in independently of FRS2.

      Strengths:

      (1) The authors used a variety of genetic tools to carefully dissect the essential signals downstream of FGFR signaling during lens development.

      (2) The authors made a convincing case that something other than FRS2 binding mediates FGFR signaling in the juxtamembrane domain.

      (3) The authors demonstrated that despite the requirement of both the adaptor function and phosphatase activity of SHP2 are required for embryonic survival, neither of these activities is absolutely required for lens development.

      (4) The authors provide more information as to why FGFR loss has a phenotype much more severe than the loss of FRS2 alone during lens development.

      (5) The authors followed up their work analyzing various signaling molecules in the context of lens development with biochemical analyses of FGF-induced phosphorylation in murine embryonic fibroblasts (MEFs).

      (6) In general, this manuscript represents a Herculean effort to dissect FGFR signaling in vivo with biochemical backing with cell culture experiments in vitro.

      Weaknesses:

      (1) The authors demonstrate that the loss of FGFR1 and FGFR2 can be compensated by a constitutive active KRAS allele in the lens and suggest that FGFRs largely support lens development only by driving ERK activation. However, the authors also saw that lens development was substantially rescued by preventing apoptosis through the deletion of BAK and BAX. To my knowledge, the deletion of BAK and BAX should not independently activate ERK. The authors do not show whether ERK activation is restored in the BAK/BAX deficient lenses. Do the authors suggest the FGFR3 and/or FGFR4 provide sufficient RAS and ERK activation for lens development when apoptosis is suppressed? Alternatively, is it the survival function of FGFR-signaling as much as a direct effect on lens differentiation?

      (2) Do the authors suggest that GRB2 is required for RAS activation and ultimately ERK activation? If so, do the authors suggest that ERK activation is not required for FGFR-signaling to mediate lens induction? This would follow considering that the GRB2 deficient lenses lack a problem with lens induction.

      (3) The increase in p-Shc is only slightly higher in the Cre FGFR1f/f FGFR2r/LR than in the FGFR1f/Δfrs FGFR2f/f. Can the authors provide quantification?

      (4) The authors have not shown directly that Shc1 binds to the juxtamembrane region of either Fgfr1 or Fgfr2.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      In the revised manuscript, the authors have responded to my recommendations to revise the original manuscript, except for three suggestions below.

      (1) The original recommendation: Results (page 6, line 8): The authors mentioned "we observed .... expression of Foxe3 in ...mutant lens cells (Figure 1E, arrows). However, Foxe3-expressing lens cells are a very small population in Figure 1E. It is important to state the decreased number of Foxe3-expressing lens cells in FGFR1/2 mutants. In addition, I would like to request the authors to show histograms indicating sample size and statistical analysis for marker expression: Foxe3 (Figure 1E), Prox1 and aA-crystallin (Fig. 1F), cyclin D1 and TUNEL (Fig. 1G) and pmTOR and pS6 (Supplementary figure 1B).

      Author's response: We added a statement indicating that the number of Foxe3-expressing cells is reduced in FGFR1/2 mutants, which is now quantified in Fig. 1H. Quantifications for Cyclin D1 and TUNEL are now shown in Fig. 1I and J, respectively. However, we chose not to quantify Prox1, αA-crystallin, pmTOR, and pS6, as the FGFR1/2 mutants showed no staining for these markers.<br /> My recommendation: Although the authors have responded to revise the quantification of Foxe3-expressing cells, Cyclin D1 and TUNEL, they did not conduct statistical analysis of Prox1, αA-crystallin, pmTOR, and pS6, because of absence of these marker signals. I understand that no signal makes statistical analysis no meaningful. However, it is still important to indicate how many the authors repeated experiments to confirm the same result. Please indicate the number of biological replicates or independent experiments in the figure legends, for example "Biological replicates, n=3" or "Three independent experiments show similar results". As for pS6 labeling, there seems to be a weak signal in Supplementary Figure 1B, so please show statistical analysis to indicate its histogram.

      We have added the number of biological replicates for Prox1 and αA staining in the legend of Fig.1. The review is correct that there is weak staining of pS6, and also pmTOR. The quantification of pS6 and pmTOR staining are now shown in Supplementary Fig. 1C and D.

      (2) The original recommendation: Results (page 6, line 19- page 7, line 6): The authors showed that inducible expression of constitutive active Kras, KrasG12D, using Le-Cre, recovered lens size to the half level of wild-type control. However, in the lens of mice with Le-Cre; FGFR1/2f/f; LSL-KrasG12D, pERK was detected in the most posterior edge of the lens fiber core, whereas pERK was detected in the broader area of the lens in control. Furthermore, pMEK was detected in the whole lens of mice with Le-Cre; FGFR1/2f/f; and LSL-KrasG12D, whereas pMEK was detected only in the lens epithelial cells at the equator. So, the spatial profile of pERK and pMEK expression was different from those of wild-type, although the authors observed that Prox1 and Crystallin expression are normally induced in the lens of mice with Le-Cre; FGFR1/2f/f; LSL-KrasG12D. I wonder whether the lens normally develops in mice with Le-Cre; LSL-KrasG12D? Is the lens growth enhanced in mice with Le-Cre; LSL-KrasG12D? Please add the panels of mice with Le-Cre; LSL-KrasG12D in Figure 2B and 2C. In addition, I wonder whether apoptosis is suppressed in the lens of mice with Le-Cre; FGFR1/2f/f; LSL-KrasG12D?

      Authors' response: Response: As we previously reported (Developmental Biology 355, 2011, 12-20), Le-Cre; LSL-KrasG12D did not lead to enhanced lens growth. While we agree that including images of Le-Cre; LSL-KrasG12D as controls in Fig. 2B and C and evaluating apoptosis in Le-Cre; FGFR1/2f/f; LSL-KrasG12D mutants would be appropriate, we regretfully no longer have these animals available to conduct these experiments.

      My recommendation: I would like to suggest the authors conduct these experiments again, because the recovery of lens formation by Bax/Bak KD in Fgfr1/2 KD mice (Fig. 2F) suggests that KrasG12D activates the AKT-mediated cell survival pathway as well as that MEK/MAPK pathway downstream of FGF signaling pathway. Regarding the availability of mouse strains, in general, it is necessary to keep animal strains available for sincere response to reviewers' suggestions. Please clarify why these strains are now not available and justify the reason in the response to reviewers' recommendations.

      We acknowledge the reviewer's suggested experiments. However, our research utilized multiple mouse strains that are costly to maintain, a challenge that was exacerbated during and after the COVID-19 pandemic. Unfortunately, we no longer have access to the specific mouse strains required to conduct these additional studies.

      (3) The original recommendation: Figures 7E, and 7F: The authors showed that lens morphology and lens size evaluation in genetic combinations: control, Frs2/Shc1 KD, Frs2/Shp2 KD, and Frs2/Shp2/Shc1 KD. However, I would like to request the authors to show more detailed data in these genetic combinations, for example, pERK, foxe3, Maf, Prox1, Jag1, p57, cyclin D3, g-crystallin, and TUNEL.

      Authors' response: Unfortunately, we no longer have these mutant mice to perform these detailed staining.

      My recommendation: As I mentioned in the statement on weakness above, it is important to provide the cellular-level evidence to support the main conclusion on the involvement of Shc1 in Grb2 recruitment of FGF signaling for lens development, because this is the main novel finding in this manuscript. Regarding the availability of mouse strains, it is generally necessary to keep animal strains available for sincere response to reviewers' suggestions. Please clarify why these strains are now not available and justify the reason in the response to the reviewers' suggestions.

      We regret that we did not anticipate these experiments suggested by the reviewer. Unfortunately, we are unable to perform these studies as we no longer maintain the required mouse strains in our colony.

      Reviewer #3 (Recommendations for the authors):

      The changes made by the authors improved the manuscript. I have no further suggestions.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      In their manuscript, Kong Fang et al describe a robust pipeline for the isolation of small extracellular vesicles through a combination of size exclusion chromatography and miniaturized density gradient separation. Subsequently, they prove that the method is reproducible and suitable for small-volume operations while at the same time not compromising the quality of vesicles.

      Strengths:

      The paper narrates a robust method for purifying high-quality sEVs from small amounts of blood plasma. They also demonstrate that through this approach, they can derive sEVs without compromising the protein composition, integrity of the vesicles, or contamination with other proteins or lipids.

      Weaknesses:

      The paper is a nice summary of how to enrich sEVs from blood samples. Although well performed and substantiated with data, the paper primarily deals with method development and optimisation.

      We agree with the reviewer's assessment that this paper primarily focuses on the development and optimization of a method. Using this robust technique for isolating small extracellular vesicles (sEVs) from small blood volumes, our future research will investigate sEVs isolated from clinical samples, with a particular focus on their role in various diseases.

      Reviewer #2 (Public Review):

      Summary:

      In this work, the authors manage to optimize a simple and rapid protocol using SEC followed by DGCU to isolate sEVs with adequate purity and yield from small volumes of plasma. Isolated fractions containing sEVs using SEC, DGCU, SEC-DGCU, and DGCU-SEC are compared in terms of their yield, purity surface protein profile, and RNA content. Although the combined use of these methodologies has already been evaluated in previous works, the authors manage to adapt them for the use of small volumes of plasma, which allows working in 1.5 mL tubes and reducing the centrifugation time to 2 hours.

      The authors finally find that although both the SEC-DGCU and DGCU-SEC combinations achieve isolates with high purity, the SEC-DGCU combination results in higher yields.

      This work provides an interesting tool for the rapid obtention of sEVs with sufficient yield and purity for detailed characterization which could be very useful in research and clinical therapy.

      Strengths:

      - The work is well-written and organized.

      - The authors clearly state the problem they want to address, that is, optimizing a method that allows sEV to be isolated from small volumes of plasma.

      - Although these methodologies have been tested in previous works, the authors manage to isolate sEVs of high purity and good performance through a simple and fast methodology.

      - The characteristics of all isolated fractions are exhaustively analyzed through various state-of-the-art methodologies.

      - They present a good interpretation of the results obtained through the methodologies used.

      Weaknesses:

      - Lack of references that support some of the results obtained.

      - Although this work focuses on comparing different techniques and their combinations to find an optimal option, the authors do not use any statistical method that reliably shows the differences between these techniques, except when repeatability is measured.

      We appreciate the reviewer's insightful comments and will incorporate the suggested missing references. We acknowledge that we did not perform statistical analyses when comparing the differences among the three methods. Nevertheless, the superiority of the SEC-DGUC method is evident from observations based on several independent characterization methods, including Cryo-EM, TEM, western blot, and total RNA quantification.

      Firstly, repeated Cryo-EM observations consistently confirm that the SEC-alone method shows severe lipoprotein contamination while the SEC-DGUC method drastically reduces such lipoprotein contamination. In comparing the SEC-DGUC and DGUC-SEC methods, multiple independent characterization methods showed that the SEC-DGUC method yields significantly greater quantity of sEVs: 1) The western blot experiment showed much higher signal intensity for all four tested sEV markers (CD9, CD63, CD81, and TSG101), with estimated concentrations approximately 2.1, 2.1, 4.7, and 4.2 times higher than the DGUC-SEC method. 2) The total RNA analysis showed that SEC-DGUC-1 contained more than 4 times the total amount of RNA compared to DGUC-SEC-PF. 3) Establishing the normalization baseline, particle size distributions in SEC-DGUC-1 and DGUC-SEC-PF measured by TEM were found to be similar, suggesting comparable purity and distribution of the captured sEVs. For comparison purposes, within each independent characterization method, the same plasma source and total plasma volume were used, while across different methods, different plasma sources were used. These independent characterization methods have consistently demonstrated the superiority of the SEC-DGUC method over the DGUC-SEC or SEC-alone methods.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      In my opinion, this work is elegantly designed and supported by data, which would motivate more studies related to blood-derived microvesicles in the context of infectious and systemic diseases. Overall, the manuscript is well-written and explained in sufficient detail. I only have minor comments.

      (1) Recruitment of volunteers for blood/plasma collection: there is a need for a statement that this was in accordance with ethical and biosafety regulations of the Institute/Clinic.

      We added two sentences at the beginning of the Blood Collection section (under Materials and methods): “All procedures involving peripheral blood specimens were approved by the Singapore National Health Group Domain Specific Review Board (the central ethics committee) and were mutually recognized by the Nanyang Technological University Institutional Review Board (IRB#2018/00671). All blood specimens were de-identified prior to their use in the experiments.”

      (2) Since this is a method development and validation article, it would be good to include an image of the iodixanol gradient with the high-density sEV zone, after centrifugation.

      We have incorporated an image after centrifugation in Supplementary Figure 3.

      (3) Although several sEV markers are shown in Figure 7A, flotillin is missing in this figure which was part of Figure 6B. Does flotillin show a different pattern? Flotillin is a DRM-associated marker, and hence may behave differently, would be interesting to add any insights.

      We appreciate the reviewer’s careful observation. In Figure 6B, Flotillin was used to confirm the presence of sEVs in different density zones. However, for the purpose of comparing the yield between the SEC-DGUC and DGUC-SEC methods, as shown in Figure 7A, Flotillin was not included in the western blot analysis. No obvious pattern changes were observed in other sEV markers tested in both Figures 6B and 7A.   

      (4) Methods section of LC/MS analysis- which protein database was used for protein identification?

      We added the following sentence at the end of the LC/MS analysis section: “The protein database used for protein identification was Uniprot Human.”

      Reviewer #2 (Recommendations For The Authors):

      In line 43 some references are needed.

      We added this reference: EL Andaloussi, S., Mäger, I., Breakefield, X. et al. Extracellular vesicles: biology and emerging therapeutic opportunities. Nat Rev Drug Discov 12, 347–357 (2013). https://doi.org/10.1038/nrd3978

      In line 107, please avoid using short forms such as "it's".

      We have revised that to “it is.”

      In line 153: "...separates low-density particles from those of high density, but a considerable amount of..." the word "but" should not be in the sentence.

      We have removed “but” in this sentence.

      In line 181 the authors establish that "Notably, SEC-PF exhibited a high level of ApoB and low expression of sEV markers." Is there any explanation for this?

      SEC-PF represents the eluate from the SEC step, collected before the DGUC step. This fraction contains a mixture of lipoproteins and sEVs. Due to the overwhelming abundance of lipoproteins compared to sEVs, the western blot predictably shows a high level of ApoB with minimal expression of sEV markers. This highlights that SEC alone effectively reduces plasma protein content but does not efficiently remove lipoproteins. Figure 6C further illustrates this point, as cryo-EM images of SEC-PF reveal the presence of sEVs, which are vastly outnumbered by lipoproteins.

      In line 198, the sentence "Theoretically, the DGUC-SEC protocol should also effectively isolate sEVs from plasma" need to be supported by references.

      See for instance:

      - Holcar M, Ferdin J, Sitar S, Tušek-Žnidarič M, Dolžan V, Plemenitaš A, Žagar E, Lenassi M. 2020. Enrichment of plasma extracellular vesicles for reliable quantification of their size and concentration for biomarker discovery. Sci Rep 10:21346. doi:10.1038/s41598-020-78422-y.

      - Jia Y, Yu L, Ma T, Xu W, Qian H, Sun Y, Shi H. 2022. Small extracellular vesicles isolation and separation: Current techniques, pending questions and clinical applications. Theranostics 12:6548-6575. doi:10.7150/thno.74305

      - Vergauwen G, Dhondt B, Van Deun J, De Smedt E, Berx G, Timmerman E, Gevaert K, Miinalainen I, Cocquyt V, Braems G, Van den Broecke R, Denys H, De Wever O, Hendrix A. 2017. Confounding factors of ultrafiltration and protein analysis in extracellular vesicle research. Sci Rep 7:2704. doi:10.1038/s41598-017-02599-y

      We have added this reference: Holcar M, Ferdin J, Sitar S, Tušek-Žnidarič M, Dolžan V, Plemenitaš A, Žagar E, Lenassi M. 2020. Enrichment of plasma extracellular vesicles for reliable quantification of their size and concentration for biomarker discovery. Sci Rep 10:21346. https://doi.org/10.1038/s41598-020-78422-y.  

      In line 309 the authors establish that "NTA measured size distributions displayed well-overlapped histograms of particles". It is possible for the authors to analyze this overlapping using some statistical test as a chi-squared test?

      We have conducted a statistical analysis of the histogram similarities using the Jensen-Shannon Divergence (JSD) method. This is reflected in the manuscript under the results section, “Repeatability and reliability of the SEC-DGUC protocol”, where we state: “We then compared size distributions for each plasma fraction using Jensen-Shannon Divergence (JSD). The JSD values, which are well below 0.1 (Figure 10B), indicate a consistent population of isolated particles, as further supported by Supplementary Figure 8.” Additionally, we included JSD values in the legend of Figure 10B: “JSD values for SEC-DGUC-1 to 4 are 0.015, 0.006, 0.001, and 0.002, indicating strong similarities among the histograms.” These additions demonstrate the robustness and repeatability of the SEC-DGUC protocol.

      In line 360, "lasts ~ 16 hours or more." This statement needs a reference that supports this time.

      We have added this reference: Vergauwen, G. et al. Robust sequential biophysical fractionation of blood plasma to study variations in the biomolecular landscape of systemically circulating extracellular vesicles across clinical conditions. J Extracell Vesicles 10, e12122 (2021).

      In line 399, the reference format is different from the previously used format.

      This is corrected. We thank the reviewer for this careful examination.

      Line 466: This sentence is not quite clear. It can be understood that for every 0.5 mL of plasma, 2 mL of particle fraction are obtained and that for 6 mL of plasma, this method will give a total volume of 24 mL. However, it is not clear what is meant by the fact that it has been concentrated to 6 mL. While one can assume that those final 6 mL concentrates come from the initial 24 mL, perhaps the way this sentence was worded was not appropriate. I would recommend rewriting it for a simpler interpretation of how this method was performed.

      We have changed the sentence to: “For the DGUC experiment using the 12 ml tube, 24 ml of PFs were obtained from 6 ml of plasma and subsequently concentrated to 6 ml. The 6 ml of concentrated PFs were then transferred to a Beckman Coulter ultra-clear centrifuge tube (344059, Beckman Coulter, USA) for further processing.”

      Line 519: The authors established a second dilution to avoid absorbance values above 1.2. Is there any justification for this value, taking into account that the Lambert-Beer law presents more precision in the absorbance range of 0.2 to 0.8?

      We have added this reference: https://diagnostic.serumwerk.com/wp-content/uploads/2021/05/V05-Serumwerk.pdf

      Line 519-520: "Also included were water and 0.25 M sucrose as blanks". Perhaps authors could consider rephrasing this sentence.

      We have changed the sentence to: “The absorbance measurements were made against water and 0.25 M sucrose blanks.”

      In line 520, the sentence must say "each sample was made by triplicate".

      We have changed the sentence to: “Each sample was prepared by triplicate to reduce error.” We thank the reviewer for this suggestion.

      Line 673: The phrase "0.1% formic acid in 100% ACN" would be better, in my opinion, if it said "0.1% formic acid in ACN".

      Yes, these two expressions have the same meaning. However, to ensure clarity, we have updated the description to “0.1% formic acid in ACN.”. We thank reviewer for this suggestion.

      Supplementary Figure 1: in the Figure caption there is an error in the numbering: at the end, where it is written (E), it should be (F). Please, correct this.

      We have made the necessary correction and sincerely appreciate the reviewer’s attentiveness.

      Supplementary Figure 5: Some sEVs are hard to visualize due to poor image resolution. Is there any possibility for the authors to enhance these images?

      We thank the reviewer for this valuable comment. To improve the visual clarity of the images, we have opted to display four sub-figures instead of nine.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript entitled "Molecular dynamics of the matrisome across sea anemone life history", Bergheim and colleagues report the prediction, using an established sequence analysis pipeline, of the "matrisome" - that is, the compendium of genes encoding constituents of the extracellular matrix - of the starlet sea anemone Nematostella vectensis. Re-analysis of an existing scRNA-Seq dataset allowed the authors to identify the cell types expressing matrisome components and different developmental stages. Last, the authors apply time-resolved proteomics to provide experimental evidence of the presence of the extracellular matrix proteins at three different stages of the life cycle of the sea anemone (larva, primary polyp, adult) and show that different subsets of matrisome components are present in the ECM at different life stages with, for example, basement membrane components accompanying the transition from larva to primary polyp and elastic fiber components and matricellular proteins accompanying the transition from primary polyp to the adult stage.

      Strengths:

      The ECM is a structure that has evolved to support the emergence of multicellularity and different transitions that have accompanied the complexification of multicellular organisms. Understanding the molecular makeup of structures that are conserved throughout evolution is thus of paramount importance.

      The in-silico predicted matrisome of the sea anemone has the potential to become an essential resource for the scientific community to support big data annotation efforts and understand better the evolution of the matrisome and of ECM proteins, an important endeavor to better understand structure/function relationships. This study is also an excellent example of how integrating datasets generated using different -omic modalities can shed light on various aspects of ECM metabolism, from identifying the cell types of origins of matrisome components using scRNA-Seq to studying ECM dynamics using proteomics.

      Weaknesses:

      My concerns pertain to the three following areas of the manuscript:

      (1) In-silico definition of the anemone matrisome using sequence analysis:

      a) While a similar computational pipeline has been applied to predict the matrisome of several model organisms, the authors fail to provide a comprehensive definition of the anemone matrisome: In the text, the authors state the anemone matrisome is composed of "551 proteins, constituting approximately 3% of its proteome (see page 6, line 14), but Figure 1 lists 829 entries as part of the "curated" matrisome, Supplementary Table S1 lists the same 829 entries and the authors state that "Here, we identified 829 ECM proteins that comprise the matrisome of the sea anemone Nematostella vectensis" (see page 17, line 10). Is the sea anemone matrisome composed of 551 or 829 genes? If we refer to the text, the additional 278 entries should not be considered as part of the matrisome, but what is confusing is that some are listed as glycoproteins and the "new_manual_annotation" proposed by the authors and that refer to the protein domains found in these additional proteins suggest that in fact, some could or should be classified as matrisome proteins. For example, shouldn't the two lectins encoded by NV2.3951 and NV2.3157 be classified as matrisome-affiliated proteins? Based on what has been done for other model organisms, receptors have typically been excluded from the "matrisome" but included as part of the "adhesome" for consistency with previously published matrisome; the reviewer is left wondering whether the components classified as "Other" / "Receptor" should not be excluded from the matrisome and moved to a separate "adhesome" list.

      In addition to receptors, the authors identify nearly 70 glycoproteins classified as "Other". Here, does other mean "non-matrisome" or "another matrisome division" that is not core or associated? If the latter, could the authors try to propose a unifying term for these proteins? Unfortunately, since the authors do not provide the reasons for excluding these entries from the bona fide matrisome (list of excluding domains present, localization data), the reader is left wondering how to treat these entries.

      Overall, the study would gain in strength if the authors could be more definitive and, if needed, even propose novel additional matrisome annotations to include the components for now listed as "Other" (as was done, for example, for the Drosophila or C. elegans matrisomes).

      b) It is surprising that the authors are not providing the full currently accepted protein names to the entries listed in Supplementary Table S1 and have used instead "new_manual_annotation" that resembles formal protein names. This liberty is misleading. In fact, the "new_manual_annotation" seems biased toward describing the reason the proteins were positively screened for through sequence analysis, but many are misleading because there is, in fact, more known about them, including evidence that they are not ECM proteins. The authors should at least provide the current protein names in addition to their "new_manual_annotations".

      c) To truly serve as a resource, the Table should provide links to each gene entry in the Stowers Institute for Medical Research genome database used and some sort of versioning (this could be added to columns A, B, or D). Such enhancements would facilitate the assessment of the rigor of the list beyond the manual QC of just a few entries.

      d) Since UniProt is the reference protein knowledge database, providing the UniProt IDs associated with the predicted matrisome entries would also be helpful, giving easy access to information on protein domains, protein structures, orthology information, etc.

      e) In conclusion, at present, the study only provides a preliminary draft that should be more rigorously curated and enriched with more comprehensive and authoritative annotations if the authors aspire the list to become the reference anemone matrisome and serve the community.

      (2) Proteomic analysis of the composition of the mesoglea during the sea anemone life cycle:

      a) The product of 287 of the 829 genes proposed to encode matrisome components was detected by proteomics. What about the other ~550 matrisome genes? When and where are they expressed? The wording employed by the authors (see line 11, page 13) implies that only these 287 components are "validated" matrisome components. Is that to say that the other ~550 predicted genes do not encode components of the ECM? This should be discussed.

      b) Can the authors comment on how they have treated zero TMT values or proteins for which a TMT ratio could not be calculated because unique to one life stage, for example?

      c) Could the authors provide a plot showing the distribution of protein abundances for each matrisome category in the main figure 4? In mammals, the bulk of the ECM is composed of collagens, followed by fibrillar ECM glycoproteins, the other matrisome components being more minor. Is a similar distribution observed in the sea anemone mesoglea?

      d) Prior proteomic studies on the ECM of vertebrate organisms have shown the importance of allowing certain post-translational modifications during database search to ensure maximizing peptide-to-spectrum matching. Such PTMs include the hydroxylation of lysines and prolines that are collagen-specific PTMs. Multiple reports have shown that omitting these PTMs while analyzing LC-MS/MS data would lead to underestimating the abundance of collagens and the misidentification of certain collagens. The authors may want to re-analyze their dataset and include these PTMs as part of their search criteria to ensure capturing all collagen-derived peptides.

      e) The authors should ensure that reviewers are provided with access to the private PRIDE repository so the data deposited can also be evaluated. They should also ensure that sufficient meta-data is provided using the SRDF format to allow the re-use of their LC-MS/MS datasets.

      (3) Supplementary tables:

      The supplementary tables are very difficult to navigate. They would become more accessible to readers and non-specialists if they were accompanied by brief legends or "README" tabs and if the headers were more detailed (see, for example, Table S2, what does "ctrl.ratio_Larvae_rep2" exactly refer to? Or Table S6 whose column headers using extensive abbreviations are quite obscure). Similarly, what do columns K to BX in Supplementary Table S1 correspond to? Without more substantial explanations, readers have no way of assessing these data points.

    1. Reviewer #1 (Public review):

      Liu et al., present glmSMA, a network-regularized linear model that integrates single-cell RNA-seq data with spatial transcriptomics, enabling high-resolution mapping of cellular locations across diverse datasets. Its dual regularization framework (L1 for sparsity and generalized L2 via a graph Laplacian for spatial smoothness) demonstrates robust performance of their model and offers novel tools for spatial biology, despite some gaps in fully addressing spatial communication.

      Overall, the manuscript is commendable for its comprehensive benchmarking across different spatial omics platforms and its novel application of regularized linear models for cell mapping. I think this manuscript can be improved by addressing method assumptions, expanding the discussion on feature dependence and cell type-specific biases, and clarifying the mechanism of spatial communication.

      The conclusions of this paper are mostly well supported by data, but some aspects of model development and performance evaluation need to be clarified and extended.

      (1) What were the assumptions made behind the model? One of them could be the linear relationship between cellular gene expression and spatial location. In complex biological tissues, non-linear relationships could be present, and this would also vary across organ systems and species. Similarly, with regularization parameters, they can be tuned to balance sparsity and smoothness adequately but may not hold uniformly across different tissue types or data quality levels. The model also seems to assume independent errors with normal distribution and linear additive effects - a simplification that may overlook overdispersion or heteroscedasticity commonly observed in RNA-seq data.

      (2) The performance of glmSMA is likely sensitive to the number and quality of features used. With too few features, the model may struggle to anchor cells correctly due to insufficient discriminatory power, whereas too many features could lead to overfitting unless appropriately regularized. The manuscript briefly acknowledges this issue, but further systematic evaluation of how varying feature numbers affect mapping accuracy would strengthen the claims, particularly in settings where marker gene availability is limited. A simple way to show some of this would be testing on multiple spatial omics (imaging-based) platforms with varying panel sizes and organ systems. Related to this, based on the figures, it also seems like the performance varies by cell type. What are the factors that contribute to this? Variability in expression levels, RNA quantity/quality? Biases in the panel? Personally, I am also curious how this model can be used similarly/differently if we have a FISH-based, high-plex reference atlas. Additional explanation around these points would be helpful for the readers.

      (3) Application 3 (spatial communication) in the graphical abstract appears relatively underdeveloped. While it is clear that the model infers spatial proximities, further explanation of how these mappings translate into insights into cell-cell communication networks would enhance the biological relevance of the findings.

      (4) What is the final resolution of the model outputs? I am assuming this is dictated by the granularity of the reference atlas and the imposed sparsity via the L1 norm, but if there are clear examples that would be good. In figures (or maybe in practice too), cells seem to be assigned to small, contiguous patches rather than pinpoint single-cell locations, which is a pragmatic compromise given the inherent limitations of current spatial transcriptomics technologies. Clarification on the precise spatial scale (e.g., pixel or micrometer resolution) and any post-mapping refinement steps would be beneficial for the users to make informed decisions on the right bioinformatic tools to use.

    2. Reviewer #2 (Public review):

      Summary:

      The author proposes a novel method for mapping single-cell data to specific locations with higher resolution than several existing tools.

      Strengths:

      The spatial mapping tests were conducted on various tissues, including the mouse cortex, human PDAC, and intestinal villus.

      Weaknesses:

      (1) Although the researchers claim that glmSMA seamlessly accommodates both sequencing-based and image-based spatial transcriptomics (ST) data, their testing primarily focused on sequencing-based ST data, such as Visium and Slide-seq. To demonstrate its versatility for spatial analysis, the authors should extend their evaluation to imaging-based spatial data.

      (2) The definition of "ground truth" for spatial distribution is unclear. A more detailed explanation is needed on how the "ground truth" was established for each spatial dataset and how it was utilized for comparison with the predicted distribution generated by various spatial mapping tools.

      (3) In the analysis of spatial mapping results using intestinal villus tissue, only Figure 3d supports their findings. The researchers should consider adding supplemental figures illustrating the spatial distribution of single cells in comparison to the ground truth distribution to enhance the clarity and robustness of their investigation.

      (4) The spatial mapping tests were conducted on various tissues, including the mouse cortex, human PDAC, and intestinal villus. However, the original anatomical regions are not displayed, making it difficult to directly compare them with the predicted mapping results. Providing ground truth distributions for each tested tissue would enhance clarity and facilitate interpretation. For instance, in Figure 2a and Supplementary Figures 1 and 2, only the predicted mapping results are shown without the corresponding original spatial distribution of regions in the mouse cortex. Additionally, in Figure 3c, four anatomical regions are displayed, but it is unclear whether the figure represents the original spatial regions or those predicted by glmSMA. The authors are encouraged to clarify this by incorporating ground truth distributions for each tissue.

      (5) The cell assignment results from the mouse hippocampus (Supplementary Figure 6) lack a corresponding ground truth distribution for comparison. DG and CA cells were evaluated solely based on the gene expression of specific marker genes. Additional analyses are needed to further validate the robustness of glmSMA's mapping performance on Slide-seq data from the mouse hippocampus.

      (6) The tested spatial datasets primarily consist of highly structured tissues with well-defined anatomical regions, such as the brain and intestinal villus. It remains unclear whether glmSMA can be effectively applied to tissue types where anatomical regions are not distinctly separated, such as liver tissue. Further evaluation of such tissues would help determine the method's broader applicability.

    3. Author response:

      Reviewer #1 (Public review):

      Summary:

      Liu et al., present glmSMA, a network-regularized linear model that integrates single-cell RNA-seq data with spatial transcriptomics, enabling high-resolution mapping of cellular locations across diverse datasets. Its dual regularization framework (L1 for sparsity and generalized L2 via a graph Laplacian for spatial smoothness) demonstrates robust performance of their model and offers novel tools for spatial biology, despite some gaps in fully addressing spatial communication.

      Overall, the manuscript is commendable for its comprehensive benchmarking across different spatial omics platforms and its novel application of regularized linear models for cell mapping. I think this manuscript can be improved by addressing method assumptions, expanding the discussion on feature dependence and cell type-specific biases, and clarifying the mechanism of spatial communication.

      The conclusions of this paper are mostly well supported by data, but some aspects of model development and performance evaluation need to be clarified and extended.

      We thank the reviewer for their thoughtful comments. We will clarify the model assumptions and the feature selection process to make it more understandable. To clarify, the performance of glmSMA does not depend on cell type. For some rare cell types, the small number of cells can lead to a drop in performance. To better illustrate our results and reduce cell type-specific biases, we will shuffle and randomly sample the cell types.

      (1) What were the assumptions made behind the model? One of them could be the linear relationship between cellular gene expression and spatial location. In complex biological tissues, non-linear relationships could be present, and this would also vary across organ systems and species. Similarly, with regularization parameters, they can be tuned to balance sparsity and smoothness adequately but may not hold uniformly across different tissue types or data quality levels. The model also seems to assume independent errors with normal distribution and linear additive effects - a simplification that may overlook overdispersion or heteroscedasticity commonly observed in RNA-seq data.

      Thank you for this comment. We acknowledge that the non-linear relationships can be present in complex tissues and may not be fully captured by a linear model. 

      Our choice of a linear model was guided by an investigation of the relationship in the current datasets, which include intestinal villus, mouse brain, and fly embryo.

      There is a linear correlation between expression distance and physical distance [Nitzan et al]. Within a given anatomical structure, cells in closer proximity exhibit more similar expression patterns. In tissues where non-linear relationships are more prevalent—such as the human PDAC sample—our mapping results remain robust. We acknowledge that we have not yet tested our algorithm in highly heterogeneous regions like the liver, and we plan to include such analyses in future work if necessary. Regarding the regularization parameters, we agree that the balance between sparsity and smoothness is sensitive to tissue-specific variation and data quality. In our current implementation, we explored a range of values to find robust defaults.

      (2) The performance of glmSMA is likely sensitive to the number and quality of features used. With too few features, the model may struggle to anchor cells correctly due to insufficient discriminatory power, whereas too many features could lead to overfitting unless appropriately regularized. The manuscript briefly acknowledges this issue, but further systematic evaluation of how varying feature numbers affect mapping accuracy would strengthen the claims, particularly in settings where marker gene availability is limited. A simple way to show some of this would be testing on multiple spatial omics (imaging-based) platforms with varying panel sizes and organ systems. Related to this, based on the figures, it also seems like the performance varies by cell type. What are the factors that contribute to this? Variability in expression levels, RNA quantity/quality? Biases in the panel? Personally, I am also curious how this model can be used similarly/differently if we have a FISH-based, high-plex reference atlas. Additional explanation around these points would be helpful for the readers.

      Thank you for this thoughtful comment. The performance of our method is indeed sensitive to the number and quality of selected features. To optimize feature selection, we employed multiple strategies, including Moran’s I statistic, identification of highly variable genes, and the Seurat pipeline to detect anchor genes linking the spatial transcriptomics data with the reference atlas. The number of selected markers depends on the quality of the data. For high-quality datasets, fewer than 100 markers are typically sufficient for accurate prediction. To address this more clearly, we will revise the manuscript to include detailed descriptions of our feature selection process and demonstrate how varying the number of selected features impacts performance.

      We evaluated our method across diverse tissue types and platforms—including Slide-seq, 10x Visium, and Virtual-FISH—which represent both sequencing-based and imaging-based spatial transcriptomics technologies. Our model consistently achieved strong performance across these settings. It's worth noting that the performance of other methods, such as CellTrek [Wei et al] and novoSpaRc [Nitzan et al], also depends heavily on feature selection. In particular, performance degrades substantially when fewer features are used.

      We do not believe that the observed performance is directly influenced by cell type composition. Major cell types are typically well-defined, and rare cell types comprise only a small fraction of the dataset. For these rare populations, a single misclassification can disproportionately impact metrics like KL divergence due to small sample size. However, this does not necessarily indicate a systematic cell type–specific bias in the mapping. To mitigate this issue, we will implement shuffling and sampling procedures to reduce potential bias introduced by rare cell types.

      (3) Application 3 (spatial communication) in the graphical abstract appears relatively underdeveloped. While it is clear that the model infers spatial proximities, further explanation of how these mappings translate into insights into cell-cell communication networks would enhance the biological relevance of the findings.

      Thank you for this valuable feedback. We agree that further elaboration on the connection between spatial proximity and cell–cell communication would enhance the biological interpretation of our results. While our current model focuses on inferring spatial relationships, we may provide some cell-cell communications in the future.

      (4) What is the final resolution of the model outputs? I am assuming this is dictated by the granularity of the reference atlas and the imposed sparsity via the L1 norm, but if there are clear examples that would be good. In figures (or maybe in practice too), cells seem to be assigned to small, contiguous patches rather than pinpoint single-cell locations, which is a pragmatic compromise given the inherent limitations of current spatial transcriptomics technologies. Clarification on the precise spatial scale (e.g., pixel or micrometer resolution) and any post-mapping refinement steps would be beneficial for the users to make informed decisions on the right bioinformatic tools to use.

      Thank you for the comment. For each cell, our algorithm generates a probability vector that indicates its likely spatial assignment along with coordinate information. We will include the resolution and the number of cells assigned to each spot in future versions. In our framework, each cell is mapped to one or more spatial locations with associated probabilities. Depending on the amount of regularization through L1 and L2 norms, a cell may be localized to a small patch or distributed over a broader domain. For the 10x Visium data, we applied a repelling algorithm to enhance visualization [Wei et al]. If a cell’s original location is already occupied, it is reassigned to a nearby neighborhood to avoid overlap. The users can also see the entire regularization path by varying the penalty terms. 

      Nitzan M, Karaiskos N, Friedman N, Rajewsky N. Gene expression cartography. Nature. 2019;576(7785):132-137. doi:10.1038/s41586-019-1773-3

      Wei, R. et al. (2022) ‘Spatial charting of single-cell transcriptomes in tissues’, Nature Biotechnology, 40(8), pp. 1190–1199. doi:10.1038/s41587-022-01233-1. 

      Reviewer #2 (Public review):

      Summary:

      The author proposes a novel method for mapping single-cell data to specific locations with higher resolution than several existing tools.

      Thank you for recognizing our contribution. Our goal was to develop a method that achieves higher spatial resolution in mapping single-cell data compared to existing tools. We are encouraged by the results and will continue to refine the approach to improve accuracy and generalizability across platforms and tissue types.

      Strengths:

      The spatial mapping tests were conducted on various tissues, including the mouse cortex, human PDAC, and intestinal villus.

      Thank you for this comment. We believe that evaluating our method across diverse tissue types—such as the mouse cortex, human PDAC, and intestinal villus—demonstrates its robustness and broad applicability. We plan to continue expanding these evaluations to additional tissue contexts and species to further validate the method’s generalizability.

      Weakness:

      (1) Although the researchers claim that glmSMA seamlessly accommodates both sequencing-based and image-based spatial transcriptomics (ST) data, their testing primarily focused on sequencing-based ST data, such as Visium and Slide-seq. To demonstrate its versatility for spatial analysis, the authors should extend their evaluation to imaging-based spatial data.

      Thank you for the comment. We have tested our algorithm on the virtual FISH dataset from the fly embryo, which serves as an example of image-based spatial omics data. However, such datasets often contain a limited number of available genes. To address this, we will conduct additional testing on image-based data if needed. The Allen Brain Atlas provides high-quality ISH data, and we can select specific brain regions from this resource to further evaluate our algorithm if necessary [Lein et al]. Currently, we plan to focus more on the 10x Visium platform, as it supports whole-transcriptome profiling and offers a wide range of tissue samples for analysis.

      (2) The definition of "ground truth" for spatial distribution is unclear. A more detailed explanation is needed on how the "ground truth" was established for each spatial dataset and how it was utilized for comparison with the predicted distribution generated by various spatial mapping tools.

      Thank you for the comment. To clarify how ground truth is defined across different tissues, we provide the following details. Direct ground truth for cell locations is often unavailable in scRNA-seq data due to experimental constraints. To address this, we adopted alternative strategies for estimating ground truth in each dataset:

      - 10x Visium Data: We used the cell type distribution derived from spatial transcriptomics (ST) data as a proxy for ground truth. We then computed the KL divergence between this distribution and our model's predictions for performance assessment.

      - Slide-seq Data: We validated predictions by comparing the expression of marker genes between the reconstructed and original spatial data.

      - Fly Embryo Data: We used predicted cell locations from novoSpaRc as a reference for evaluating our algorithm.

      These strategies allowed us to evaluate model performance even in the absence of direct cell location data. In addition, we can apply multiple evaluation strategies within a single dataset.

      (3) In the analysis of spatial mapping results using intestinal villus tissue, only Figure 3d supports their findings. The researchers should consider adding supplemental figures illustrating the spatial distribution of single cells in comparison to the ground truth distribution to enhance the clarity and robustness of their investigation.

      Thank you for the comment. We will include additional details for this dataset in the supplementary figures. As the intestinal villus is a relatively simple tissue, most existing algorithms performed well on it. For this reason, we did not initially provide extensive details in the main text.

      (4) The spatial mapping tests were conducted on various tissues, including the mouse cortex, human PDAC, and intestinal villus. However, the original anatomical regions are not displayed, making it difficult to directly compare them with the predicted mapping results. Providing ground truth distributions for each tested tissue would enhance clarity and facilitate interpretation. For instance, in Figure 2a and Supplementary Figures 1 and 2, only the predicted mapping results are shown without the corresponding original spatial distribution of regions in the mouse cortex. Additionally, in Figure 3c, four anatomical regions are displayed, but it is unclear whether the figure represents the original spatial regions or those predicted by glmSMA. The authors are encouraged to clarify this by incorporating ground truth distributions for each tissue.

      Thank you for the comment. To improve visualization, we will include anatomical structures alongside the mapping results in the next version, wherever such structures are available (e.g., mouse brain cortex, human PDAC sample, etc.). Regions will be color-coded to enhance clarity and make the spatial organization easier to interpret.

      (5) The cell assignment results from the mouse hippocampus (Supplementary Figure 6) lack a corresponding ground truth distribution for comparison. DG and CA cells were evaluated solely based on the gene expression of specific marker genes. Additional analyses are needed to further validate the robustness of glmSMA's mapping performance on Slide-seq data from the mouse hippocampus.

      Thank you for the comment. The ground truth for DG and CA cells was not available. To better evaluate the model's performance, we will compute the KL divergence between the original and predicted cell type distributions, following the same approach used for the 10x Visium dataset.

      (6) The tested spatial datasets primarily consist of highly structured tissues with well-defined anatomical regions, such as the brain and intestinal villus. Anatomical regions are not distinctly separated, such as liver tissue. Further evaluation of such tissues would help determine the method's broader applicability.

      Thank you for the comment. We have already tested our algorithm on the fly embryo, where anatomical structures are not well defined or clearly separated. If needed, we can further apply glmSMA to more complex tissues such as the liver. To clarify the role of anatomical structures in our model: glmSMA does not require anatomical information as input. Instead, it leverages a distance matrix between cells to apply L2 norm regularization. Despite the absence of anatomical information, the model still demonstrates strong performance. We will include results to illustrate its effectiveness without anatomical input. Additionally, we plan to evaluate the model on tissues where anatomical regions are not clearly delineated.

      Lein, E., Hawrylycz, M., Ao, N. et al. Genome-wide atlas of gene expression in the adult mouse brain. Nature 445, 168–176 (2007). https://doi.org/10.1038/nature05453

      Reviewer #3 (Public review):

      Summary:

      The authors aim to develop glmSMA, a network-regularized linear model that accurately infers spatial gene expression patterns by integrating single-cell RNA sequencing data with spatial transcriptomics reference atlases. Their goal is to reconstruct the spatial organization of individual cells within tissues, overcoming the limitations of existing methods that either lack spatial resolution or sensitivity.

      Strengths:

      (1) Comprehensive Benchmarking:

      Compared against CellTrek and Novosparc, glmSMA consistently achieved lower Kullback-Leibler divergence (KL divergence) scores, indicating better cell assignment accuracy.

      Outperformed CellTrek in mouse cortex mapping (90% accuracy vs. CellTrek's 60%) and provided more spatially coherent distributions.

      (2) Experimental Validation with Multiple Real-World Datasets:

      The study used multiple biological systems (mouse brain, Drosophila embryo, human PDAC, intestinal villus) to demonstrate generalizability.

      Validation through correlation analyses, Pearson's coefficient, and KL divergence support the accuracy of glmSMA's predictions.

      We thank reviewer #3 for their positive feedback and thoughtful recommendations.

      Weaknesses:

      (1) The accuracy of glmSMA depends on the selection of marker genes, which might be limited by current FISH-based reference atlases.

      We agree that the accuracy of glmSMA is influenced by the selection of marker genes, and that current FISH-based reference atlases may offer a limited gene set. To address this, we incorporate multiple feature selection strategies, including highly variable genes and spatially informative genes (e.g., via Moran’s I), to optimize performance within the available gene space. As more comprehensive reference atlases become available, we expect the model’s accuracy to improve further.

      (2) glmSMA operates under the assumption that cells with similar gene expression profiles are likely to be physically close to each other in space which not be true under various heterogeneous environments.

      While this assumption effectively captures spatial continuity in many cases, we acknowledge that it may not hold across all biological contexts. To address this, we plan to refine our regularization strategy and evaluate the model's performance in heterogeneous tissue regions.

    1. Reviewer #1 (Public review):

      Summary:

      Kwon et al present a very well-conducted and well-written sieve analysis of rotavirus infections in a passive surveillance network in the US, considering how relative vaccine efficacy changes with genetic distance from the vaccine strains including the whole genome. The results are compelling, supported by a number of sensitivity analyses, and the manuscript is generally easy to follow.

      Strengths:

      (1) The underlying study base, a surveillance network across multiple sites in the US.

      (2) The use of a test-negative design, which is well established for rotavirus, to estimate vaccine efficacy.

      (3) The use of genetic distance to measure differences between infecting and vaccine strains, and the innovative use of k-means clustering to make results more interpretable.

      (4) The secondary and sensitivity analyses that provide additional context and support for the primary findings.

      Weaknesses:

      (1) As identified by the authors, there is a limited sample size for the analysis of RV1 (monovalent rotavirus vaccine).

      (2) Sieve analyses were originally designed for randomized trials, in which setting their key assumptions are more likely to be met. There is little discussion in this paper of how those assumptions might be violated and what effect that might have on the results. The authors have access to some important confounders, but I believe some more discussion on potential biases in this observational study is warranted.

    2. Author response:

      Public Reviews

      Reviewer #1 (Public review):

      Summary:

      Kwon et al present a very well-conducted and well-written sieve analysis of rotavirus infections in a passive surveillance network in the US, considering how relative vaccine efficacy changes with genetic distance from the vaccine strains including the whole genome. The results are compelling, supported by a number of sensitivity analyses, and the manuscript is generally easy to follow.

      Strengths:

      (1) The underlying study base, a surveillance network across multiple sites in the US.

      (2) The use of a test-negative design, which is well established for rotavirus, to estimate vaccine efficacy.

      (3) The use of genetic distance to measure differences between infecting and vaccine strains, and the innovative use of k-means clustering to make results more interpretable.

      (4) The secondary and sensitivity analyses that provide additional context and support for the primary findings.

      Weaknesses:

      (1) As identified by the authors, there is a limited sample size for the analysis of RV1 (monovalent rotavirus vaccine).

      (2) Sieve analyses were originally designed for randomized trials, in which setting their key assumptions are more likely to be met. There is little discussion in this paper of how those assumptions might be violated and what effect that might have on the results. The authors have access to some important confounders, but I believe some more discussion on potential biases in this observational study is warranted.

      We appreciate the reviewer’s positive comments and the opportunity to discuss the application of sieve analysis in observational vaccine effectiveness studies, contrasting it with its traditional use in clinical trials assessing vaccine efficacy. We fully acknowledge the reviewer's point that sieve analysis was originally developed for, and is most frequently employed in, randomized controlled trials (RCTs).

      Sieve analysis, as defined by Gilbert et al. (2001), has the following core assumptions: (A1) uniform susceptibility to infection for all participants except for vaccine-induced strain-specific effects; (A2) equal exposure (for each strain s = 1,…,K ) distribution between vaccine groups; and (A3), constant strain prevalence. RCTs ensure these through randomization. However, our observational design is vulnerable to violating these assumptions, especially A1 and A3. To address A1 and A3, we adjusted for age (in years), sample collection year, and clinical setting (i.e., outpatient, inpatient, ED), aiming to account for both individual-level and temporal variations.

      A2 is particularly challenging in observational settings. We found that study site was correlated with both vaccination status (main predictor) and the strain distribution, potentially violating A2. However, adjusting for study site reversed the expected association. Upon further reflection, we realized that the site-specific differences in strain distributions likely reflect the population-level effect of vaccination, which we believe outweighs the potential confounding by study site as an independent cause of both individual-level vaccination status and strain distributions irrespective of vaccination. Thus, adjusting for site would have obscured this genuine population-level effect, and therefore we elected not to do so. We will include further discussion of this point in the revised manuscript.

      Our study demonstrates the unique capacity of sieve analysis to disentangle individual- and population-level effects on vaccine effectiveness in observational settings. We will expand on these considerations, including the potential biases inherent to observational studies and the rationale for our analytical choices, within the discussion section of the revised manuscript.

      Reviewer #2 (Public review):

      Summary:

      This study introduces a new metric for assessing the efficacy of rotavirus vaccines through the genetic distance clustering of strains. The authors analyzed variations in vaccine protection using whole genome sequencing.

      Strengths:

      Evaluating vaccine efficacy using whole genome sequencing can enhance our understanding of how pathogen evolution influences disease transmission and control.

      Weaknesses:

      While the study proposed a new method for evaluating vaccine efficacy using genetic information, its weaknesses arise from the insufficient evidence that analyses based on whole genome sequencing are more reliable than those that rely solely on VP7 and VP4 genotypes.

      Though most cases received the RV5 vaccine (n=119 compared to n=30 for RV1), Figure 2 and the primary focus of the paper concentrate on RV1, as the authors identified a stronger association with genetic distance.

      Additionally, it is unclear whether the difference between the two groups (j=0 versus j=1) is statistically significant for the analysis based on genetic distance to the RV1 strain, as well as for that based on minimum genetic distance to any of the RV5 vaccine strains. In both cases, the confidence intervals show substantial overlap

      The authors do not seem to have used a criterion for model selection based on the number of clusters; therefore, k=2 may not represent the optimal number of clusters, particularly in relation to the genetic distance associated with the RV5 vaccine (Figure 1B), which does not appear to show a bimodal distribution.

      Finally, outcomes for RV1 are highly associated with both homotypic and heterotypic antibody responses (Supplemental Figure 10), which have already been shown to impact vaccine effectiveness (The Pediatric Infectious Disease Journal 40(12):p 1135-1143, 2021, doi:10.1097/INF.0000000000003286). Given this strong association, the benefit of using genetic distance is unclear, as the GxPx genotype serves as a good proxy for genetic similarity. 

      We sincerely appreciate reviewer's careful consideration of our manuscript and their constructive suggestions for improvement.

      Regarding the comparison of whole-genome sequencing with traditional VP7/VP4 genotyping, we concur that a more explicit comparison would strengthen our findings. To this end, we plan to incorporate the direct comparison of genetic distance (GD) and genotype-specific vaccine effectiveness (VE) analyses into the main text. Additionally, we will conduct an analysis of VE based on homotypic, partially heterotypic, and fully heterotypic genotype groupings. This will provide a clearer demonstration of the potential added value of GD in refining VE estimates, particularly for future applications. Given the potential for reassortment among the rotavirus gene segments, our analysis highlights that relying solely on the VP7/VP4 genotype can at times be misleading. 

      Regarding k-means clustering, we wish to clarify that the selection of k=2 was not arbitrary. It was determined using the elbow method on the total within-sum-of-squares (using the fviz_nbclust function in the factoextra R package, with n=5000 bootstrapping). While we acknowledge that other methods, such as silhouette and gap statistics, may yield different optimal cluster numbers, we prioritized maximizing group sample sizes. We will explicitly state this model selection criterion within the methods section of the revised manuscript.

      We acknowledge the reviewer’s concern regarding the overlapping confidence intervals and the statistical significance of the differences between the VE for the j=0 and j=1 groups. One way to address this would be to modify our analysis. Instead of two separate logistic regression models (controls vs j=0 cases, and controls vs j=1 cases), we could employ a multinomial logistic regression model with three categories: controls (reference), j=0 cases, and j=1 cases, then conduct Wald test to directly compare the regression slopes for the j=0 and j=1 cases against controls. We intend to explore this approach in the revised manuscript, which will provide a more rigorous assessment of differences in VE by accounting for the relationship between groups within a single model.

      Reviewer #3 (Public review):

      Overall, this is an outstanding paper. It presents a novel approach to estimating rotavirus vaccine efficacy; is clearly written and presented; and has implications for this vaccine specifically as well as type-specific vaccine evaluation more generally. The analytical framework is a creative and there is rigorous use of data and statistical approaches. It has long been argued that rotavirus immunity/vaccine performance operates beyond the scale of G/P genotyping. This paper is the first to demonstrate that convincingly, using data on all 11 viral genes and whole genome sequence analysis. I have only minor comments that I recommend should be addressed.

      We sincerely thank the reviewer for their highly positive assessment of our manuscript. We will carefully address their minor comments and incorporate their recommendations in the revised manuscript, which we believe will further enhance the clarity and impact of our study.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      I would like to thank the reviewers for their comments and interest in the manuscript and the study.

      Reviewer #1

      1. I would assume that there are RNA-seq and/or ChIP-seq data out there produced after knockdown of one or more of these DBPs that show directional positioning.

      The directional positioning of CTCF-binding sites at chromatin interaction sites was analyzed by CRISPR experiment (Guo Y et al. Cell 2015). We found that the machine learning and statistical analysis showed the same directional bias of CTCF-binding motif sequence and RAD21-binding motif sequence at chromatin interaction sites as the experimental analysis of Guo Y et al. (lines 229-253, Figure 3b, c, d and Table 1). Since CTCF is involved in different biological functions (Braccioli L et al. Essays Biochem. 2019 ResearchGate webpage), the directional bias of binding sites may be reduced in all binding sites including those at chromatin interaction sites (lines 68-73). In our study, we investigated the DNA-binding sites of proteins using the ChIP-seq data of DNA-binding proteins and DNase-seq data. We also confirmed that the DNA-binding sites of SMC3 and RAD21, which tend to be found in chromatin loops with CTCF, also showed the same directional bias as CTCF by the computational analysis.

      __2. Figure 6 should be expanded to incorporate analysis of DBPs not overlapping CTCF/cohesin in chromatin interaction data that is important and potentially more interesting than the simple DBPs enrichment reported in the present form of the figure. __

      Following the reviewer's advice, I performed the same analysis with the DNA-binding sites that do no overlap with the DNA-binding sites of CTCF and cohesin (RAD21 and SMC3) (Fig. 6 and Supplementary Fig. 4). The result showed the same tendency in the distribution of DNA-binding sites. The height of a peak on the graph became lower for some DNA-binding proteins after removing the DNA-binding sites that overlapped with those of CTCF and cohesin. I have added the following sentence on lines 435 and 829: For the insulator-associated DBPs other than CTCF, RAD21, and SMC3, the DNA-binding sites that do not overlap with those of CTCF, RND21, and SMC3 were used to examine their distribution around interaction sites.

      3. Critically, I would like to see use of Micro-C/Hi-C data and ChIP-seq from these factors, where insulation scores around their directionally-bound sites show some sort of an effect like that presumed by the authors - and many such datasets are publicly-available and can be put to good use here.

      As suggested by the reviewer, I have added the insulator scores and boundary sites from the 4D nucleome data portal as tracks in the UCSC genome browser. The insulator scores seem to correspond to some extent to the H3K27me3 histone marks from ChIP-seq (Fig. 4a and Supplementary Fig. 3). We found that the DNA-binding sites of the insulator-associated DBPs were statistically overrepresented in the 5 kb boundary sites more than other DBPs (Fig. 4d). The direction of DNA-binding sites on the genome can be shown with different colors (e.g. red and green), but the directionality of insulator-associated DNA-binding sites is their overall tendency, and it may be difficult to notice the directionality from each binding site because the directionality may be weaker than that of CTCF, RAD21, and SMC3 as shown in Table 1 and Supplementary Table 2. We also observed the directional biases of CTCF, RAD21, and SMC3 by using Micro-C chromatin interaction data as we estimated, but the directionality was more apparent to distinguish the differences between the four directions of FR, RF, FF, and RR using CTCF-mediated ChIA-pet chromatin interaction data (lines 287 and 288).

       I found that the CTCF binding sites examined by a wet experiment in the previous study may not always overlap with the boundary sites of chromatin interactions from Micro-C assay (Guo Y et al. *Cell* 2015). The chromatin interaction data do not include all interactions due to the high sequencing cost of the assay, and include less long-range interactions due to distance bias. The number of the boundary sites may be smaller than that of CTCF binding sites acting as insulators and/or some of the CTCF binding sites may not be locate in the boundary sites. It may be difficult for the boundary location algorithm to identify a short boundary location. Due to the limitations of the chromatin interaction data, I planned to search for insulator-associated DNA-binding proteins without using chromatin interaction data in this study.
      
       I discussed other causes in lines 614-622: Another reason for the difference may be that boundary sites are more closely associated with topologically associated domains (TADs) of chromosome than are insulator sites. Boundary sites are regions identified based on the separation of numerous chromatin interactions. On the other hand, we found that the multiple DNA-binding sites of insulator-associated DNA-binding proteins were located close to each other at insulator sites and were associated with distinct nested and focal chromatin interactions, as reported by Micro-C assay. These interactions may be transient and relatively weak, such as tissue/cell type, conditional or lineage-specific interactions.
      
       Furthermore, I have added the statistical summary of the analysis in lines 372-395 as follows: Overall, among 20,837 DNA-binding sites of the 97 insulator-associated proteins found at insulator sites identified by H3K27me3 histone modification marks (type 1 insulator sites), 1,315 (6%) overlapped with 264 of 17,126 5kb long boundary sites, and 6,137 (29%) overlapped with 784 of 17,126 25kb long boundary sites in HFF cells. Among 5,205 DNA-binding sites of the 97 insulator-associated DNA-binding proteins found at insulator sites identified by H3K27me3 histone modification marks and transcribed regions (type 2 insulator sites), 383 (7%) overlapped with 74 of 17,126 5-kb long boundary sites, 1,901 (37%) overlapped with 306 of 17,126 25-kb long boundary sites. Although CTCF-binding sites separate active and repressive domains, the limited number of DNA-binding sites of insulator-associated proteins found at type 1 and 2 insulator sites overlapped boundary sites identified by chromatin interaction data. Furthermore, by analyzing the regulatory regions of genes, the DNA-binding sites of the 97 insulator-associated DNA-binding proteins were found (1) at the type 1 insulator sites (based on H3K27me3 marks) in the regulatory regions of 3,170 genes, (2) at the type 2 insulator sites (based on H3K27me3 marks and gene expression levels) in the regulatory regions of 1,044 genes, and (3) at insulator sites as boundary sites identified by chromatin interaction data in the regulatory regions of 6,275 genes. The boundary sites showed the highest number of overlaps with the DNA-binding sites. Comparing the insulator sites identified by (1) and (3), 1,212 (38%) genes have both types of insulator sites. Comparing the insulator sites between (2) and (3), 389 (37%) genes have both types of insulator sites. From the comparison of insulator and boundary sites, we found that (1) or (2) types of insulator sites overlapped or were close to boundary sites identified by chromatin interaction data.
      

      4. The suggested alternative transcripts function, also highlighted in the manuscripts abstract, is only supported by visual inspection of a few cases for several putative DBPs. I believe this is insufficient to support what looks like one of the major claims of the paper when reading the abstract, and a more quantitative and genome-wide analysis must be adopted, although the authors mention it as just an 'observation'.

      According to the reviewer's comment, I performed the genome-wide analysis of alternative transcripts where the DNA-binding sites of insulator-associated proteins are located near splicing sites. The DNA-binding sites of insulator-associated DNA-binding proteins were found within 200 bp centered on splice sites more significantly than the other DNA-binding proteins (Fig. 4e and Table 2). I have added the following sentences on lines 405 - 412: We performed the statistical test to estimate the enrichment of insulator-associated DNA-binding sites compared to the other DNA-binding proteins, and found that the insulator-associated DNA-binding sites were significantly more abundant at splice sites than the DNA-binding sites of the other proteins (Fig 4e and Table 2; Mann‒Whitney U test, p value 5. Figure 1 serves no purpose in my opinion and can be removed, while figures can generally be improved (e.g., the browser screenshots in Figs 4 and 5) for interpretability from readers outside the immediate research field.

      I believe that the Figure 1 would help researchers in other fields who are not familiar with biological phenomena and functions to understand the study. More explanation has been included in the Figures and legends of Figs. 4 and 5 to help readers outside the immediate research field understand the figures.

      6. Similarly, the text is rather convoluted at places and should be re-approached with more clarity for less specialized readers in mind.

      Reviewer #2's comments would be related to this comment. I have introduced a more detailed explanation of the method in the Results section, as shown in the responses to Reviewer #2's comments.

      Reviewer #2

      1. Introduction, line 95: CTCF appears two times, it seems redundant.

      On lines 91-93, I deleted the latter CTCF from the sentence "We examine the directional bias of DNA-binding sites of CTCF and insulator-associated DBPs, including those of known DBPs such as RAD21 and SMC3".

      2. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?

      Although CTCF is known to be the main insulator protein in vertebrates, we found that 97 DNA-binding proteins including CTCF and cohesin are associated with insulator sites by modifying and developing a machine learning method to search for insulator-associated DNA-binding proteins. Most of the insulator-associated DNA-binding proteins showed the directional bias of DNA-binding motifs, suggesting that the directional bias is associated with the insulator.

       I have added the sentence in lines 96-99 as follows: Furthermore, statistical testing the contribution scores between the directional and non-directional DNA-binding sites of insulator-associated DBPs revealed that the directional sites contributed more significantly to the prediction of gene expression levels than the non-directional sites. I have revised the statement in lines 101-110 as follows: To validate these findings, we demonstrate that the DNA-binding sites of the identified insulator-associated DBPs are located within potential insulator sites, and some of the DNA-binding sites in the insulator site are found without the nearby DNA-binding sites of CTCF and cohesin. Homologous and heterologous insulator-insulator pairing interactions are orientation-dependent, as suggested by the insulator-pairing model based on experimental analysis in flies. Our method and analyses contribute to the identification of insulator- and chromatin-associated DNA-binding sites that influence EPIs and reveal novel functional roles and molecular mechanisms of DBPs associated with transcriptional condensation, phase separation and transcriptional regulation.
      

      3. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS.

      On lines 121-124, to explain the procedure for the SNP of an eQTL, I have added the sentence in the Methods: "If a DNA-binding site was located within a 100-bp region around a single-nucleotide polymorphism (SNP) of an eQTL, we assumed that the DNA-binding proteins regulated the expression of the transcript corresponding to the eQTL".

      4. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details.

      On line 119, I have included the explanation of the eQTL dataset of GTEx v8 as follows: " The eQTL data were derived from the GTEx v8 dataset, after quality control, consisting of 838 donors and 17,382 samples from 52 tissues and two cell lines". On lines 681 and 865, I have added the filename of the eQTL data "(GTEx_Analysis_v8_eQTL.tar)".

      5. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.

      The reviewer would mention Figure 2, not Figure 1. If so, the matrices in panels a and b in Figure 2 are equivalent. I have shown it in the figure: The same figure in panel a is rotated 90 degrees to the right. The green boxes in the matrix show the regions with the ChIP-seq peak of a DNA-binding protein overlapping with a SNP of an eQTL. I used eQTL data to associate a gene with a ChIP-seq peak that was more than 2 kb upstream and 1 kb downstream of a transcriptional start site of a gene. For each gene, the matrix was produced and the gene expression levels in cells were learned and predicted using the deep learning method. I have added the following sentences to explain the method in lines 133 - 139: Through the training, the tool learned to select the binding sites of DNA-binding proteins from ChIP-seq assays that were suitable for predicting gene expression levels in the cell types. The binding sites of a DNA-binding protein tend to be observed in common across multiple cell and tissue types. Therefore, ChIP-seq data and eQTL data in different cell and tissue types were used as input data for learning, and then the tool selected the data suitable for predicting gene expression levels in the cell types, even if the data were not obtained from the same cell types.

      6. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?

      As suggested by the reviewer, to help readers understand the observation, I have added Supplementary Fig. S4c to show the distribution of DNA-binding sites of "CTCF, RAD21, and SMC3" and "BACH2, FOS, ATF3, NFE2, and MAFK" around chromatin interaction sites. I have modified the following sentence to indicate the figure on line 501: Although a DNA-binding-site distribution pattern around chromatin interaction sites similar to those of CTCF, RAD21, and SMC3 was observed for DBPs such as BACH2, FOS, ATF3, NFE2, and MAFK, less than 1% of the DNA-binding sites of the latter set of DBPs colocalized with CTCF, RAD21, or SMC3 in a single bin (Fig. S4c).

       In Aljahani A et al. *Nature Communications* 2022, we find that depletion of cohesin causes a subtle reduction in longer-range enhancer-promoter interactions and that CTCF depletion can cause rewiring of regulatory contacts. Together, our data show that loop extrusion is not essential for enhancer-promoter interactions, but contributes to their robustness and specificity and to precise regulation of gene expression. Goel VY et al. *Nature Genetics* 2023 mentioned in the abstract: Microcompartments frequently connect enhancers and promoters and though loss of loop extrusion and inhibition of transcription disrupts some microcompartments, most are largely unaffected. These results suggested that chromatin loops can be driven by other DBPs independent of the known CTCF/Cohesin.
      
      I added the following sentence on lines 569-577: The depletion of cohesin causes a subtle reduction in longer-range enhancer-promoter interactions and that CTCF depletion can cause rewiring of regulatory contacts. Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently. Furthermore, the loop extrusion is not essential for enhancer-promoter interactions, but contributes to their robustness and specificity and to precise regulation of gene expression.
      
       FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates (Ji D et al. *Molecular Cell* 2024). CTCF have also found to form transcriptional condensate and phase separation (Lee R et al. *Nucleic acids research* 2022). FOS was found to be an insulator-associated DNA-binding protein in this study and is potentially involved in chromatin remodeling, transcription condensation, and phase separation with the other factors such as BACH2, ATF3, NFE2 and MAFK. I have added the following sentence on line 556: FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates.
      

      7. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?

      Goel VY et al. Nature Genetics 2023 identified highly nested and focal interactions through region capture Micro-C, which resemble fine-scale compartmental interactions and are termed microcompartments. In the section titled "Most microcompartments are robust to loss of loop extrusion," the researchers noted that a small proportion of interactions between CTCF and cohesin-bound sites exhibited significant reductions in strength when cohesin was depleted. In contrast, the majority of microcompartmental interactions remained largely unchanged under cohesin depletion. Our findings indicate that most P-P and E-P interactions, aside from a few CTCF and cohesin-bound enhancers and promoters, are likely facilitated by a compartmentalization mechanism that differs from loop extrusion. We suggest that nested, multiway, and focal microcompartments correspond to small, discrete A-compartments that arise through a compartmentalization process, potentially influenced by factors upstream of RNA Pol II initiation, such as transcription factors, co-factors, or active chromatin states. It follows that if active chromatin regions at microcompartment anchors exhibit selective "stickiness" with one another, they will tend to co-segregate, leading to the development of nested, focal interactions. This microphase separation, driven by preferential interactions among active loci within a block copolymer, may account for the striking interaction patterns we observe.

       The authors of the paper proposed several mechanisms potentially involved in microcompartments. These mechanisms may be involved in looping with insulator function. Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently (Hsieh TS et al. *Nature Genetics* 2022). Among the identified insulator-associated DNA-binding proteins, Maz and MyoD1 form loops without CTCF (Xiao T et al. *Proc Natl Acad Sci USA* 2021 ; Ortabozkoyun H et al. *Nature genetics* 2022 ; Wang R et al. *Nature communications* 2022). I have added the following sentences on lines 571-575: Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently. I have included the following explanation on lines 582-584: Maz and MyoD1 among the identified insulator-associated DNA-binding proteins form loops without CTCF.
      
       As for the directionality of CTCF, if chromatin loop anchors have some structural conformation, as shown in the paper entitled "The structural basis for cohesin-CTCF-anchored loops" (Li Y et al. *Nature* 2020), directional DNA binding would occur similarly to CTCF binding sites. Moreover, cohesin complexes that interact with convergent CTCF sites, that is, the N-terminus of CTCF, might be protected from WAPL, but those that interact with divergent CTCF sites, that is, the C-terminus of CTCF, might not be protected from WAPL, which could release cohesin from chromatin and thus disrupt cohesin-mediated chromatin loops (Davidson IF et al. *Nature Reviews Molecular Cell Biology* 2021). Regarding loop extrusion, the 'loop extrusion' hypothesis is motivated by in vitro observations. The experiment in yeast, in which cohesin variants that are unable to extrude DNA loops but retain the ability to topologically entrap DNA, suggested that in vivo chromatin loops are formed independently of loop extrusion. Instead, transcription promotes loop formation and acts as an extrinsic motor that extends these loops and defines their final positions (Guerin TM et al. *EMBO Journal* 2024). I have added the following sentences on lines 543-547: Cohesin complexes that interact with convergent CTCF sites, that is, the N-terminus of CTCF, might be protected from WAPL, but those that interact with divergent CTCF sites, that is, the C-terminus of CTCF, might not be protected from WAPL, which could release cohesin from chromatin and thus disrupt cohesin-mediated chromatin loops. I have included the following sentences on lines 577-582: The 'loop extrusion' hypothesis is motivated by in vitro observations. The experiment in yeast, in which cohesin variants that are unable to extrude DNA loops but retain the ability to topologically entrap DNA, suggested that in vivo chromatin loops are formed independently of loop extrusion. Instead, transcription promotes loop formation and acts as an extrinsic motor that extends these loops and defines their final positions.
      
       Another model for the regulation of gene expression by insulators is the boundary-pairing (insulator-pairing) model (Bing X et al. *Elife* 2024) (Ke W et al. *Elife* 2024) (Fujioka M et al. *PLoS Genetics* 2016). Molecules bound to insulators physically pair with their partners, either head-to-head or head-to-tail, with different degrees of specificity at the termini of TADs in flies. Although the experiments do not reveal how partners find each other, the mechanism unlikely requires loop extrusion. Homologous and heterologous insulator-insulator pairing interactions are central to the architectural functions of insulators. The manner of insulator-insulator interactions is orientation-dependent. I have summarized the model on lines 559-567: Other types of chromatin regulation are also expected to be related to the structural interactions of molecules. As the boundary-pairing (insulator-pairing) model, molecules bound to insulators physically pair with their partners, either head-to-head or head-to-tail, with different degrees of specificity at the termini of TADs in flies (Fig. 7). Although the experiments do not reveal how partners find each other, the mechanism unlikely requires loop extrusion. Homologous and heterologous insulator-insulator pairing interactions are central to the architectural functions of insulators. The manner of insulator-insulator interactions is orientation-dependent.
      

      8. Do the authors think that the identified DBPs could work in that way as well?

      The boundary-pairing (insulator-pairing) model would be applied to the insulator-associated DNA-binding proteins other than CTCF and cohesin that are involved in the loop extrusion mechanism (Bing X et al. Elife 2024) (Ke W et al. Elife 2024) (Fujioka M et al. PLoS Genetics 2016).

       Liquid-liquid phase separation was shown to occur through CTCF-mediated chromatin loops and to act as an insulator (Lee, R et al. *Nucleic Acids Research* 2022). Among the identified insulator-associated DNA-binding proteins, CEBPA has been found to form hubs that colocalize with transcriptional co-activators in a native cell context, which is associated with transcriptional condensate and phase separation (Christou-Kent M et al. *Cell Reports* 2023). The proposed microcompartment mechanisms are also associated with phase separation. Thus, the same or similar mechanisms are potentially associated with the insulator function of the identified DNA-binding proteins. I have included the following information on line 554: CEBPA in the identified insulator-associated DNA-binding proteins was also reported to be involved in transcriptional condensates and phase separation.
      

      9. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?

      Snead WT et al. Molecular Cell 2019 mentioned that protein post-transcriptional modifications (PTMs) facilitate the control of molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin (Tang X et al. Nature Communications 2024). I found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Supplementary Fig. 2d). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation by PTMs. I have added the following explanation on lines 584-590: Furthermore, protein post-transcriptional modifications (PTMs) facilitate control over the molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin. We found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Fig. 4f and Supplementary Fig. 3c). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation through PTMs.

      10. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Structural molecular model of cohesin-CTCF-anchored loops has been published by Li Y et al. Nature 2020. The structural conformation of CTCF and cohesin in the loops would be the cause of the directional bias of CTCF binding sites, which I mentioned in lines 539 - 543 as follows: These results suggest that the directional bias of DNA-binding sites of insulator-associated DBPs may be involved in insulator function and chromatin regulation through structural interactions among DBPs, other proteins, DNAs, and RNAs. For example, the N-terminal amino acids of CTCF have been shown to interact with RAD21 in chromatin loops.

       To investigate the principles underlying the architectural functions of insulator-insulator pairing interactions, two insulators, Homie and Nhomie, flanking the *Drosophila even skipped *locus were analyzed. Pairing interactions between the transgene Homie and the eve locus are directional. The head-to-head pairing between the transgene and endogenous Homie matches the pattern of activation (Fujioka M et al. *PLoS Genetics* 2016).
      

      Reviewer #3

      Major Comments:

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.

      When a protein complex binds to DNA, one protein of the complex binds to the DNA directory, and the other proteins may not bind to DNA. However, the DNA motif sequence bound by the protein may be registered as the DNA-binding motif of all the proteins in the complex. The molecular structure of the complex of CTCF and Cohesin showed that both CTCF and Cohesin bind to DNA (Li Y et al. Nature 2020). I think there is a possibility that if the molecular structure of a protein complex becomes available, the previous recognition of the DNA-binding ability of a protein may be changed. Therefore, I searched the Pfam database for 99 insulator-associated DNA-binding proteins identified in this study. I found that 97 are registered as DNA-binding proteins and/or have a known DNA-binding domain, and EP300 and SIN3A do not directory bind to DNA, which was also checked by Google search. I have added the following explanation in line 257 to indicate direct and indirect DNA-binding proteins: Among 99 insulator-associated DBPs, EP300 and SIN3A do not directory interact with DNA, and thus 97 insulator-associated DBPs directory bind to DNA. I have updated the sentence in line 20 of the Abstract as follows: We discovered 97 directional and minor nondirectional motifs in human fibroblast cells that corresponded to 23 DBPs related to insulator function, CTCF, and/or other types of chromosomal transcriptional regulation reported in previous studies.

      2. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.

      As the reviewer mentioned, I recognize enhancers are relatively small regions. In the paper, I intended to examine further upstream and downstream of promoter regions where enhancers are found. Therefore, I have modified the sentence in lines 929 - 931 of the Fig. 2 legend as follows: Enhancer-gene regulatory interaction regions consist of 200 bins of 10 kbp between -1 Mbp and 1 Mbp region from TSS, not including promoter.

      3. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.

      Following the reviewer's advice, I have added the ChIP-seq data of H3K9me3 as a truck of the UCSC Genome Browser. The distribution of H3K9me3 signal was different from that of H3K27me3 in some regions. I also found the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions and took some screenshots of the UCSC Genome Browser of the regions around the sites in Supplementary Fig. 3b. I have modified the following sentence on lines 974 - 976 in the legend of Fig. 4: a Distribution of histone modification marks H3K27me3 (green color) and H3K9me3 (turquoise color) and transcript levels (pink color) in upstream and downstream regions of a potential insulator site (light orange color). I have also added the following result on lines 356 - 360: The same analysis was performed using H3K9me3 marks, instead of H3K27me3 (Fig. S3b). We found that the distribution of H3K9me3 signal was different from that of H3K27me3 in some regions, and discovered the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions (Fig. S3b).

      4. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      The resolution of the Micro-C assay is considered to be 100 bp and above, as the human nucleome core particle contains 145 bp (and 193 bp with linker) of DNA. However, internucleosomal DNA is cleaved by endonuclease into fragments of multiples of 10 nucleotides (Pospelov VA et al. Nucleic Acids Research 1979). Highly nested focal interactions were observed (Goel VY et al. Nature Genetics 2023). Base pair resolution was reported using Micro Capture-C (Hua P et al. Nature 2021). Sub-kilobase (20 bp resolution) chromatin topology was reported using an MNase-based chromosome conformation capture (3C) approach (Aljahani A et al. Nature Communications 2022). On the other hand, Hi-C data was analyzed at 1 kb resolution. (Gu H et al. bioRxiv 2021). If the resolution of Micro-C interactions is at best at 1 kb, the binding sites of a DNA-binding protein will not show a peak around the center of the genomic locations of interaction edges. Each panel shows the number of binding sites of a specific DNA-binding protein at a specific distance from the midpoint of all chromatin interaction edges. I have modified and added the following sentences in lines 593-597: High-resolution chromatin interaction data from a Micro-C assay indicated that most of the predicted insulator-associated DBPs showed DNA-binding-site distribution peaks around chromatin interaction sites, suggesting that these DBPs are involved in chromatin interactions and that the chromatin interaction data has a high degree of resolution. Base pair resolution was reported using Micro Capture-C.

      Minor Comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., ____https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2____or ____https://pubmed.ncbi.nlm.nih.gov/37486787____/). The authors should discuss how that would impact their results.

      The directional bias of CTCF binding sites was identified by ChIA-pet interactions of CTCF binding sites. The analysis of the contribution scores of DNA-binding sites of proteins considering the binding sites of CTCF as an insulator showed the same tendency of directional bias of CTCF binding sites. In the analysis, to remove the false-positive prediction of DNA-binding sites, I used the binding sites that overlapped with a ChIP-seq peak of the DNA-binding protein. This result suggests that the DNA-binding sites of CTCF obtained by the current analysis have sufficient quality. Therefore, if the accuracy of prediction of DNA-binding sites is improved, although the number of DNA-binding sites may be different, the overall tendency of the directionality of DNA-binding sites will not change and the results of this study will not change significantly.

       As for the first reference in the reviewer's comment, chromatin interaction data from Micro-C assay does not include all chromatin interactions in a cell or tissue, because it is expensive to cover all interactions. Therefore, it would be difficult to predict all chromatin interactions based on machine learning. As for the second reference in the reviewer's comment, pioneer factors such as FOXA are known to bind to closed chromatin regions, but transcription factors and DNA-binding proteins involved in chromatin interactions and insulators generally bind to open chromatin regions. The search for the DNA-binding motifs is not required in closed chromatin regions.
      

      2. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      In the DeepLIFT paper, the authors explain that DeepLIFT is a method for decomposing the output prediction of a neural network on a specific input by backpropagating the contributions of all neurons in the network to every feature of the input (Shrikumar A et al. ICML 2017). DeepLIFT compares the activation of each neuron to its 'reference activation' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.

       Truly explainable AI would be able to find cause and reason, and to make choices and decisions like humans. DeepLIFT does not perform causal inferences. I did not use the term "Explainable AI" in our manuscript, but I briefly explained it in Discussion. I have added the following explanation in lines 623-628: AI (Artificial Intelligence) is considered as a black box, since the reason and cause of prediction are difficult to know. To solve this issue, tools and methods have been developed to know the reason and cause. These technologies are called Explainable AI. DeepLIFT is considered to be a tool for Explainable AI. However, DeepLIFT does not answer the reason and cause for a prediction. It calculates scores representing the contribution of the input data to the prediction.
      
       Furthermore, to improve the readability of the manuscript, I have included the following explanation in lines 159-165: we computed DeepLIFT scores of the input data (i.e., each binding site of the ChIP-seq data of DNA-binding proteins) in the deep leaning analysis on gene expression levels. DeepLIFT compares the importance of each input for predicting gene expression levels to its 'reference or background level' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.
      
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Osato and Hamada propose a systematic approach to identify DNA binding proteins that display directional binding. They used a modified Deep Learning method (DEcode) to investigate binding profiles of 1356 DBP from GTRD database at promoters (30 of 100bp bins around TSS) and enhancers (200 bins of 10Kb around eSNPs) and use this to predict expression of 25,071 genes in Fibroblasts, Monocytes, HMEC and NPC. This method achieves a good prediction power (Spearman correlation between predicted and actual expression of 0.74). They then use PIQ, and overlap predicted binding sites with actual ChIP-seq data to investigate the motifs of TFs that are controlling gene expression. They find 99 insulator proteins showing either a specific directional bias or minor non-directional bias, corresponding to 23 DBP previously reported to have insulator function. Of the 23 proteins they identify as regulating enhancer promoter interactions, 13 are associated with CTCF. They also show that there are significantly more insulator proteins binding sites at borders of polycomb domains, transcriptionally active or boundary regions based on chromatin interactions than other proteins.

      Major Comments:

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.
      2. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.
      3. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.
      4. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      Minor comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2 or https://pubmed.ncbi.nlm.nih.gov/37486787/). The authors should discuss how that would impact their results.
      2. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      Referee Cross-Commenting

      I would like to mention that I agree with the comments of reviewers 1 and 2.

      Significance

      General assessment:

      This is the first study to my knowledge that attempts to use Deep Learning to identify insulators and directional biases in binding. One of the limitations is that no additional methods were used to show that these DBP have directional binding bias. It is not necessarily to employ additional methods, but it would definitely strengthen the paper.

      Advancements:

      This is a useful catalogue of potential DNA binding proteins of interest, beyond just CTCF. Some known TFs are there, but also new ones are found.

      Audience:

      Basic research mainly, with particular focus on chromatin conformation and TF binding fields.

      My expertise:

      ML/AI methods in genomics, TF binding models, epigenetics and 3D chromatin interactions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see my points below). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the following list.

      Also, I encourage the authors to integrate the current presentation of the data with other (published) data about chromatin architecture, to make more robust the claims and go deeper into the biological implications of the current work. Se my list below.

      It follows a specific list of relevant points to be addressed:

      Specific points:

      1. Introduction, line 95: CTCF appears two times, it seems redundant;
      2. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?
      3. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS;
      4. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details;
      5. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.
      6. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?
      7. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?
      8. Do the authors think that the identified DBPs could work in that way as well?
      9. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?
      10. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Significance

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, chromatin organization is an important topic in the context of a constantly expanding research field. Therefore, the work is timely and could be useful for the community. The paper appears overall well written and the figures look clear and of good quality. Nevertheless, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see list of specific points). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the above reported points.

    1. Reviewer #1 (Public review):

      Summary:

      This study presents findings on dual TCR regulatory T cells (Tregs) using previously published single-cell RNA and TCR sequencing datasets. The authors aimed to quantify dual TCR Tregs in different tissues and analyze their characteristics. Rather than perform the difficult experiments needed to ascertain the functional role of dual receptors, this study relies entirely on scRNA-VDJ-seq data published by two other groups. The findings primarily confirm prior work rather than provide new insights, and the methodology has significant weaknesses that limit the study's impact. We have concerns about the scientific integrity of this work.

      Strengths:

      (1) The use of single-cell RNA and TCR sequencing is appropriate for addressing potential relationships between gene expression and dual TCR.

      (2) The data confirm the presence of dual TCR Tregs in various tissues, with proportions ranging from 10.1% to 21.4%, aligning with earlier observations in αβ T cells.

      (3) Tissue-specific patterns of TCR gene usage are reported, which could be of interest to researchers studying T cell adaptation, although these were more rigorously analyzed in the original works.

      Weaknesses

      (1) Lack of Novelty: The primary findings do not substantially advance our understanding of dual TCR expression, as similar results have been reported previously in other contexts.

      (2) Incomplete Evidence: The claims about tissue-specific differences lack sufficient controls (e.g., comparison with conventional T cells) and functional validation (e.g., cell surface expression of dual TCRs).

      (3) Methodological Weaknesses: The diversity analysis does not account for sample size differences, and the clonal analysis conflates counts and clonotypes, leading to potential misinterpretation.

      (4) Insufficient Transparency: The sequence analysis pipeline is inadequately described, and the study lacks reproducibility features such as shared code and data.

      (5) Weak Gene Expression Analysis: No statistical validation is provided for differential gene expression, and the UMAP plots fail to reveal meaningful clustering patterns.

      (6) A quick online search reveals that the same authors have repeated their approach of reanalysing other scientists' publicly available scRNA-VDJ-seq data in six other publications:

      (1) Peng, Q., Xu, Y. & Yao, X. scRNA+ TCR-seq revealed dual TCR T cells antitumor response in the TME of NSCLC. J Immunother Cancer 12 (2024). https://doi.org:10.1136/jitc-2024-009376

      (2) Wang, H., Li, J., Xu, Y. & Yao, X. scRNA + BCR-seq identifies proportions and characteristics of dual BCR B cells in the peritoneal cavity of mice and peripheral blood of healthy human donors across different ages. Immun Ageing 21, 90 (2024). https://doi.org:10.1186/s12979-024-00493-6

      (3) Xu, Y. et al. scRNA+TCR-seq reveals the pivotal role of dual receptor T lymphocytes in the pathogenesis of Kawasaki disease and during IVIG treatment. Front Immunol 15, 1457687 (2024). https://doi.org:10.3389/fimmu.2024.1457687

      (4) Yuanyuanxu, Qipeng, Qingqingma & Yao, X. scRNA + TCR-seq revealed the dual TCR pTh17 and Treg T cells involvement in autoimmune response in ankylosing spondylitis. Int Immunopharmacol 135, 112279 (2024). https://doi.org:10.1016/j.intimp.2024.112279

      (5) Zhu, L. et al. scRNA-seq revealed the special TCR beta & alpha V(D)J allelic inclusion rearrangement and the high proportion dual (or more) TCR-expressing cells. Cell Death Dis 14, 487 (2023). https://doi.org:10.1038/s41419-023-06004-7

      (6) Zhu, L., Peng, Q., Wu, Y. & Yao, X. scBCR-seq revealed a special and novel IG H&L V(D)J allelic inclusion rearrangement and the high proportion dual BCR expressing B cells. Cell Mol Life Sci 80, 319 (2023). https://doi.org:10.1007/s00018-023-04973-8

      In other words, the approach used here seems to be focused on quick re-analyses of publicly available data without further validation and/or exploration

      Appraisal of the Study's Aims and Conclusions:

      The authors set out to analyze dual TCR Tregs across tissues, but the lack of robust controls, incomplete analyses, and insufficient novelty limit the study's ability to achieve its aims. The results confirm prior findings but do not provide compelling evidence to support the broader claims about the characteristics or significance of dual TCR Tregs.

      Impact and Utility:

      While the study provides a descriptive analysis of dual TCR Tregs, its limited novelty and methodological weaknesses reduce its likely impact on the field. The methods and data could have utility for researchers interested in tissue-specific TCR gene usage, but additional rigor is required to make the findings broadly applicable.

    2. Reviewer #3 (Public review):

      Summary:

      This study addressed the TCR pairing types and CDR3 characteristics of Treg cells. By analyzing scRNA and TCR-seq data, it claims that 10-20% of dual TCR Treg cells exist in mouse lymphoid and non-lymphoid tissues and suggests that dual TCR Treg cells in different tissues may play complex biological functions.

      Strengths:

      The study addresses an interesting question of how dual-TCR-expressing Treg cells play roles in tissues.

      Weaknesses:

      This study is inadequate, particularly regarding data interpretation, statistical rigor, and the discussion of the functional significance of Dual TCR Tregs.

      Major Comments:

      (1) Definition of Dual TCR and Validity of Doublet Removal<br /> This study analyzes Treg cells with Dual TCR, but it is not clearly stated how the possibility of doublet cells was eliminated. The authors mention using DoubletFinder for detecting doublets in scRNA-seq data, but is this method alone sufficient?<br /> We strongly recommend reporting the details of doublet removal and data quality assessment in the Supplementary Data.

      (2) Inconsistency in the Proportion of Dual TCR T Cells in the Skin Across Figures<br /> In Figure 3D, the proportion of Dual TCR T cells (A1+A2+B1+B2) in the skin is reported to be very high compared to other tissues. However, in Figure 4C, the proportion appears lower than in other tissues, which may be due to contamination by non-Tregs. The authors should clarify why it was necessary to include non-Tregs as a target for analysis in this study. Additionally, the sensitivity of scRNA-seq and TCR-seq may vary between tissues and may also be affected by RNA quality and sequencing depth in skin samples, so the impact of measurement bias should be assessed.

      (3) Issue of Cell Contamination<br /> In Figure 2A, the data suggest a high overlap between blood, kidney, and liver samples, likely due to contamination. Can the authors effectively remove this effect? If the dataset allows, distinguishing between blood-derived and tissue-resident Tregs would significantly enhance the reliability of the findings. Otherwise, it would be difficult to separate biological signals from contamination noise, making interpretation challenging.

      (4) Inconsistency Between CDR3 Overlap and TCR Diversity<br /> The manuscript states that Single TCR Tregs have a higher CDR3 overlap, but this contradicts the reported data that Dual TCR Tregs exhibit lower TCR diversity (higher 1/DS score). Typically, when TCR diversity is low (i.e., specific clones are concentrated), CDR3 overlap is expected to increase. The authors should carefully address this discrepancy and discuss possible explanations.

      (5) Functional Evaluation of Dual TCR Tregs<br /> This study indicates gene expression differences among tissue-resident Dual TCR T cells, but there is no experimental validation of their functional significance. Including functional assays, such as suppression assays or cytokine secretion analysis, would greatly enhance the study's impact.

      (6) Appropriateness of Statistical Analysis<br /> When discussing increases or decreases in gene expression and cell proportions (e.g., Figure 2D), the statistical methods used (e.g., t-test, Wilcoxon, FDR correction) should be explicitly described. They should provide detailed information on the statistical tests applied to each analysis.

    1. Reviewer #2 (Public review):

      Summary:

      In this paper, Andriani and colleagues are examining the potential role of Zn flux in sperm and its effect on Slo3 channels. This is an interesting question that is likely critical to how sperm function properly and Slo3 channels are a possible candidate for a downstream molecule that is impacted by Zn. In this paper, the authors use Zn imaging, sperm motility assays, and electrophysiology to show that Zn flux impacts sperm function. They then go on to look at the impact Zn has on Slo3 current and propose a binding site based on MD simulations. While the ideas are interesting, the experiments are not well described in many places making understanding the results very difficult. In addition, critical controls are missing throughout the paper.

      Strengths:

      The question of how Zn flux impacts membrane potential and sperm motility is an important one. Moreover, Slo3 presents an interesting candidate or the target of Zn regulation. The combination of methods used here also has the potential to uncover mechanisms of Zn regulation of Slo3.

      Weaknesses:

      Much of the paper lacks experimental description which makes interpretation quite difficult, or a detailed discussion is missing. Examples include:

      (1) Figure 1, particularly the Zn imaging, is not sufficiently described. How is the fluorescence intensity measured? A representative ROI? The whole tail and head? Are the sperm immobile? If not, there is evidence that motion artifacts can significantly distort these sorts of measures from Calcium measurements in Cilia. Were there controls done? Is the small amount of Zn seen in the tail above the background?

      (2) The second half of Figure 1 is also not well described. What is the extracellular solution in the recordings? When you apply the Zn ionophore, do you expect influx or efflux? I assume efflux is based on the conclusions but this should be discussed explicitly.

      (3) Figure 2H labels the Y axis, "normalized current". Normalized to what? Why do neither of the curves end at 1? A better description of what this figure represents is needed.

      (4) The alpha fold simulations are not well described. How many Zn binding sites were found? Are all of the histidine mutations in Figure 4 Supplement 1 the ones that were found?

      (5) There is no discussion of physiological intracellular Zn concentration. How much Zn is inside the sperm? How much if likely Free vs buffered? Is 100uM a reasonable physiological concentration?

      There are a number of areas where the interpretation is not well supported by the data including:

      (6) You say in the Figure 4 supplement, that "we did not observe any significant decrease in the percentage of current inhibition." But that is a pretty misleading statement. There are large changes (increases) in the amount of zinc inhibition. These might be allosteric changes but I don't think you can safely eliminate these as relevant Zn binding sites. Also, some of these mutations appear to allow at least some unbinding of Zn.

      (7) Following up on the above point, it seems unfair to conclude that the D162S, E169A, and E205 mutants are part of the inhibitory binding site for Zn when the mutation has no effect on inhibition and only an effect on the washout. The mutations on the intracellular side also had an impact on the washout so it seems equally likely that they are the critical residues based on your data.

      (8) Nowhere in the paper do you make the specific link between Zn flux and membrane hyperpolariation via Slo3. You show that Zn flux changes the ability of the sperm to hyperpolarize and you show that Slo3 is inhibited by Zn but the connection between the two is not demonstrated. There appears to be a specific Slo3 blocker. If you use this in sperm, do you no longer see the Zn effect?

      (9) In the second half of Figure 1, the authors suggest that there is "no hyperpolization in 100uM Zn. That is not really true. It is reduced but not absent.

      (10) The claim that Lrcc52 with Slo3 shows a higher current inhibition at pH 7.5 than pH 8 is not well supported because there are only 3 replicates in the 7.5 case. In addition, the claim is made in the test that 100uM ZnCl2 "already inhibited mSlo3+Lrcc52 at pH7.5", contrasted with mSlo3 alone, is not tested statistically.

      In a number of places, better controls are needed.

      (11) How specific is this effect for Zn? Mg2+, for instance, is also a divalent cation that is in the hundreds of uM range inside the cell. Does it exert the same effect? Each ion certainly has unique preferred coordination geometries, does your predicted binding with MD show what you might expect for tetrahedral coordination with Zn? Did you test other divalent cations functionally or in silicon?

      (12) For the VCF experiments, a significantly higher concentration of Zn was used (10mM). What is the reason for this? There is no discussion of how much a "puff" is. Assuming you are using the RNA injector it is probably on the order of 50nL or less. Assuming the volume of an oocyte is 1uL that would argue that the final concentration is 500uM or higher. But this is also complicated by potential local effects of high Zn at the injection site, artifacts of injecting that much metal, and the fact that a great deal of the Zn will likely be bound to other things inside the cell. Better controls are needed for this experiment.

    2. Reviewer #3 (Public review):

      Summary:

      The study titled "Zinc is a Key Regulator of the Sperm-Specific K+ Channel (Slo3) Function" aims to investigate the role of intracellular zinc in sperm capacitation and its regulation of the sperm-specific Slo3 potassium channel. Capacitation is a crucial physiological process that enables sperm to fertilize an egg, and membrane hyperpolarization through Slo3 activation is a well-established event in this process. The authors propose that intracellular zinc dynamically decreases during capacitation and inhibits Slo3-mediated K⁺ currents, thereby playing a regulatory role in sperm function.

      Strengths:

      (1) Novel Contribution to Sperm Physiology.

      The study provides new insights into how zinc dynamics contribute to sperm capacitation, specifically through its direct inhibition of Slo3 activity.<br /> Previous research has focused primarily on extracellular zinc's effect on sperm function; this work expands the discussion to intracellular zinc regulation, an area with limited prior investigation.

      (2) Strong Electrophysiological Evidence.

      The study employs inside-out patch-clamp recordings in Xenopus oocytes to demonstrate zinc's direct inhibition of Slo3 currents.<br /> The observed slow dissociation of zinc from Slo3 suggests a long-lasting regulatory effect, adding to the understanding of ion channel modulation in sperm cells.

      (3) Molecular Mechanistic Insights

      Using Molecular Dynamics (MD) simulations and mutagenesis, the authors identify potential zinc-binding sites within Slo3's voltage-sensing domain (VSD), particularly E169 and E205.

      These computational predictions are supported by electrophysiological recordings, strengthening the argument that zinc directly binds and inhibits Slo3.

      (4) Physiological Relevance and Functional Implications

      The study suggests that zinc inhibition of Slo3 could contribute to sperm motility regulation during capacitation.

      The authors provide sperm motility assays as supporting evidence, showing that zinc chelation affects motility only after capacitation has begun, suggesting a dynamic role of intracellular zinc in the capacitation process.

      Weaknesses:

      While the study presents compelling electrophysiological data and molecular insights, there are several critical gaps that must be addressed before fully supporting the physiological relevance of the findings.

      (1) The authors should measure the effects in sperm cells using the patch-clamp technique to directly record Slo3 currents. By normalizing Slo3 currents to cell capacitance at different intracellular zinc concentrations, the authors can quantitatively assess the extent of Slo3 inhibition by zinc and strengthen the physiological relevance of their findings.

      (2) Lack of Controls in Non-Capacitated Sperm

      The claim that zinc is exported from sperm during capacitation needs stronger experimental validation.

      The authors did not include a control group of non-capacitated sperm in key fluorescence imaging experiments, making it difficult to confirm that the observed zinc decrease is capacitation-specific rather than a general zinc redistribution process.

      To strengthen this conclusion, experiments should be performed in non-capacitating conditions to determine whether intracellular zinc levels remain unchanged.

      (3) Unclear Role of Zinc in Physiological Capacitation

      The study clearly demonstrates zinc inhibition of Slo3 but does not sufficiently establish how this affects capacitation at a functional level.

      Additional motility and capacitation markers should be analyzed to confirm that zinc influences sperm behavior beyond Slo3 inhibition.

      (4) Insufficient Data on Zinc-Slo3 Specificity

      The authors should consider using quinidine, a known washable Slo3 inhibitor, to confirm that zinc acts specifically on Slo3 channels rather than other endogenous ion channels.

      The study would benefit from including washout controls in the inside-out patch-clamp recordings, as seen in Figure 3-Supplement 1, to confirm that zinc inhibition is reversible or long-lasting.

      (5) Missing Discussion of Zinc's Role in CatSper Regulation

      The study focuses solely on Slo3 but does not mention CatSper, the principal Ca²⁺ channel essential for sperm capacitation.

      Zinc has been reported to inhibit CatSper activity, which could significantly impact sperm function.

      The discussion should address whether zinc's effect on Slo3 represents a broader regulatory mechanism influencing multiple ion channels during capacitation.

      Final Assessment

      This work presents important findings on zinc regulation of Slo3 channels, supported by strong electrophysiological and molecular analyses. However, the physiological relevance of these findings remains unclear due to missing controls, and needs additional functional assays. Addressing these issues would significantly enhance the manuscript's scientific rigor and impact.

    1. Reviewer #2 (Public review):

      Summary:

      The paper introduces the IBEX Knowledge-Base (KB), a shared online resource designed to help scientists working with immunofluorescence imaging. It acts as a central hub where researchers can find and share information about reagents, protocols, and imaging methods. The KB is not static like traditional publications; instead, it evolves as researchers contribute new findings and refinements. A key highlight is that it includes results of both successful and unsuccessful experiments, helping scientists avoid repeating failed experiments and saving time and resources. The platform is built on open-access tools ensuring that the information remains available to everyone. Overall, the KB aims to collaboratively accelerate research, improve reproducibility, and reduce wasted effort in imaging experiments.

      Strengths:

      (1) The IBEX KB is built entirely on open-source tools, ensuring accessibility and long-term sustainability. This approach aligns with FAIR data principles and ensures that the KB remains adaptable to future advancements.

      (2) The KB also follows strict data organization standards, ensuring that all information about reagents and protocols is clearly documented and easy to find with little ambiguity.

      (3) The KB allows scientists to report both positive and negative results, reducing duplication of effort and speeding up the research process.

      (4) The KB is helpful for all researchers, but even more so for scientists in resource-limited settings. It provides guidance on finding affordable alternatives to expensive or discontinued reagents, making it easier for researchers with fewer resources to perform high-quality experiments.

      (5) The KB includes a community discussion forum where scientists can ask for advice, share troubleshooting tips, and collaborate with others facing similar challenges.

      Weaknesses:

      (1) The potential impact of IBEX KB is very clear. However, the paper would benefit by also discussing more on KB maintenance and outreach, and how higher participation could be incentivized.

      (2) Use of resources like GitHub may limit engagement from non-coding members of the scientific community. Will there be alternative options like a user-friendly web interface to contribute more easily?

    1. 里查德·伯恩哈特(Richard Barnhart)

      根据您提供的知识库信息和上下文,关于 里查德·伯恩哈特(Richard Barnhart) 的关键点如下:


      1. 身份与领域

      • 身份:美国艺术史学者,专注于中国宋代山水画研究。
      • 研究方向
      • 宋代山水画的图像学分析,尤其是“潇湘八景”主题的视觉表达与文化意义。
      • 探讨艺术与政治、文学的关联,如潇湘八景中隐含的文人政治隐喻。

      2. 主要贡献

      • 对潇湘八景的研究
      • 视觉文本基础:在20世纪60年代开始研究宋代山水画,90年代系统梳理潇湘八景的图像意象,为后续研究奠定视觉分析框架。
      • 跨学科视角:结合艺术史、文学与政治史,揭示潇湘八景不仅是自然风景的描绘,更是文人借山水抒发政治情怀的载体。

      • 代表成果

      • 著作与论文中分析潇湘八景的构图、符号与历史背景,例如宋迪、牧溪等画家作品中的隐喻表达。

      3. 学术背景与影响

      • 学术地位
      • 被视为研究潇湘八景的先驱学者之一,其工作为后续研究者(如阿尔弗雷达·莫克)提供了关键基础。
      • 与美国其他学者(如瓦勒瑞·沃迪兹、莫克)共同推动了中国艺术史的国际研究,尤其关注宋代艺术的政治与文化语境。

      • 方法论

      • 采用图像学方法,从画面细节(如构图、笔墨、符号)解读艺术家的意图,结合历史文献(如范仲淹《岳阳楼记》)深化分析。

      4. 知识库关联信息

      • 潇湘八景的关联
      • 在用户提供的潇湘八景分析中提到,伯恩哈特的研究聚焦于宋代山水画的视觉表达,尤其是潇湘八景的图像意象,强调其“诗画结合”的传统与政治隐喻。
      • 他通过研究揭示潇湘题材如何成为文人表达“忧愤与不平”的艺术媒介,呼应唐宋文人的流放文化与政治困境。

      5. 可能的混淆说明

      • 注意
      • 您提供的知识库中另有一位名为 “伯恩哈特” 的巴西足球运动员(全名 Bernhardt,非 Barnhart),以及法国女演员 萨拉·伯恩哈特(Sarah Bernhardt)(与艺术家慕夏合作),但均与 Richard Barnhart 无关。
      • Richard Barnhart 的研究领域为中国艺术史,而非体育或表演艺术。

      6. 研究局限性

      • 知识库限制:<br /> 根据您提供的资料,关于 Richard Barnhart 的具体生平(如出生日期、教育背景、其他著作)未被详细提及,仅提及他对潇湘八景的研究贡献。如需更深入的信息,需参考其专著或学术论文。

      总结

      里查德·伯恩哈特(Richard Barnhart) 是美国研究中国宋代山水画的重要学者,尤其以对“潇湘八景”的图像学分析著称。他的工作通过视觉与文本的结合,揭示了宋代文人借山水画表达政治与文化隐喻的深层逻辑,为理解中国古典艺术提供了跨学科视角。若需进一步信息,建议查阅其关于宋代艺术的专著或学术文献。

    1. The Kirschner and Karpinski (2010) paper examined differences between Facebook users and non-users but did not analyze actual time spent on Facebook (though their methods suggest this information was collected). It should also be noted the Kirschner and Karpinski (2010) study was conducted with a relatively small (N = 219) sample of mostly upper-level undergraduate (junior and senior) and graduate students. Lastly, the study by Kolek and Saunders (2008) examined student Facebook profiles and found there were no differences in overall GPA between users and non-users. It is important to note that “non-users” in the Kolek and Saunders (2008) study were those students from a randomly selected sample of the university’s student database that researchers could not find on Facebook. These students could have either been non-users or had privacy settings enabled to disallow searching, confounding the results of these findings.Previous studies of Facebook use and academic performance have also been limited in how student grades were measured. Until now, all studies have used self-reported, non-continuous measures. Students in the first sample in the Pasek et al. (2009) study were asked to report grades on an 8-point Likert scale with anchors for “Mostly F’s” to “Mostly A’s” and categories in between such as “A’s and B’s.” For the second and third samples, GPA was coded on a four-point scale from “D or less” to “A.” Kirschner and Karpinski (2010) used a five-point Likert scale (where 0 was 1.99 and below, 1 was 2.0–2.49, 2 was 2.5–2.99, 3 was 3.0–3.49, and 4 was 3.5–4.0). Kirschner and Karpinski (2010) also coded time spent studying on a 5-point Likert scale (where 0 was less than 1 h per week, 1 was 1–5 h, 2 was 6–10 h, 3 was 11–15 h, and 4 was 16 or more hours).

      This study improves upon past research by using a large sample (N = 1839) and university-reported GPA. It includes multiple measures of Facebook usage and controls for variables like high school GPA, socioeconomic status, and ethnicity.

      Key Strength: The study controls for pre-existing academic ability (high school GPA) to isolate the effects of Facebook use.

      Limitation: Although the study is more rigorous, it cannot establish causation—only correlation.

    2. In 1987, Chickering and Gamson proposed seven principles for good practice in undergraduate education, all of which were related to student engagement. They are: (1) student/faculty contact; (2) cooperation among students; (3) active learning; (4) prompt feedback; (5) emphasizing time on task; (6) communicating high expectations; and (7) respecting diversity. The fifth principle, “emphasizing time on task,” is important in conceptualizing the relationship between Facebook use and academic success. Specifically, Chickering and Gamson (1987) state that “time plus energy equals learning,” and there can be no substitute for time students spend engaged in academic work (p. 6). If indeed students are spending their time engaging with a technology such as Facebook, their available time to spend on academic tasks might be limited. Furthermore, Astin (1984) states that students reach academic goals because of the time and effort spent on activities designed to reach those goals. For instance, if getting good grades is an important goal, the amount of time students spend studying is a critical factor in reaching that goal. If student time is spent elsewhere—on Facebook, for instance—less time will be allotted to focusing on academic activities and grades may suffer as a result.Most of the research on the academic outcomes of technology use has been conducted at the primary or secondary school level and has found disparate results. As with the research on student engagement, a comprehensive review is beyond the scope of this paper; however, the following section highlights a number of studies illustrative of the mixed effects of technology use on academic outcomes. For instance, some studies have found a positive relationship between using technology with elementary and/or high school students and academic outcomes like standardized test scores and course grades (Shapley et al., 2010, Suhr et al., 2010, Tienken and Wilson, 2007). On the other hand, some studies have found a negative relationship between technology use and academic outcomes in these settings (Aypay et al., 2007, Waight and Abd-El-Khalick, 2007). Still other studies have found contradictory positive and negative results. In a review of the literature on interactive white boards (IWB), DiGregorio and Sobel-Lojeski (2010) found both positive and negative relationships between IWB use and academic outcomes such as learning measures, student attitudes, and motivation. The conflicting results found in these studies are quite possibly due to differences in the technologies being measured (for instance, laptops vs. IWBs) and to differences in how the technologies are used (for instance, using laptops for subject area practice vs. using them for the creation of content).While studies at the university level have also found contradictory results, they serve to further clarify how specific uses of technology may be related to academic outcomes. For instance, Bliuc, Ellis, Goodyear, and Piggott (2010) discovered that students who reported using discussion boards as tools to find answers—rather than improve their understanding of a topic—were utilizing that technology in ways that encouraged reproduction and task completion rather than deep learning; those students also had lower final course grades. Rizzuto, LeDoux, and Hatala (2009) found that using a course management system for required actions (like posting comments) was positively related to course test scores. Sapp and Simon (2005) found that more students in online courses failed to complete the course and had lower grades than students in matched face-to-face courses. In related research, Weatherly, Grabe, and Arthur (2003) found that students in a section of a psychology course who were provided lecture slides via a course management system had lower exam scores than a control section, presumably because of decreased attendance. Research on college students has largely focused on institutionally supported technologies like course management systems, while very few studies have examined the effects of technologies more commonly used by students, such as Facebook.

      Annotation: The paper reviews previous studies that examined the link between Facebook use and academic performance, highlighting conflicting results.

      Key Finding: Some studies (Kirschner & Karpinski, 2010) found lower GPAs among Facebook users, while others (Pasek et al., 2009) found no significant difference.

      Methodological Limitations: Prior studies used self-reported GPA data, binary measures of Facebook use (yes/no), and small sample sizes, limiting the reliability of findings.

    3. Even though there is professional and popular interest in how Facebook use affects student learning, not much research has been conducted on the topic (Abramson, 2011, Kamenetz, 2011, Pychl, 2008). Some studies have examined the relationship between Facebook use and psychosocial outcomes. For instance, Ellison, Steinfield, and Lampe (2007) found that there is a positive relationship between using Facebook and forming and maintaining social capital. Valenzuela, Park, and Kee (2009) found that civic participation, life satisfaction, and social trust were related to the intensity of college students’ Facebook use. In newer research, Ellison, Steinfield, and Lampe (2011) extended their previous study and found that engaging in social information-seeking behaviors on Facebook (to learn more about a person with whom the user already has a connection) was related to increased social capital, while using Facebook to maintain relationships and meet strangers was not.The construct of student engagement, defined as “the amount of physical and psychological energy that the student devotes to the academic experience,” was originally proposed by Astin in 1984 (p. 297). Astin’s (1984) theory of student engagement was based on five tenets: (1) Engagement refers to the investment of physical and psychological energy; (2) Engagement occurs along a continuum (some students are more engaged than others and individual students are engaged in different activities at differing levels); (3) Engagement has both quantitative and qualitative features; (4) The amount of student learning and development associated with an educational program is directly related to the quality and quantity of student engagement in that program; and (5) The effectiveness of any educational practice is directly related to the ability of that practice to increase student engagement.

      The author discusses theories of student engagement, particularly Astin’s (1984) theory, which states that academic success depends on time and energy devoted to educational activities. Facebook, as a platform for engagement, could theoretically have positive or negative effects.

      Key Finding: Some studies suggest that students who spend more time on social media are also more engaged in co-curricular activities, but excessive usage may take time away from studying.

    1. 美国正义会社(JSA)

      美国正义会社(Justice Society of America, JSA)介绍


      1. 起源与背景

      • 首次登场:<br /> 美国正义会社(JSA)是漫画史上第一个超级英雄团队,首次出现在1940年《全明星漫画》(All Star Comics)第3期,由作家加德纳·福克斯和编辑谢尔顿·梅耶共同创作。其诞生背景为二战时期,具有明显的政治意味,角色常与反法西斯斗争相关联。
      • 黄金时代代表:<br /> 作为DC漫画“黄金时代”(1930-1950年代)的标志性团队,JSA是当时超级英雄团队的先驱,比正义联盟(JLA)和漫威的复仇者联盟早数十年出现。

      2. 初代成员与核心角色

      • 创始成员(1940年):
      • 闪电侠(杰森·加里克):首位极速者,通过吸入化学气体获得超速能力。
      • 绿灯侠(阿兰·斯科特):首位绿灯侠,凭借神秘绿灯提灯获得能量戒指,能力受限于木质物质。
      • 命运博士(肯特·奈尔森):通过神秘仪式获得通灵能力,后成为团队的精神领袖。
      • 其他成员:鹰侠(卡特·霍尔,转世埃及王子)、幽灵(西西里岛伯爵)、睡魔(保罗·萨维奇)、原子侠(阿尔·斯蒂尔)、石匣(多拉·黛安)。
      • 三巨头:<br /> 绿灯侠、闪电侠和命运博士因核心地位被称为“三巨头”。

      3. 发展历程

      • 黄金时代(1940-1950年代)
      • JSA在《全明星漫画》中活跃,以二战为背景,对抗纳粹等现实威胁。
      • 1950年代漫画产业衰退,JSA暂停连载,超级英雄题材进入低谷。
      • 白银时代重启(1960年代)
      • 随着二代闪电侠(巴里·艾伦)登场,DC引入平行宇宙概念,将JSA置于“地球-2”(Earth-2),与地球-1的正义联盟(JLA)并存。
      • 两代团队在《闪电侠》等漫画中跨宇宙合作,奠定多元宇宙基础。
      • 青铜时代回归(1970-1980年代)
      • 1970年代《全明星队》(All-Star Squadron)重启JSA,聚焦二战时期的任务。
      • 1985年《无限地球危机》事件后,多元宇宙合并为“新地球”,JSA与JLA处于同一世界。
      • 现代演进(1990年代至今)
      • 1991年推出独立刊物《美国正义会社》,2000年后更名为《JSA》,探索老英雄传承与新成员加入。
      • 新52重启(2011年):JSA以“地球-2”版本出现在《地球2》系列中,成为平行世界的统治者,风格更黑暗。
      • DC多元宇宙扩展:在《明日传奇》电视剧(2016年起)中,JSA成员如鹰侠、原子侠等活跃于跨时空冒险。

      4. 核心特点与地位

      • 历史意义
      • 首个超级英雄团队,启发了后续所有超级英雄联盟(如正义联盟、复仇者联盟)。
      • 跨公司合作:部分成员来自DC的子公司(如全美漫画),体现早期跨品牌联动。
      • 风格差异
      • JSA:二战背景,强调集体主义与爱国精神,反派多为现实威胁(如纳粹)。
      • JLA:现代设定,成员更具个人主义,对抗超自然或宇宙级威胁。
      • 多元宇宙角色
      • JSA常作为“前辈团队”,与正义联盟并肩作战,如对抗灭霸(《无限战争》)或守护时间线。
      • 命运博士的神秘能力(如操控命运沙漏)成为DC宇宙的关键元素。

      5. 重要事件与挑战

      • 《最后的日子》(The Final Crisis, 2008)
      • JSA成员为保护时间线集体牺牲,命运博士与幽灵等核心成员陨落,团队濒临解散。
      • 重启与回归
      • 通过时间旅行或多元宇宙分支,部分成员在新故事线中复活或以新身份回归(如“新52”中的地球-2 JSA)。
      • 影视表现
      • 《明日传奇》(2016-2020):JSA成员(如鹰侠、原子侠、午夜神医)作为重要角色登场,参与跨时空任务。
      • 《黑亚当》电影(2022):JSA与黑亚当的对抗被提及,命运博士与绿灯侠短暂亮相。

      6. 主要盟友与敌人

      • 盟友
      • 正义联盟(JLA)、少年泰坦、全明星中队(All-Star Squadron)、无限公司。
      • 主要敌人
      • 不义会社(Injustice Society, I Society):JSA的宿敌,成员包括反物质版本的JSA角色(如反闪电侠)。
      • 其他反派:纳粹组织、时间旅行者、宇宙级威胁(如星神、黑暗君主)。

      7. 文化影响与遗产

      • 美漫里程碑:<br /> JSA奠定了超级英雄团队协作、跨作品联动的模式,影响了后续所有超级英雄宇宙的构建。
      • 角色传承
      • 许多JSA成员的后代或继承人加入新团队(如正义协会二代、少年泰坦)。
      • 鹰侠绿灯侠等角色成为DC经典符号,持续出现在漫画、电影和游戏中。

      总结

      美国正义会社(JSA)不仅是DC漫画的传奇团队,更是整个超级英雄文化发展的基石。从二战时期的正义象征到多元宇宙中的跨时空守护者,JSA通过数十年的演变,始终代表着超级英雄团队的凝聚力与历史深度。其在漫画、影视和流行文化中的持续影响力,使其成为美漫史上不可替代的传奇。

    1. pg 2 - Elijah Cummings passed away at 68 - 72 is average lifespan age of a Black man in U.S. which is a nation's lowest life expectancy and 4 years less than a white man (7 years less than a black women and 9 less than a white women) - Black women 3-4 times more likely to die during pregnancy than a white women

    Annotators

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      The paper by Fournier et al. investigates the sensitivity of neural circuits to changes in intrinsic and synaptic conductances. The authors use models of the stomatogastric ganglion (STG) to compare how perturbations to intrinsic and synaptic parameters impact network robustness. Their main finding is that changes to intrinsic conductances tend to have a larger impact on network function than changes to synaptic conductances, suggesting that intrinsic parameters are more critical for maintaining circuit function.

      The paper is well-written and the results are compelling, but I have several concerns that need to be addressed to strengthen the manuscript. Specifically, I have two main concerns:

      (1) It is not clear from the paper what the mechanism is that leads to the importance of intrinsic parameters over synaptic parameters.

      (2) It is not clear how general the result is, both within the framework of the STG network and its function, and across other functions and networks. This is crucial, as the title of the paper appears very general.

      I believe these two elements are missing in the current manuscript, and addressing them would significantly strengthen the conclusions. Without a clear understanding of the mechanism, it is difficult to determine whether the results are merely anecdotal or if they depend on specific details such as how the network is trained, the particular function being studied, or the circuit itself. Additionally, understanding how general the findings are is vital, especially since the authors claim in the title that "Circuit function is more robust to changes in synaptic than intrinsic conductances," which suggests a broad applicability.

      I do not wish to discourage the authors from their interesting result, but the more we understand the mechanism and the generality of the findings, the more insightful the result will be for the neuroscience community.

      Major comments

      (1) Mechanism

      While the authors did a nice job of describing their results, they did not provide any mechanism for why synaptic parameters are more resilient to changes than intrinsic parameters. For example, from Figure 5, it seems that there is mainly a shift in the sensitivity curves. What is the source of this shift? Can something be changed in the network, the training, or the function to control it? This is just one possible way to investigate the mechanism, which is lacking in the paper.

      (2) Generality of the results within the framework of the STG circuit

      (a) The authors did show that their results extend to multiple networks with different parameters (the 100 networks). However, I am still concerned about the generality of the results with respect to the way the models were trained. Could it be that something in the training procedure makes the synaptic parameters more robust than intrinsic parameters? For example, the fact that duty cycle error is weighted as it is in the cost function (large beta) could potentially affect the parameters that are more important for yielding low error on the duty cycle.

      (b) Related to (a), I can think of a training scheme that could potentially improve the resilience of the network to perturbations in the intrinsic parameters rather than the synaptic parameters. For example, in machine learning, methods like dropout can be used to make the network find solutions that are robust to changes in parameters. Thus, in principle, the results could change if the training procedure for fitting the models were different, or by using a different optimization algorithm. It would be helpful to at least mention this limitation in the discussion.

      (3) Generality of the function

      The authors test their hypothesis based on the specific function of the STG. It would be valuable to see if their results generalize to other functions as well. For example, the authors could generate non-oscillatory activity in the STG circuit, or choose a different, artificial function, maybe with different duty cycles or network cycles. It could be that this is beyond the scope of this paper, but it would be very interesting to characterize which functions are more resilient to changes in synapses, rather than intrinsic parameters. In other words, the authors might consider testing their hypothesis on at least another 'function' and also discussing the generality of their results to other functions in the discussion.

      (4) Generality of the circuit

      The authors have studied the STG for many years and are pioneers in their approach, demonstrating that there is redundancy even in this simple circuit. This approach is insightful, but it is important to show that similar conclusions also hold for more general network architectures, and if not, why. In other words, it is not clear if their claim generalizes to other network architectures, particularly larger networks. For example, one might expect that the number of parameters (synaptic vs intrinsic) might play a role in how resilient the function is with respect to changes in the two sets of parameters. In larger models, the number of synaptic parameters grows as the square of the number of neurons, while the number of intrinsic parameters increases only linearly with the number of neurons. Could that affect the authors' conclusions when we examine larger models?

      In addition, how do the authors' conclusions depend on the "complexity" of the non-linear equations governing the intrinsic parameters? Would the same conclusions hold if the intrinsic parameters only consisted of fewer intrinsic parameters or simplified ion channels? All of these are interesting questions that the authors should at least address in the discussion.

      We thank Reviewer #1 for their valuable input. We agree with the reviewer that generality of the results may have been overstated. To address this we changed the title of the manuscript to make it more specific to rhythmic circuits and we included a sentence to this effect in the discussion. 

      (1) We were more interested in knowing which set of conductances is more robust in a population of models, rather than a mechanism. If such a mechanism exists it will be the subject of a different study.

      (2) (a) It is impossible to explore the whole parameter space of these models. Our method to find circuits will leave subsets of circuits out of the study. Our sole goal in constructing the model database was that the activities were similar but the conductances were different.  (b) Of course one could devise a cost function targeting circuits that are more or less robust to changes in one parameter. Whether those exist is a different matter. This is not what we intended to do.

      (3) For this we would need a different circuit that produces non-oscillatory activity. A normal pyloric rhythm circuit always produces oscillatory activity unless it is “crashed"either by temperature or perturbations, but even in this case because we don’t have a proper “control” activity (circuits crash in different ways) we would not be able to utilize the same approach.

      We think it is a valuable idea to perform a similar study in another small circuit with nonoscillatory (or rhythmic) activities. 

      (4) We did not explore the issue of how our results generalize to larger networks as it would be pure speculation. It could be potentially interesting to do a similar sensitivity analysis with a large network trained to perform a simple task. Our understanding is that many large trained networks are extremely sensitive to perturbations in synaptic weights, at the same time that the intrinsic properties of neurons in ANN are typically oversimplified and identical across units. 

      Reviewer #2 (Public review):

      Summary:

      This manuscript presents an important exploration of how intrinsic and synaptic conductances affect the robustness of neural circuits. This is a well-deserved question, and overall, the manuscript is written well and has a logical progression.

      The focus on intrinsic plasticity as a potentially overlooked factor in network dynamics is valuable. However, while the stomatogastric ganglion (STG) serves as a well-characterized and valuable model for studying network dynamics, its simplified structure and specific dynamics limit the generalizability of these findings to more complex systems, such as mammalian cortical microcircuits.

      Strengths:

      Clean and simple model. Simulations are carefully carried out and parameter space is searched exhaustively.

      Weaknesses:

      (1) Scope and Generalizability:

      The study's emphasis on intrinsic conductance is timely, but with its minimalistic and unique dynamics, the STG model poses challenges when attempting to generalize findings to other neural systems. This raises questions regarding the applicability of the results to more complex circuits, especially those found in mammalian brains and those where the dynamics are not necessarily oscillating. This is even more so (as the authors mention) because synaptic conductances in this study are inhibitory, and changes to their synaptic conductances are limited (as the driving force for the current is relatively low).

      (2) Challenges in Comparison:

      A significant challenge in the study is the comparison method used to evaluate the robustness of intrinsic versus synaptic perturbations. Perturbations to intrinsic conductances often drastically affect individual neurons' dynamics, as seen in Figure 1, where such changes result in single spikes or even the absence of spikes instead of the expected bursting behavior. This affects the input to downstream neurons, leading to circuit breakdowns. For a fair comparison, it would be essential to constrain the intrinsic perturbations so that each neuron remains within a particular functional range (e.g., maintaining a set number of spikes). This could be done by setting minimal behavioral criteria for neurons and testing how different perturbation limits impact circuit function.

      (3) Comparative Metrics for Perturbation:

      Another notable issue lies in the evaluation metrics for intrinsic and synaptic perturbations. Synaptic perturbations are straightforward to quantify in terms of conductance, but intrinsic perturbations involve more complexity, as changes in maximal conductance result in variable, nonlinear effects depending on the gating states of ion channels. Furthermore, synaptic perturbations focus on individual conductances, while intrinsic perturbations involve multiple conductance changes simultaneously. To improve fairness in comparison, the authors could, for example, adjust the x-axis to reflect actual changes in conductance or scale the data post hoc based on the real impact of each perturbation on conductance. For example, in Figure 6, the scale of the panels of the intrinsic (e.g., g_na-bar) is x500 larger than the synaptic conductance (a row below), but the maximal conductance for sodium hits maybe for a brief moment during every spike and than most of the time it is close to null. Moreover, changing the sodium conductance over the range of 0-250 for such a nonlinear current is, in many ways, unthinkable, did you ever measure two neurons with such a difference in the sodium conductance? So, how can we tell that the ranges of the perturbations make a meaningful comparison?

      We thank Reviewer #2 for their comments. We agree with both reviewers about scope and generalizability. We changed the title of the manuscript and included a sentence in the discussion to address this. 

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) Line 63: Tau_b is tau in Fig 1B? What is the 'network period' tau_n? Both are defined in the methods, but it would be good to clarify here and also in the figure.

      This was fixed. Tau_b is the  bursting period and we indicated it in the figure. Network period means the period of the network activity. This was rewritten.  

      (2) Line 74: "maximal conductances g_i." What is i? I can imagine what you meant, but it would be good to clarify the notation.

      There are multiple different currents. Letter ‘i' is an index over the different types. It now reads as follows,

      "The activity of the network depends on the values of the maximal conductances g ̄ i, where i is an index corresponding to the different current types (Na,CaS,CaT,Kd,KCa,A,H,Leak IMI)"

      (3) Line 78: "conductances are changed by a random amount." How much is the "random amount"? In percentages? 

      We fixed this sentence. This is how it reads now, 

      "The blue trace in Figure 1C corresponds to the activity of the same model when each  of the intrinsic conductances is changed by a random amount within a range between 0  (completely removing the conductance) and twice its starting value, 2×gi, or equivalently, an increment of 100%."

      Similarly, in Line 87: "by a similar percent." Can you provide Figures 1E-F in percentages? Are the percentages the same?

      The phrase "by a similar percent.” Is misleading and unimportant. Thank you, we removed it. 

      (4) Line 113: Why did you add I_MI? Is it important for the results or for the conclusions?

      I_MI was added because the current is known to be there and it is not more or less important for the results or conclusions than any other current. 

      (5) Line 117: "We used a genetic algorithm to generate a database." Confusing. I guess you meant that you used genetic algorithms to optimize the cost function.

      Thank you for this comment. We fixed this sentence, see below. 

      “We used a genetic algorithm to optimize the cost function, and in this way generated a database of N = 100 models with different values of maximal conductances (Holland 88)."

      (6) Line 136: "The models in the database were constrained to produce solutions whose features were similar to the experimental measurements." Why are there differences in the features? Is this an optimization issue? I thought you wanted to claim that there are degenerate solutions, that is, solutions where the parameters are different, but the output is identical. Please clarify.

      The concept of degenerate solutions does not imply that the solutions are mathematically identical. In biology this means that they provide very similar functions, but do so with different underlying parameters (in this case, maximal conductances). The activity of the pyloric network is slightly different across animals, and it also changes over time within the same individual. Variation across models reflects individual variation in the biological circuit, and it is strength of our modeling approach. The function of the circuits are equally good because they produce biologically realistic patterns, although the details of the activity patterns show differences. 

      (7) Line 139: "distributed (p > 0.05)." What test did you use? N? Similarly, at Lines 218, 241, 239, etc. Please be more rigorous when reporting statistical tests.

      Thank you. We now specify the test we utilized every time we report a p value. 

      (8) Line 143: "In this case, it is not possible to identify clusters, suggesting that there are no underlying relationships between the features in the model database." The 2D plot is misleading, as the features are in 11 dimensions. Claims should be about the 11D space, not projections onto 2D. In fact, I don't think you can rule out correlations between the features based on the 2D plots. For example, shouldn't there be correlations between the on and off phases and the burst durations?

      Thank you. These sentences were confusing and were removed. We added the following sentence to the end of that paragraph.

      "Because the feature vectors are similar, their t-SNE projections do not form groups or clusters."

      (9) Related to this, I don't understand this sentence: "Even though the conductances are broadly distributed over many-fold ranges, the output of the circuits results in tight yet uncorrelated distributions.”

      This sentence is confusing and was removed. 

      (10) Line 158: Repetition of Line 152: Figure 3 shows the currentscapes of each cell in two model networks.

      We removed the second instance of the repeated sentences. 

      (11) Line 160: "yet the activity of the networks is similar." Well, they are similar, but not identical. I can also say that the current scapes are 'similar'. This should be better quantified and not left as a qualitative description.

      While this is an interesting point it will not change the results and conclusions of the present study. The network models are different since the values of their maximal conductances are distributed over wide ranges.  

      (12) Line 218: midpoint parameter? Is that b - the sharpness? Please be consistent. Regarding the mechanism (see above) - any ideas what leads to this shift in the sensitivity curves between the two types of parameters?

      Yes, we made a mistake. ‘b’ is the midpoint parameter. This was fixed in the text, thank you.

      (13) Figure 6 illustrates why synaptic parameters are more robust, but it is not quantified. Why not provide a quantitative measure for this claim? For example, calculate the colored area within the white square for each pair, for each cell, and for each model. Show that these measures can predict improved robustness for one model over another and for synaptic vs. intrinsic parameters.

      The ratio of areas of the colored and non-colored regions in the whole hyperboxes (for intrinsic and synaptic conductances) is the number reported in the y-axis of the sensitivity curves when we include all conductances (and not just a pair). 

      We computed the ratios of the colored/noncolored areas in all panels in figure 6 and now report these quantities as follows, 

      "We computed the proportions of areas of the white boxes that correspond to pyloric activity. These values for the intrinsic conductances panels are PD = 0.58, LP = 0.50, PY = 0.49, and the proportions for the synaptic conductances panels are PDPY = 0.62, P DLP = 0.87, and LPPD = 0.94. The occupied areas for synaptic conductances are larger than in the intrinsic conductances panels, consistent with our finding that the circuits’ activities are more robust to changes in synaptic conductances versus changes in intrinsic conductances."

      "As before, we computed the proportion of areas of pyloric activity within the white boxes: PD = 0.61, LP = 0.55, PY = 0.52, and the proportions for the synaptic conductances panels are PDPY = 0.88, PDLP = 0.87, and LPP D = 0.83. These results provide an intuition of the complexities of GP . Not only are these regions hard-to-impossible to characterize in one circuit, but they are also different across circuits.” 

      (14) Does the sign of the synaptic weights affect the conclusions?

      We did not explore this issue because all chemical synapses in this network are inhibitory.

      (15) Line 492: typo: deltai.

      We fixed this.

      Reviewer #2 (Recommendations for the authors):

      (1) Line 301 - you can also add Williams and Fletcher 2019 Neuron.

      We added the reference. Thank you. 

      (2) Line 316 - this is a strange comment as these exact regions that were shown intrinsic plasticity (e.g., Losonczy, Attila, Judit K. Makara, and Jeffrey C. Magee. "Compartmentalized dendritic plasticity and input feature storage in neurons." Nature 452.7186 (2008): 436-441).

      We did not understand this comment. 

      (3) I found only one citation for the work of Turrigiano, the most relevant of which is only mentioned in the Method section. This is odd, as her work directly relates how synaptic conductance perturbation results in changes in intrinsic conductance.

      We included more references to the work of Turrigiano to provide more context. 

      "Desai, Niraj S., Lana C. Rutherford, and Gina G. Turrigiano. "Plasticity in the intrinsic excitability of cortical pyramidal neurons." Nature neuroscience 2, no. 6 (1999): 515-520.” "Desai, Niraj S., Sacha B. Nelson, and Gina G. Turrigiano. "Activity-dependent regulation of excitability in rat visual cortical neurons." Neurocomputing 26 (1999): 101-106.”

      (4) Line 329 - The list of citations is very limited regarding studies of ext/int balance which started really way before 2009. Please give some of the credit to the classics.

      We included the following additional references.

      Van Vreeswijk, Carl, and Haim Sompolinsky. "Chaos in neuronal networks with balanced excitatory and inhibitory activity." Science 274, no. 5293 (1996): 1724-1726.

      Rubin, Ran, L. F. Abbott, and Haim Sompolinsky. "Balanced excitation and inhibition are required for high-capacity, noise-robust neuronal selectivity." Proceedings of the National Academy of Sciences 114, no. 44 (2017): E9366-E9375.

      Wang, Xiao-Jing. "Macroscopic gradients of synaptic excitation and inhibition in the neocortex." Nature reviews neuroscience 21, no. 3 (2020): 169-178.

      Lo, Chung-Chuan, Cheng-Te Wang, and Xiao-Jing Wang. "Speed-accuracy tradeoff by a control signal with balanced excitation and inhibition." Journal of Neurophysiology 114, no. 1 (2015): 650-661.

      (5) In Figure 1B, why does it say 'OFF' when the neuron is spiking?

      The label indicates the interval of time elapsed between the first spike in the PD neuron (taken as a reference), and the last spike in the burst (PD off). 

      Summary of changes to figures:

      Figure 1:

      Fixed labels indicating bursting period and burst duration.

      Figure 5:

      Added labels in panels C and D specifying the symbol corresponding to the sigmoidal parameter.

      Additional changes

      We changed the title of the manuscript as follows:

      "Rhythmic circuit function is more robust to changes in  synaptic than intrinsic conductances." We included the following sentence at the end of the Discussion Section. 

      "We believe our results will hold for other rhythmic circuits and will be relevant for similar studies in other circuits with more complex functions.”

      We realized we made a mistake with the units for maximal conductances. They were incorrectly expressed in nS (nano Siemens) in the figure labels, and correctly expressed in micro Siemens in the methods section. This was fixed and now conductances are expressed in micro Siemens consistently in the manuscript.

    1. Reviewer #1 (Public review):

      Summary:

      This study takes a detailed approach to understand the effect of menopausal hormone therapy (MHT) in brain aging of females. Neuroimaging data from the UK Biobank is used to explore brain aging and shows an unexpected effect of current MHT use and poorer brain health outcomes relative to never users. There is considerable debate about the benefits of MHT and estrogens in particular for brain health, and this analysis illustrates thta the effects are certainly not straight forward and require greater considerations.

      Strengths:

      (1) The detailed approach to obtain important information about MHT use from primary care records. Prior studies have suggested that factors such as estrogen/progestin type, route of administration, duration, and timing of use relative to menopause onset can contribute to whether MHT benefits brain health.<br /> (2) Consideration of type of menopause (spontaneous, or surgical) in the analysis, as well as sensitivity diagnoses to rule out the effect being driven by those with clinical conditions<br /> (3) The incorporation of the brain age estimate along with hippocampal volume to address brain health<br /> (4) The complex data are also well explained and interpretations are reasonable.<br /> (5) Limitations of the UKbiobank data are acknowledged

      Weaknesses:

      These have since been addressed by the authors in the revision.

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public review): 

      Summary:

      The authors examine the role of the medial prefrontal cortex (mPFC) in cognitive control, i.e. the ability to use task-relevant information and ignore irrelevant information, in the rat. According to the central-computation hypothesis, cognitive control in the brain is centralized in the mPFC and according to the local hypothesis, cognitive control is performed in task-related local neural circuits. Using the place avoidance task which involves cognitive control, it is predicted that if mPFC lesions affect learning, this would support the central computation hypothesis whereas no effect of lesions would rather support the local hypothesis. The authors thus examine the effect of mPFC lesions in learning and retention of the place avoidance task. They also look at functional interconnectivity within a large network of areas that could be activated during the task by using cytochrome oxidase, a metabolic marker. In addition, electrophysiological unit recordings of CA1 hippocampal cells are made in a subset of (lesioned or intact) animals to evaluate overdispersion, a firing property that reflects cognitive control in the hippocampus. The results indicate that mPFC lesions do not impair place avoidance learning and retention (though flexibility is altered during conflict training), do not affect cognitive control seen in hippocampal place cell activity (alternation of frame-specific firing), a measure of location-specific firing variability, in pretraining. It nevertheless has some effect on functional interconnections. The results overall support the local hypothesis. 

      Strengths:

      Straightforward hypothesis: clarification of the involvement of the mPFC in the brain is expected and achieved. Appropriate use of fully mastered methods (behavioral task, electrophysiological recordings, measure of metabolic marker cytochrome oxidase) and rigorous analysis of the data. The conclusion is strongly supported by the data. 

      Weaknesses:

      No notable weaknesses in the conception, making of the study, and data analysis. The introduction does not mention important aspects of the work, i.e. cytochrome oxidase measure and electrophysiological recordings. The study is actually richer than expected from the introduction. 

      The revised Introduction now includes:

      “We used cytochrome oxidase, a metabolic marker of baseline neuronal activity, to confirm the mPFC lesions were effective and that there are non-local network consequences despite the local lesion. We first evaluated cytochrome oxidase activity in regions known to be associated with performance in the active place avoidance task, or regions with known connectivity to the mPFC. We then evaluated covariance of activity amongst the regions in an effort to detect network consequences of the lesion.”

      Reviewer #2 (Public review): 

      Park et al. set out to test two competing hypotheses about the role of the medial prefrontal cortex (PFC) in cognitive control, the ability to use task-relevant cues and ignore taskirrelevant cues to guide behavior. The "central computation" hypothesis assumes that cognitive control relies on computations performed by the PFC, which then interacts with other brain regions to accomplish the task. Alternatively, the "local computation" hypothesis suggests that computations necessary for cognitive control are carried out by other brain regions that have been shown to be essential for cognitive control tasks, such as the dorsal hippocampus and the thalamus. If the central computation hypothesis is correct, PFC lesions should disrupt cognitive control. Alternatively, if the local computation hypothesis is correct, cognitive control would be spared after PFC lesions. The task used to assess cognitive control is the active place avoidance task in which rats must avoid a section of a rotating arena using the stationary room cues and ignoring the local olfactory cues on the rotating platform. Performance on this task has previously been shown to be disrupted by hippocampal lesions and hippocampal ensembles dynamically represent the room and arena depending on the animal's proximity to the shock zone. They found no group (lesion vs. sham) differences in the three behavioral parameters tested: distance traveled, latency to enter the shock zone, and number of shock zone entries for both the standard task and the "conflict" task in which the shock zone was rotated by 180 degrees. The only significant difference was the savings index; the lesion group entered the new shock zone more often than the sham group during the first 5 minutes of the second conflict session. This deficit was interpreted as a cognitive flexibility deficit rather than a cognitive control failure. Next, the authors compared cytochrome oxidase activity between sham and lesion groups in 14 brain regions and found that only the amygdala showed significant elevation in the lesion vs. sham group. Pairwise correlation analysis revealed a striking difference between groups, with many correlations between regions lost in the lesion group (between reuniens and hippocampus, reuniens and amygdala and a correlation between dorsal CA1 and central amygdala that appeared in the lesion group and were absent in the sham group. Finally, the authors assessed dorsal hippocampal representations of the spatial frame (arena vs. room) and found no differences between lesion and sham groups. The only difference in hippocampal activity was reduced overdispersion in the lesion group compared to the sham group on the pretraining session only and this difference disappeared after the task began. Collectively, the authors interpret their findings as supporting the local computation hypothesis; computations necessary for cognitive control occur in brain regions other than the PFC. 

      Strengths:

      (1) The data were collected in a rigorous way with experimental blinding and appropriate statistical analyses. 

      (2) Multiple approaches were used to assess differences between lesion and sham groups, including behavior, metabolic activity in multiple brain regions, and hippocampal singleunit recording. 

      Weaknesses:

      (1) Only male rats were used with no justification provided for excluding females from the sample.

      This is a weakness we acknowledge. The experiments were performed at a time when we did not have female rats in the lab.

      (2) The conceptual framework used to interpret the findings was to present two competing hypotheses with mutually exclusive predictions about the impact of PFC lesions on cognitive control. The authors then use mainly null findings as evidence in support of the local computation hypothesis. They acknowledge that some people may question the notion that the active place avoidance task indeed requires cognitive control, but then call the argument "circular" because PFC has to be involved in cognitive control. This assertion does not address the possibility that the active place avoidance task simply does not require cognitive control. 

      We beg to differ that the possibility was not addressed. Prior to making the assertion, the manuscript describes the evidence that the active place avoidance task requires cognitive control. The evidence is multifold, and includes task design, behavior, and electrophysiology; we argue that this is more evidence than has been provided for other tasks that are asserted to require cognitive control. Specifically line 417 states:

      “We have previously demonstrated cognitive control in the active place avoidance task variant we used (Fig. 1) because the rats must ignore local rotating place cues to avoid the stationary shock zone. Even when the arena does not rotate, rats distinctly learn to avoid the location of shock according to distal visual room cues and local olfactory arena cues, such that the distinct place memories can be independently manipulated using probe trials [49, 50]. When the arena rotates as in the present studies, neural manipulations that impair the place avoidance are no longer impairing when the irrelevant arena cues are hidden by shallow water [14, 15, 51, 52]. Furthermore, persistent hippocampal neural circuit changes caused by active place avoidance training are not detected when shallow water hides the irrelevant arena cues to reduce the cognitive control demand [10, 31, 33]. While these findings unequivocally demonstrate the salience of relevant stationary room cues to use for avoiding shock and irrelevant arena cues to ignore during active place avoidance, the most compelling evidence of cognitive control comes from recording hippocampal ensemble discharge. Hippocampal ensemble discharge purposefully represents current position using stationary room information when the subject is close to the stationary shock zone and alternatively represents rotating arena information when the mouse is far from the stationary shock zone [Fig. 4; 10].”

      Line 436, however, acknowledges a fact that will always be true: no matter what anyone opines - until there are universally agreed upon objective criteria, it is logically possible that active place avoidance does not require cognitive control. The revision states: Despite this evidence from task design, behavioral observations, and direct electrophysiological representational switching as required to directly demonstrate cognitive control, one might still argue that it is logically possible that the active place avoidance task does not require cognitive control and this is why the mPFC lesion did not impair place avoidance of the initial shock zone. We consider such reasoning to be unproductive because it presumes that only tasks that require an intact mPFC can be cognitive control tasks. We nonetheless acknowledge that for some, we have not provided sufficient evidence that the active place avoidance requires cognitive control.

      “We assert the evidence is compelling, and together these findings require rejecting the central-computation hypothesis that the mPFC is essential for the neural computations that are necessary for all cognitive control tasks.”

      (3) The authors did not link the CO activity with the behavioral parameters even though the CO imaging was done on a subset of the animals that ran the behavioral task nor did they make any attempt to interpret these findings in light of the two competing hypotheses posed in the introduction. Moreover, the discussion lacks any mechanistic interpretations of the findings. For example, there are no attempts to explain why amygdala activity and its correlation with dCA1 activity might be higher in the PFC lesioned group. 

      The CO study was performed to assess the effects of the lesion, as stated on line 262 “Cytochrome oxidase (CO), a sensitive metabolic marker for neuronal function [27], was used to evaluate whether lesion effects were restricted to the mPFC.” Furthermore, as a matter of fact, line 411 states “Thus, CO imaging and electrophysiological evidence identify changes in the brain beyond the directly damaged mPFC area. In particular, the dorsal hippocampus loses the inhibitory input from mPFC [45, 46] and loses the metabolic correlation with the nucleus reuniens, which is thought to be a relay between the mPFC and the dorsal hippocampus [47, 48].”

      These CO measures assess baseline metabolic function and so it would be inappropriate to correlate them with the measures of behavior. Because the lesion and control groups do not differ on most measures of behavior, a relationship to CO measures is not expected. Importantly, even if there were differences in correlations between CO activity and behavioral measures, what could they mean? The study was designed to distinguish between two hypotheses, not to determine what CO differences could mean for behavior. As such, it is not at all clear how metabolic consequences of the lesion relate to the two hypotheses being evaluated, and so we consider it inappropriate to speculate. We did examine, and now include, the correlation between lesion size and conflict behavior. The Fig. 1 legend states “Savings was not related to lesion size r = 0.009, p = 0.98. *p < 0.05.”

      (4) Publishing null results is important to avoid wasting animals, time, and money. This study's results will have a significant impact on how the field views the role of the PFC in cognitive control. Whether or not some people reject the notion that the active place avoidance task measures cognitive control, the findings are solid and can serve as a starting point for generating hypotheses about how brain networks change when deprived of PFC input. 

      We thank the reviewer for the acknowledgement.

      Reviewer #3 (Public review): 

      Summary:

      This study by Park and colleagues investigated how the medial prefrontal cortex (mPFC) influences behavior and hippocampal place cell activity during a two-frame active place avoidance task in rats. Rats learned to avoid the location of mild shock within a rotating arena, with the shock zone being defined relative to distal cues in the room. Permanent chemical lesions of the mPFC did not impair the ability to avoid the shock zone by using distal cues and ignoring proximal cues in the arena. In parallel, hippocampal place cells alternated between two spatial tuning patterns, one anchored to the distal cues and the other to the proximal cues, and this alteration was not affected by the mPFC lesion. Based on these findings, the authors argue that the mPFC is not essential for differentiating between task-relevant and irrelevant information. 

      Strengths:

      This study was built on substantial work by the Fenton lab that validated their two-frame active place avoidance task and provided sound theoretical and analytical foundations. Additionally, the effectiveness of mPFC lesions was validated by several measures, enabling the authors to base their argument on the lack of lesion effects on behavior and place cell dynamics. 

      Weaknesses:

      The authors define cognitive control as "the ability to judiciously use task-relevant information while ignoring salient concurrent information that is currently irrelevant for the task." (Lines 77-78). This definition is much simpler than the one by Miller and Cohen: "the ability to orchestrate thought and action in accordance with internal goals (Ref. 1)" and by Robbins: "processes necessary for optimal scheduling of complex sequence of behaviour." (Dalley et al., 2004, PMID: 15555683). Differentiating between task-relevant and irrelevant information is required in various behavioral tasks, such as differential learning, reversal learning, and set-shifting tasks. Previous rodent behavioral studies have shown that the integrity of the mPFC is necessary for set-shifting but not for differential or reversal learning (e.g., Enomoto et al., 2011, PMID: 21146155; Cho et al., 2015, PMID: 25754826). In the present task design, the initial training is a form of differential learning between proximal and distal cues, and the conflict training is akin to reversal learning. Therefore, the lack of lesion effects is somewhat expected. It would be interesting to test whether mPFC lesions impair set-shifting in their paradigm (e.g., the shock zone initially defined by distal cues and later by proximal cues). If the mPFC lesions do not impair this ability and associated hippocampal place dynamics, it will provide strong support for the authors' local computation hypothesis.

      Thank you for these comments. In addressing them we have provided a significant revision to the manuscript’s Introduction. While authors like those cited by the reviewer have defined cognitive control, those definitions are difficult to test rigorously, as it is almost a matter of opinion whether a subject is displaying “the ability to orchestrate thought and action in accordance with internal goals" or whether they are using "processes necessary for optimal scheduling of complex sequence of behaviour." What would such definitions of cognitive control predict about neuronal activity? We have deliberately used a simple, operational definition of cognitive control because it is physiologically testable. In the revision, starting at line 93, we have provided an excerpt from Miller and Cohen (2001) with discussion. The importance of that work is that it provides explicit neuronal criteria and a means to operationally define cognitive control. As stated on Line 118 “Accordingly, cognitive control would be at work when there is sustained neuronal network representations of task-relevant information that suppresses or gates representations of salient task-irrelevant information in accord with purposeful judicious behavior.”

      We used a R+A- task variant in which there is a stationary room-frame shock zone and task irrelevant arena-frame information. A strict correspondence to shift-shifting task design cannot be accomplished with active place avoidance because an A+R- task that requires avoiding an arena-frame shock zone in the absence of a room-frame shock zone can be accomplished trivially if the subject chooses to not move when it is in a place with no shock. However, the R+A+ task variant is readily learned, in which there is both a room-frame and an arena-frame shock zone (see cited work below). This task variant requires the subject to judiciously shift between avoiding the room-frame shock zone using stationary room information and avoiding the arena-frame shock zone using rotating arena information. This R+A+ task variant might meet the reviewer’s criteria for cognitive control. We have recorded hippocampal and entorhinal ensemble activity during the R+A+ task variant and it is very similar to the activity during the R+A- task we used. Nonetheless, future work will investigate the efect of mPFC lesion on the R+A+ task variant.

      Cited work:

      Fenton AA, Wesierska M, Kaminsky Y, Bures J (1998), Both here and there: simultaneous expression of autonomous spatial memories in rats. Proc Natl Acad Sci U S A 95:11493-11498. Kelemen E, Fenton AA (2010), Dynamic grouping of hippocampal neural activity during cognitive control of two spatial frames. PLoS Biol 8:e1000403.

      Burghardt NS, Park EH, Hen R, Fenton AA (2012), Adult-born hippocampal neurons promote cognitive flexibility in mice. Hippocampus 22:1795-1808.

      Park EH, Keeley S, Savin C, Ranck JB, Jr., Fenton AA (2019), How the Internally Organized Direction Sense Is Used to Navigate. Neuron 101:1-9.

      Recommendations for the authors:  

      Reviewer #1 (Recommendations for the authors): 

      (1) Incorporate the cytochrome oxidase and hippocampal recordings (rationale and hypothesis) in the introduction, explaining how these aspects are relevant to the general question. 

      We have done this as requested. See lines 159-173 of the revised introduction.

      (2) Figure 1C. On Day 4-5 (conflict training) in which the shock zone was relocated 180 deg from the initial location, the behavioral tracks did not show any presence of the rat in this sector (in particular for the lesion example). Figure 4 nevertheless indicates that entrances have been made (which was expected since rats have to know that the shock zone was relocated).

      Thanks for pointing this out. The tracks are from the end of the sessions. The labels have been changed to specify which trials the tracks are from.

      (3) Figure 1C. The caption is huge as it contains the statistical analyses details. I would prefer to have these details in the text and keep the caption at a "reasonable" length. At the end of the caption (l. 190-191), it would be less confusing the keep the numbering of the training days: replace D1T1 with D2T1 and D2T9 with D3T9).

      The statistical details have been relocated to the main text and the numbering updated, as suggested, thank you.

      (4) It was not inconsiderable to show that mPFC lesion had some effects in the present task if it were only to validate the effectiveness of the lesion. This brain area has been shown to be important for planning, cognitive flexibility, etc. Indeed the authors found that the saving index was greater in sham than in mPFC rats (overdispersion in hippocampal firing was also reduced in pretraining) and interpreted this result as impaired flexibility. Would an alternative explanation be a memory deficit? I nevertheless expected that impaired flexibility in mPFC rats would be expressed in conflict trials in the form of more entrances in the zone that was initially not associated with shock (at least in the first trials of Day 4). But it appears to not be the case.

      A memory deficit is unlikely to explain the difference between the groups on the first trial of Day 5. Memory in the lesion rats was tested multiple times, specifically at the start of each trial (time to first entrance), including on the 24-h retention test, and no deficits were observed. Performance on Day 9 trial 1 is worse in the lesion group than in the controls, but it is not parsimonious to attribute this to a simple memory deficit since 24-h memory was good and similar between lesion and control rats on days 3 and 4, and memory on Day 5 was equally poor in both the lesion and control rats, as measured by time to first entrance.  

      (5) Material and methods. The injected volume of ibotenic acid should be mentioned. 

      The volume 0.2 µl was added. See line 531.

      (6) The rationale for doing the conflict training session should be indicated somewhere. 

      The rationale was provided. See lines 204-208.

      Reviewer #2 (Recommendations for the authors): 

      (1) Line 132: The text states that all sham rats improved and only 6/10 lesion rats improved is followed by a t-test, which tests the difference between means; it does not compare proportions. Also, what criterion was used to determine if an improvement was seen or not? 

      The statistical comparison is provided (now lines 230: test of proportions z = 2.3, p = 0.03). Improvement was simply numerically fewer entrances.

      (2) Line 138: This is a very long and confusing sentence. Consider revising for clarity. 

      The sentence (now line 234) was revised.

      (3) Figure 1B only includes data from 3 animals. Most published studies show the whole dataset by presenting the largest and smallest lesions. 

      Supplemental Figure S2 was added with all the lesions depicted and quantified.

      (4) Figure 1C suggestion to make the schematic shock zone line up with the shock zone shown for the tracking data. 

      Graphically, it looks better as drawn as it uses to perspective to depict a three-dimensional structure.

      (5) Methods: Clarify if the shock zone location was the same across all rats. 

      Line 570 states that the shock zone was the same for all rats.

      (6) Line 158: "Behavioral tracks" is not clear. Suggest more precise wording.

      Reworded to “Tracked room-frame positions” (now line 249)

      (7) Line 166: "effect of trial" - should this be the main effect of trial?; "interaction" - should this be "group x trial" interaction? 

      Reworded (now line 181).

      (8) Line 167: "or their interaction" is awkward in the context of the sentence. 

      Reworded (now line 182).

      (9) Line 182: Avoid talking about "trends" as if they are almost significant unless the authors suspect that they did not have sufficient statistical power to detect differences. In that case, a power analysis should be provided. 

      Removed.

      (10) Line 190: "left:...right..." is hard to follow, especially with acronyms like D1T1. Consider revising for clarity. 

      Revised (now lines 246-248).

      (11) Line 195: "effectiveness of the PFC to impair" is unnecessarily verbose. 

      Reworded (now lines 255-257).

      (12) Savings results: There is a lot of variability in the lesion group. It would be interesting to know if the extent of the lesion correlates with savings.

      Savings was not related to lesion. See line 259.

      (13) Line 300: The thalamic recording results are not reported in the results section (other than appearing in the table). Moreover, there is no detail about which thalamic nucleus these recordings are from.

      Lines 411 and 614 provides these details.  

      (14) Line 312: "no longer impair" contains a grammatical error. 

      Corrected (now line 422)

      (15) Line 325: "was not impairing" contains a grammatical error. 

      Corrected (now line 437).

      (16) Line 327: The sentence ending with "...opinion of others" seems unnecessarily confrontational. 

      Previous reviewers at other journals have maintained this position, we therefore included such a strong statement in our initial submission. However, we now revised this statement to avoid appearing confrontational.

      (17) Line 329: Sentence is awkward. Consider revising. 

      Revised (now line 443).

      (18) Line 384: The authors should disclose if there was an objective metric for determining the adequacy of the lesion. 

      The lesion assessment and quantification is better explained in the Methods under “Cytochrome oxidase activity and Nissl staining,” (lines 708-714).

      (19) Line 385: The authors should clarify how they got from 15 rats (Line 376) to 10. 

      This information is provided in the methods.

      (20) Line 390: It is not clear why skin irritation in the cage mate would prevent the rat from being tested. 

      This has been explained in the Methods under “Behavioral analysis followed by cytochrome oxidase activity” (lines 515-518).

      (21) Methods section: The authors should describe how the tracking data were acquired. Overhead camera? Tracker based on luminance or body position? What software program was used? What was the sampling rate? 

      This is now better explained in the Methods under “Active place avoidance task) (lines 538551).

      (22) Methods section: Include how fast the arena was rotating and other details about the task such as where rats were placed during the ITI. 

      Better explained in the Methods under “Active place avoidance task”.

      (23) Line 439: The recording system used (hardware & software) should be stated. 

      This is now included in the Methods (line 538).

      (24) Line 435: Though overdispersion calculation is described thoroughly, there is nothing in the paper that tells me what overdispersion means. 

      What the measure means is now described in the Methods under “Electrophysiology data analysis” (lines 646-650).

      (25) Line 561: The test used to assess effect sizes should be stated. 

      Effect sizes corresponding to the statistical tests are provided.

      Reviewer #3 (Recommendations for the authors): 

      (1) At the end of the conflict training, rats with mPFC lesions learned to avoid the new shock zone (Figure 1F, Block 16), but their place cells did not show room-preferring activity near the shock zone (Figure 4B). This observation questions whether spatial frame-specific representation is relevant for active avoidance. Can the authors clarify this point?

      This is a dynamic behavior and the hippocampal dynamics match, changing with a dynamic that is a few seconds, as we have shown in several published papers. The lack of a preference averaged over 20 minutes when the rats are avoiding both the current and former shock zones during the conflict session is pretty much what would be expected from such a coarse measurement. The important measure is the spatially-resolved measure of room versus arena preference. Figure 4B shows that in the lesion rats there is less of a frame preference during conflict, generally (consistent with poorer flexibility). However, Figure 4D quantifies the frame preference near and far from the shock zone and accordingly, there is no difference between the groups.

      (2) Related to the point above, the author might consider including panels in Figures 4C and D to show the neural activity during the pretraining and conflict training retention period. I assume p(room) will be comparable between the Near and Far segment in both sessions, but the p(room) may be higher in the Conflict training session than the Pretraining session. This would show that the mPFC lesion impairs suppressing the place cell activity encoding the old shock location. 

      Thanks for the suggestion. While we don’t think we can draw any strong conclusions from this analysis we are fine to show it. The issue is that during conflict, the rats have two perfectly reasonable representations of where there was shock, the initial location that was turned off to make the conflict, and the most recent conflict location of shock. Importantly, these recordings are during conflict retention after we turned off the shock for the retention recording (for the second time in the rat’s experience). Turning off the shock allows us to exactly match the physical conditions of pretraining, initial retention and conflict retention, which was the experimental design’s goal. However, the experiential history of the rats prior to initial retention and conflict retention cannot match, because during initial retention the rats had never experienced a changed shock zone whereas, by conflict retention, they had experienced multiple changes. Importantly, we have previously shown that mouse hippocampal ensembles represent both initial and conflict shock locations, as the animals consider their options during conflict trials (see Dvorak et al 2018, PLoS Biol 16:e2003354). Consequently, we cannot make any strong predictions about whether or not hippocampal activity during conflict retention should be room-frame preferring selectively in the vicinity of the current shock zone. As I am sure the reviewer appreciates from their own introspection, mental representations are mercifully not obliged to dictate behavior. In fact, that is what is interesting and controversial about cognitive control – it is a dynamic internal process and the innovation of our work lies in demonstrating that one cannot only rely on behavior to assess this process. Nonetheless, we did this analysis and now present it in the revised Fig. 4. During pretraining both lesion and sham groups express no particular spatially-modulated preference for either the room or the arena frame, as expected. During initial training both groups express a room-frame preference in the vicinity of the shock zone, as we initially reported. By inspection, during conflict, the sham rats express a preference for room-frame activity in the vicinity of the most recent shock zone location; this preference is weaker than what is expressed during initial retention. The lesion rats do not show this preference. These impressions are quantified in revised Fig. 4D; the comparisons within the conflict retention sessions did not reach statistical significance. We leave it to the reader to interpret what that means. Thanks for the nudge.

      (3) The significant group difference in place cell overdispersion during the pretraining phase (Figure 3C) is interesting, but some readers would appreciate additional sentences on its functional implication. Does it mean the spatial tuning of place cells was disrupted by the mPFC lesion?

      Only the reliability of spatial firing was altered, not the spatial tuning.

      (4) Although the method section described how to calculate overdispersion and SFEP, some concise, intuitive descriptions of these measures in the result section would help readers understand these results.

      Overdispersion is better explained. See lines 646-650.

      (5) I recommend adding a figure of the task performance of the rats used in the electrophysiological recording experiment and a table summarizing the number of cells recorded per animal. 

      We have included Table S2 with the cell counts and a summary of the performance for each of the rat in the electrophysiological recording experiment.

      (6) Readers would appreciate additional information on task apparatus, such as the size, appearance, and rotating speed of the arena, as well as stationary cues available in the room. 

      This is now provided in the Methods under “Active place avoidance task”.

      (7) Lines 425-416: "On the fourth day of the behavioral training, the rats had a single trial with the shock on to test retention of the training." Shouldn't it be "shock off"? 

      No the shock was on to prevent extinction learning and to increase the challenge for conflict learning.

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1:

      Major Concerns/Public Review

      Comment 1: There is a mild disconnect between behavioral readout (reflexive pain) and neural circuits of interest (emotional). Considering that this circuit is likely engaged in the aversiveness of pain, it would have been interesting to see how carrageenan and/or AIE impacted non-reflexive pain measures. Perhaps this would reveal a potentiated or dysregulated phenotype that matches the neurophysiological changes reported. However, this critique does not take away from the value of the paper or its conclusions.

      We agree that including measures of non-reflexive pain would enhance future studies and potentially reveal a phenotype that is closely related to the observed changes in neurophysiology.

      Minor Concerns/Recommendations

      Comment 1: There are a few minor grammatical errors in the text, mostly in the captions. A close read should be able to identify these errors.

      We have fixed what grammatical errors we found.

      Reviewer #2:

      Major Concerns/Public Review

      No major concerns.

      Minor Concerns/Recommendations

      Comment 1: If pain sensitivity was assessed at 3 time points post carrageenan administration, why were these data averaged? Were there no differences between the time points? The data from the 3 time points should be presented, either in a figure, table, or supplementary materials.

      We averaged the pain sensitivity data across the 3 time points following carrageenan administration because we were trying to present this data in a more concise manner. Pain sensitivity did change over time following carrageenan administration. We have now included the unaveraged data in figure 2 (panels D, F, H, and J).

      Comment 2: For the optically-evoked EPSCs and IPSCs, were the peak amplitudes the max responses that could be obtained? If not, how were levels of ChR2 expression or light intensity controlled for?

      The peak amplitudes for EPSCs and IPSCs were half the maximal response that could be evoked by optical stimulation. The AMPA and NMDA currents were maximal responses as prior literature indicated some PVINs have small NMDA currents, and we wanted to ensure these currents would be detected reliably. We updated our methods section to include this information in the voltage clamp recordings section.

      Comment 3: In the example traces for the aEPSC experiment, the figure legend states that the "+" symbol indicates an asynchronous event. However, there are several "|" or "-" symbols in the figure. Perhaps this is an issue with the resolution of the figure and those are supposed to be "+"s.

      We have increased the resolution of the figures to ensure that the markings of the asynchronous events display properly. We apologize for not noticing that these symbols were not displayed correctly in the original figures included in the manuscript.

      Comment 4: For the von Frey and the Hargreaves test, were animals acclimated to the apparatus in the days leading up to the first test, or was the 5-minute pre-test the only acclimation that was done? This information needs to be provided. If the latter, there is concern that the animals did not fully acclimate to the apparatus and handling prior to testing, which should be taken into consideration in the interpretation of the behavioral analyses.

      The rats underwent handling once a day for three days prior to the first von Frey and Hargreaves tests. On the day prior to the first test, rats were acclimated to the von Frey and Hargreaves apparatuses. The acclimation period consisted of a 15-min exposure to the von Frey apparatus and a 30-min exposure to the Hargreaves apparatus for each animal. This information has been added to the revised methods section under the assessment of mechanical and thermal sensitivity heading.

      Reviewer #3:

      Major Concerns/Public Review

      Comment 1: There is incomplete evidence supporting some of the conclusions drawn in this manuscript. The authors claim that the changes in feedforward inhibition onto pyramidal cells are due to the changes in parvalbumin interneurons, but evidence is not provided to support that idea. PV cells do not spontaneously fire action potentials spontaneously in slices (nor do they receive high levels of BLA activity while at rest in slices). It is possible that spontaneous GABA release from PV cells is increased after AIE but the authors did not report sIPSC frequency. Second, the authors did not determine that PV cells mediate the feedforward BLA op-IPSCs and changes following AIE (this would require manipulation to reduce/block PV-IN activity). This limitation in results and interpretation is important because prior work shows BLA-PFC feedforward IPSCs can be driven by somatostatin cells. Cholecystokinin cells are also abundant basket cells in PFC and have been recently shown to mediate feedforward inhibition from the thalamus and ventral hippocampus, so it's also possible that CCK cells are involved in the effects observed here.

      The hypothesis that adolescent alcohol exposure could change spontaneous GABA release from PVINs is an interesting one that merits future exploration. Unfortunately, as the focus of this manuscript was on circuit-specific alterations in synaptic function, this experiment is somewhat outside the scope of the paper as sIPSCs and mIPSCs are not circuit specific measures of GABA activity and would not reflect spontaneous release from only GABA interneurons receiving input from the BLA. Despite this, a future study investigating spontaneous GABA release from PVINs in the PrL would be a valuable complement to the present study.

      While we did not directly manipulate PVINs to demonstrate that decreased oIPSC amplitude at PrL<sup>PAG</sup> neurons following AIE is due solely to changes in PVINs, it is notable that both the intrinsic excitability of PVINs and the BLA-driven E/I balance at PVINs were reduced following AIE. These changes would be consistent with decreased PVIN output onto PrL<sup>PAG</sup> neurons. However, we agree that this does not preclude the possibility that changes in SST or CCK interneurons contribute to the observed decrease in BLA-driven inhibition at PrL<sup>PAG</sup> neurons following AIE. As such, we have altered the wording in the discussion to indicate that reduced BLA-driven feedforward inhibition of PrL<sup>PAG</sup> neurons may be related, at least in part, to the observed changes in PVINs.

      Comment 2: The authors conclude that the changes in this circuit likely mediate long-lasting hyperalgesia, but this is not addressed experimentally. In some ways, the focused nature of the study is a benefit in this regard, as there is extensive prior literature linking this circuit with pain behaviors in alternative models (e.g., SNI), but it should be noted that these studies have not assessed hyperalgesia stemming from prior alcohol exposure. While the current studies do not include a causative behavioral manipulation, the strength of the association between BLA-PL-PAG function and hyperalgesia could be bolstered by current data if there were relationships detected between electrophysiological properties and hyperalgesia. Have the authors assessed this? In addition, this study is limited by not addressing the specificity of synaptic adaptations to the BLA-PL-PAG circuit. For instance, PL neurons send reciprocal projections to BLA and send direct projections to the locus coeruleus (which the authors note is an important downstream node of the PAG for regulating pain).

      We have not assessed correlations between the electrophysiological properties and hyperalgesia. We feel that future studies using DREADDs to perform cell-type and circuit-specific manipulations can better address the involvement of this circuitry in long-lasting hyperalgesia following AIE. With respect to the circuit specificity of the observed changes, we have previously evaluated the effects of AIE on pyramidal neurons projecting from the PrL to the BLA (PrL<sup>BLA</sup>). We found that following AIE exposure there was no change in the intrinsic excitability of these neurons. In addition, the amplitude and frequency of sEPSCs and sIPSCs onto PrL<sup>BLA</sup> neurons was unchanged. While these results did not assess whether the BLA-PrL-BLA circuit undergoes synaptic adaptations similar to those observed in the BLA-PrL-vlPAG circuit, it is notable that the intrinsic excitability of PrL<sup>BLA</sup> neurons was unchanged following AIE exposure. This indicates that the effects of AIE on the intrinsic excitability of pyramidal neurons in the PrL may be circuit specific. We agree that it would be interesting to study the effect of AIE on PrL neurons that project to the locus coeruleus, however due to the well-defined role of the BLA-PrL-vlPAG circuit in pain we chose to evaluate this circuit first.

      Comment 3: I have some concerns about methodology. First, 5-ms is a long light pulse for optogenetics and might induce action-potential independent release. Does TTX alone block op-EPSCs under these conditions? Second, PV cells express a high degree of calcium-permeable AMPA receptors, which display inward rectification at positive holding potentials due to blockade from intracellular polyamines. Typically, this is controlled/promoted by including spermine in the internal solution, but I do not believe the authors did that. Nonetheless, the relatively low A/N ratios for this cell type suggest that CP-AMPA receptors were not sampled with the +40/+40 design of this experiment, raising concerns that the majority of AMPA receptors in these cells were not sampled during this experiment. Finally, it should be noted that asEPSC frequency can also reflect changes in a number of functional/detectable synapses. This measurement is also fairly susceptible to differences in inter-animal differences in ChR2 expression. There are other techniques for assessing presynaptic release probability (e.g., PPR, MK-801 sensitivity) that would improve the interpretation of these studies if that is intended to be a point of emphasis.

      When we included TTX but not 4-AP we did not observe any optically evoked responses, so we don’t believe that the 5-ms pulse induced action-potential independent release in these experiments. With respect to the second point, we did not include spermine in the internal solution for the AMPA/NMDA recordings in PVINs, and it is possible that endogenous polyamines interfered with recording CP-AMPA receptors in the +40/+40 design. To address this concern, we recalculated the AMPA/NMDA ratio for PVINs using data from an optically evoked AMPA current that was collected while holding the cell at -70 mV. This data was collected at the end of the +40/+40 recording protocol as we were interested in assessing whether there would be any difference in the ratio of the +40/-70 AMPA current across treatment conditions. As there were no observed difference in the +40/-70 AMPA current ratio across treatment groups, we had originally used the +40 AMPA current for calculating the AMPA/NMDA ratio for PVINs to make the methods for calculating this ratio uniform for both PVINs and PrL<sup>PAG</sup> neurons. The methods, results, and Fig. 10 have been updated to reflect the recalculated AMPA/NMDA ratio for PVINs. Notably, only the significance of the AIE x carrageenan interaction was altered by the change in the way the AMPA/NMDA ratio was calculated. Originally, this interaction displayed a trend toward significance (p = 0.0501), however when the recalculated AMPA/NMDA ratio was analyzed this interaction term became significant (p = 0.0131). We have also added the +40/-70 AMPA ratio to figure 10 as it might be of interest.

      Finally, the point regarding aEPSC frequency reflecting not only release probability but also the number of functional/detectable synapses is an important consideration. For this manuscript, we intentionally selected aEPSC frequency for this reason. As the BLA to PrL projection continues to mature during adolescence, the number of BLA contacts onto GABA neurons in the PrL increases. Thus, we thought that it was possible that AIE would alter the number of detectable BLA inputs onto PVINs. We acknowledge that as this measure is sensitive to differences in ChR2 expression between animals/slices it can be difficult to interpret. We also agree that in the future it would be beneficial to include either PPR or MK-801 sensitivity to improve interpretability.

      Comment 4: In a few places in the manuscript, results following voluntary drinking experiments (especially Salling et al. and Sicher et al.) are discussed without clear distinction from prior work in vapor models of dependence.

      We have altered the manuscript to specifically note where voluntary drinking was used rather than vapor models.

      Comment 5: Discussion (lines 416-420). The authors describe some differing results with the literature and mention that the maximum current injection might be a factor. To me, this does not seem like the most important factor and potentially undercuts the relevance of the findings. Are the cells undergoing a depolarization block? Did the authors observe any changes in the rheobase or AP threshold? On the other hand, a more likely difference between this and previous work is that the proportion of PAG-projecting cells is relatively low, so previous work in L5 likely sampled many types of pyramidal cells that project to other areas. This is a key example where additional studies by the current group assessing a distinct or parallel set of pyramidal cells would aid in the interpretation of these results and help to place them within the existing literature. Along these lines, PAG-projecting neurons are Type A cells with significant hyperpolarization sag. Previous studies showed that adolescent binge drinking stunts the development of HCN channel function and ensuing hyperpolarization sag. Have the authors observed this in PAG-projecting cells? Another interesting membrane property worth exploring with the existing data set is the afterhyperpolarization / SK channel function.

      In discussing the maximum current injection as a factor in differing results on intrinsic excitability, we were principally considering how the additional data points increase the power of the analysis and thus the likelihood of detecting an effect. In focusing on this, however, we ignored other relevant and interesting factors that we should also have discussed. Additional analyses examining HCN and SK channel function have now been added to the manuscript and incorporated into the results section under the heading Adolescent Intermittent Ethanol Exposure and Carrageenan Enhanced the Intrinsic Excitability of Prelimbic Neurons Projecting to the Ventrolateral Periaqueductal Gray. We have also modified the third paragraph in the discussion to add additional context. Additional information on the biophysical properties of the neurons has been added to Figure 4.

      Minor Concerns/Recommendations

      Comment 1: Subheadings are vague. "Analysis of..." Should be rephrased to use active voice to describe key findings.

      The subheadings have been rephrased to describe key findings.

      Comment 2: Consider altering or consolidating the figure layout for clarity. For instance, it would be helpful for aEPSCs to be near the AMPA and NMDA experiments. The feedforward IPSCs could also be with the PV-IN recordings. This would be helpful in developing a cohesive picture of key findings. To that end, a working model or graphical abstract would be helpful.

      It doesn’t appear that this journal allows graphical abstracts, but we have added a model that summarizes the principal findings in the discussion.

      Comment 3: There are a lot of statistics punctuating the text in the Results. It can be hard to parse at times.

      We considered moving the statistics to tables, but this became unwieldy.

      Comment 4: The Discussion is quite long (10 paragraphs). Suggest consolidating to 3-4 most salient points.

      We appreciate this comment and have made some edits to the discussion, albeit without consolidating it to only 3-4 points.

    1. Reviewer #1 (Public review):

      This computational study builds on a previous study (Liu et al) from the Marder lab from 1998, where a model was proposed that demonstrated activity-dependent homeostatic recovery of activity in individual bursting neurons, based on three "sensors" of intrinsic calcium concentration. The original model modified levels of ion channel conductances. The current model builds on that and adds activity-dependent modifications of the voltage-dependence of these ionic currents, implemented to happen concurrently with maximum conductance levels, but at a different timescale. The faster timescale change in voltage dependence is justified by the assumption that such changes can occur by neuromodulatory chemicals or similar second messenger-based mechanisms that presumably act at a faster rate than the regulation of channel densities. The main finding is that the difference in timescales between the two homeostatic mechanisms (channel density vs. voltage dependence) could result in distinct subsets of parameters, depending on how fast the second messenger mechanisms operate.

      This study is an interesting and noteworthy extension of the theoretical ideas proposed by the classic study of Liu et al, 1998. It addresses a very important question: How do two known mechanisms of modifications of neuronal activity that occur at different timescales interact within an activity-dependent homeostatic framework? However, the study and its presentation have some major shortcomings that should be addressed to strengthen the claim.

      Major comments:

      (1) The main issue that I have with this study is the lack of exploration of "why" the model produces the results it does. Considering this is a model, it should be possible to find out why the three timescales of half-act/inact parameter modifications lead to different sets of results. Without this, it is simply an exploratory exercise. (The model does this, but we do not know the mechanism.) Perhaps this is enough as an interesting finding, but it remains unconvincing and (clearly) does not have the impact of describing a potential mechanism that could be potentially explored experimentally.

      (2) A related issue is the use of bootstrapping to do statistics for a family of models, especially when the question is in fact the width of the distribution of output attributes. I don't buy this. One can run enough models to find say N number of models within a tight range (say 2% cycle period) and the same N number within a loose range (say 20%) and compare the statistics within the two groups with the same N.

      (3) The third issue is that many of the results that are presented (but not the main one) are completely expected. If one starts with gmax values that would never work (say all of them 0), then it doesn't matter how much one moves the act/inact curves one probably won't get the desired activity. Alternately, if one starts with gmax values that are known to work and randomizes the act/inact midpoints, then the expectation would be that it converges to something that works. This is Figure 1 B and C, no surprise. But it should work the other way around too. If one starts with random act/inact curves that would never work and fixes those, then why would one expect any set of gmax values would produce the desired response? I can easily imagine setting the half-act/inact values to values that never produce any activity with any gmax.

      (4) A potential response to my previous criticism would be that you put reasonable constraints on gmax's or half-act/inact values or tie the half-act to half-inact. But that is simply arbitrary ad hoc decisions made to make the model work, much like the L8-norm used to amplify some errors. There is absolutely no reason to believe this is tied to the biology of the system.

      (5) The discussion of this manuscript is at once too long and not adequate. It goes into excruciating detail about things that are simply not explored in this study, such as phosphorylation mechanisms, justification of model assumptions of how these alterations occur, or even the biological relevance. (The whole model is an oversimplification - lack of anatomical structure, three calcium sensors, arbitrary assumptions, and how parameter bounds are implemented.) Lengthy justifications for why channel density & half-act/inact of all currents are obeying the same time constant are answering a question that no one asked. It is a simplified model to make an important point. The authors should make these parts concise and to the point. More importantly, the authors should discuss the mechanism through which these differences may arise. Even if it is not clear, they should speculate.

      (6) There should be some justification or discussion of the arbitrary assumptions made in the model/methods. I understand some of this is to resolve issues that had come up in previous iterations of this approach and in fact the Alonso et al, 2023 paper was mainly to deal with these issues. However, some level of explanation is needed, especially when assumptions are made simply because of the intuition of the modeler rather than the existence of a biological constraint or any other objective measure.

    2. Reviewer #3 (Public review):

      Mondal et al. use computational modeling to investigate how activity-dependent shifts in voltage-dependent (in)activation curves can complement activity-dependent changes in ion channel conductance to support homeostatic plasticity. While changes in the voltage-dependent properties of ion channels are known to modulate neuronal excitability, their role as a homeostatic plasticity mechanism interacting with channel conductance has been largely unexplored. The results presented here demonstrate that activity-dependent regulation of voltage-dependent properties can interact with plasticity in channel conductance to allow neurons to attain and maintain target activity patterns, in this case, intrinsic bursting. These results also show that the rate of channel voltage-dependent shifts can influence steady-state parameters reached as the model stabilizes into a stable intrinsic bursting state. That is, the rate of these modifications shapes the range of channel conductances and half-(in)activation parameters as well as activity characteristics such as burst period and duration. A major conclusion of the study is that altering the timescale of channel voltage dependence can seamlessly shift a neuron's activity characteristics, a mechanism that the authors argue may be employed by neurons to adapt to perturbations. While the study's conclusions are mostly well-supported, additional analyses, and simulations are needed.

      (1) A main conclusion of this study is that the speed at which (in)activation dynamics change determines the range of possible electrical patterns. The authors propose that neurons may dynamically regulate the timescale of these changes (a) to achieve alterations in electrical activity patterns, for example, to preserve the relative phase of neuronal firing in a rhythmic network, and (b) to adapt to perturbations. The results presented in Figure 4 clearly demonstrate that the timescale of (in)activation modifications impacts the range of activity patterns generated by the model as it transitions from an initial state of no activity to a final steady-state intrinsic burster. This may have important implications for neuronal development, as discussed by the authors.

      However, the authors also argue that the model neuron's dynamics - such as period, and burst duration, etc - could be dynamically modified by altering the timescale of (in)activation changes (Figure 6 and related text). The simulations presented here, however, do not test whether modifications in this timescale can shift the model's activity features once it reaches steady state. In fact, it is unlikely that this would be the case since, at steady-state, calcium targets are already satisfied. It is likely, however, as the authors suggest, that the rate at which (in)activation dynamics change may be important for neuronal adaptation to perturbations, such as changes in temperature or extracellular potassium. Yet, the results presented here do not examine how modifying this timescale influences the model's response to perturbations. Adding simulations to characterize how alterations in the rate of (in)activation dynamics affect the model's response to perturbations-such as transiently elevated extracellular potassium (Figure 5) - would strengthen this conclusion.

      (2) Another key argument in this study is that small, coordinated changes in channel (in)activation contribute to shaping neuronal activity patterns, but that, these subtle effects may be obscured when averaging across a population of neurons. This may be the case; however, the results presented don't clearly demonstrate this point. This point would be strengthened by identifying correlations, if they exist, between (in)activation curves, conductance, and the resulting bursting patterns of the models for the simulations presented in Figure 2 and Figure 4, for example. Alternatively, or additionally, relationships between (in)activation curves could be probed by perturbing individual (in)activation curves and quantifying how the other model parameters compensate, which could clearly illustrate this point.

    3. Author response:

      We thank the reviewers for their detailed and constructive comments on our manuscript entitled “Activity-Dependent Changes in Ion Channel Voltage-Dependence Influence the Activity Patterns Targeted by Neurons.” We appreciate the time and effort the reviewers invested in critiquing our work and are grateful for the opportunity to clarify and improve our manuscript.

      As noted by the reviewers, the main message of the manuscript is that the intrinsic properties and activity characteristics of targeted bursters depend on the timescale of half-(in)activation alterations in the homeostatic mechanism. However, the concerns of the reviewers reveal that the manuscript is organized in ways that detract from this message. Below we respond to the points the reviewers raise and close by outlining the changes that we will make to the manuscript as a result. Our goal will be to streamline the message of the paper while addressing the concerns of the reviewers.

      Response to Reviewer #1:

      Point 1: We interpret the reviewer’s question about “mechanism” to be: why do half-(in)activation alterations redirect degenerate bursters to different parameter regions? (A separate aspect of “mechanism,” namely how these alterations might be biologically implemented, is already addressed in the paper.)

      We speculate that Figure 3 illustrates this process. As conductance densities slowly evolve, rapid half-(in)activation changes cause the sensor variable (α) to jump abruptly as it searches for a voltage-dependence configuration that meets calcium targets (Figure 3A). The channel densities are slightly altered and this process continues again. Slowing the half-(in)activations alterations reduces these abrupt fluctuations (Figure 3B). Making the alterations infinitely slow effectively removes half-(in)activation changes altogether, leaving the system reliant solely on slower alterations in maximal conductances (Figure 3C). Because each timescale of half-(in)activation produces a different channel repertoire at each time step, the neuron follows distinct trajectories through the space of activity characteristics and intrinsic properties over the long term.

      Point 2: We appreciate the reviewer’s skepticism regarding our statistical approach with the “Group of 5” and “Group of 20.” These groups arose from historical aspects of our analysis and this analysis does not directly advance the main point—that changes in the timescale of channel voltage-dependence alterations impact the properties of bursters to which the homeostatic mechanism converges. Therefore, we plan to remove the references to the Group of 5 and focus on how the Group of 20 responds to variations in the timescale of voltage-dependent alterations.

      Point 3: Our paper claims that the half-(in)activation mechanism is subordinate to the maximal conductance mechanism. We agree with the reviewer that making this claim requires more care. The simulations we run are controls in the spirit described below.

      The reviewer notes that in our simulations, half-(in)activations are already near the range required for bursting, which forces maximal conductances to undergo larger changes and thus appear more critical. We however note that the opposite can also occur: if half-(in)activation values were already positioned in ranges required for bursting, an arrangement of small maximal conductances may potentially produce bursting. The latter might give the impression that maximal conductance alterations and half-(in)activation alterations are equally important. The simulations we ran are simply suggested this wasn’t true for these models.

      Points 4 - 6: In Point 4, the reviewer highlights model choices (e.g., constraints on maximal conductance and half-(in)activation, use of the L8 norm) are not clearly justified. In Point 5, the reviewer suggests that the paper provides excessive detail about other model choices. Point 6 appears to reiterate concerns about insufficient justification for some modeling decisions.

      Our intent was to acknowledge every caveat, which led us to include long section on Model Assumptions in the Discussion. However, as Point 5 notes, this makes the Discussion cumbersome. The Discussion should focus on remarks regarding the impact that timescale of half-(in)activation alterations have on the family of bursters targeted by the homeostatic mechanism. Consequently, we will relocate the extended discussion of model assumptions from the Discussion to the Methods section. This section already touches on how the constraints on half-(in)activation alterations compare to earlier versions of the model (noted in Point 6) and will be expanded to further explain our choice of the L8 norm (Point 4).

      Response to Reviewer #2:

      Weakness 1: The reviewer notes that the writing is “rather confusing.” This likely arises from the fact that we did not consistently emphasize the core message: the timescale of half-(in)activation alterations influences the intrinsic properties and activity characteristics of bursters targeted by the homeostatic mechanism. We will address this by reorganizing the manuscript to make that focus clearer, and we outline these planned revisions at the end of these responses.

      The reviewer specifically points out that the state-of-the-art is not clearly articulated. We will reorganize the Introduction to highlight this. Briefly, work on activity-dependent homeostasis has historically focused on changes in channel density. This is supported by experiment and has been modelled theoretically. In comparison, changes in channel voltage-dependence, while documented, are less explored due to the challenges of measuring them. In this work, we attempt to study the impact that alterations in channel voltage-dependence have on activity-dependent homeostasis. To do this, we extend existing computational models of activity-dependent homeostasis—models that have hitherto only altered channel density—by incorporating a mechanism that also adjusts channel voltage-dependence.

      Weakness 2: The Discussion highlights two potential implications of our findings—one for neuronal development and another for activity recovery following perturbations. However, they were outlined after the Model Assumptions section which, as Reviewer 1 points out, is quite detailed and cumbersome.

      Another aspect that may contribute to the challenge in interpreting our results may be our conceptual approach to neuronal excitability, which relies on a computational model of activity-dependent homeostasis that abstracts much of the underlying biochemistry. Our message is general: the timescale of half-(in)activation alterations influences the intrinsic properties and activity characteristics of bursters targeted by a homeostatic mechanism. As such, the implications are general. Their value lies in circumscribing a conceptual framework from which experimentalists may devise and test new hypotheses. We do not aim to predict or explain any specific phenomenon in this work. To address this concern however, we will expand our discussion of how these findings may guide experimental considerations, particularly regarding neuronal development and activity recovery during perturbations, to better illustrate the practical utility of our results.

      Response to Reviewer #3:

      Point 1: This reviewer suggests that our core message—namely, that the timescale of half-(in)activation alterations affects the intrinsic properties and activity patterns targeted by a homeostatic mechanism—should also apply during perturbations. We plan to address this by extending our analysis on the Group of 20 models. We will perturb activity by increasing extracellular potassium concentration and change the timescale of half-(in)activation alterations during the perturbation. This should underscore how the neuron’s stabilized activity pattern depends on this timescale, reinforcing our central message.

      Point 2: In this part of the Discussion, we noted that multiple half-activation shifts collectively shape the neuron’s global properties, and that averaging might obscure these effects. However, in light of the reviewers’ comments, we recognize that this observation alone does not directly advance the paper’s main message. To make it relevant, we would need to (1) identify correlations between intrinsic parameters (i.e., half-(in)activation and maximal conductance) and the resulting activity patterns, and (2) examine how these correlations shift under different timescales half-(in)activation alterations. Since we have not performed that analysis, we will revise this part of the Discussion to clarify its connection to the paper’s principal focus by noting that a deeper exploration of this notion using correlations will be the topic of future work.

      Conclusion: We outline updates we will make to the paper here.

      Introduction: In response to Reviewer 2, we will provide a clearer explanation of the state-of-the-art in activity-dependent homeostasis and highlight our specific contribution. We will emphasize that our conclusions, while generic, are relevant in experimental contexts.

      Results: We will reorganize this section to underscore the main point: the timescale of half-(in)activation alterations affects the intrinsic properties and activity characteristics of bursters in the homeostatic mechanism. Figures 1 will remain as it is. It shows assembly from random initial conditions and explain that for these simulations we must always consider the half-(in)activation mechanism with a mechanism that alters maximal conductances as the half-(in)activation alterations alone cannot form bursters. Figure 2 will remain as is, but we will remove any discussion of the “Group of 5,” addressing Reviewer 1’s feedback. What is presently Figure 4 will then follow, illustrating how timescale differences shape the properties of 20 degenerate solutions. We then present Figure 3 to address Reviewer 1’s critique on mechanism. Here we will explain how different timescales of half-(in)activation alteration cause the homeostatic mechanism to update channel properties differently, leading to distinct trajectories through the space of intrinsic properties and activity characteristics (as described in the response of Point 1 of Reviewer 1’s feedback). Finally, following Point 1 of Reviewer 3, we will add a new figure highlighting the role of half-(in)activation timescale during perturbation.

      Discussion: To streamline the Discussion, the “Model Assumptions” section will be moved to Methods. In line with Point 2 of Reviewer 3, we will clarify how the concept of "small half-(in)activation shifts lead to global changes in neuronal properties" aligns with our core message. Additionally, following Reviewer 2’s comments, we will expand our discussion of implications by including how experimentalists might use our findings to inform studies on perturbations and development.

      Methods: We will expand “Model Assumptions” to explain in more detail why we chose the L8 norm.

    1. Reviewer #3 (Public review):

      A recent bioRxiv paper from Craig Hunter's lab (Gainey et al. 2024) puts into question several manuscripts that report that pathogen avoidance by the nematode C. elegans to the pathogenic bacteria, Pseudomonas aeruginosa, for several generations after initial exposure is not robust nor repeatable. From the Hunter lab publication, the authors tried to eliminate genetic drift of the pathogenic bacterial strains and C. elegans, as well as several experimental conditions, including assay temperature conditions and the effect of light.

      The papers (Moore et al. 2019, Kaletsky et al. 2020, Moore et al. 2021 and Sengupta et al. 2024) that the Gainey et al. manuscript brings into question discovered that Pseudomonas aeruginosa can produce a small RNA (sRNA), P11, that is necessary and sufficient for pathogen avoidance of the future generation of C. elegans (up to F4 generation). The Gainey et al. manuscript does not assess the status of P11 production in their work.

      Here, the Murphy group has made several new discoveries that highlight the differences with the work performed in the Hunter lab. One, the assay used to test attraction and avoidance of C. elegans for pathogenic bacteria differs amongst the two groups. In the Murphy lab papers, and many others in this field, the assay is established whereby worms can decide between spots of non-pathogenic bacteria (E. coli) or pathogenic (P. aeruginosa) on a single plate separated by a few centimeters. Also included in each spot is an aliquot of NaN3 to freeze the animals upon entry into their first bacterial choice. C. elegans will initially choose the pathogenic bacteria as its first choice and then learn to avoid the pathogenic spot thereafter. Therefore, establishing this first baseline attraction point is essential for determining future avoidance events. The Hunter lab did not use NaN3 and instead relied upon moving plates to 4°C to slow the worm's movements to count the population. Furthermore, the Hunter lab allowed the "choice" to proceed for an hour before moving to 4°C, making capture of the initial attraction phase of the choice assay difficult to discern since the worms could move freely from their initial choice due to the lack of the paralyzing NaN3.

      The second major advance that the Murphy group has found is that the growth of P. aeruginosa prior to being used for the choice assay is critical. Growth on plates at 25°C, but not 20°C on plates or in liquid at 37°C, can produce the transgenerational inheritance of pathogen avoidance. Interestingly, P11 is only produced by P. aeruginosa at 25°C grown on plates. The Hunter group grew the Pseudomonas bacteria at 37°C in liquid with gentle shaking and then spotted onto assay plates followed by growth for 2 days at 25°C and then equilibrated to room temperature before the choice assay. The Hunter lab did not check the status of P11 production in any of their experiments.

      The results from the Murphy group are solid and they go on to find genetic requirements in C. elegans required for the transgenerational response to P. aeruginosa and P11. Furthermore, they repeat their results with additional members of the Pseudomonas clade and find the same transgenerational avoidance response and new sRNAs responsible for the avoidance response to the newly tested Pseudomonas members.

      Overall, the discrepancies between the Hunter work and the numerous papers for the Murphy group would tend to complicate this area of research. However, this eLife paper plainly illustrates the straightforward nature of the experimental setup and reconfirms the necessary and sufficient nature of P11 in orchestrating the multigenerational response to pathogenic Pseudomonas. It appears that ensuring the production of P11 from the Pseudomonas culture and ensuring that the assay captures the initial bacterial choice are essential to observe the transgenerational inheritance of the avoidance phenotype.

    1. Reviewer #1 (Public review):

      Summary:

      This study presents a system for delivering precisely controlled cutaneous stimuli to freely moving mice by coupling markerless real-time tracking to transdermal optogenetic stimulation, using the tracking signal to direct a laser via galvanometer mirrors. The principal claims are that the system achieves sub-mm targeting accuracy with a latency of <100 ms. The nature of mouse gait enables accurate targeting of forepaws even when mice are moving.

      Strengths:

      The study is of high quality and the evidence for the claims is convincing. There is increasing focus in neurobiology in studying neural function in freely moving animals, engaged in natural behaviour. However, a substantial challenge is how to deliver controlled stimuli to sense organs under such conditions. The system presented here constitutes notable progress towards such experiments in the somatosensory system and is, in my view, a highly significant development that will be of interest to a broad readership.

      Weaknesses:

      (1) "laser spot size was set to 2.00 } 0.08 mm2 diameter (coefficient of variation = 3.85)" is unclear. Is the 0.08 SD or SEM? (not stated). Also, is this systematic variation across the arena (or something else)? Readers will want to know how much the spot size varies across the arena - ie SD. CV=4 implies that SD~7 mm. ie non-trivial variation in spot size, implying substantial differences in power delivery (and hence stimulus intensity) when the mouse is in different locations. If I misunderstood, perhaps this helps the authors to clarify. Similarly, it would be informative to have mean & SD (or mean & CV) for power and power density. In future refinements of the system, would it be possible/useful to vary laser power according to arena location?

      (2) "The video resolution (1920 x 1200) required a processing time higher than the frame interval (33.33 ms), resulting in real-time pose estimation on a sub-sample of all frames recorded". Given this, how was it possible to achieve 84 ms latency? An important issue for closed-loop research will relate to such delays. Therefore please explain in more depth and (in Discussion) comment on how the latency of the current system might be improved/generalised. For example, although the current system works well for paws it would seem to be less suited to body parts such as the snout that do not naturally have a stationary period during the gait cycle.

    2. Reviewer #2 (Public review):

      Parkes et al. combined real-time keypoint tracking with transdermal activation of sensory neurons to examine the effects of recruitment of sensory neurons in freely moving mice. This builds on the authors' previous investigations involving transdermal stimulation of sensory neurons in stationary mice. They illustrate multiple scenarios in which their engineering improvements enable more sophisticated behavioral assessments, including (1) stimulation of animals in multiple states in large arenas, (2) multi-animal nociceptive behavior screening through thermal and optogenetic activation, and (3) stimulation of animals running through maze corridors. Overall, the experiments and the methodology, in particular, are written clearly. However, there are multiple concerns and opportunities to fully describe their newfound capabilities that, if addressed, would make it more likely for the community to adopt this methodology:

      The characterization of laser spot size and power density is reported as a coefficient of variation, in which a value of ~3 is interpreted as uniform. My interpretation would differ - data spread so that the standard deviation is three times larger than the mean indicates there is substantial variability in the data. The 2D polynomial fit is shown in Figure 2 - Figure Supplement 1A and, if the fit is good, this does support the uniformity claim (range of spot size is 1.97 to 2.08 mm2 and range of power densities is 66.60 to 73.80 mW). The inclusion of the raw data for these measurements and an estimate of the goodness of fit to the polynomials would better help the reader evaluate whether these parameters are uniform across space and how stable the power density is across repeated stimulations of the same location. Even more helpful would be an estimate of whether the variation in the power density is expected to meaningfully affect the responses of ChR2-expressing sensory neurons.

      While the error between the keypoint and laser spot error was reported as ~0.7 to 0.8 mm MAE in Figure 2L, in the methods, the authors report that there is an additional error between predicted keypoints and ground-truth labeling of 1.36 mm MAE during real-time tracking. This suggests that the overall error is not submillimeter, as claimed by the authors, but rather on the order of 1.5 - 2.5 mm, which is considerable given the width of a hind paw is ~5-6 mm and fore paws are even smaller. In my opinion, the claim for submillimeter precision should be softened and the authors should consider that the area of the paw stimulated may differ from trial to trial if, for example, the error is substantial enough that the spot overlaps with the edge of the paw.

      As the major advance of this paper is the ability to stimulate animals during ongoing movement, it seems that the Figure 3 experiment misses an opportunity to evaluate state-dependent whole-body reactions to nociceptor activation. How does the behavioral response relate to the animal's activity just prior to stimulation?

      Given the characterization of full-body responses to activation of TrpV1 sensory neurons in Figure 4 and in the authors' previous work, stimulation of TrpV1 sensory neurons has surprisingly subtle effects as the mice run through the alternating T maze. The authors indicate that the mice are moving quickly and thus that precise targeting is required, but no evidence is shared about the precision of targeting in this context beyond images of four trials. From the characterization in Figure 2, at max speed (reported at 241 +/- 53 mm/s, which is faster than the high speeds in Figure 2), successful targeting occurs less than 50% of the time. Is the initial characterization consistent with the accuracy in this context? To what extent does inaccuracy in targeting contribute to the subtlety of affecting trajectory coherence and speed? Is there a relationship between animal speed and disruption of the trajectory?

    1. Absenteeism occurswhen an employee begins to chronically miss work,and can refer to a pattern of missing work [2, 3]. TheWorld Health Organization (WHO) defines chronicconditions as noncommunicable diseases (NCDs)that last for a long duration with a generally slowprogression, occurring as result of a combinationof genetic, physiological, environmental, and behav-ioral factors [4].Globally, the prevalence of chronic conditions isincreasing, becoming more common within the work-force as the number of people living with a chroniccondition increases. Moreover, appropriate recogni-tion and interventions by organizations to care fortheir employees with chronic conditions in their workenvironment are lacking, which may affect the qualityISSN 1051-9815/$35.00 © 2023 – IOS Press. All rights reserved.

      Discussing employee absenteeism makes it likely that the intended audience would be for employers, adiministrative departments, employees and general readers who would like to know more information. I didn't mean to highlight all of this just the first sentence. Now that I'm continuing my reading it also mentions the work force which also shows the audience could be for employers or employees.

    Annotators

    1. Vadim FetisovVadim Fetisov SciProfiles Scilit Preprints.org Google Scholar 1,*, Adam M. GonopolskyAdam M. Gonopolsky SciProfiles Scilit Preprints.org Google Scholar 2, Maria Yu. ZemenkovaMaria Yu. Zemenkova SciProfiles Scilit Preprints.org Google Scholar 3, Schipachev AndreySchipachev Andrey SciProfiles Scilit Preprints.org Google Scholar 1, Hadi DavardoostHadi Davardoost SciProfiles Scilit Preprints.org Google Scholar 4, Amir H. MohammadiAmir H. Mohammadi SciProfiles Scilit Preprints.org Google Scholar 5 and Masoud RiaziMasoud Riazi SciProfiles Scilit Preprints.org Google Scholar 6

      Through connection to college, led by Vadim Fetisov, a member of the Petroleum Engineering Department at Saint Petersburg Mining University establishes major subject creates credibility. Their detailed explanation of refinery operations demonstrates their knowledge in petroleum engineering. The primary author's position at a top mining university and the team's talks about peer reviewed literature to ensure their skill in carbon capture research, even though the study itself makes no mention of awards.

    1. .mw-parser-output cite.citation{font-style:inherit;word-wrap:break-word}.mw-parser-output .citation q{quotes:"\"""\"""'""'"}.mw-parser-output .citation:target{background-color:rgba(0,127,255,0.133)}.mw-parser-output .id-lock-free.id-lock-free a{background:url("//upload.wikimedia.org/wikipedia/commons/6/65/Lock-green.svg")right 0.1em center/9px no-repeat}.mw-parser-output .id-lock-limited.id-lock-limited a,.mw-parser-output .id-lock-registration.id-lock-registration a{background:url("//upload.wikimedia.org/wikipedia/commons/d/d6/Lock-gray-alt-2.svg")right 0.1em center/9px no-repeat}.mw-parser-output .id-lock-subscription.id-lock-subscription a{background:url("//upload.wikimedia.org/wikipedia/commons/a/aa/Lock-red-alt-2.svg")right 0.1em center/9px no-repeat}.mw-parser-output .cs1-ws-icon a{background:url("//upload.wikimedia.org/wikipedia/commons/4/4c/Wikisource-logo.svg")right 0.1em center/12px no-repeat}body:not(.skin-timeless):not(.skin-minerva) .mw-parser-output .id-lock-free a,body:not(.skin-timeless):not(.skin-minerva) .mw-parser-output .id-lock-limited a,body:not(.skin-timeless):not(.skin-minerva) .mw-parser-output .id-lock-registration a,body:not(.skin-timeless):not(.skin-minerva) .mw-parser-output .id-lock-subscription a,body:not(.skin-timeless):not(.skin-minerva) .mw-parser-output .cs1-ws-icon a{background-size:contain;padding:0 1em 0 0}.mw-parser-output .cs1-code{color:inherit;background:inherit;border:none;padding:inherit}.mw-parser-output .cs1-hidden-error{display:none;color:var(--color-error,#d33)}.mw-parser-output .cs1-visible-error{color:var(--color-error,#d33)}.mw-parser-output .cs1-maint{display:none;color:#085;margin-left:0.3em}.mw-parser-output .cs1-kern-left{padding-left:0.2em}.mw-parser-output .cs1-kern-right{padding-right:0.2em}.mw-parser-output .citation .mw-selflink{font-weight:inherit}@media screen{.mw-parser-output .cs1-format{font-size:95%}html.skin-theme-clientpref-night .mw-parser-output .cs1-maint{color:#18911f}}@media screen and (prefers-color-scheme:dark){html.skin-theme-clientpref-os .mw-parser-output .cs1-maint{color:#18911f}}"Excerpts from The IEEE LEADERSHIP WIRE" (PDF), IEEE Delhi News, 04 (1): 6, January 2004 ^ Roden, M S (Aug 2006). "IEEE Xplore 2.1". Choice. 43 (WEB X): 154. IEEE Xplore is essential for technical libraries serving electrical engineering students, faculty, and professionals. A comprehensive database of more than a million full-text documents with more than 25,000 pages added each month, it contains all IEEE journals, transactions, letters, magazines, conference proceedings, and standards beginning with 1988 and selected documents as early as 1952....With more and more libraries substituting electronic information for paper copies, this online resource has become a must for electrical engineering reference. ^ Wilde, Michelle (1 April 2016). "IEEE Xplore Digital Library". The Charleston Advisor. 17 (4): 24–30. doi:10.5260/chara.17.4.24. S2CID 64102919. The content of IEEE Xplore is technical in nature and will appeal to researchers working in technical fields, particularly electrical engineering, electronics, and computer science. ^ "IEEE Xplore - Overview". IEEE. Retrieved 25 July 2020. ^ Scardiui, Brandi (April 2015). "IEEE Xplore". Information Today. 32 (3): 23. IEEE adds about 25,000 new documents to Xplore each month. ^ Griffin, Luke (2002-01-01). "IEEE Xplore". Reference Reviews. 16 (4): 27–28. doi:10.1108/rr.2002.16.4.27.198. ISSN 0950-4125. ^ Griffin, Luke (August 2002). "IEEE Xplore. Version 1.3". Online Information Review. 26 (4): 285. doi:10.1108/oir.2002.26.4.285.12. ^ "IEEE Xplore Digital Library Subscriptions". IEEE. Retrieved 2020-09-04.

      The sources are cited using a bulleted format and not in a somewhat MLA style.

    1. Detailed Summary

      1. You own your data, in spite of the cloud. <br /> Section summary: <br /> Local-fist software tries to solve the problem of ownership, agency and data lock-in present in cloud-based software, without compromising cross-collaboration and improving user control.

      Section breakdown<br /> §1: SaaS<br /> Pros: Easy sync across devices, real-time collab Cons: loss of ownership and agency; loss of data is software is lost.

      §2: Local-fist software<br /> - Enables collaboration & ownership - Offline cross-collaboration - Improved security, privacy, long-term preservation & user control of data

      §3 & §4: Article Methodology<br /> - Survey of existing storage & sharing approaches and their trade-offs - Conflict-free Replicated Data Types (CRDTs), natively multi-user - Analysis of challenges of the data model as implemented at Ink & Switch - Analysis of CRDT viability, UI - Suggestion of next steps

      2. Motivation: collaboration and ownership<br /> Section summary: <br /> The argument for cross-device, real-time collab PLUS personal ownership

      Section breakdown<br /> §1: Examples of online collabs<br /> §2: SaaS increasingly critical, data increasingly valuable<br /> §3: There are cons<br /> §4: Deep emotional attachment to your data brings feeling of ownership, especially for creative expression<br /> §5: SaaS require access to 3rd party server, limitation on what can be done. Cloud provider owns the data.<br /> §6: SaaS: no service, no data. If service is shut down, you might manage to export data, but you may not be able to run your copy of the software.<br /> §7: Old-fashioned apps were local-disk based (IDEs, git, CAD). You can archive, backup, access or do whatever with the data without 3rd party approval.<br /> §8: Can we have collaboration AND ownership?<br /> §9: Desire: cross-device, real-time collab PLUS personal ownership

      3. Seven ideals for local-first software<br /> Section breakdown<br /> §1: Belief: data ownership & real-time collab are compatible<br /> §2: Local-first software local storage & local networks are primary, server secondary<br /> §3: SaaS: In the server, or it didn't happen. Local-first: local is authoritative, servers are for cross-device.

      3.1.1 No spinners<br /> SaaS feels slower because if requires round-trip to a server for data modification and some lookups. Lo-Fi doesn't have dependency on server, data sync happens on the background. This is no guarantee of fast software, but there's a potential for near-instant response.<br /> 3.1.2 Data not trapped on one device <br /> Data sync will be discussed in another section. Server works as off-site backup. The issue of conflict will also be discussed later.<br /> 3.1.3 The network is optional<br /> It's difficult to retrofit offline support to SaaS. Lo-Fi allows CRUD offline and data sync might not require the Internet: Bluetooth/local Wi-fi could be enough.<br /> 3.1.4 Seamless collabs<br /> Conflicts can be tricky for complex file formats. Google Docs became de facto standard. This is the biggest challenge for Lo-Fi, but is believed to be possible. It's also expected that Lo-Fi supports multiple collab.

      TBC

  3. Mar 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      __Reviewer #1 __

      (Evidence, reproducibility and clarity (Required)):

      The manuscript identified a novel role of Intraflagellar Transport Protein 20 (IFT20) in the function and homeostasis of lymphatic endothelial junctions. The authors showed that IFT20 regulates VE-cadherin localization at adherens junctions in lymphatic endothelial cells. The authors performed impressive in vivo work that shows the requirement for IFT20 for the homeostasis of intercellular junctions, lymphangiogenesis, and drainage function of lymphatic vessels. In contrast, the cell biology part of the paper was underwhelming and will need significant revisions to support the proposed model. In the result section, several conclusions have to be toned down to match the actual results. The study employs in vivo mouse models, immunofluorescence, biochemical assays, and loss-of-function experiments to support their conclusions.

      Major comments - The authors present disrupted localization of VE-cadherin. Is this a mislocalization and/or protein stability issue in IFT20 KD cells? A western blot can help assess protein levels, and a phase-chase endocytosis assay of VE-cadherin can strengthen evidence. The authors did not confirm the permeability phenotype seen in vivo.

      We thank the reviewer for this helpful suggestion.

      Planned 1: Western blot to assess total VE-cadherin protein levels in IFT20 WT and KD cells.

      Planned 2: Immunofluorescence staining for cell-surface VE-cadherin using permeabilized and non-permeabilized IFT20 WT and KD cells during VEGF-C stimulation and washout.

      Together, these two experiments will assess VE-cadherin stability and more directly test the hypothesis that VE-cadherin does not recycle effectively back to the cell surface in the absence of IFT20.

      • While the authors focused on IFT20 and rab5, we do not have a clear idea about the vesicular dynamics as well as the status of early, late, and recycling endosomes in IFT20 KD cells. Is IFT 20 localized to non-rab5+ endosomes, and if yes, what are the species? A more general endosomal profiling would help strengthen the authors' message. For example, in Fig. 4-5, the authors will have to stain for other early endosomal markers as well as late, and recycling endosomal markers in control and IFT20 KD cells.

      Thank you for this helpful suggestion.

      Planned 3: Immunofluorescence staining for EEA1 (early endosome), RAB7 (late endosome), RAB4 (fast recycling), RAB11 (recycling endosome) along with IFT20 to determine its localization pattern.

      This experiment will determine the localization of IFT20 relative to various endosomal compartments.

      • In fig. 6C, a majority of VE-cadherin is not associated with Rab5. Staining with additional endosomal markers might help identify other endosomal species colocalizing with VE-cadherin. It will be critical to add to Fig. 6c the intensity profiles depicting colocalizations. The authors can also live image a fluorescently (f)-tagged VE-cadherin (maybe with another f-tagged rab5) and assess their association dynamics in IFT20 KD cells (similar to fig6C).

      Thank you for this helpful suggestion.

      Planned 4: Immunofluorescence staining for EEA1 (early endosome), RAB7 (late endosome), RAB4 (fast recycling), RAB11 (recycling endosome) along with VE-cadherin in IFT20 WT and KD cells to determine its localization pattern.

      Planned 5: Additional colocalization analysis such as adding intensity profiles and possibly proximity ligation assay.

      Beyond the scope of this manuscript 1: While we agree that imaging the dynamics of FP-tagged VE-cadherin in live cells would provide more detail about its localization, we feel that this is beyond the scope of the current manuscript.

      These experiments will determine the localization of VE-cadherin across various endosomal compartments and strengthen the current colocalization data.

      • Primary cilia do seem to regulate vascular plexus in the mouse retina as well as endothelial permeability through mediating subcellular localization of junction proteins. The authors do not clearly exclude the ciliary function of IFT20 in mediating lymphatic endothelial cell-cell junctions. A rescue experiment can help settle this question by targeting IFT20 exclusively to cilia (or not) and assessing, for example, VE-cadherin localization. The following is optional: It is also unclear whether the described regulation is specific to IFT20 or can be phenocopied by the ablation of another IFT subunit and/or cilia ablation through the depletion of a non-IFT cilia assembly regulator.

      Thank you for this helpful suggestion. We propose an alternative strategy.

      Planned 6: To determine the role of ciliary vs. nonciliary functions, we will knockdown IFT74, an IFT protein in the same IFT complex B as IFT20 that is required for cilia assembly and function but is not known to participate in vesicular trafficking. We will assess VE-cadherin localization in IFT74 WT and KD cells by immunofluorescence.

      Beyond the scope of the manuscript 2: We have not optimized reagents for targeting IFT20 to the cilium (e.g. ciliary targeting sequence) and believe that assessing the effects of a protein from the same IFT complex (IFT74) without known nonciliary functions will alleviate the reviewer’s concern.

      • Figs. 7A and B do not seem very convincing. The control vs. IFT20 KD western blot levels look mostly similar between the two conditions. The result section does not translate the actual data in Fig. 7A and B. Additionally, there are no statistical comparisons between control and KD conditions in the graphs. Except for a potential pVEGFR-3 increase at 30 min VEGF-C in IFT20 KD cells, but after washout the level is similar to control. This figure does not support well the model presented in fig. 8. The conclusion in lines 456-459 has to be toned down.

      Thank you for this helpful suggestion.

      Planned 7: We will remove these western blot data with the exception of pVEGFR-3 and add phospho-tyrosine immunofluorescence. We will use immunofluorescence to quantify phosphorylated tyrosine levels and repeat western blots for pVEGFR-3 at different concentrations and time points of VEGF-C stimulation in IFT20 WT and KD cells. We will remove the other western blot data and revise the text accordingly. We will also attempt to pull down total VEGFR-3 and then blot for pVEGFR-3 to improve sensitivity of this assay.

      These experiments will focus our analysis on the activation of VEGFR-3.

      • The authors were not able to stain for pVEGFR3. It would still be helpful to see a colocalization between total VEGFR3, IFT20, and VE-cadherin in control cells and IFT20 KD cells (VEGFR3 and VE-cadherin).

      Thank you for this helpful suggestion.

      Planned 8: We will perform immunofluorescence for VEGFR-3, IFT20, and VECAD and assess their localization.

      Minor comments - The control used in Figures 1 and 2 does not seem ideal. The proper control would be IFT20fl/fl cre neg. Is there a reason why the authors excluded a lox allele in control? Also, the authors have to provide the mice age used in these figures and when the Cre kicks in in the result section.

      Thank you for this helpful suggestion.

      Planned 9: We will clarify the use of control genotypes, and add mouse ages and Cre details to results/methods. This is a constitutive LYVE-1 Cre.

      • Please describe the overall mouse phenotype(s) of the LYVE1 CRE-IFT20 flox.

      Thank you for pointing out this oversight.

      Planned 10: We will include a description of the overall phenotypes of LYVE1 Cre IFT20 KOs in the text. One notable phenotype is abdominal ascites.

      • Line 109:'By expression, the authors probably mean immunostained.

      Thank you for pointing out this oversight.

      Planned 11: We will change to “immunostaining for”.

      • Many graphs exhibit undefined Y-axis labels and units. Please clarify these as well as the way they were quantified. Include such information in figure legends and/or in the materials and methods section. The figures in question are fig1C, E and F, fig2E, fig3E, fig4b and D, fig6b and D, fig7B and C.

      Thank you for this helpful suggestion.

      Planned 12: We will clarify the quantification strategies and units in the text and figure legends and make sure the axes are clearly labeled.

      • Line 295:'homeostasis"-the authors probably mean in a serum-rich condition.

      Thank you for this helpful suggestion.

      Planned 13: That is indeed what we meant. We will merge this sentence and the next sentence to be clearer.

      • Fig4C specifically the lower two images on the right side: the images do not seem to represent the corresponding graphs.

      Thank you for this helpful suggestion.

      Planned 14: We will double check these images and adjust if necessary.

      • Please add the statistical tests used to evaluate significance in all figure legends.

      Thank you for pointing out this oversight.

      Planned 15: We will be sure statistical tests are named in all figure legends.

      Reviewer #1 (Significance (Required)):

      This study provides novel insights into IFT20's role in VE-cadherin trafficking and endothelial junction stability, with its strongest aspect being the in vivo data in Figures 1 and 2, demonstrating lymphatic defects upon IFT20 loss. This represents a conceptual advance by extending IFT protein function beyond cilia (if one of the major comments is addressed) to vascular integrity. However, mechanistic depth is lacking, and ciliary role was not tested-additional rescue and colocalization experiments are needed to confirm the model. The study will interest vascular and lymphatic biologists, as well as cell biologists studying intracellular trafficking and cilia.

      Expertise: cilia and mouse genetics

      __Reviewer #2 __

      (Evidence, reproducibility and clarity (Required)):

      Paulson et al. use an in vivo model of IFT20 deletion (Lyve1-Cre) and primary lymphatic endothelial cell (LEC) cultures to investigate the role of IFT20 in controlling LEC-LEC junction dynamics. The key findings/suggestions include: i) Authors show alterations in the VE-cadherin (or ZO-1) staining at the LEC junctions upon IFT20 deletion or silencing. ii) They also show evidence of the IFT20 localization to RAB5 endosomes and alteration of RAB5 endosome dynamics upon IFT20 silencing.

      In the current manuscript, some of the key data are not convincing. Further experimentation and analysis (also of the existing data) are needed to solidify the authors' statements as detailed below. I expect that the suggested experiments can be executed in 3-to-6 months and require, at least, antibodies, which have not been used in the current manuscript.

      Major comments

      1. The data information, presented in the figure legends, is difficult to understand. The authors should always indicate how many biological replicates and independent experiments the data is derived from. This holds also for the representative images. Now, it seems that some of the quantified data are derived from only 1 experiment (see, for example, rows 423-425: "Graphs show one representative biological replicate of two, each comprising two technical replicates with 100+ cells per condition"). The quantifications should be based on data from at least three independent experiments.

      Often data points represent the field of views from a single sample, thus, biasing the statistical testing. The data points should represent biological replicates or independent experiments to allow the reader to make conclusions, about whether the findings are statistically significant and can be repeated.

      Thank you for this helpful critique.

      Planned 16: We will be sure to indicate biological and technical replicates and ensure that quantifications are representative of at least three independent experiments. We will also ensure that quantifications are statistically robust.

      The Lyve1-Cre is not specific for lymphatic vasculature (for example https://www.jax.org/strain/012601# and Lee LK et al. 2020, Cell Reports), as also stated by the authors (row 112). However, this is not shown in the data and complicates the interpretation of the data. Here, authors can stain the IFT20 with their existing mouse IFT20-specific antibody to show the loss in the lymphatic and/or blood vasculature. If IFT20 is lost in both vasculature types, it is not possible to say "lymphatic specific" (for example, row 143) and draw conclusions that the observed phenotypes would be primary to IFT20 loss in the lymphatic vasculature.

      Thank you for this helpful suggestion.

      Planned 17: We will assess IFT20 KO in blood vasculature and tone down lymphatic-specific language in the text.

      The authors write (rows 164-168) "Lymphatic vessels in the IFT20 KO or VE-cadherin KO embryonic dorsal skin exhibited increased and variable lumen size and excessive branching, suggesting that impaired lymphatic organization and function contributed to the fluid homeostasis defect. Here, immunofluorescence staining for LYVE-1 in the ear skin revealed similar patterning defects in adult IFT20 KO lymphatic vessels (Figure 2A), that have also been described in VE-cadherin KO mice (Hägerling et al., 2018)." However, based on Figure 2A, it is not obvious that there would be excessive lymphatic vessel branching, impaired organization or similarities to VE-cadherin deleted lymphatic vessels. To justify their statement, the authors should provide quantification of the branching (at least 3 mice/genotype).

      Thank you for this helpful critique.

      Planned 18: Based on the suggestion from Reviewer 3, we will remove these morphological and skin drainage data.

      IFT20 deletion or silencing causes alterations in the cell junction pattern/VE-cadherin intensity. The authors' interpretation that IFT20 deletion/silencing would cause discontinuous or "button-like" junctions is not supported by the provided images (Figures 1E, 3F, 6A, 6C). Rather, it seems that the levels of VE-cadherin in vivo are decreased, whereas the "continuity" of the junction is not altered. In cell culture, IFT20 silencing seems to cause wider and, to some extent, overlapping VE-cadherin junctions and not "discontinuous". These junctions may represent a more immature state. The authors should change the nomenclature accordingly or provide additional data. Using the existing cell culture experiment images, it would be more appropriate to analyze the width of the VE-cadherin junctions, instead of the "granularity".

      Thank you for this helpful suggestion.

      To assess VE-cadherin levels in vitro, we will perform western blots as described in Planned 1 above.

      Planned 19: We will measure widths of junctions from IFT20 KD and WT images and adjust the language in the text.

      Paulson et al. show images of IFT20 and RAB5 double-stained samples. The co-localization seems to happen mostly at the weakly IFT20 positive puncta (Figure 3A-B). Authors should show the disappearance of the signal in the siIFT20 treated samples (in comparison to siControl samples) to highlight the specificity of the weak signal.

      Thank you for this helpful suggestion.

      Planned 20: We will add data showing the IFT20 KD more clearly at high magnification.

      1. The Authors analyze the co-localization of VE-cadherin and RAB5 as co-localization area (Figure 6C-D). The images show that the co-localization is stated to happen at LEC periphery/junctions. LEC periphery is notoriously thin and microscope Z-resolution does not allow distinction of truly co-localizing or "on top of each other" signal. Based on row 607 co-localization would be expected to happen at least in EEA1+ vesicles, which are located perinuclearly (not at the junctions) in LECs (Korhonen et al. 2022, JCI). Authors could use EEA1, RAB5, and VE-cadherin triple staining for the quantification.

      Thank you for this helpful suggestion.

      Please see Planned 3 and Planned 4 above where we propose experiments to address this concern.

      In the current experiments, authors cannot conclude whether the VE-cadherin signal is at the cell junction (non-internalized), in endosomes (internalized during the experiment), or newly produced VE-cadherin on its way to the plasma membrane. To allow conclusions about the internalized VE-cadherin, and its localization in RAB5 vesicles, authors should conduct, for example, a classical endocytosis assay: incubation of live cells with non-blocking anti-VE-cadherin antibody, followed by acid wash to remove the non-internalized antibody, fixation and staining for RAB5. Also, shorter VEGF-C treatment would allow conclusions about the VE-cadherin dynamics.

      Thank you for this helpful suggestion.

      In Planned 2 above, we will perform immunofluorescence staining for cell-surface VE-cadherin using permeabilized and non-permeabilized IFT20 WT and KD cells during VEGF-C stimulation at various timepoints and washout to address this concern.

      siRNAs can have off-target effects and, thus, the use of at least two independent methods/oligos for silencing is needed. Paulson et al. use a pool of 4 oligos for silencing. They should rather test the efficacy of the single oligos and then use the two best oligos (1/sample) to show and quantify the same phenotype. This is needed at least for the key experiments shown in Figures 4C-D, Figure 6A-B (see also comment #3), Figure 7A-B

      Thank you for this helpful suggestion. We chose these reagents based on pooled siRNAs at low concentration minimizing off-target effects while still achieving strong KD vs. single siRNAs at higher concentration. Please see this technical note for further information about minimizing off-target effects by the use of pooled siRNAs vs. single siRNAs: https://horizondiscovery.com/-/media/Files/Horizon/resources/Application-notes/off-target-tech-review-technote.pdf?sc_lang=en

      1. “SMARTpool siRNA reagents pool four highly functional SMARTselection designed siRNAs targeting the same gene. Studies show that strong on-target gene knockdown can be achieved with minimal off-target effects if a pool consisting of highly functional multiple siRNA is subsituted for individual duplexes. This finding is in contrast to speculation that mixtrues of siRNAs can compound off-target effects. … [Their data show that] while individual duplexes delivered at 100 nM can induce varying numbers of off-targeted genes, transfection of the corresponding SMARTpool siRNA (100 nM total concentration) induces only a fraction of the total off-target profile.”
      2. “Our scientists have identified a unique combination of [chemical] modifications that eliminate as much as 80% of off-target effects.”
      3. “The ON-TARGETplus product line is comprised of four individual siRNAs, and SMARTpool reagents which are chemically modified and rationally designed to minimize off-target effects.”

        OPTIONAL: Paulson et al stated in the first article (2021, Front. Cell Dev. Biol.) that IFT20 deletion/silencing causes lymphatic endothelial phenotypes due to its role in primary cilia, whereas here the authors conclude that IFT20 controls VE-cadherin dynamics at the RAB5 vesicles. However, the current experiments cannot dissect the role of IFT20 in these two distinct locations. For this, authors could delete/silence another gene required for primary cilia or RAB5 endosomes and then analyze, which IFT20 phenotypes are recapitulated.

      Thank you for this helpful suggestion. Please see Planned 6 above where we propose to determine the role of ciliary vs. nonciliary IFT functions by knocking down IFT74, an IFT protein in the same IFT complex B as IFT20 that is required for cilia assembly and function but is not known to participate in vesicular trafficking. We will assess VE-cadherin localization in IFT74 WT and KD cells by immunofluorescence.

      The data shown in Figure 2 B-E (Lymphatic drainage) is not necessary for the current manuscript ("IFT20 regulates VE-cadherin traffic in LECs") and can be removed. As the authors state in the manuscript, the drainage phenotype may be due to lymphatic vessel valve defects (rows 584-585) rather than primary for LEC-LEC junction defects. The data does not justify the abstract sentence "and lymph transport is impaired by intracellular sequestration of VE-cadherin" (row 42).

      Thank you for this helpful suggestion. Please see Planned 18 above, where we propose to remove these data.

      Minor comments

      1. For some of the images, the signal should be enhanced to allow visual inspection also in the paper version (Figures 5A-B and 6C, magenta).

      Thank you for this helpful suggestion.

      Planned 21: We will enhance the signal in the indicated figures.

      Authors show representative Western Blots and quantification of several biological replicates/sample types to investigate signaling responses upon VEGF-C treatment of control and siIFT20 cells. The authors state that the P-levels of VEGFR3, ERK, VE-cadherin, and AKT have different dynamics in control and IFT20-silenced cells. To justify this conclusion, authors should test the statistical significance between the siControl and siIFT20 samples at each time point. The current quantification (Figure 7B) shows that there is, at least, a trend of increased p-VEGFR3, p-VE-cadherin, p-ERK, and p-AKT in IFT20 silenced cells. However, the representative Western Blot image does not display a clear difference (Figure 7A). Authors should include the original western blots, used for quantification, as supplements.

      Thank you for this helpful suggestion. Please see Planned 7 above where we propose to remove these data with the exception of pVEGFR-3 and add corresponding immunofluorescence data. We will ensure blots are included as supplemental figures.

      The authors use western blot quantification to show that the altered LEC junctions affect VEGFR3 signaling. They further hypothesize that the increased VEGFR3 signaling may be a consequence of VEGFR3 localization in endosomes. The authors did not detect any signal using the phospho-specific VEGFR3 antibody (rows 441-442). To analyze the location of VEGFR3 upon VEGF-C treatment in siControl and siIFT20 LECs, the authors should use anti-VEGFR3 (total) antibodies that have been shown to detect VEGFR3 in similar assays.

      Thank you for this helpful suggestion.

      Please see Planned 8 above where we will perform immunofluorescence for VEGFR-3, IFT20, and VECAD and assess their localization.

      The normality of the data should be tested before the selection of the statistical test. If this has been done, please, indicate it in the materials and methods or re-run the statistical analysis, if some of the data is not normally distributed.

      Thank you for this helpful suggestion.

      Planned 22: We will double check the statistics and normality for all quantifications.

      The authors should use arrows, arrowheads, etc. to highlight examples of relevant features in the images. For example, in Figure 3C, the increased stress fiber formation is not obvious to the reader.

      Thank you for this helpful suggestion.

      Planned 23: We will add arrows etc. where appropriate.

      Reviewer #2 (Significance (Required)):

      Lymphatics are essential for fluid, leukocyte, and lipid trafficking to lymph nodes and/or systemic circulation. Recent findings have promoted lymphatics as a potential target to control the level of adaptive immunity in inflammation-associated diseases, including tumorigenesis (for example Song et al 2020, Nature). Early work on lymphatic endothelium in vivo, highlighted the dynamics of lymphatic endothelial junction, which, reversibly, can alter between continuous and discontinuous ("button-like") states (Baluk 2007, Am. Jour. Pathol.; Yao 2012, Am J. Pathol.). These changes may have an effect on fluid drainage capacity, lymphatic vessel growth, and prevention of pathogen dissemination to the systemic circulation. Recently, lymphatic junctions have been shown to present hubs of VEGFR3 signaling, VEGFR3 and VE-cadherin dynamics, and leukocyte transmigration (Sung et al. 2022, Nat. Cardiovasc. Res.; Hagerling et al. 2018, EMBO J.; Liaqat et al. 2024, EMBO J.). Thus, the manuscript by Paulson et al. investigates a topical subject.

      The authors suggest a role for IFT20 in the control of VE-cadherin dynamics. Based on my expertise in lymphatic endothelial biology, I envision that the manuscript can potentially increase knowledge on the regulators of the lymphatic endothelial junctions, which might have physiological, and in the long term, translational significance. However, in the current manuscript, the exact mechanisms of how IFT20 controls lymphatic endothelial junctions are left open. In addition to the lymphatic research field, the study is, potentially of interest to researchers working on blood vasculature or, even, epithelium, i.e. tissues where junctional dynamics play a major role in health and disease.

      Furter controls, analysis, and experimentation are needed to warrant the authors' statements. In their future work, the authors should also consider means to rigorously dissect the IFT20 functions in primary cilia and endosomes.

      __Reviewer #3 __

      (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the group of Fink and coworkers investigates mechanistic aspects of the intraflagellar transport protein 20 (IFT20) function in lymphatic endothelial cells (LECs). In a previous study, this group had demonstrated the presence of primary cilia on LECs and shown that loss of IFT20 during development resulted in edema, lymphatic vessel dilation and altered branching. Lymphatic-specific deletion of IFT20 cell-autonomously exacerbated acute lymphangiogenesis after corneal suture. In this manuscript, Paulson et al. recapitulate the suture-induced hyper-lymphangiogenesis after lymphatic-specific IFT20 KO using a LYVE1-Cre delete strain and demonstrate a reduced, more discontinuous VE-Cadherin (VECad) staining in newly formed lymphatic vessels (LVs). Prompted by distended and hyperbranching dermal vessels, the performed functional tracer injection experiments and demonstrate increased lymphatic backflow and leakage into the interstitium. To gain further mechanistic insights the authors turned to reductionist cell culture models, starting with a mouse LEC line, in which IFT20 had been deleted using CRISPR/Cas9 resulting in loss of primary cilia, increased stress fibre formation and impaired junctional integrity. More importantly, similar effects were detected in human dermal (HD)LECs after IFT20 KD. Further IFT20 KD HDLECs showed accumulation of RAB5+ vesicles indicating defective endosome maturation. Indistinguishable formation of RAB5+ endosomes after VEGF-C stimulation in HDLECs and IFT20KD HDLECs indicated that endocytosis and formation of early endosomes occur independent if IFT20. Through starvation, stimulation and wash-out experiments the authors provide colocalization data suggesting that after VEGF-C stimulation IFT20 is recruited to endosomes where it contributes to VECad recycling. Finally, the authors addressed if the increase in RAB5+ endosomes following VEGF-C stimulation resulted in prolonged retention of signaling-active VEGFR-3 in endosomes. Western blotting for phosphorylation of VEGFR-3 and its downstream signaling components after activation of starved HDLECs or IFT20KD HDLECs and subsequent factor wash out provided evidence towards this model.

      Subsequently open question and potential suggestions for improvement are listed: The authors describe a slight leakiness of the LYVE1-Cre deleter strain to result in massive hemangiogenesis (line112). How extensive is the resulting deletion in blood endothelial cells? What are the consequences for VECad distribution in BEC junctions i.e. for blood vessels and vascular permeability? Are the defects described specific for LECs or are the manifestation of generic defects in LECs?

      Thank you for these helpful suggestions.

      Please see Planned 17 above where we will assess IFT20 KO in blood vasculature and tone down lymphatic-specific language in the text.

      Fig. 1 E, what is the distribution of LYVE1 in IFT20 KO LECs at higher magnification, is LYVE1 excluded from the VECad expression domain?

      Thank you for this helpful suggestion.

      Planned 24: We will review our corneal confocal data to address this question.

      Fig.1 F, what does VECad-positive LV (%) area (line 154 - 155) refer to, given that all LECs are VECad+ but the junctional distribution of the protein is distinctly different?

      Thank you for pointing out our need to clarify. This quantification measures the overlap of VE-cadherin with LYVE-1 as a way to measure the area covered by adherens junctions between lymphatic endothelial cells. Where junctions are punctate, they have smaller area vs. long continuous junctions.

      Planned 25: We will update the text to clarify this measurement.

      In the discussion, the authors speculate that the development of valves could be potentially impaired in IFT20 LEC KO mice. Ear skin would be an excellent tissue to stain the valves and analyse their structure in collecting LVs. Of particular interest in this context are Int a9, VECad, FOXC2 and PROX1 expression. The later two are required for valve formation and upregulated in valve forming areas in response to oscillatory shear stress (Sabine A et al. (2012) Dev Cell 22 (2):430-445. doi:10.1016/j.devcel.2011.12.020).

      Thank you for this helpful suggestion. Based on the suggestion from Reviewer 2, we will remove the ear lymph drainage data and focus on the cell biology in this manuscript. Our current experiments focus more on lymphatic valve formation in this context and these data can be moved to a separate manuscript.

      Planned 26: We will revise the text to remove speculation about valve development in this model and address this in a later manuscript.

      Does IFT20 KO and loss of the primary cilium impair OSS sensing and result in a failure to express sufficient levels of PROX1 for valve formation (Fig. 7 C).

      Thank you for this helpful comment. We will address the role of cilia in OSS sensing and valve formation in a forthcoming manuscript.

      A larger area view including pre-collectors and collectors would be informative and reveal changes in the overall structure of the lymphatic vessel bed in absence of IFT20.

      Based on the suggestion from Reviewer 2, we will remove these data.

      Fig. 2 A, (line 187 - 190) please indicate the age of analysed animals.

      Planned 27: We will add the ages of mice used.

      With respect to Fig.1, LVs in the area are mainly capillaries, what is the distribution of VECad? Are the LVs comprised of oak-leave shaped LECs, higher magn. pictures would be required.

      Thank you for this helpful suggestion.

      Planned 28: We will include higher magnification images of capillaries.

      Fig. 2 (C - E) Line 201 - 203 the description of retrograde flow using a clock terminology is unusual and not clear to the reader. Is this meant relative to the point of injection with 12 being at the top or relative to the injection axis (i.e. forward / backward direction)? It would seem that indication of the angle in combination with a sketch of the analysis would help the reader to interpret these data.

      Thank you for this helpful critique. We will remove these data based on this suggestion and that of Reviewer 2.

      The application of cell culture models is appropriate, however, the value of the mLEC model is questionable given that VECad is not detectable in these cells and PROX1 and VEGFR-3 staining are not shown. Therefore, the HDLEC model bears significantly more relevance. In Fig. 3D, were mLECS mitotically arrested during the 24hrs transwell migration, to exclude division and crowding effects during the observation time?

      Thank you for this helpful critique.

      Planned 29: We will clarify the methods for this experiment in the text.

      Fig. 6 It is commendable that the authors report their lack of success to directly visualize VEGFR-3 endocytosis by IF and attempt a WB analysis instead. However given the spread of the results normalization to ß-actin as a loading control appears inappropriate. Phosphorylated forms of VEGFR-3 and VECad should be normalized to the expression of the total protein as measured with a non-phospospecific antibody, exactly the way done here for ERK1/2 and AKT. Generally, IP-WB experiments provide superior data in this type of setting.

      Thank you for this helpful suggestion. Based on suggestions from the other reviewers, we will remove these WB data with the exception of pVEGFR-3 and add corresponding immunofluorescence. We will include additional time points and include blots used for quantifications as supplements.

      Line 597 - 599: "VEGFR-3 signaling is required for the establishment of VE-cadherin button junctions as lymphatic collecting vessels mature but is not required for their maintenance (Jannaway et al., 2023)." Collecting LVs are characterized by zipper junctions, but not button junctions. Therefore, this sentence needs clarification.

      Thank you for this helpful suggestion.

      Planned 30: We will clarify this text.

      Reviewer #3 (Significance (Required)):

      The role of IFT20 in formation of the primary cilium and endocytic vesicle transport warrants its investigation in lymphatic endothelial cells. Therefore, this study addresses relevant questions and provides important first insights into the cell biological function of IFT20 in this cell type. IFT20 has so far not been implicated in endocytosis and recycling of VECad and VEGFR-3 and the model suggested by the authors is compelling and adds to the mechanistic understanding of previous studies on the role of VECad in LECs. In particular, it could be of relevance for the enigmatic formation of button junction in lymphatic capillaries and the mechano-response of LECS underlying valve formation. At this point, the picture obtained from the endocytosis assays is more conclusive compared to the analysis of the impact of IFT20 loss on button junction formation. Clearly the study is of interest for a general cell biological audience as well as vascular biologists.

      • *
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Paulson et al. use an in vivo model of IFT20 deletion (Lyve1-Cre) and primary lymphatic endothelial cell (LEC) cultures to investigate the role of IFT20 in controlling LEC-LEC junction dynamics. The key findings/suggestions include: i) Authors show alterations in the VE-cadherin (or ZO-1) staining at the LEC junctions upon IFT20 deletion or silencing. ii) They also show evidence of the IFT20 localization to RAB5 endosomes and alteration of RAB5 endosome dynamics upon IFT20 silencing.

      In the current manuscript, some of the key data are not convincing. Further experimentation and analysis (also of the existing data) are needed to solidify the authors' statements as detailed below. I expect that the suggested experiments can be executed in 3-to-6 months and require, at least, antibodies, which have not been used in the current manuscript.

      Major comments

      1. The data information, presented in the figure legends, is difficult to understand. The authors should always indicate how many biological replicates and independent experiments the data is derived from. This holds also for the representative images. Now, it seems that some of the quantified data are derived from only 1 experiment (see, for example, rows 423-425: "Graphs show one representative biological replicate of two, each comprising two technical replicates with 100+ cells per condition"). The quantifications should be based on data from at least three independent experiments.

      Often data points represent the field of views from a single sample, thus, biasing the statistical testing. The data points should represent biological replicates or independent experiments to allow the reader to make conclusions, about whether the findings are statistically significant and can be repeated. 2. The Lyve1-Cre is not specific for lymphatic vasculature (for example https://www.jax.org/strain/012601# and Lee LK et al. 2020, Cell Reports), as also stated by the authors (row 112). However, this is not shown in the data and complicates the interpretation of the data. Here, authors can stain the IFT20 with their existing mouse IFT20-specific antibody to show the loss in the lymphatic and/or blood vasculature. If IFT20 is lost in both vasculature types, it is not possible to say "lymphatic specific" (for example, row 143) and draw conclusions that the observed phenotypes would be primary to IFT20 loss in the lymphatic vasculature. 3. The authors write (rows 164-168) "Lymphatic vessels in the IFT20 KO or VE-cadherin KO embryonic dorsal skin exhibited increased and variable lumen size and excessive branching, suggesting that impaired lymphatic organization and function contributed to the fluid homeostasis defect. Here, immunofluorescence staining for LYVE-1 in the ear skin revealed similar patterning defects in adult IFT20 KO lymphatic vessels (Figure 2A), that have also been described in VE-cadherin KO mice (Hägerling et al., 2018)." However, based on Figure 2A, it is not obvious that there would be excessive lymphatic vessel branching, impaired organization or similarities to VE-cadherin deleted lymphatic vessels. To justify their statement, the authors should provide quantification of the branching (at least 3 mice/genotype). 4. IFT20 deletion or silencing causes alterations in the cell junction pattern/VE-cadherin intensity. The authors' interpretation that IFT20 deletion/silencing would cause discontinuous or "button-like" junctions is not supported by the provided images (Figures 1E, 3F, 6A, 6C). Rather, it seems that the levels of VE-cadherin in vivo are decreased, whereas the "continuity" of the junction is not altered. In cell culture, IFT20 silencing seems to cause wider and, to some extent, overlapping VE-cadherin junctions and not "discontinuous". These junctions may represent a more immature state. The authors should change the nomenclature accordingly or provide additional data. Using the existing cell culture experiment images, it would be more appropriate to analyze the width of the VE-cadherin junctions, instead of the "granularity". 5. Paulson et al. show images of IFT20 and RAB5 double-stained samples. The co-localization seems to happen mostly at the weakly IFT20 positive puncta (Figure 3A-B). Authors should show the disappearance of the signal in the siIFT20 treated samples (in comparison to siControl samples) to highlight the specificity of the weak signal. 6. The Authors analyze the co-localization of VE-cadherin and RAB5 as co-localization area (Figure 6C-D). The images show that the co-localization is stated to happen at LEC periphery/junctions. LEC periphery is notoriously thin and microscope Z-resolution does not allow distinction of truly co-localizing or "on top of each other" signal. Based on row 607 co-localization would be expected to happen at least in EEA1+ vesicles, which are located perinuclearly (not at the junctions) in LECs (Korhonen et al. 2022, JCI). Authors could use EEA1, RAB5, and VE-cadherin triple staining for the quantification.

      In the current experiments, authors cannot conclude whether the VE-cadherin signal is at the cell junction (non-internalized), in endosomes (internalized during the experiment), or newly produced VE-cadherin on its way to the plasma membrane. To allow conclusions about the internalized VE-cadherin, and its localization in RAB5 vesicles, authors should conduct, for example, a classical endocytosis assay: incubation of live cells with non-blocking anti-VE-cadherin antibody, followed by acid wash to remove the non-internalized antibody, fixation and staining for RAB5. Also, shorter VEGF-C treatment would allow conclusions about the VE-cadherin dynamics. 7. siRNAs can have off-target effects and, thus, the use of at least two independent methods/oligos for silencing is needed. Paulson et al. use a pool of 4 oligos for silencing. They should rather test the efficacy of the single oligos and then use the two best oligos (1/sample) to show and quantify the same phenotype. This is needed at least for the key experiments shown in Figures 4C-D, Figure 6A-B (see also comment #3), Figure 7A-B 8. OPTIONAL: Paulson et al stated in the first article (2021, Front. Cell Dev. Biol.) that IFT20 deletion/silencing causes lymphatic endothelial phenotypes due to its role in primary cilia, whereas here the authors conclude that IFT20 controls VE-cadherin dynamics at the RAB5 vesicles. However, the current experiments cannot dissect the role of IFT20 in these two distinct locations. For this, authors could delete/silence another gene required for primary cilia or RAB5 endosomes and then analyze, which IFT20 phenotypes are recapitulated. 9. The data shown in Figure 2 B-E (Lymphatic drainage) is not necessary for the current manuscript ("IFT20 regulates VE-cadherin traffic in LECs") and can be removed. As the authors state in the manuscript, the drainage phenotype may be due to lymphatic vessel valve defects (rows 584-585) rather than primary for LEC-LEC junction defects. The data does not justify the abstract sentence "and lymph transport is impaired by intracellular sequestration of VE-cadherin" (row 42).

      Minor comments

      1. For some of the images, the signal should be enhanced to allow visual inspection also in the paper version (Figures 5A-B and 6C, magenta).
      2. Authors show representative Western Blots and quantification of several biological replicates/sample types to investigate signaling responses upon VEGF-C treatment of control and siIFT20 cells. The authors state that the P-levels of VEGFR3, ERK, VE-cadherin, and AKT have different dynamics in control and IFT20-silenced cells. To justify this conclusion, authors should test the statistical significance between the siControl and siIFT20 samples at each time point.

      The current quantification (Figure 7B) shows that there is, at least, a trend of increased p-VEGFR3, p-VE-cadherin, p-ERK, and p-AKT in IFT20 silenced cells. However, the representative Western Blot image does not display a clear difference (Figure 7A). Authors should include the original western blots, used for quantification, as supplements. 12. The authors use western blot quantification to show that the altered LEC junctions affect VEGFR3 signaling. They further hypothesize that the increased VEGFR3 signaling may be a consequence of VEGFR3 localization in endosomes. The authors did not detect any signal using the phospho-specific VEGFR3 antibody (rows 441-442). To analyze the location of VEGFR3 upon VEGF-C treatment in siControl and siIFT20 LECs, the authors should use anti-VEGFR3 (total) antibodies that have been shown to detect VEGFR3 in similar assays. 13. The normality of the data should be tested before the selection of the statistical test. If this has been done, please, indicate it in the materials and methods or re-run the statistical analysis, if some of the data is not normally distributed. 14. The authors should use arrows, arrowheads, etc. to highlight examples of relevant features in the images. For example, in Figure 3C, the increased stress fiber formation is not obvious to the reader.

      Significance

      Lymphatics are essential for fluid, leukocyte, and lipid trafficking to lymph nodes and/or systemic circulation. Recent findings have promoted lymphatics as a potential target to control the level of adaptive immunity in inflammation-associated diseases, including tumorigenesis (for example Song et al 2020, Nature). Early work on lymphatic endothelium in vivo, highlighted the dynamics of lymphatic endothelial junction, which, reversibly, can alter between continuous and discontinuous ("button-like") states (Baluk 2007, Am. Jour. Pathol.; Yao 2012, Am J. Pathol.). These changes may have an effect on fluid drainage capacity, lymphatic vessel growth, and prevention of pathogen dissemination to the systemic circulation. Recently, lymphatic junctions have been shown to present hubs of VEGFR3 signaling, VEGFR3 and VE-cadherin dynamics, and leukocyte transmigration (Sung et al. 2022, Nat. Cardiovasc. Res.; Hagerling et al. 2018, EMBO J.; Liaqat et al. 2024, EMBO J.). Thus, the manuscript by Paulson et al. investigates a topical subject.

      The authors suggest a role for IFT20 in the control of VE-cadherin dynamics. Based on my expertise in lymphatic endothelial biology, I envision that the manuscript can potentially increase knowledge on the regulators of the lymphatic endothelial junctions, which might have physiological, and in the long term, translational significance. However, in the current manuscript, the exact mechanisms of how IFT20 controls lymphatic endothelial junctions are left open. In addition to the lymphatic research field, the study is, potentially of interest to researchers working on blood vasculature or, even, epithelium, i.e. tissues where junctional dynamics play a major role in health and disease.

      Furter controls, analysis, and experimentation are needed to warrant the authors' statements. In their future work, the authors should also consider means to rigorously dissect the IFT20 functions in primary cilia and endosomes.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript identified a novel role of Intraflagellar Transport Protein 20 (IFT20) in the function and homeostasis of lymphatic endothelial junctions. The authors showed that IFT20 regulates VE-cadherin localization at adherens junctions in lymphatic endothelial cells. The authors performed impressive in vivo work that shows the requirement for IFT20 for the homeostasis of intercellular junctions, lymphangiogenesis, and drainage function of lymphatic vessels. In contrast, the cell biology part of the paper was underwhelming and will need significant revisions to support the proposed model. In the result section, several conclusions have to be toned down to match the actual results. The study employs in vivo mouse models, immunofluorescence, biochemical assays, and loss-of-function experiments to support their conclusions.

      Major comments

      • The authors present disrupted localization of VE-cadherin. Is this a mislocalization and/or protein stability issue in IFT20 KD cells? A western blot can help assess protein levels, and a phase-chase endocytosis assay of VE-cadherin can strengthen evidence. The authors did not confirm the permeability phenotype seen in vivo.
      • While the authors focused on IFT20 and rab5, we do not have a clear idea about the vesicular dynamics as well as the status of early, late, and recycling endosomes in IFT20 KD cells. Is IFT 20 localized to non-rab5+ endosomes, and if yes, what are the species? A more general endosomal profiling would help strengthen the authors' message. For example, in Fig. 4-5, the authors will have to stain for other early endosomal markers as well as late, and recycling endosomal markers in control and IFT20 KD cells.
      • In fig. 6C, a majority of VE-cadherin is not associated with Rab5. Staining with additional endosomal markers might help identify other endosomal species colocalizing with VE-cadherin. It will be critical to add to Fig. 6c the intensity profiles depicting colocalizations. The authors can also live image a fluorescently (f)-tagged VE-cadherin (maybe with another f-tagged rab5) and assess their association dynamics in IFT20 KD cells (similar to fig6C).
      • Primary cilia do seem to regulate vascular plexus in the mouse retina as well as endothelial permeability through mediating subcellular localization of junction proteins. The authors do not clearly exclude the ciliary function of IFT20 in mediating lymphatic endothelial cell-cell junctions. A rescue experiment can help settle this question by targeting IFT20 exclusively to cilia (or not) and assessing, for example, VE-cadherin localization. The following is optional: It is also unclear whether the described regulation is specific to IFT20 or can be phenocopied by the ablation of another IFT subunit and/or cilia ablation through the depletion of a non-IFT cilia assembly regulator.
      • Figs. 7A and B do not seem very convincing. The control vs. IFT20 KD western blot levels look mostly similar between the two conditions. The result section does not translate the actual data in Fig. 7A and B. Additionally, there are no statistical comparisons between control and KD conditions in the graphs. Except for a potential pVEGFR-3 increase at 30 min VEGF-C in IFT20 KD cells, but after washout the level is similar to control. This figure does not support well the model presented in fig. 8. The conclusion in lines 456-459 has to be toned down.
      • The authors were not able to stain for pVEGFR3. It would still be helpful to see a colocalization between total VEGFR3, IFT20, and VE-cadherin in control cells and IFT20 KD cells (VEGFR3 and VE-cadherin).

      Minor comments

      • The control used in Figures 1 and 2 does not seem ideal. The proper control would be IFT20fl/fl cre neg. Is there a reason why the authors excluded a lox allele in control? Also, the authors have to provide the mice age used in these figures and when the Cre kicks in in the result section.
      • Please describe the overall mouse phenotype(s) of the LYVE1 CRE-IFT20 flox.
      • Line 109:'By expression, the authors probably mean immunostained.
      • Many graphs exhibit undefined Y-axis labels and units. Please clarify these as well as the way they were quantified. Include such information in figure legends and/or in the materials and methods section. The figures in question are fig1C, E and F, fig2E, fig3E, fig4b and D, fig6b and D, fig7B and C.
      • Line 295:'homeostasis"-the authors probably mean in a serum-rich condition.
      • Fig4C specifically the lower two images on the right side: the images do not seem to represent the corresponding graphs.
      • Please add the statistical tests used to evaluate significance in all figure legends.

      Significance

      This study provides novel insights into IFT20's role in VE-cadherin trafficking and endothelial junction stability, with its strongest aspect being the in vivo data in Figures 1 and 2, demonstrating lymphatic defects upon IFT20 loss. This represents a conceptual advance by extending IFT protein function beyond cilia (if one of the major comments is addressed) to vascular integrity. However, mechanistic depth is lacking, and ciliary role was not tested-additional rescue and colocalization experiments are needed to confirm the model. The study will interest vascular and lymphatic biologists, as well as cell biologists studying intracellular trafficking and cilia.

      Expertise: cilia and mouse genetics

    1. 1. Lucy and Ricky’s position as the first interracial couple on television; 2. cinematographer Karl Freund’s use of a multi-camera system to record the series on film in front of a live audience; 3. the announced arrival of Little Ricky; 4. making its stars, Lucille Ball and Desi Arnaz, the first television millionaires; and 5. being one of the series that convinced networks and studios that telefilm production in Hollywood would become the future of the industry.

      All of these things in tandem is very wild! It is almost impossible to visualize a hit this large in today's media. Now, there are not even shows that come close to being this unifying within culture.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Kemp et al. aimed to explore the transcriptional cell cycle regulation of replication-dependent (RD) histone genes at histone locus body (HLB) in Drosophila. They evaluate the accumulation of RNA pol II and RD histone transcripts at HLB during the cell cycle using live and fixed imaging of Drosophila tissues at different stages of development. They find that RNA pol II is enriched at HLB, not only during S phase when RD histone genes are transcribed but throughout the cell cycle. Outside of S phase, they detect short but not full-length RD histone transcripts suggesting a mechanism of RNA pol II pausing. Full length RD transcripts are only produced upon cyclin E/Cdk2 activation when cells enter S phase, arguing that Cyclin E/Cdk2 can activate transcription elongation. They propose that the elongation release triggered by Cyclin E/Cdk2 is the critical step linking RD histone gene expression and cell cycle progression rather than the recruitment of RNA pol II to HLB. The data are interesting and robust, however, additional experiments could reinforce the findings and the proposed model.

      Specific comments/concerns are listed below.

      1. In Figure 3, quantifications of the fluorescence at HLBs for mCherry-RBP1 and MXC-mScarlet should be provided.
      2. In Figure 5C, both 5' and 3' transcripts are observed in G214 cells. However, their accumulation in the cytoplasm is not visible. How do the authors explain this result? What happens in S14 cells?
      3. In Figure 6, the authors observed RD histone 3' transcripts only in replicating cells (EdU positive) while they detected 5' transcripts in both replicating and non-replicating cells. They argue that the appearance of 3' transcripts is due to the release from transcriptional pausing. To further support particular states in the transcriptional arrest, data by immunofluorescence using specific antibodies recognizing either RNA pol II ser5P or ser2P would determine whether the presence of 3' transcripts is associated with the accumulation at HLB of RNA pol II ser2P (elongating polymerase). Moreover, is there a correlation between P-MXC and RNA pol II ser2P?
      4. In Figure 7 panels C and D, the 5' transcripts should be shown. Although RD histone 3' transcripts accumulate in CyE+ embryonic cells, unfortunately, their presence at HLBs (pointed by arrows) is not visible in the image of panel E. To firm up conclusions quantifications of the 3' and 5' transcripts should be provided for CycE+ and CycEnull cells. In Hur et al., 2020, the authors looked at RD histone transcripts in WT embryo and CycE+/-/Cdk2+/- mutant. They found that the amount of H3 transcripts using a probe corresponding to the coding sequence is not changed in the mutant as compared to the WT. In contrast, they found that there is an increase of transcripts that are not correctly processed using probes downstream the stem-loop region. This seems inconsistent with the results presented here where a decrease of 3' transcripts is observed. This needs an explanation/discussion. Are such incorrectly processed transcripts observed in CycEnull mutant?
      5. The authors suggest that active Cyclin E/Cdk2 triggers the release of RNA pol II promoter-proximal pausing and therefore induces transcriptional elongation at RD histone genes when cells enter S phase. To further support this hypothesis, determining whether there is an enrichment of the elongation factor p-TEFb at HLB when Cyclin E/Cdk2 is active would help.
      6. Instead of using cycling E mutants, to determine whether it is the phosphorylation of MXC which directly impacts the elongation of RD histone genes, it would be interesting to generate phospho-null or phospho-mimetic mutant of MXC.
      7. In Suzuki et al., 2022, the authors described 3' RNA pol II pausing at RD histone genes. Although this study used human cells, it would be interesting to discuss that in addition to a promoter-proximal pausing that regulates transcription elongation, a 3' pausing could also regulate the transcription termination and 3' processing.
      8. In the discussion, the authors should point out some limitations of their studies linked to the method and could propose for the future that a more precise and molecular view of the pausing mechanism could be carried out using sequencing methods such as ChIP-seq of various isoforms of the RNA pol II (total, ser2P, ser5P) and elongation regulators (p-TEFb.....) and PRO-seq.

      Minor points:

      1. In Figure 1, for panels B and D as well as for panels C and E, to falicitate comparison of the localization of the different proteins, it would help to show the same developmental stages and the same image scales.
      2. In Figures 3 and 7 (C-F), the developmental stages should be indicated on the images, as it is done in the other figures.
      3. In the legend of Figure 7, it is indicated (D) and (E) instead of (C) and (D) in the sentence: "Endocycling midgut cells in (D) contain cytoplasmic histone mRNA which is absent in (E) (boxed regions)."

      Significance

      Kemp et al. aimed to explore the transcriptional cell cycle regulation of replication-dependent (RD) histone genes at histone locus body (HLB) in Drosophila. They evaluate the accumulation of RNA pol II and RD histone transcripts at HLB during the cell cycle using live and fixed imaging of Drosophila tissues at different stages of development. They find that RNA pol II is enriched at HLB, not only during S phase when RD histone genes are transcribed but throughout the cell cycle. Outside of S phase, they detect short but not full-length RD histone transcripts suggesting a mechanism of RNA pol II pausing. Full length RD transcripts are only produced upon cyclin E/Cdk2 activation when cells enter S phase, arguing that Cyclin E/Cdk2 can activate transcription elongation. They propose that the elongation release triggered by Cyclin E/Cdk2 is the critical step linking RD histone gene expression and cell cycle progression rather than the recruitment of RNA pol II to HLB.

      The data are interesting and robust, however, additional experiments could reinforce the findings and the proposed model.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      __Evidence, reproducibility and clarity __

      This is a well-written manuscript that describes a thorough study of the functionality of individual residues of a central component of the ESX-3 type VII secretion system of Mycobacterium smegmatis, EccD3, in the essential role of this protein transport system in iron acquisition. Using the powerful and unbiased approach of deep mutational scanning (DMS), the authors assessed the impact of different mutations on a large number of residues of this component. This carefully executed research highlights the importance of hydrophobic residues at the center the ubiquitin-like domain, specific residues of the linker domain that connects this domain with the transmembrane domains and specific residues that connect EccD3 with the MycP3 component.

      Major comments

      Since the LOF effects in the iron-sufficient and iron-deficient condition differ less than expected, the differences of the DMS results between these two conditions should be better presented, explained and discussed: 1. The authors discuss: "Of the 270 LOF mutations seen in the iron-deficient condition, 37 (13.7%) were tolerant in the iron sufficient condition, and 39 (14.44%) had strong LOF effects but weak LOF effects in the iron sufficient condition." Do the authors mean that 39 (14.44%) had strong LOF effects in the iron-deficient condition, but weak LOF effects in the iron-sufficient condition. In turn, does this mean that the remaining mutants (71.9%) had similar LOF effects in the two conditions?

      We thank this reviewer for their comment and for highlighting a lack of clarity. We have updated the main text to more effectively communicate our point - that 270 mutants had LOF effects in the iron-deficient media. 37 of these 270 mutants were tolerant in the iron-sufficient media. 39 of these 270 mutants had strong LOF effects in iron-deficient media, but were weak LOF in iron-sufficient media. The remaining 124/270 mutants had weak LOF effects in both conditions. The larger point is that removing iron leads to stronger selection - tolerant mutants become LOF, weak LOF become strong LOF. Removing iron pushes mutants at the bounds over the limit.

      __ The diagonal shape of the scatter plot in Fig. 2C, which shows the correlation of the Enrich2 scores of all mutants in the two conditions, indicates that the growth of most mutants is affected similarly in these conditions, but in Fig. 2D lower graph, which shows only the Enrich2 scores of missense mutants, there are clear differences between the two conditions. How can this be explained?__

      We apologize for any confusion created by this presentation of our data. We hoped to highlight that while effects are largely similar across conditions, there are some differences. As communicated in our first response, 270 out of our ~2700 missense mutations had LOF effects in the iron-deficient condition. 37 of these 270 mutants were tolerant in the iron-sufficient media. 39 of these 270 mutants had strong LOF effects in iron-deficient media, but were weak LOF in iron-sufficient media. The remaining 124 mutations had weak LOF effects in both conditions.

      While Figure 2C shows this difference, it is hard to see by nature of using a scatter plot. We have added contours to highlight how our data is distributed. Our density plots in Figure 2D are meant to try to highlight these differences, where the top plot is showing the effects of all missense mutations. Negatively scored mutations represent LOF effects, mutations with scores around 0 are considered tolerant, and the extremely rare scores with positive scores have GOF effects. Our bottom plot specifically zooms into the negatively scored mutations, to show the 270 LOF mutants we discussed. Specifically, we were hoping to highlight the 39 mutations that have strong LOF effects in iron-deficient media (so the purple line scores are more negative), but weak LOF effects in iron-sufficient media (the green line scores are less negative).

      __ Regarding the authors' explanation for the observed LOF effects in the permissive condition, "This speaks to the sensitivity of next-generation sequencing compared to the strong differences observed between conditions in phenotypic growth curves." But this sensitivity does not explain the observed large LOF effects but no growth difference in the permissive condition, unless the analysis is less quantitative than expected? Could it be that there is local iron depletion in this mixed culture, causing selection pressure even in the iron-sufficient condition? Moreover, the severity of the growth defect at the time of sampling, i.e., after 24 hours of growth, is unclear. Indeed, the growth curve in Fig. 1 shows that the growth of the double mutant in iron-deficient conditions is significantly impaired at that timepoint. In the growth curve in Fig. 2B (and also slightly in Fig. 2F), however, the growth defect is less pronounced: the double mutant has a similar OD600 as the WT strain, although the error bar is larger. Is this variability between replicates also seen in the DMS analysis? In general, no statistics are shown for the DMS analysis and there is no information on the significance of the observed LOF effects. In addition, the legend should explain how many replicates the DMS data are based on.__

      We thank this reviewer for their comment and for highlighting a point of confusion. In addition to increased sensitivity in next generation sequencing compared to our growth curve experiments, our data analysis and variant scoring was performed by comparing growth rates of our mutant strains to our wild type strain. So, any effect on viability or growth rates seen by expression mutant variants will be more notable in our DMS scoring, as they are relative to wild type. In contrast, our growth curves are plotted as the raw OD600 values of each strain. We believe this difference underlies the difference seen in our heatmaps and growth rates.

      It is also a relevant and important point that our libraries are grown as mixed cultures, where there is competition over the limited iron in their growth media, as we highlight in our discussion.

      While the double mutant does show a stark growth defect at 24 hours in Figure 1 compared to the WT and complement, it grows just as well as those strains in Figure 2B. The growth defect becomes notable after 24 hours. Within this experiment, we observed variability in growth at the 24hr timepoint for the negative control strain, but also selection when compared to the positive control and library growth at later time points. We analyzed our DMS data in accordance with typical methods used in the field (see: https://doi.org/10.1186/s13059-017-1272-5). We include statistics for the DMS analysis as supplemental Figure 1. We apologize for any confusion regarding the figure caption, however in our manuscript we do point out that our library growth in Figure 2B was repeated in triplicate in the figure caption, and the samples collected during that experiment were the ones used to generate the DMS data.

      Minor comments

      1. Line and page numbering should be added to the manuscript to facilitate the reviewing process.

      We have updated our manuscript to include line and page numbering.

      __ "Knockout of the entire ESX-3 operon leads to inhibited M. smegmatis growth in a low-iron environment. When individual components of the ESX-3 system are deleted, growth is only available under impaired if the additional siderophore exochelin formyltransferase fxbA is also knocked out20." First, a reference should be added to the first sentence. Second, Siegrist et al. did not exactly show this. They showed that the fxbA/eccC3 double mutant grows slower that the fxbA single mutant. To my knowledge there is no publication showing that single esx-3 component mutants grow as WT in iron-deficient conditions. Do the authors have data demonstrating this? If true, it is surprising that mutating EccD3 has a milder phenotype compared the complete region deletion, as it is a crucial ESX-3 component.__

      We apologize for any confusion. We had the relevant reference two lines prior, and have since added it to that sentence as well.

      The reviewer is correct that Siegrest et al did not show the effects of just ESX-3 component single deletions. However, Siegrest et al. 2009 demonstrated that deleting the entire ESX-3 operon results in growth similar to the wild type strain in low-iron media. In contrast, the fxbA single knockout exhibits a notable growth defect, and the fxbA/ESX-3 double knockout has an even more severe growth defect. Following the logic that a double knockout is needed to observe a growth defect in low-iron media, Siegrest et al. 2014 demonstrated this also extends to single ESX-3 component knockouts, such as the fxbA/eccD3 double knockout strain. To ensure clarity and accuracy, I will edit the sentence to say "When individual components of the ESX-3 system are deleted, growth is significantly impaired when the additional siderophore exochelin formyltransferase fxbA is also knocked out."

      __ Reference to Table 1, should be a reference to Table S1.__

      We have updated our manuscript to correct this reference.

      __ "Our heatmaps surprisingly reveal residues where substitutions are deleterious specifically in the iron-sufficient condition" Refer here to Fig. S2.__

      We have updated our manuscript to include this reference.

      __ "In the iron-deficient condition, 6/551 (1.08%) missense mutations have a weak LOF effect, and 0 have strong effects." More clearly explain this refers to the residues of the transmembrane region.__

      We have updated our manuscript to provide more clarity.

      __ "The MycP transmembrane helix has been hypothesized to be required for ESX complex specificity, targeting MycP to associate with the correct ESX homologue." I miss a reference here. And I thought that the transmembrane domain of MycP was required for complex stability not for specificity?__

      We thank the reviewer for pointing out our missing citation, and asking us to clarify our point. I believe the literature suggests that both the protease and transmembrane domains of MycP are required for both complex stability and specificity. van Winden et al. 2016 https://doi.org/10.1128/mbio.01471-16 show that MycP5 needs to be present for secretion. The protease activity can be abolished and the ESX-5 complex can still secrete and be pulled down, as seen by BN-PAGE. van Winden et al. 2019 https://doi.org/10.1074/jbc.RA118.007090 show that truncated mutants missing either the protease domain or the transmembrane domain cannot rescue ESX-5 secretion or complex stability in a MycP knockout strain. More relevant, they attempted to rescue MycP1 and MycP5 mutants by creating chimeric proteins that either had the MycP1 protease domain and MycP5 transmembrane domain, or the MycP5 protease domain and MycP1 transmembrane domain. If the protease and transmembrane domains were required for complex stability and NOT specificity, we would see MycP5 rescue ESX-1 secretion in the MycP1 mutant strains and vice versa. We would also see the chimera proteins rescue both ESX-1 and ESX-5 secretion and complex stability. Instead, we see that neither chimera rescued ESX-1 nor ESX-5 secretion or complex stability, implying that both MycP domains are necessary.

      We will amend our paper text to reference MycP's role in complex stability instead of specificity, and soften the language: "The MycP transmembrane helix has been shown to be required for ESX complex stability, as MycP knockouts and truncated mutants abolish ESX secretion and pulldowns of the entire complex."

      __ "....role in ESX function relating to EccB3 and EccC3. In the transmembrane, ..... we" Insert "region" after "transmembrane"__

      We have updated our manuscript to include this update.

      Significance

      The study provides insight into individual residues of a central component of the ESX-3 type VII secretion system for functionality, which is useful for those studying the functioning of mycobacterial type VII secretion systems. Moreover, because this system is essential for the growth of the important pathogen M. tuberculosis, this knowledge can be used to design new anti-tuberculosis compounds that block the ESX-3 system. Although the results mainly confirm previous observations (highlighting specific residues important for the stability of ubiquitin and residues of other parts of EccD important for protein-protein interactions within the ESX-3/ESX-5 membrane complex), to my knowledge this is the first time DMS has been applied to mycobacteria. This study is therefore of interest to mycobacteriologists.


      Reviewer #2

      __Evidence, reproducibility and clarity __

      This work provides valuable insights into EccD3 function, a core component of the ESX-3 secretion system. The strength of this study lies in the development of a robust functional assay for the systematic mapping of functionally relevant amino acids in EccD3. The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system. 1. The authors engineered an M. smegmatis knockout strain with deletions of fxbA and eccD3. Deletion of fxbA renders the exocholin iron uptake system non-functional, forcing the bacteria to rely on siderophore-mediated iron uptake under iron-limiting conditions. This process, in turn, depends on ESX-3 secretion activity, as PPE4, a known ESX-3 substrate, has been previously implicated in iron utilization in M. tuberculosis (Tufariello et al., 2016). This experimental setup links EccD3 function to a growth phenotype under iron-limiting conditions, as mutations impairing ESX-3 secretion disrupt iron utilization and mycobacterial growth. 2. By complementing the knockout strain with a library of EccD3 mutant variants, the authors systematically identify residues essential for protein-protein interactions within the ESX-3 core complex. Structural analysis corroborates the functional relevance of these residues, specifically those mediating interactions between EccD3 and other ESX-3 components, or those disrupting the hydrophobic core of the EccD3 ubiquitin-like (Ubl) domain. 3. Structural comparisons with the MycP5-bound ESX-5 complex allow the authors to predict residues within EccD3 that may interact with MycP3 during ESX-3 core complex assembly. Furthermore, comparisons with the ESX-5 hexamer suggest residues that may stabilize or drive oligomerization of the ESX-3 dimer into its putative hexameric state. These insights are significant and provide testable hypotheses for future studies. 4. The methodology is limited to ESX-3. The authors exploit the essentiality of ESX-3 for siderophore-dependent growth under iron-limiting conditions. However, this functional readout cannot be directly transferred to other ESX systems (ESX-1, ESX-2, ESX-4, ESX-5), which have distinct substrates, biological roles, and regulatory mechanisms.

      Significance

      This work provides valuable insights into EccD3 function, a core component of the ESX-3 secretion system. The strength of this study lies in the development of a robust functional assay for the systematic mapping of functionally relevant amino acids in EccD3. The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system.

      Thank you for your thoughtful and supportive feedback. We appreciate your time and effort in reviewing our study.


      Reviewer #3

      __Evidence, reproducibility and clarity __

      The manuscript by Trinidad et al. provides a deep mutational scanning (DMS) analysis to investigate the functional roles of residues from the EccD3 subunit of the Type VII ESX-3 secretion apparatus from M. smegmatis. A previously published structure of ESX-3 from M. smegmatis by the Rosenberg group (Oren Rosenberg is also an author of this paper) is used as basis for structural interpretation of the DMS data presented in this contribution. A shortcoming of the previous structure, despite being very rich in terms of structural details, was in the lack of hexameric pore formation, which has been established more recently by structures of the related ESX-5 system.

      Technically, DMS is state-of-the art and a powerful approach to systematically scan residues of potential functional interest. Therefore, the data presented here, provide a remarkable repository for further interpretation in this contribution and by other future investigations. The experimental data have been deposited in Github enabling access by others in the future.

      Overall, the paper would benefit from an improved overall organisation. I found in part hard to extract some of the main points from the way the data are presented. In essence, two separate screens were performed, the first one focusing on the EccD3 Ubl domain and adjacent linker regions and a second one on the EccD3 TM region. I think the paper could be better structured accordingly. Tables of residues with strong effects in iron-deficient and iron-sufficient media, together with their structural annotation, would facilitate extracting main messages from this manuscript. Without going too much in detail, there is also scope for improvement of most of the structural figures. More consistency in terms of color coding with the previous paper by Powileit et al. (2019) would also help navigation.

      A potential weakness of the paper is in the limited scope of interpretation of the data in the context of the dimeric ESX-3 assembly, which is actually acknowledged by the authors. Computational AI-based methods should allow generating a complete pore model of ESX-3, which would allow interpretation of some of the data in a more functional relevant context. This would enhance the validity of the current interpretations presented.

      We acknowledge the lack of a hexameric ESX-3 structure, and would love to base our analysis on such a structure. Unfortunately, experimentally purifying and determining such a structure is beyond the scope of this manuscript. While AI-based methods are certainly exciting and helpful to make sense of mutational data, they are not able to computationally predict such large structures. The AlphaFold3 server website is commonly used for these purposes and allows predictions of up to 5000 tokens (or amino acids). An ESX-3 hexamer would be composed of 6x EccB proteins (519 AA each), 6x EccC proteins (1326 AA each), 12x EccD proteins (476 AA each), and 6x EccE proteins (310 AA each). Together, this complex would be made up of 18,642 amino acids.

      We tried using alphafold to predict an ESX-5 dimer complex, as well as reproduce the ESX-3 dimer complex, and were unable to produce these structures. Each ESX protomer is assembled correctly, as each protein within the complex makes appropriate contacts with each other. We see the EccD-dimers still form the membrane vestibule within each ESX complex. The issue is the ESX dimer complex has not assembled correctly: the EccC transmembrane helix 1 of a protomer should interact with the EccB transmembrane helix of the neighboring protomer; and, the N-terminus of EccB in one protomer should interact with the loop between the EccD transmembrane helices 10 and 11 in the neighboring protomer. Instead, Alphafold creates contacts along the EccD proteins from both complexes. We have included a "top-down" view of the ESX-5 dimer, where the periplasmic domains of EccB have been cleaved off for clarity.

      A side view:

      Here we have the ESX-3 dimer structure published by Poweleit et al. side-by-side with the ESX-3 dimer predicted by alphafold, visualized in Pmyol. The alphafold structure largely has each proteins' domains and folds properly predicted, including even the EccD3 dimer found in each ESX protomer. However, the protomers are not assembled into a dimer properly as compared to the purified ESX-3 dimer from PDB: 6umm. We included a "front" and "side view", as well as a "top down" view where the cytoplasmic domains have been hidden for visual clarity.

      The use of full names and acronyms needs to be more consistent. As an example, the terms "ubiquitin-like" and ubiquitin-like (Ubl) and UBl are used in parallel throughout the manuscript. The percentages given in various places of the paper could be reduced to integers, as they generally relate to relatively small data sets. Please express numbers with a precision, reasonable matching expected statistical significance.

      We apologize for the lack of consistency in how we referred to the ubiquitin-like domain. I originally wrote "ubiquitin-like (Ubl)" once per section (intro, results, discussion). I have edited these all to just "Ubl" after the introduction, except for figure and section titles. We have also reduced our percentages to integers.

      Some of the DMS experiments have been repeated three-fold, which should be a minimal number to allow extracting statistical significance, other experiments have only been repeated two-fold. Could this be clarified, please?

      We apologize for this oversight, and thank the reviewer for pointing this out. All experiments were done in triplicate, the exception being the site-directed mutant growth curves, which were performed in duplicate. We have repeated this experiment in triplicate in response to this point. As we repeated this experiment, mutant R134A dropped out due to technical reasons, and so we did not include it in the updated growth curves.

      Specific comments on text and figures:

      Figure 1: The EM densities shown considerably deviate from those that were shown in the original publication by Poweleit et al (2019). If there is an aim is to reinterpret the data this needs to be described in sufficient technical detail. There may be a case for this, in light of recent advances in computational AI-vased structural biology.

      We acknowledge this may be confusing and we apologize for that, as the EM density I have shown in this manuscript uses the same map we used to create the one seen in the original publication Poweleit et al 2019. There are existing crystal structures of EccB1 and the ATPase domains of EccC1 that we used to create homology models of EccB3 and EccC3 using the structure-prediction software RaptorX for the 2019 publication. These homology models were then combined with a low resolution EM density to create the model seen in the 2019 eLife paper. I did not include those homology models in this manuscript, as I did not believe those predictions were relevant to this study. I wanted to include the highest resolution and thus most accurate depiction of our ESX-3 structure.

      Introduction, statement "We made comparisons to a prior DMS on ubiquitin to increase signal-to-noise in our interpretation of the Ubl domain mutagenesis data." Could this be further explained please? I could not find anything in addition in the Methods section and elsewhere.

      __ __We apologize for the confusion!

      EccD3 Ubl domain and ubiquitin DMS dataset comparisons

      To compare the DMS data of EccD3 Ubl with that of ubiquitin, we first identified homologous residues in each structure. This was achieved by aligning the EccD3 Ubl domain with ubiquitin (PDB: 1ubq) using PyMOL and assessing the positional correspondence of side chains (e.g., ubiquitin residue I3 aligned with EccD3 residue V12). Next, we referenced missense mutation datasets to calculate the average DMS score for each residue position in both proteins. We then generated a scatter plot to compare the average missense scores for ubiquitin and EccD3 Ubl using ggplot2. Data points were color-coded according to the functional roles assigned to ubiquitin, with residues forming the hydrophobic patch and core highlighted, while all other residues were represented in grey.

      Description of "vestibule" as a core feature of the ESX-3 structure. As mentioned above, this is very much a result of the presented dimeric arrangement. In the context of a complete pore model, these features may change or even disappear.

      While we would certainly welcome an ESX-3 hexamer model to definitively determine whether this feature persists, such a model is not currently available. However, the highly homologous ESX-5 complex retains these EccD vestibules, and there is no reason to believe these features would change or disappear. Therefore, based on our interpretation of the ESX-3 dimer and ESX-5 hexamer we believe that the EccD membrane vestibule is not just an artifact of the ESX-3 dimer complex.

      It is possible that the reviewer misunderstood what we were referring to as the vestibule. We updated the language in the text to improve clarity. However the vestibule is not a consequence of ESX-3 complex dimer formation. It is an inherent feature of the ESX monomer complexes, where two EccD proteins dimerize to form said vestibule. Furthermore, there is no evidence to suggest that this feature would be lost in a hexameric state.

      Structurally, the ESX-3 dimer consists of two ESX-3 monomer complexes, each containing one EccB, one EccC, one EccE, and two EccD proteins. Therefore, each ESX-3 monomer inherently includes an EccD dimer. The presence of the EccD dimer is not exclusive to the ESX-3 dimer but is a fundamental component of each ESX-3 complex. Similarly, the ESX-5 hexamer retains the EccD dimer within each ESX-5 complex, further supporting the idea that this structural feature is conserved.

      Figure 2, panel B: Isn't right that "positive" and "negative" need to exchanged? Perhaps, there is something I misunderstood.

      We apologize for the confusion, and appreciate the reviewer pointing out this inconsistency. We have updated the manuscript to correct this.

      Figure 2, panel F: it is hard to extract the assignments from the overlaid curves.

      We apologize for a lack of clarity in how this growth curve was presented. We have included labels at the end point to show where each sample is.

      Figure 3, caption "from low (red) to white (tolerant)": for the sake of consistency, please either put the color in parentheses, or functional description. Does this statement relate to panel A or B? "All other residues are colored white". I can't see this.

      We apologize for the inconsistency, and have updated this label. We hope we have clarified the fact that the entire structure is white except for the residues we colored red.

      Results text "In contrast to ubiquitin, all hydrophobic core residues in the EccD3 Ubl domain are equally intolerant to charged residue swaps. Unsurprisingly, residues important for ubiquitin's specific degradation interactions are not sensitive to substitutions in the EccD3 Ubl domain." Does this mean that proper folding of Ubl is less critical for ESX_3 function? Please elaborate on this further.

      We apologize for any confusion. Our data shows that residues which side chains extend into the hydrophobic core of the Ubl domain are intolerant to swaps to charge residues. We hypothesize these missense mutations disrupt this hydrophobic core, and lead to destabilization of this domain. These intolerant missense mutations each have negative Enrich2 scores, implying a loss of ESX-3 function, and that proper folding of the Ubl is critical for ESX-3 function. We have updated our text to clarify this point:

      Unsurprisingly, residues important for ubiquitin function's specific interactions are not sensitive to substitutions in the EccD3 Ubl domain. There is no simple discernable preference within the Ubl domain to any side that maintains protein-protein interactions, implying that the scores are dominated by stability effects and that the Ubl domain must maintain a stable β-grasp fold for ESX-3 function.

      Figure 4, panel C: the surface does not provide residue-specific information, hence this panel is not very informative.

      We agree with the reviewer that Figure 4 panel C was not very informative, and so we have removed it from Figure 4 for the sake of brevity.

      Results text "T148 extends out from transmembrane helix 1 into a hydrophobic pocket between transmembrane helices 1, 2, and 3." Could this please be illustrated in one of the structural presentations?

      We have updated figure 5 to include a snapshot of this residue and the hydrophobic pocket it extends into, as panel E.

      Results text, last paragraph, Figure 5C-D: interpretation of the experimental ESX-3 data based on ESX-5 models is problematic, without showing proof of conservation of relevant sequence/structural features. As mentioned above, I would encourage the authors to establish a hexameric ESX-3 model and interpret the data starting from there. Extrapolation of the interpretation of data to other ESX systems, including ESX-5, would expand the scope by generalization, which however would open another chapter. The ESX-5 structure does not explain e.g. why W227 when mutated is less sensitive to iron depletion as opposed to iron being present.

      We do not believe we can use AI to predict a hexameric ESX-3 model. We will update our supplement to include a figure showing proof of conservation between the EccD3 and EccD5 sequences. We can superpose the ESX-3 dimer structure onto the ESX-5 hexamer structure, and see that this dimeric complex overlays quite well on top of an ESX-5 subcomplex. We can imagine this hexamer as a trimer of dimers, where three copies of this dimeric complex interact to form the hexamer. The superposition is not perfect and there are slight rearrangements to different helices to allow for hexamer formation, but these do not imply we cannot compare these two homologous structures.

      We have included a new structure snapshot in Figure 5, where panel D is the ESX-3 dimer (PDB: 6umm) shown as a side and top-down view. This allows for a comparison with panel C, the snapshot of the ESX-5 complex (PDB: 7np7) where in two protomers the EccB, EccC, and EccD proteins are colored the same way as ESX-3, and the other ESX-5 protomers are colored white. Note that in this hexamer, EccE is missing. We see the EccD membrane vestibule is conserved in both structures.

      Significance

      Strength and Limitations: already assessed under "Evidence, reproducibility and clarity".

      There is scope for further interpretation using experimental structural and modeling data. There is also scope for applying complementary assays for selected mutants, most likely within a lower throughput format.

      Advance: The contribution demonstrates well the power of DMS for systematic screening, in the context of Type VII secretion. The main advance is in the raw data generated and deposited.

      Audience: microbiology with a specific interest in secretion, structural biology

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Trinidad et al. provides a deep mutational scanning (DMS) analysis to investigate the functional roles of residues from the EccD3 subunit of the Type VII ESX-3 secretion apparatus from M. smegmatis. A previously published structure of ESX-3 from M. smegmatis by the Rosenberg group (Oren Rosenberg is also an author of this paper) is used as basis for structural interpretation of the DMS data presented in this contribution. A shortcoming of the previous structure, despite being very rich in terms of structural details, was in the lack of hexameric pore formation, which has been established more recently by structures of the related ESX-5 system.

      Technically, DMS is state-of-the art and a powerful approach to systematically scan residues of potential functional interest. Therefore, the data presented here, provide a remarkable repository for further interpretation in this contribution and by other future investigations. The experimental data have been deposited in Github enabling access by others in the future.

      Overall, the paper would benefit from an improved overall organisation. I found in part hard to extract some of the main points from the way the data are presented. In essence, two separate screens were performed, the first one focusing on the EccD3 Ubl domain and adjacent linker regions and a second one on the EccD3 TM region. I think the paper could be better structured accordingly. Tables of residues with strong effects in iron-deficient and iron-sufficient media, together with their structural annotation, would facilitate extracting main messages from this manuscript. Without going too much in detail, there is also scope for improvement of most of the structural figures. More consistency in terms of color coding with the previous paper by Powileit et al. (2019) would also help navigation.

      A potential weakness of the paper is in the limited scope of interpretation of the data in the context of the dimeric ESX-3 assembly, which is actually acknowledged by the authors. Computational AI-based methods should allow generating a complete pore model of ESX-3, which would allow interpretation of some of the data in a more functional relevant context. This would enhance the validity of the current interpretations presented.

      The use of full names and acronyms needs to be more consistent. As an example, the terms "ubiquitin-like" and ubiquitin-like (Ubl) and UBl are used in parallel throughout the manuscript. The percentages given in various places of the paper could be reduced to integers, as they generally relate to relatively small data sets. Please express numbers with a precision, reasonable matching expected statistical significance.

      Some of the DMS experiments have been repeated three-fold, which should be a minimal number to allow extracting statistical significance, other experiments have only been repeated two-fold. Could this be clarified, please?

      Specific comments on text and figures:

      Figure 1: The EM densities shown considerably deviate from those that were shown in the original publication by Poweleit et al (2019). If there is an aim is to reinterpret the data this needs to be described in sufficient technical detail. There may be a case for this, in light of recent advances in computational AI-vased structural biology.

      Introduction, statement "We made comparisons to a prior DMS on ubiquitin to increase signal-to-noise in our interpretation of the Ubl domain mutagenesis data." Could this be further explained please? I could not find anything in addition in the Methods section and elsewhere.

      Description of "vestibule" as a core feature of the ESX-3 structure. As mentioned above, this is very much a result of the presented dimeric arrangement. In the context of a complete pore model, these features may change or even disappear.

      Figure 2, panel B: Isn't right that "positive" and "negative" need to exchanged? Perhaps, there is something I misunderstood.

      Figure 2, panel F: it is hard to extract the assignments from the overlaid curves.

      Figure 3, caption "from low (red) to white (tolerant)": for the sake of consistency, please either put the color in parentheses, or functional description. Does this statement relate to panel A or B? "All other residues are colored white". I can't see this.

      Results text "In contrast to ubiquitin, all hydrophobic core residues in the EccD3 Ubl domain are equally intolerant to charged residue swaps. Unsurprisingly, residues important for ubiquitin's specific degradation interactions are not sensitive to substitutions in the EccD3 Ubl domain." Does this mean that proper folding of Ubl is less critical for ESX_3 function? Please elaborate on this further.

      Figure 4, panel C: the surface does not provide residue-specific information, hence this panel is not very informative.

      Results text "T148 extends out from transmembrane helix 1 into a hydrophobic pocket between transmembrane helices 1, 2, and 3." Could this please be illustrated in one of the structural presentations?

      Results text, last paragraph, Figure 5C-D: interpretation of the experimental ESX-3 data based on ESX-5 models is problematic, without showing proof of conservation of relevant sequence/structural features. As mentioned above, I would encourage the authors to establish a hexameric ESX-3 model and interpret the data starting from there. Extrapolation of the interpretation of data to other ESX systems, including ESX-5, would expand the scope by generalization, which however would open another chapter. The ESX-5 structure does not explain e.g. why W227 when mutated is less sensitive to iron depletion as opposed to iron being present.

      Referee cross-commenting

      I especially second the comments of referee #1, major comments, point 3 (statistical significance of the data). Addressing this point is crucial for the paper. Referee #2, significance section "The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system." I was considering making the same point but did not at the end. Of course, ultimately, it would be great if all components of ESX-3 could be analyzed they way it was done for the EccD3 component. However, I am afraid such exercise could become quite open ended. Already by now, there is some compromise on the depth of mechanistic interpretation in light of a large data set generated.

      Significance

      Strength and Limitations: already assessed under "Evidence, reproducibility and clarity".

      There is scope for further interpretation using experimental structural and modeling data. There is also scope for applying complementary assays for selected mutants, most likely within a lower throughput format.

      Advance: The contribution demonstrates well the power of DMS for systematic screening, in the context of Type VII secretion. The main advance is in the raw data generated and deposited.

      Audience: microbiology with a specific interest in secretion, structural biology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This work provides valuable insights into EccD3 function, a core component of the ESX-3 secretion system. The strength of this study lies in the development of a robust functional assay for the systematic mapping of functionally relevant amino acids in EccD3. The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system.

      1. The authors engineered an M. smegmatis knockout strain with deletions of fxbA and eccD3. Deletion of fxbA renders the exocholin iron uptake system non-functional, forcing the bacteria to rely on siderophore-mediated iron uptake under iron-limiting conditions. This process, in turn, depends on ESX-3 secretion activity, as PPE4, a known ESX-3 substrate, has been previously implicated in iron utilization in M. tuberculosis (Tufariello et al., 2016). This experimental setup links EccD3 function to a growth phenotype under iron-limiting conditions, as mutations impairing ESX-3 secretion disrupt iron utilization and mycobacterial growth.
      2. By complementing the knockout strain with a library of EccD3 mutant variants, the authors systematically identify residues essential for protein-protein interactions within the ESX-3 core complex. Structural analysis corroborates the functional relevance of these residues, specifically those mediating interactions between EccD3 and other ESX-3 components, or those disrupting the hydrophobic core of the EccD3 ubiquitin-like (Ubl) domain.
      3. Structural comparisons with the MycP5-bound ESX-5 complex allow the authors to predict residues within EccD3 that may interact with MycP3 during ESX-3 core complex assembly. Furthermore, comparisons with the ESX-5 hexamer suggest residues that may stabilize or drive oligomerization of the ESX-3 dimer into its putative hexameric state. These insights are significant and provide testable hypotheses for future studies.
      4. The methodology is limited to ESX-3. The authors exploit the essentiality of ESX-3 for siderophore-dependent growth under iron-limiting conditions. However, this functional readout cannot be directly transferred to other ESX systems (ESX-1, ESX-2, ESX-4, ESX-5), which have distinct substrates, biological roles, and regulatory mechanisms.

      Significance

      This work provides valuable insights into EccD3 function, a core component of the ESX-3 secretion system. The strength of this study lies in the development of a robust functional assay for the systematic mapping of functionally relevant amino acids in EccD3. The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a well-written manuscript that describes a thorough study of the functionality of individual residues of a central component of the ESX-3 type VII secretion system of Mycobacterium smegmatis, EccD3, in the essential role of this protein transport system in iron acquisition. Using the powerful and unbiased approach of deep mutational scanning (DMS), the authors assessed the impact of different mutations on a large number of residues of this component. This carefully executed research highlights the importance of hydrophobic residues at the center the ubiquitin-like domain, specific residues of the linker domain that connects this domain with the transmembrane domains and specific residues that connect EccD3 with the MycP3 component.

      Major comments

      Since the LOF effects in the iron-sufficient and iron-deficient condition differ less than expected, the differences of the DMS results between these two conditions should be better presented, explained and discussed:

      1. The authors discuss: "Of the 270 LOF mutations seen in the iron-deficient condition, 37 (13.7%) were tolerant in the iron sufficient condition, and 39 (14.44%) had strong LOF effects but weak LOF effects in the iron sufficient condition." Do the authors mean that 39 (14.44%) had strong LOF effects in the iron-deficient condition, but weak LOF effects in the iron-sufficient condition. In turn, does this mean that the remaining mutants (71.9%) had similar LOF effects in the two conditions?
      2. The diagonal shape of the scatter plot in Fig. 2C, which shows the correlation of the Enrich2 scores of all mutants in the two conditions, indicates that the growth of most mutants is affected similarly in these conditions, but in Fig. 2D lower graph, which shows only the Enrich2 scores of missense mutants, there are clear differences between the two conditions. How can this be explained?
      3. Regarding the authors' explanation for the observed LOF effects in the permissive condition, "This speaks to the sensitivity of next-generation sequencing compared to the strong differences observed between conditions in phenotypic growth curves." But this sensitivity does not explain the observed large LOF effects but no growth difference in the permissive condition, unless the analysis is less quantitative than expected? Could it be that there is local iron depletion in this mixed culture, causing selection pressure even in the iron-sufficient condition? Moreover, the severity of the growth defect at the time of sampling, i.e., after 24 hours of growth, is unclear. Indeed, the growth curve in Fig. 1 shows that the growth of the double mutant in iron-deficient conditions is significantly impaired at that timepoint. In the growth curve in Fig. 2B (and also slightly in Fig. 2F), however, the growth defect is less pronounced: the double mutant has a similar OD600 as the WT strain, although the error bar is larger. Is this variability between replicates also seen in the DMS analysis? In general, no statistics are shown for the DMS analysis and there is no information on the significance of the observed LOF effects. In addition, the legend should explain how many replicates the DMS data are based on.

      Minor comments

      1. Line and page numbering should be added to the manuscript to facilitate the reviewing process.
      2. "Knockout of the entire ESX-3 operon leads to inhibited M. smegmatis growth in a low-iron environment. When individual components of the ESX-3 system are deleted, growth is only available under impaired if the additional siderophore exochelin formyltransferase fxbA is also knocked out20." First, a reference should be added to the first sentence. Second, Siegrist et al. did not exactly show this. They showed that the fxbA/eccC3 double mutant grows slower that the fxbA single mutant. To my knowledge there is no publication showing that single esx-3 component mutants grow as WT in iron-deficient conditions. Do the authors have data demonstrating this? If true, it is surprising that mutating EccD3 has a milder phenotype compared the complete region deletion, as it is a crucial ESX-3 component.
      3. Reference to Table 1, should be a reference to Table S1.
      4. "Our heatmaps surprisingly reveal residues where substitutions are deleterious specifically in the iron-sufficient condition" Refer here to Fig. S2.
      5. "In the iron-deficient condition, 6/551 (1.08%) missense mutations have a weak LOF effect, and 0 have strong effects." More clearly explain this refers to the residues of the transmembrane region.
      6. "The MycP transmembrane helix has been hypothesized to be required for ESX complex specificity, targeting MycP to associate with the correct ESX homologue." I miss a reference here. And I thought that the transmembrane domain of MycP was required for complex stability not for specificity?
      7. "....role in ESX function relating to EccB3 and EccC3. In the transmembrane, ..... we" Insert "region" after "transmembrane"

      Significance

      The study provides insight into individual residues of a central component of the ESX-3 type VII secretion system for functionality, which is useful for those studying the functioning of mycobacterial type VII secretion systems. Moreover, because this system is essential for the growth of the important pathogen M. tuberculosis, this knowledge can be used to design new anti-tuberculosis compounds that block the ESX-3 system. Although the results mainly confirm previous observations (highlighting specific residues important for the stability of ubiquitin and residues of other parts of EccD important for protein-protein interactions within the ESX-3/ESX-5 membrane complex), to my knowledge this is the first time DMS has been applied to mycobacteria. This study is therefore of interest to mycobacteriologists.

    1. Reviewer #1 (Public review):

      Summary:

      Fluorescence imaging has become an increasingly popular technique for monitoring neuronal activity and neurotransmitter concentrations in the living brain. However, factors such as brain motion and changes in blood flow and oxygenation can introduce significant artifacts, particularly when activity-dependent signals are small. Yogesh et al. quantified these effects using GFP, an activity-independent marker, under two-photon and wide-field imaging conditions in awake behaving mice. They report significant GFP responses across various brain regions, layers, and behavioral contexts, with magnitudes comparable to those of commonly used activity sensors. These data highlight the need for robust control strategies and careful interpretation of fluorescence functional imaging data.

      Strengths:

      The effect of hemodynamic occlusion in two-photon imaging has been previously demonstrated in sparsely labeled neurons in V1 of anesthetized animals (see Shen and Kara et al., Nature Methods, 2012). The present study builds on these findings by imaging a substantially larger population of neurons in awake, behaving mice across multiple cortical regions, layers, and stimulus conditions. The experiments are extensive, the statistical analyses are rigorous, and the results convincingly demonstrate significant GFP responses that must be accounted for in functional imaging experiments.

      In the revised version, the authors have provided further methodological details that were lacking in the previous version, expanded discussions regarding alternative explanations of these GFP responses as well as potential mitigation strategies. They also added a quantification of brain motion (Fig. S5) and the fraction of responsive neurons when conducting the same experiment using GCaMP6f (Fig. 3D-3F), among other additional information.

      Weaknesses:

      (1) The authors have now included a detailed methodology for blood vessel area quantification, where they detect blood vessels as dark holes in GFP images and measure vessel area by counting pixels below a given intensity threshold (line 437-443). However, this approach has a critical caveat: any unspecific decrease in image fluorescence will increase the number of pixels below the threshold, leading to an apparent increase in blood vessel area, even when the actual vessel size remains unchanged. As a result, this method inherently introduces a positive correlation between fluorescence decrease and vessel dilation, regardless of whether such a relationship truly exists.

      To address this issue, I recommend labelling blood vessels with an independent marker, such as a red fluorescence dye injected into the bloodstream. This approach would allow vessel dilation to be assessed independently of GFP fluorescence -- dilation would cause opposite fluorescence changes in the green and red channels (i.e., a decrease in green due to hemodynamic occlusion and an increase in red due to the expanding vessel area). In my opinion, only when such ani-correlation is observed can one reliably infer a relationship between GFP signal changes and blood vessel dynamics.

      Because this relationship is central to the author's conclusion regarding the nature of the observed GFP signals, including this experiment would greatly strengthen the paper's conclusion.

      (2) Regarding mitigation strategy, the authors advocate repeating key functional imaging experiments using GFP, and state that their aim here is to provide a control for their 2012 study (Keller et al., Neuron). Given this goal, I find it important to discuss how these new findings impact the interpretation of their 2012 results, particularly given the large GFP responses observed.

      For example, Keller et al. (2012) concluded that visuomotor mismatch strongly drives V1 activity (Fig. 3A in that study). However, in the present study, mismatch fails to produce any hemodynamic/GFP response (Fig. 3A, 3B, rightmost bar), and the corresponding calcium response is also the weakest among the three tested conditions (Fig. 3D). How do these findings affect their 2012 conclusions?

      Similarly, the present study shows that GFP reveals twice as many responsive neurons as GCaMP during locomotion (Fig. 3A vs. Fig. 3D, "running"). Does this mean that their 2012 conclusions regarding locomotion-induced calcium activity need reconsideration? Given that more neurons responded with GFP than with GCaMP, the authors should clarify whether they still consider GCaMP a reliable tool for measuring brain activity during locomotion.

      (3) More generally, the author should discuss how functional imaging data should be interpreted going forward, given the large GFP responses reported here. Even when key experiments are repeated using GFP, it is not entirely clear how one could reliably estimate underlying neuronal activity from the observed GFP and GCaMP responses.

      For example, consider the results in Fig. 3A vs. 3D: how should one assess the relative strength of neuronal activity elicited by running, grating, or visuomotor mismatch? Does mismatch produce the strongest neuronal activity, since it is least affected by the hemodynamic/GFP confounds (Fig. 3A)? Or does mismatch actually produce the weakest neuronal activity, given that both its hemodynamic and calcium responses are the smallest?

      In my opinion, such uncertainty makes it difficult to robustly interpret functional imaging results. Simply repeating experiments with GFP does not fully resolve this issue, as it does not provide a clear framework for quantifying the underlying neuronal activity. Does this suggest a need for a better mitigation strategy? What could these strategies be?

      In my opinion, addressing these questions is critical not only for the authors' own work but also for the broader field to ensure a robust and reliable interpretation of functional imaging data.

      (4) The authors now discuss various alternative sources of the observed GFP signals. However, I feel that they often appear to dismiss these possibilities too quickly, rather than appreciating their true potential impacts (see below).

      For example, the authors argue that brain movement cannot explain their data, as movement should only result in a decrease in observed fluorescence. However, while this might hold for x-y motion, movement in the axial (z) direction can easily lead to both fluorescence increase and decrease. Neurons are not always precisely located at the focal plane -- some are slightly above or below. Axial movement in a given direction will bring some cells into focus while moving others out of focus, leading to fluorescence changes in both directions, exactly as observed in the data (see Fig. S2).

      Furthermore, the authors state that they discard data with 'visible' z-motion. However, subtle axial movements that escape visual detection could still cause fluorescence fluctuations on the order of a few percent, comparable to the reported signal amplitudes.

      Finally, the authors state that "brain movement kinematics are different in shape than the GFP responses we observe". However, this appears to contradict what they show in Fig. 2A. Specifically, the first example neuron exhibits fast GFP transients locked to running onset, with rapid kinematics closely matching the movement speed signals in Fig. S5A. These fast transients are incompatible with slower blood vessel area signals (Fig. 4), suggesting that alternative sources could contribute significantly.

      In sum, the possibility that alternative signal sources could significantly contribute should be taken seriously and more thoroughly discussed.

      (5) The authors added a quantification of brain movement (Fig. S5) and claim that they "only find detectable brain motion during locomotion onsets and not the other stimuli." However, Fig. S5 presents brain 'velocity' rather than 'displacement'. A constant (non-zero) velocity in Fig. S5 B-D indicates that the brain continues to move over time, potentially leading to significant displacement from its initial position across all conditions. While displacement in the x-y plane are corrected, similar displacement in the z direction likely occurs concurrently and cannot be easily accounted for. To assess this possibility, the authors should present absolute displacement relative to pre-stimulus frames, as displacement -- not velocity -- determines the size of movement-related fluorescence changes.

      (6) In line 132-133, the authors draw an analogy between the effect of hemodynamic occlusion and liquid crystal display (LCD) function. However, there are fundamental differences between the two. LCDs modulate light transmission by rotating the polarization of light, which then passes through a crossed polarizer. In contrast, hemodynamic occlusion alters light transmission by changing the number and absorbance properties of hemoglobin. Additionally, LCDs do not involve 'emission' light - back-illumination travels through the liquid crystal layer only once, whereas hemodynamic occlusion affects both incoming excitation light and the emitted fluorescence. Given these fundamental differences, the LCD analogy may not be entirely appropriate.

    2. Reviewer #2 (Public review):

      - Approach

      In this study, Yogesh et al. aimed at characterizing hemodynamic occlusion in two photon imaging, where its effects on signal fluctuations are underappreciated compared to that in wide field imaging and fiber photometry. The authors used activity-independent GFP fluorescence, GCaMP and GRAB sensors for various neuromodulators in two-photon and widefield imaging during a visuomotor context to evaluate the extent of hemodynamic occlusion in V1 and ACC. They found that the GFP responses were comparable in amplitude to smaller GCaMP responses, though exhibiting context-, cortical region-, and depth-specific effects. After quantifying blood vessel diameter change and surrounding GFP responses, they argued that GFP responses were highly correlated with changes in local blood vessel size. Furthermore, when imaging with GRAB sensors for different neuromodulators, they found that sensors with lower dynamic ranges such as GRAB-DA1m, GRAB-5HT1.0, and GRAB-NE1m exhibited responses most likely masked by the hemodynamic occlusion, while a sensor with larger SNR, GRAB-ACh3.0, showed much more distinguishable responses from blood vessel change. They thoroughly investigate other factors that could contribute to these signals and demonstrate hemodynamic occlusion is the primary cause.

      - Impact of revision

      This is an important update to the initial submission, adding much supplemental imaging and population data that provide greater detail to the analyses and increase the confidence in the authors conclusions.

      Specifically, inclusion of the supplemental figures 1 and 2 showing GFP expression across multiple regions and the fluorescence changes of thousands of individual neurons provides a clearer picture of how these effects are distributed across the population. Characterization of brain motion across stimulation conditions in supplemental figure 5 provides strong evidence that the fluorescence changes observed in many of the conditions are unlikely to be primarily due to brain motion associated imaging artifacts. The role of vascular area on fluorescence is further supported by addition of new analyses on vasoconstriction leading to increased fluorescence in Figures 4C1-4, complementing the prior analyses of vasodilation.

      The expansion of the discussion on other factors that could lead to these changes is thorough and welcome. The arguments against pH playing a factor in fluorescence changes of GFP, due to insensitivity to changes in the expected pH range are reasonable, as are the other discussed potential factors.

      With respect to the author's responses to prior critique, we agree that activity dependent hemodynamic occlusion is best investigated under awake conditions. Measurement of these dynamics under anesthesia could lead to an underestimation of their effects. Isoflurane anesthesia causes significant vasodilation and a large reduction in fluorescence intensity in non-functional mutant GRABs. This could saturate or occlude activity dependent effects.

      - Strengths

      This work is of broad interest to two photon imaging users and GRAB developers and users. It thoroughly quantifies the hemodynamic driven GFP response and compares it to previously published GCaMP data in a similar context, and illustrates the contribution of hemodynamic occlusion to GFP and GRAB responses by characterizing the local blood vessel diameter and fluorescence change. These findings provide important considerations for the imaging community and a sobering look at the utility of these sensors for cortical imaging.

      Importantly, they draw clear distinctions between the temporal dynamics and amplitude of hemodynamic artifacts across cortical regions and layers. Moreover, they show context dependent (Dark versus during visual stimuli) effects on locomotion and optogenetic light-triggered hemodynamic signals.

      The authors suggest that signal to noise ratio of an indicator likely affects the ability to separate hemodynamic response from the underlying fluorescence signal. With a new analysis (Supplemental Figure 4) They show that the relative degree of background fluorescence does not affect the size of the artifact.

      Most of the first generation neuromodulator GRAB sensors showed relatively small responses, comparable to blood vessel changes in two photon imaging, which emphasizes a need for improved the dynamic range and response magnitude for future sensors and encourages the sensor users to consider removing hemodynamic artifacts when analyzing GRAB imaging data.

      - Weaknesses

      The largest weakness of the paper remains that, while they convincingly quantify hemodynamic artifacts across a range of conditions, they provide limited means of correcting for them. However they now discuss the relative utility of some hemodynamic correction methods (e.g. from Ocana-Santero et al., 2024).

      The paper attributes the source of 'hemodynamic occlusion' primarily to blood vessel dilation, but leaves unanswered how much may be due to shifts in blood oxygenation. Figure 4 directly addresses the question of how much of the signal can be attributed to occlusion by measuring the blood vessel dilation, and has been improved by now showing positive fluorescence effects with vasoconstriction. They now also discuss the potential impact of oxygenation.

      Along these lines, the authors carefully quantified the correlation between local blood vessel diameter and GFP response (or neuropil fluorescence vs blood vessel fluorescence with GRAB sensors). We are left to wonder to what extent does this effect depend on proximity to the vessels? Do GFP/ GRAB responses decorrelate from blood vessel activity in neurons further from vessels (refer to Figure 5A and B in Neyhart et al., Cell Reports 2024)? The authors argue that the primary impact of occlusion is from blood vessels above the plane of imaging, but without a vascular reconstruction, their evidence for this is anecdotal.

      The choice of ACC as the frontal region provides a substantial contrast in location, brain movement, and vascular architecture as compared to V1. As the authors note, ACC is close to the superior sagittal sinus and thus is the region where the largest vascular effects are likely to occur. A less medial portion of M2 may have been a more appropriate comparison. The authors now include example imaging fields for ACC and interesting out-of-plane vascular examples in the supplementary figures that help assess these impacts.

      -Overall Assessment

      This paper is an important contribution to our understanding of how hemodynamic artifacts may corrupt GRAB and calcium imaging, even in two-photon imaging modes. While it would be wonderful if the authors were able to demonstrate a reliable way to correct for hemodynamic occlusion which did not rely on doing the experiments over with a non-functional sensor or fluorescent protein, the careful measurement and reporting of the effects here is, by itself, a substantial contribution to the field of neural activity imaging. It's results are of importance to anyone conducting two-photon or widefield imaging with calcium and GRAB sensors and deserves the attention of the broader neuroscience and in-vivo imaging community.

    1. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors investigate the optical properties of brochosomes produced by leafhoppers. They hypothesize that brochosomes reduce light reflection on the leafhopper's body surface, aiding in predator avoidance. Their hypothesis is supported by experiments involving jumping spiders. Additionally, the authors employ a variety of techniques including micro-UV-Vis spectroscopy, electron microscopy, transcriptome and proteome analysis, and bioassays. This study is highly interesting, and the experimental data is well-organized and logically presented.

      Strengths:

      The use of brochosomes as a camouflage coating has been hypothesized since 1936 (R.B. Swain, Entomol. News 47, 264-266, 1936) with evidence demonstrated by similar synthetic brochosome systems in a number of recent studies (S. Yang, et al. Nat. Commun. 8:1285, 2017; L. Wang, et al., PNAS. 121: e2312700121, 2024). However, direct biological evidence or relevant field studies have been lacking to directly support the hypothesis that brochosomes are used for camouflage. This work provides the first biological evidence demonstrating that natural brochosomes can be used as a camouflage coating to reduce the leafhoppers' observability to their predators. The design of the experiments is novel.

      Weaknesses:

      (1) The observation that brochosome coatings become sparse after 25 days in both male and female leafhoppers, resulting in increased predation by jumping spiders, is intriguing. However, since leafhoppers consistently secrete and groom brochosomes, it would be beneficial to explore why brochosomes become significantly less dense after 25 days.

      (2) The authors demonstrate that brochosome coatings reduce UV (specular) reflection compared to surfaces without brochosomes, which can be attributed to the rough geometry of brochosomes as discussed in the literature. However, it would be valuable to investigate whether the proteins forming the brochosomes are also UV absorbing.

      (3) The experiments with jumping spiders show that brochosomes help leafhoppers avoid predators to some extent. It would be beneficial for the authors to elaborate on the exact mechanism behind this camouflage effect. Specifically, why does reduced UV reflection aid in predator avoidance? If predators are sensitive to UV light, how does the reduced UV reflectance specifically contribute to evasion?

      (4) An important reference regarding the moth-eye effect is missing. Please consider including the following paper: Clapham, P. B., and M. C. Hutley. "Reduction of lens reflection by the 'Moth Eye' principle." Nature 244: 281-282 (1973).

      (5) The introduction should be revised to accurately reflect the related contributions in literature. Specifically, the novelty of this work lies in the demonstration of the camouflage effect of brochosomes using jumping spiders, which is verified for the first time in leafhoppers. However, the proposed use of brochosome powder for camouflage was first described by R.B. Swain (R.B. Swain, Notes on the oviposition and life history of the leafhopper Oncometopta undata Fabr. (Homoptera: Cicadellidae), Entomol. News. 47: 264-266 (1936)). Recently, the antireflective and potential camouflage functions of brochosomes were further studied by Yang et al. based on synthetic brochosomes and simulated vision techniques (S. Yang, et al. "Ultra-antireflective synthetic brochosomes." Nature Communications 8: 1285 (2017)). Later, Lei et al. demonstrated the antireflective properties of natural brochosomes in 2020 (C.-W. Lei, et al., "Leafhopper wing-inspired broadband omnidirectional antireflective embroidered ball-like structure arrays using a nonlithography-based methodology." Langmuir 36: 5296-5302 (2020)). Very recently, Wang et al. successfully fabricated synthetic brochosomes with precise geometry akin to those natural ones, and further elucidated the antireflective mechanisms based on the brochosome geometry and their role in reducing the observability of leafhoppers to their predators (L. Wang et al. "Geometric design of antireflective leafhopper brochosomes." Proceedings of the National Academy of Sciences 121: e2312700121 (2024)).

      Comments on revisions:

      In this revision, the authors have addressed some of the key concerns I raised in our previous comments. However, a few issues remain unaddressed. Additionally, the new experimental data introduced in the manuscript require further clarification, which I outline below.

      (1) As I pointed out in my previous review comments, "The use of brochosomes as a camouflage coating has been hypothesized since 1936 (R.B. Swain, Entomol. News 47, 264-266, 1936) with evidence demonstrated by similar synthetic brochosome systems in a number of recent studies (S. Yang, et al. Nat. Commun. 8:1285, 2017; L. Wang, et al., PNAS. 121: e2312700121, 2024). However, direct biological evidence or relevant field studies have been lacking to directly support the hypothesis that brochosomes are used for camouflage." While the authors did cite the original hypothesis proposed by R.B. Swain (1936), they have omitted important references that provide evidence on the use of antireflective properties of brochosomes for camouflage in a synthetic setting (see for example, Fig. 5a of S. Yang, et al. Nat. Commun. 8:1285, 2017). The authors are recommended to revise the Abstract and Introduction accordingly to ensure a fair and accurate representation of the existing literature.

      (2) The antireflection mechanisms of brochosome structures have been discussed in detail, specifically, how their geometries (i.e., brochosome diameter and pore size) contribute to reducing UV reflectance (L. Wang, et al., PNAS. 121: e2312700121, 2024 and P. Banergee, et al., Advanced Photonics Research 4:2200343, 2023). The authors should incorporate these recent findings into their discussion (line 381 - line 383 of the manuscript).

      (3) The authors presented new data brochosomes deposited on a quartz slide and measured their reflectance across UV, visible light, and infrared wavelengths. Since reflectance is highly sensitive to the uniformity of brochosome coverage on the substrate, it is crucial to quantify this coverage across the measurement area for comparison. While the authors include SEM images to illustrate the packing of brochosomes on both the leafhopper wing and the quartz substrate (Fig. S7) at a microscopic scale (~10 um view), it would be beneficial to also provide SEM images at a larger scale (e.g., 100 um - 1 mm) and quantify the density of brochosomes per unit area for comparison.

      (4) For the negative control using acetone to remove the brochosomes the leafhopper wing, have the authors confirmed the absence of brochosomes after treatment? If so, the authors should explicitly indicate this for clarity.

    2. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public review):

      Summary:

      Evading predation is of utmost importance for most animals and camouflage is one of the predominant mechanisms. Wu et al. set out to test the hypothesis of a unique camouflage system in leafhoppers. These animals coat themselves with brochosomes, which are spherical nanostructures that are produced in the Malpighian tubules and are distributed on the cuticle after eclosion. Based on previous findings on the reflectivity properties of brochosomes, the authors provide very good evidence that these nanostructures indeed reduce the reflectivity of the animals thereby reducing predation by jumping spiders. Further, they identify four proteins, which are essential for the proper development and function of brochosomes. In RNAi experiments, the regular brochosome structure is lost, the reflectivity reduced and the respective animals are prone to increased predation. Finally, the authors provide some phylogenetic sequence analyses and speculate about the evolution of these essential genes.

      Strengths:

      The study is very comprehensive including careful optical measurements, EM and TM analysis of the nanoparticles and their production line in the malphigian tubules, in vivo predation tests, and knock-down experiments to identify essential proteins. Indeed, the results are very convincingly in line with the starting hypothesis such that the study robustly assigns a new biological function to the brochosome coating system.

      A key strength of the study is that the biological relevance of the brochosome coating is convincingly shown by an in vivo predation test using a known predator from the same habitat.

      Another major step forward is an RNAi screen, which identified four proteins, which are essential for the brochosome structure (BSMs). After respective RNAi knock-downs, the brochosomes show curious malformations that are interesting in terms of the self-assembly of these nanostructures. The optical and in vivo predation tests provide excellent support for the model that the RNAi knock-down leads to a change of brochosomes structure, which reduces reflectivity, which in turn leads to a decrease of the antipredatory effect.

      Thank you very much for your positive feedback and insightful comments on our manuscript. We are delighted that you acknowledge the efforts we have made in studying the components and functions of Brochosomal proteins. We have carefully considered your suggestions and have thoroughly revised the manuscript to address the shortcomings identified in our original submission. We hope that the revised version meets with your approval. Below, please find our detailed point-by-point responses.

      Weaknesses:

      The reduction of reflectivity by aberrant brochosomes or after ageing is only around 10%. This may seem little to have an effect in real life. On the other hand, the in vivo predation tests confirm an influence. Hence, this is not a real weakness of the study - just a note to reconsider the wording for describing the degree of reflectivity.

      Thank you for your valuable suggestions. Based on your recommendations, we have revised the manuscript accordingly. Although the absolute reduction in light reflection due to Brochosomal coverage is approximately 10%, the relative decrease in light reflection on the leafhopper's surface is nearly 30%. Specifically, in the ultraviolet region, the reflection is reduced from about 30% to 20%, and in the visible light region, it is reduced from 20% to 10%. For detailed revisions, please refer to lines 151-156 of the revised manuscript.

      The single gene knockdowns seemed to lead to a very low penetrance of malformed brochosomes (Figure Supplement 3). Judging from the overview slides, less than 1% of brochosomes may have been affected. A quantification of regular versus abnormal particles in both, wildtype and RNAi treatments would have helped to exclude that the shown aberrant brochosomes did not just reflect a putative level of "normal" background defects. Of note, the quadruple knock-down of all BSMs seemed to lead to a high penetrance (Figure 4), which was already reflected in the microtubule production line. While the data shown are convincing, a quantification might strengthen the argument.

      While the RNAi effects seemed to be very specific to brochosomes and therefore very likely specific, an off-target control for RNAi was still missing. Finding the same/similar phenotype with a non-overlapping dsRNA fragment in one off-target experiment is usually considered required and sufficient. Further, the details of the targeted sequence will help future workers on the topic.

      Thank you for your valuable suggestions. Based on your recommendations, we have synthesized dsRNA targeting two non-overlapping regions of the coding sequences for four Brochosomal structural protein genes. These dsRNAs were injected individually and in combination for each gene. Our RNAi experiments for each BSM gene demonstrated that both individual and combined injections significantly suppressed the expression of the target genes, with the combined injection yielding slightly better silencing efficiency. Statistical analysis of the SEM observations revealed that the combined injection of dsRNAs targeting two non-overlapping regions led to a 60-70% reduction in the surface area coverage of Brochosomes. Additionally, approximately 20% of the remaining Brochosomes exhibited significant morphological changes. For detailed revisions, please refer to lines 199-211 of the revised manuscript, as well as Figures 3A and 3C, and Supplementary Figures 4 and 5.

      The main weakness in the current manuscript may be the phylogenetic analysis and the model of how the genes evolved. Several aspects were not clearly or consistently stated such that I felt unsure about what the authors actually think. For instance: Are all the 4 BSMs related to each other or only BSM2 and 3? If so, not only BSM2 and 3 would be called "paralogs" but also the other BSMs. If they were all related, then a phylogenetic tree including all BSMs should be shown to visualize the relatedness (including the putative ancestral gene if that is the model of the authors). Actually, I was not sure about how the authors think about the emergence of the BSMs. Are they real orphan genes (i.e. not present outside the respective clade) or was there an ancestral gene that was duplicated and diverged to form the BSMs? Where in the phylogeny does the first of the BSMs or ancestral proteins emerge (is the gene found in Clastoptera arizonana the most ancestral one?)? Maybe, the evolution of the BSMs would have to be discussed individually for each gene as they show somewhat different patterns of emergence and loss (BSM4 present in all species, the others with different degrees of phylogenetic restriction).

      Thank you very much for your constructive feedback on our phylogenetic analysis and the modeling of gene evolution. We fully agree with your insights and acknowledge that the evolutionary analysis of BSM genes remains somewhat ambiguous. This ambiguity is primarily due to the limited research on the precise structural protein composition of Brochosomes. While proteomics studies have analyzed and discussed the structural proteins of Brochosomes, the accurate composition of these proteins is still poorly understood. In this study, we identified four BSM proteins, but given the intricate structure of Brochosomes as proteinaceous spheres, we believe there may be additional BSM genes that have not yet been identified. Moreover, despite the presence of over ten thousand species within the Cicadomorpha, only three species have genome sequences available, and fewer than a hundred species have transcriptome sequencing data. The scarcity of research on Brochosomes, as well as the limited availability of genomic and transcriptomic data, poses significant challenges for our phylogenetic analysis and understanding of BSM gene evolution.

      Based on your suggestions, we have revised the manuscript accordingly. Specifically, we have updated Figure 5C by including ten additional species from Cereopoidea, Cicadoidea, and Fulgoroidea to better illustrate that BSM genes are true orphan genes. We have also added a phylogenetic tree of BSM genes within Cicadidae in Supplementary Figure 3. Additionally, we have expanded the discussion of BSM gene evolution in the manuscript (lines 503-556). For detailed revisions, please refer to Figure 5C, Supplementary Figure 3, and lines 507-585 of the revised manuscript.

      Related to these questions I remained unsure about some details in Figure 5. On what kind of analysis is the phylogeny based? Why are some species not colored, although they are located on the same branch as colored ones? What is the measure for homology values - % identity/similarity? The homology labels for Nephotetix cincticeps and N. virescens seem to be flipped: the latter is displayed with 100% identity for all genes with all proteins while the former should actually show this. As a consequence of these uncertainties, I could not fully follow the respective discussion and model for gene evolution.

      Thank you very much for your insightful comments and suggestions. We have carefully considered your feedback and have thoroughly revised our manuscript accordingly. Specifically, we have enhanced the description of the phylogenetic analysis process to provide greater clarity and transparency, with the detailed methods now included in lines 789-798. Regarding Figure 5C, we appreciate your attention to the coloring scheme. We would like to clarify that the family Cicadellidae comprises 25 subfamilies, many of which are represented by only one species in our figure. To ensure clarity and meaningful representation, we have chosen to color only those subfamilies with more than three species, thereby avoiding visual clutter and emphasizing the most relevant taxonomic groups. Additionally, we have corrected the inverted homology labels for Nephotetix cincticeps and Nephotetix virescens to ensure the accuracy and consistency of our data presentation.

      Conclusion:

      The authors successfully tested their hypothesis in a multidisciplinary approach and convincingly assigned a new biological function to the brochosomes system. The results fully support their claims - only the quantification of the penetrance in the RNAi experiments would be helpful to strengthen the point. The author's analysis of the evolution of BSM genes remained a bit vague and I remained unsure about their respective conclusions.

      The work is a very interesting study case of the evolutionary emergence of a new system to evade predators. Based on this study, the function of the BSM genes could now be studied in other species to provide insights into putative ancestral functions. Further, studying the self-assembly of such highly regular complex nano-structures will be strongly fostered by the identification of the four key structural genes.

      Reviewer #1 (Recommendations for the authors):

      Main manuscript:

      Please consider the annotated pdf with suggestions for wording and comments at the authors' discretion:

      Thank you very much for your detailed suggestions and comments provided in the annotated PDF. We have carefully reviewed each of your points and have revised the manuscript accordingly. All changes have been highlighted in red text for your convenience. The revised manuscript with tracked changes is available for your review. We believe these revisions have improved the clarity and quality of our manuscript. Thank you again for your valuable feedback.

      Supplementary Figure 2 C:

      Y-axes:

      - label: "surface coverage in %"

      - there are different scale values for the different days (e.g. 80-105 for day 5 and 0-80 at day 25). As a comparison between days is interesting, it would help to have the same scale values for all. That would show the decrease more intuitively.

      Thank you very much for your suggestion regarding the Y-axis in Supplementary Figure 2C. We agree that using a consistent scale across all time points is essential for clear and intuitive comparison. In the revised manuscript, we have standardized the Y-axis scale for Supplementary Figure 2C to a uniform range of 0-100% for all days. This change allows for a more straightforward visualization of the decreasing trend in surface coverage over time.

      Reviewer #3 (Public review):

      Summary:

      In this manuscript, the authors investigate the optical properties of brochosomes produced by leafhoppers. They hypothesize that brochosomes reduce light reflection on the leafhopper's body surface, aiding in predator avoidance. Their hypothesis is supported by experiments involving jumping spiders. Additionally, the authors employ a variety of techniques including micro-UV-Vis spectroscopy, electron microscopy, transcriptome and proteome analysis, and bioassays. This study is highly interesting, and the experimental data is well-organized and logically presented.

      Strengths:

      The use of brochosomes as a camouflage coating has been hypothesized since 1936 (R.B. Swain, Entomol. News 47, 264-266, 1936) with evidence demonstrated by similar synthetic brochosome systems in a number of recent studies (S. Yang, et al. Nat. Commun. 8:1285, 2017; L. Wang, et al., PNAS. 121: e2312700121, 2024). However, direct biological evidence or relevant field studies have been lacking to directly support the hypothesis that brochosomes are used for camouflage. This work provides the first biological evidence demonstrating that natural brochosomes can be used as a camouflage coating to reduce the leafhoppers' observability of their predators. The design of the experiments is novel.

      We are extremely grateful for your positive feedback and insightful comments on our manuscript. We are delighted that you have recognized the efforts we have put into our research on how brochosomes serve as a camouflage coating to reduce the detectability of leafhoppers to their predators. We have carefully considered your suggestions and have thoroughly revised the manuscript to address the shortcomings of the original version. We hope that the revised version meets with your approval. Below, please find our detailed point-by-point responses.

      Weaknesses:

      (1) The observation that brochosome coatings become sparse after 25 days in both male and female leafhoppers, resulting in increased predation by jumping spiders, is intriguing. However, since leafhoppers consistently secrete and groom brochosomes, it would be beneficial to explore why brochosomes become significantly less dense after 25 days.

      Thank you very much for your valuable suggestions. We appreciate your interest in the reduction of brochosomal density on the surface of leafhoppers after 25 days.We believe that the primary reason for the decreased density of brochosomes on the leafhopper surface after 25 days is the reduced synthesis and secretion of brochosomes. The Malpighian tubules are the main sites for brochosome synthesis. As shown in Figure 2D and Supplementary Figure 1, the thick glandular segments of the Malpighian tubules in both male and female leafhoppers begin to atrophy 15 days after reaching adulthood. This indicates a gradual decline in brochosome synthesis and secretion after day 15 of adulthood. Following your suggestion, we have revised the discussion section of the manuscript to elaborate on this observation. The detailed changes can be found in lines 474-491 of the revised manuscript.

      (2) The authors demonstrate that brochosome coatings reduce UV (specular) reflection compared to surfaces without brochosomes, which can be attributed to the rough geometry of brochosomes as discussed in the literature. However, it would be valuable to investigate whether the proteins forming the brochosomes are also UV absorbing.

      Thank you very much for your valuable suggestions. Following your advice, we have successfully expressed four BSM genes in a prokaryotic system, purified the corresponding proteins, and applied them to quartz glass surfaces. We then measured the light reflectance of the quartz glass surfaces coated with these purified proteins. The results showed that the purified BSM proteins did not exhibit better antireflective properties compared to the control GST protein. For more details, please refer to Supplementary Figure 8 in the revised manuscript.  We believe that the excellent antireflective properties of brochosomes are fundamentally due to their unique geometric shapes. The hollow pores within the brochosomes, with diameters of approximately 100 nm, are significantly smaller than most wavelengths in the visible spectrum. When light passes through these tiny pores, diffraction occurs, while light passing through the ridges of the brochosomes causes scattering. The interference between the diffracted and scattered light from these pores and ridges results in the observed extinction characteristics of brochosomes. We have incorporated these insights into the discussion section of the revised manuscript (lines 416-425 and lines 432-442 of the revised manuscript).

      (3) The experiments with jumping spiders show that brochosomes help leafhoppers avoid predators to some extent. It would be beneficial for the authors to elaborate on the exact mechanism behind this camouflage effect. Specifically, why does reduced UV reflection aid in predator avoidance? If predators are sensitive to UV light, how does the reduced UV reflectance specifically contribute to evasion?

      Thank you very much for your valuable suggestions. Based on your advice, we have included a detailed discussion on how reducing ultraviolet (UV) reflection can help insects avoid predation. The revised content can be found in lines 445-460 of the revised manuscript.

      “UV light serves as a crucial visual cue for various insect predators, enhancing foraging, navigation, mating behavior, and prey identification (Cronin & Bok, 2016; Morehouse et al., 2017; Silberglied, 1979). Predators such as birds, reptiles, and predatory arthropods often rely on UV vision to detect prey (Church et al., 1998; Li & Lim, 2005; Zou et al., 2011). However, UV reflectance from insect cuticles can disrupt camouflage, increasing the risk of detection and predation, as natural backgrounds like leaves, bark, and soil typically reflect minimal UV light (Endler, 1997; Li & Lim, 2005; Tovee, 1995). To mitigate this risk, insects often possess anti-reflective cuticular structures that reduce UV and broad-spectrum light reflectance. This strategy is widespread among insects, including cicadas, dragonflies, and butterflies, and has been shown to decrease predator detection rates (Hooper et al., 2006; Siddique et al., 2015; Zhang et al., 2006). For example, the compound eyes of moths feature hexagonal protuberances that reduce UV reflectance, aiding nocturnal concealment (Blagodatski et al., 2015; Stavenga et al., 2005). In butterflies, UV reflectance from eyespots on wings can attract predators, but reducing UV reflectance or eyespot size can lower predation risk and enhance camouflage (Chan et al., 2019; Lyytinen et al., 2004). Hence, the reflection of ultraviolet light from the insect cuticle surface increases the risk of predation by disrupting camouflage (Tovee, 1995)”

      (4) An important reference regarding the moth-eye effect is missing. Please consider including the following paper: Clapham, P. B., and M. C. Hutley. "Reduction of lens reflection by the 'Moth Eye' principle." Nature 244: 281-282 (1973).

      Thank you very much for pointing out the omission of the important reference on the “moth eye” effect. We sincerely apologize for the oversight. Based on your suggestion, we have now included the seminal paper by Clapham and Hutley (1973) in the revised manuscript. The reference has been added to both the Introduction and Discussion sections to provide a more comprehensive context for our discussion on anti-reflective structures in insects.

      (5) The introduction should be revised to accurately reflect the related contributions in literature. Specifically, the novelty of this work lies in the demonstration of the camouflage effect of brochosomes using jumping spiders, which is verified for the first time in leafhoppers. However, the proposed use of brochosome powder for camouflage was first described by R.B. Swain (R.B. Swain, Notes on the oviposition and life history of the leafhopper Oncometopta undata Fabr. (Homoptera: Cicadellidae), Entomol. News. 47: 264-266 (1936)). Recently, the antireflective and potential camouflage functions of brochosomes were further studied by Yang et al. based on synthetic brochosomes and simulated vision techniques (S. Yang, et al. "Ultra-antireflective synthetic brochosomes." Nature Communications 8: 1285 (2017)). Later, Lei et al. demonstrated the antireflective properties of natural brochosomes in 2020 (C.-W. Lei, et al., "Leafhopper wing-inspired broadband omnidirectional antireflective embroidered ball-like structure arrays using a nonlithography-based methodology." Langmuir 36: 5296-5302 (2020)). Very recently, Wang et al. successfully fabricated synthetic brochosomes with precise geometry akin to those natural ones, and further elucidated the antireflective mechanisms based on the brochosome geometry and their role in reducing the observability of leafhoppers to their predators (L. Wang et al. "Geometric design of antireflective leafhopper brochosomes." Proceedings of the National Academy of Sciences 121: e2312700121 (2024)).

      Thank you very much for your valuable suggestions regarding the revision of the introduction to accurately reflect the relevant contributions in the literature. Based on your feedback, we have thoroughly revised the introduction and added the suggested references to provide a comprehensive context for our study. The details of these revisions can be found in lines 84-94 of the revised manuscript.

      Reviewer #3 (Recommendations for the authors):

      (1) In Figure 2E, the data for Male-5d appears to be missing. Please verify and ensure all relevant data is included.

      Thank you for pointing out the issue regarding the data presentation in Figure 2E.We apologize for any confusion caused by the overlapping data points and the less conspicuous color choice for Male-5d. We have carefully reviewed the data and confirmed that all relevant data points, including Male-5d, are indeed present in the dataset. In the revised manuscript, we have adjusted the color scheme for Male-5d and Female-5d in Figure 2E to ensure that both curves are clearly distinguishable, even in areas where they overlap. This adjustment should facilitate a more accurate and convenient observation of the data trends. We appreciate your attention to detail, and we believe these revisions have improved the clarity and readability of the figure.

      (2) In Figure 6, please clarify the reflectance data in the inset. Clearly explain what the blue and light blue curves represent.

      Thank you for your suggestion regarding Figure 6.We have revised the figure to improve clarity. The light blue curve now represents the reflectance measurements of leafhoppers with higher brochosome coverage, while the dark blue curve corresponds to those with lower coverage. These changes, along with updated labels in the figure legend, ensure that the data are clearly distinguishable and easy to interpret. We appreciate your feedback and believe these revisions have enhanced the overall clarity of the figure.

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public review):

      Weaknesses (clarifications needed):

      (1) Experimental Design:

      The study does not mention whether the authors examined sex differences or any measures of attractiveness or hierarchy among participants (e.g., students vs. teachers). Including these variables could provide a more nuanced understanding of group dynamics.

      We are grateful to the reviewer for pointing out this valuable question. We have clarified that future studies should include sex differences or any measures of attractiveness or hierarchy among participants (e.g., students vs. teachers) (p. 27).

      “Finally, future research should investigate additional variables, including sex differences and measures of attractiveness or hierarchy among participants, such as students versus teachers.”  p. 27

      (2) fNIRS Data Acquisition:

      The authors' approach to addressing individual differences in anatomy is lacking in detail. Understanding how they identified the optimal channels for synchrony between participants would be beneficial. Was this done by averaging to find the location with the highest coherence?

      We apologize for missing some details here. We have included the following information in the fNIRS data acquisition and fNIRS data analyses to clarify the details (pp. 8 and 12).

      We employed the one-sample t-test method to assess the GNS disparity between the baseline and task sessions, identifying particular channels of interest. This analysis did not ascertain the maximum coherence level, but rather pinpointed the channel exhibiting significant divergence between the two sessions, which we designated as pertinent to the group decision-making task. Furthermore, we selected the PFC and left TPJ as our reference brain regions, guided by existing literature.

      “Two optode probe sets were used to cover each participant's prefrontal and left TPJ regions (Figure S1). The DLPFC plays a crucial role in group decision-making processes, with findings suggesting that individuals exhibiting reduced prefrontal activity were more prone to out-group exclusion and demonstrated stronger in-group preferences (Goupil et al., 2021; Jankovic, 2014; Yang et al., 2020). Similarly, the left TPJ has been previously reported to be associated with decision-making and information exchange (Freitas et al., 2019; Tindale et al., 2019).”  p. 8

      “Time-averaged GNS (also averaged across channels in each group) was compared between the baseline session (i.e., the resting phase) and the task session (from reading information to making decisions) using a series of one-sample t-tests. Here, p-values were thresholded by controlling for FDR (p < 0.05; Benjamini & Hochberg, 1995). When determining the frequency band of interest, the time-averaged GNS was also averaged across channels. After that, we analyzed the time-averaged GNS of each channel. Then, channels showing significant GNS were regarded as regions of interest and included in subsequent analyses.” p. 12

      (3) Behavioral Analysis:

      For group identification, the analysis currently uses a dichotomous approach. Introducing a regression model to capture the degree of identification could offer more granular insights into how varying levels of group identification affect collective behavior and performance.

      Thank you for your suggestion. As suggested, we have conducted the regression model to examine how varying levels of group identification affect collective performance, with the score of group identification being the independent variable and collective performance as the dependent variable (pp.9 and 15).

      “Moreover, we employed a regression model to examine how varying levels of group identification affect collective performance, using group identification scores as the independent variable and collective performance as the dependent variable.”  p.9

      “The results from the regression model highlighted a significant association between the degree of group identification and collective performance (β \= 0.45, t = 4.56, p \= 0.019).”  p.15

      (4) Single Brain Activation Analysis:

      The application of the General Linear Model (GLM) is unclear, particularly given the long block durations and absence of multiple trials. Further explanation is needed on how the GLM was implemented under these conditions.

      Thank you for your suggestion, we have added more details in this section (p.11).

      “In the GLM model analysis, HbO was the dependent variable, and the regression amount was set for different task stages (a. Reading information, b. Sharing private information, c. Discussing information, d. Decision). After that, we convolved the regression factor with the Hemodynamic Response Function (HRF) and obtained the brain activation β value of each participant in each channel at different task stages through regression analysis.’  p.11

      (5) Within-group neural Synchrony (GNS) Calculation:

      The method for calculating GNS could be improved by using mutual information instead of pairwise summation, as suggested by Xie et al. (2020) in their study on fMRI triadic hyperscanning. Additionally, the explanation of GNS calculation is inconsistent. At one point, it is mentioned that GNS was averaged across time and channels, while elsewhere, it is stated that channels with the highest GNS were selected. Clarification on this point is essential.

      We appreciate the reviewer for highlighting this inquiry. We utilized a conventional GNS calculation approach, as detailed in Line 296 of the manuscript, where the GNS was determined in pairs after the WTC computation, and then averaged. Further details regarding the second question have been provided in the article (p.12).

      (6) Placement of fNIRS Probes:

      The probes were only placed in the frontal regions, despite literature suggesting that the superior temporal sulcus (STS) and temporoparietal junction (TPJ) regions are crucial for triadic team performance. A justification for this choice or inclusion of these regions in future studies would be beneficial.

      The original manuscript clearly stated the use of two optode probe sets to encompass the prefrontal and left TPJ regions of each participant (see Figure S1, p. 8).

      (7) Interpretation of fNIRS Data:

      Given that fNIRS signals are slow, similar to BOLD signals in fMRI, the interpretation of Figure 6 raises concerns. It suggests that it takes several minutes (on the order of 4-5 minutes) for people to collaborate, which seems implausible. More context or re-evaluation of this interpretation is needed.

      The question you have pointed out is very pertinent, and we have added more explanation for this result (pp. 25-26).

      As previous studies have shown, the BOLD signal collected by fNIRS is slowly increasing compared to neuronal activity, which means that it has hysteresis (Turner et al., 1998). In social interactions such as group decision-making, the time of neural synchronization is delayed because people need to spend time increasing the number of dialogues to improve collaboration efficiency and form the same preference (Zhang et al., 2019). For example, the study of group consensus found that participants would show significant neural alignment after completing a period of dialogue (Sievers et al., 2024). In the task of cooperation, with the improvement of tacit understanding between two participants, the higher degree of neural synchronization (Cui et al., 2012). Therefore, the generation of neural synchronization depends on the interaction over a period of time. Therefore, we believe that the 4-5 minutes of collaboration time shown in Figure 6 may be related to establishing consensus and the same preference of team members, which is reflected in the dynamic time change of neural synchronization.

      Moreover, previous studies on neural synchronization during social interaction and group decision-making revealed that substantial neural synchronization occurred around 50-55 seconds into a teaching task involving prior knowledge (Liu et al., 2019) and persisted approximately 6 minutes into the discussion period (Xie et al., 2023). These results collectively validate the suitability of utilizing fNIRS signal response time in our study (pp. 25-26).

      “Our study also has demonstrated significant increases in single-brain activation, DLPFC-OFC functional connectivity, and GNS at 7, 12, and 17 minutes, respectively, following task initiation. The significant increase in these neural activities together constructs the two-in-one neural model that explains how group identification influences the collective performance we proposed. As previous studies have shown, the BOLD signal collected by fNIRS is slowly increasing compared to neuronal activity, which means that it has hysteresis (Turner et al., 1998). In social interactions such as group decision-making, the time of neural synchronization is delayed because people need to spend time increasing the number of dialogues to improve collaboration efficiency and form the same preference (Zhang et al., 2019). For example, participants would exhibit significant neural alignment, but only after they had completed a period of dialogue (Sievers et al., 2024). In the task of cooperation, with the improvement of cooperation efficiency between two participants, the higher degree of neural synchronization (Cui et al., 2012). Therefore, the generation of neural synchronization depends on the interaction over a period of time, which can affect the estimation of collaboration time. Prior research has shown that when the teaching task with prior knowledge began 50-55 seconds, significant neural synchronization could be generated between teacher and students, which meant that students and teacher achieved the same goal of learning knowledge (Liu et al., 2019). Moreover, a noteworthy increase in GNS was observed approximately 6 minutes into the group discussion period for better discussing and solving the problem (Xie et al., 2023). These findings are similar to ours. Therefore, the time points we found could reflect the dynamic time change of the neural process of team collaboration.’ pp.25-26

      Reviewer #2 (Public review):

      Weaknesses:

      The authors need to clearly articulate their hypothesis regarding why neural synchronization occurs during social interaction. For example, in line 284, it is stated that "It is plausible that neural synchronization is closely associated with group identification and collective performance...", but this is far from self-evident. Neural synchronization can occur even when people are merely watching a movie (Hasson et al., 2004), and movie-watchers are not engaged in collective behavior. There is no direct link between the IBS and collective behavior. The authors should explain why they believe inter-brain synchronization occurs in interactive settings and why they think it is related to collective behavior/performance.

      Thank you for bringing these points to our attention, we have clarified the relationship between neural synchronization and collective behavior in the Introduction section. (p.4). Moreover, in order to investigate whether neural synchronization stems from a common task or environment, we pseudo-randomized all pairs of subjects and created a null distribution consisting of 1,000 pseudo-groups, as described in Lines 311-315. This approach enabled us to eliminate neural synchronization resulting from factors other than social interaction, allowing us to identify neural patterns associated with collective performance (p.12).

      “Moreover, Ni et al. (2024) indicated that neural synchronization was linked to the strength of social-emotional communication and connections between individuals. An increase in neural synchronization has also been shown to predict the coordination and cooperation abilities of group members (Lu et al., 2023). Therefore, we hypothesize that neural synchronization may be related to group performance.” p.4

      “After that, the nonparametric permutation test was conducted on the observed interaction effects on GNS of the real group against the 1,000 permutation samples. By pseudo-randomizing the data of all participants, a null distribution of 1000 pseudo-groups was generated (e.g., time series from member 1 in group 1 were grouped with member 2 in group 2 & member 3 in group 3). The GNS of 1,000 reshuffled pseudo-groups was computed, and the GNS of the real groups was assessed by comparing it with the values generated by 1000 reshuffled pseudo-groups.” p.12

      The authors state that "GNS in the OFC was a reliable neuromarker, indicating the influence of group identification on collective performance," but this claim is too strong. Please refer to Figure 4B. Do the authors really believe that collective performance can be predicted given the correlation with the large variance shown? There is a significant discrepancy between observing a correlation between two variables and asserting that one variable is a predictive biomarker for the other.

      Thank you for your suggestion, we have revised the relevant statement (p.18).

      “Through correlation and regression model analysis, we found that in group decision-making, the increase in group identity would affect group performance by improving GNS in the OFC brain region.”  p.18

      Why are the individual answers being analyzed as collective performance (See, L-184)? Although these are performances that emerge after the group discussion, they seem to be individual performances rather than collective ones. Typically, wouldn't the result of a consensus be considered a collective performance? The authors should clarify why the individual's answer is being treated as the measure of collective performance.

      We appreciate the insightful comment provided by the reviewer. The decision to utilize individual responses as a metric of overall performance is based on several key considerations. Previous studies on various hidden profile tasks have utilized averaged individual scores to represent collective performance (e.g., Stasser et al., 1995; Wittenbaum et al., 1996; Brockner et al., 2022). Secondly, while consensus outcomes are typically regarded as collective expressions, we argue that in the context of this study, individual responses are not independent entities but rather extensions of the group decision-making process. The collective deliberation process significantly influenced individual thinking and decision-making in this study. Through group discussions, members shared perspectives, adjusted their stances, and formulated their responses based on collective insights. The responses provided by participants in this study were molded by the dynamics of group conversations, serving as an indirect measure of group performance and potentially indicating the efficacy of collective deliberations.

      Performing SPM-based mapping followed by conducting a t-test on the channels within statistically significant regions constitutes double dipping, which is not an acceptable method (Kriegeskorte et al., 2011). This issue is evident in, for example, Figures 3A and 4A.

      Please refer to the following source: https://www.nature.com/articles/nn.2303

      We have carefully reviewed the articles provided by the reviewer, and we acknowledge the concerns regarding selective analysis and double dipping in our statistical approach. To address this, we believe it is important to clarify this issue further in the Discussion section (pp.26-27).

      Our study introduces a novel perspective while utilizing conventional fNIRS-based hyperscanning analyses (Liu et al., 2019; Pärnamets et al., 2020; Reinero et al., 2021; Számadó et al., 2021; Solansky, 2011), methods that are widely endorsed within the field. In our analysis, significant channels were first identified using a one-sample t-test, followed by additional analyses including ANOVA, independent samples t-tests, and other procedures. We would like to emphasize that the statistical assumptions underlying the one-sample t-test and paired-sample t-test in our study maintain a level of independence. Moreover, to further mitigate concerns about the potential for double dipping, we employed permutation testing to validate the robustness of our results and ensure that our findings are not influenced by biases inherent in the selection of significant regions.

      We recognize the importance of rigorous statistical practices and are committed to upholding the highest standards of analysis. As such, we have revisited our methodology and included a more detailed explanation of the steps taken to avoid double dipping and ensure the integrity of our analyses in the revised manuscript.

      “Although our study has found a new perspective, the analysis method still refers to and uses the traditional fNIR-based hyperscanning analyses (Liu et al., 2019; P¨arnamets et al., 2020; Reinero et al., 2021; Számadó et al., 2021; Solansky, 2011), which is generally accepted by the majority of fNIR-based hyperscanning researchers. For example, we would first identify significant channels through a one-sample t-test and then conduct further analyses, such as ANOVA or independent samples t-tests. Selective analysis is a powerful tool and is perfectly justified whenever the results are statistically independent of the selection criterion under the null hypothesis (Kriegeskorte et al., 2019). However, it may lead to double dipping and missing information. In this study, the absence of statistically significant TPJ activation in the analyzed data led to the TPJ being ignored. In the future, it should be made explicit in the analysis, and the reliability of the results should be ensured by appropriate statistical methods (e.g., cross-validation, independent data sets, or techniques to control for selective bias).” p.26-27

      In several key analyses within this study (e.g., single-brain activation in the paragraph starting from L398, neural synchronization in the paragraph starting from L393), the TPJ is mentioned alongside the DLPFC. However, in subsequent detailed analyses, the TPJ is entirely ignored.

      We thank the reviewer for your careful review and valuable comment. TPJ is referenced in certain analyses within this paper (as detailed in paragraphs L414 and L440); however, its role remains inadequately investigated and expounded upon in subsequent more intricate analyses. This is due to the absence of statistically significant TPJ activation in the analyzed data. As pointed out by the reviewer, limitations may exist in pursuing further analyses through ROIs, a point we also have addressed in the Discussion section (p.27).

      The method for analyzing single-brain activation is unclear. Although it is mentioned that GLM (generalized linear model) was used, it is not specified what regressors were prepared, nor which regressor's β-values are reported as brain activity. Without this information, it is difficult to assess the validity of the reported results.

      We have revised the relevant description to clarify the analyses of single-brain activation (p. 11)

      While the model illustrated in Figure 7 seems to be interesting, for me, it seems not to be based on the results of this study. This is because the study did not investigate the causal relationships among the three metrics. I guess, Figure 5D might be intended to explain this, but the details of the analysis are not provided, making it unclear what is being presented.

      We regret the confusion that has arisen. Firstly, as highlighted by the reviewer, the model depicted in Figure 7 is not directly derived from the causal analysis conducted in this study. Our investigation did not directly explore the causal relationships among the three indicators; instead, we constructed a model based on correlations and potential mechanisms. In the revised manuscript, we have explicitly stated that Figure 7 represents a descriptive model (p.22).

      Regarding Figure 5D, the reviewer noted that while it may offer some explanatory value, it lacks the necessary analytical detail to elucidate the chart's significance clearly. We have clarified the details of the analysis in Figure 5 (pp.13-14). The model in Figure 5D suggested that the connection between the similarity in individual-collective performance and the correlation of brain activation, as well as whether the impact of each individual’s single-brain activation on the corresponding group’s GNS was regulated by their brain activation connectivity.

      “Finally, we employed correlation and mediation analyses to assess if brain activation connectivity could explain the connection between individuals’ single-brain activation and the related group’s GNS. We examined the connection between the similarity in individual-collective performance and the correlation of brain activation, as well as whether the impact of each individual’s single-brain activation on the corresponding group’s GNS was regulated by their brain activation connectivity. We utilized the PROCESS tool in SPSS to investigate the proposed moderation effect. Specifically, we applied Model 1 with 5000 bootstrap resamples to examine the interaction between the independent variable (i.e., single-brain activation) and the moderator (i.e., brain activation connectivity) in predicting the dependent variable (i.e., GNS). It is noteworthy that prior to analysis, all variables in the moderation model were mean-centered to reduce multicollinearity and improve the interpretability of interaction terms.”  p.13-14

      “Building on the above results, we have developed a two-in-one neural model that explains how group identification influences collective performance. This descriptive model aims to illustrate the potential interrelationships among these indicators and establish a conceptual framework to inspire forthcoming research endeavors.”  p.21

      The details of the experiment are not described at all. While I can somewhat grasp what was done abstractly, the lack of specific information makes it impossible to replicate the study.

      As suggested, we have clarified the details of the experiment in the manuscript.

      (1) As stated in the public review, the details of the experiment are not described at all and while I can somewhat grasp what was done abstractly, the lack of specific information makes it impossible to replicate the study. In points a-e below, I list the aspects that I could not fully understand, but I am not asking for direct answers to these points. Instead, please provide a detailed description of the experiment so that it can be replicated.

      Thank you for your suggestion; we have responded to each question sequentially and elaborated on the experiment specifics to ensure replicability.

      (a) Please provide more detailed information about the Group Identification Task. How much did each participant speak (was there any asymmetry in the amount of speaking, and was there any possibility that the asymmetry influenced the identification rating)? Did the three participants interact in person, or online? Are they isolated from experimenters? How was the rating conducted, what I mean is that it's a PC-based rating?

      We apologize for the lack of detail in our description of the procedures for the experiment.

      For the first question, we draw upon previous studies concerning the manipulation of group identity while controlling the content of pre-task conversations. Specifically, the high-identity group engaged in self-introductions and identified similarities among the three members, whereas the low-identity group discussed topics related to the current semester's classes (Xie et al., 2023; Yang et al., 2020). Both discussions were conducted for the same duration of three minutes, ensuring that the number of exchanges between the two groups remained comparable. There was almost no asymmetry in the amount of speaking. We also conducted a manipulation check, which confirmed the effectiveness of our identity manipulation(pp.5-6).

      Xie, E., Li, K., Gu, R., Zhang, D., & Li, X. (2023). Verbal information exchange enhances collective performance through increasing group identification. NeuroImage, 279, 120339.

      Yang, J., Zhang, H., Ni, J., De Dreu, C. K., & Ma, Y. (2020). Within-group synchronization in the prefrontal cortex associates with intergroup conflict. Nature neuroscience, 23(6), 754-760.

      “Both discussions were conducted for the same duration of three minutes, ensuring that the number of exchanges between the two groups remained comparable.”  p.5-6

      For the second question,the three participants interacted offline in a face-to-face setting, while the experimenter remained outside the laboratory (p.6).

      “The three participants conducted face-to-face offline interaction throughout the manipulation process.” p.6

      For the third question, at the beginning of the experimental task, participants were isolated from the experimenters (p.6).

      “In addition to explaining the next phase of the task and controlling the timer, experimenters would be isolated from participants.” p.6

      For the last question, the rating of group identification was conducted through a questionnaire presented on participants’ phones (p.6).

      “The questionnaire was presented on participants’ phones.” p.6

      (b) The procedures of the Main Task are also unclear. For the Reading Information (5 min): How was the information presented? PC-based or paper-based? How were the participants seated? Did they read it independently?

      We apologize for the missing details. We have included the following information in the article.

      For the first and last question, each participant would get a piece of paper, which presents the common information and private information. They read independently. (p.6)

      “Each participant would get a piece of paper, which presented the information. Participants could read independently.” p.6

      About how the participants sat, the three participants sat around a table without partitions between each other. Only in the discussion stage, they could communicate face-to-face (p.6).

      “They sat around a table without partitions between each other.” p.6

      “In this process of discussion, the participants were able to communicate face-to-face and verbally.” p.6

      (c) For Sharing Private Information: The authors stated they share text messages using Tencent Meeting. If so, how and with what devices? How was the information displayed on the screen? Were the participants even in the same room?

      Thank you for your reminder. We have added more details now (p.6). Firstly, the experimenter sent the Tencent Meeting link to the participants. After the participants entered the meeting through their mobile phones, they could text the information they wanted to share in the chat box of the meeting. They were in the same room, with Tencent Meeting recording shared information, the participants could view them at any time.

      “During the group sharing, participants entered Tencent Meeting via their mobile phones and were able to text their private information in the chat box to their group members for 5 minutes.” p.6

      (d) For Discussing Information: It's a verbal interaction. How did they interact with others? What is the distance between them? I found a very small picture in Figure 8, but that is all information about experiment settings, that is provided by the authors.

      We are sorry about the missing details. As we have explained in the article it’s a verbal communication, so participants could talk face to face in one room. We have included the following information in the article (p.6).

      “Participants were sitting and communicating around a table. The distance between adjacent participants was about 15 cm, and the distance between face-to-face participants was about 40 cm. In this process of discussion, the participants were able to communicate face-to-face and verbally.” p.6

      (e) For the Decision Process (5 min): How did they answer (What I mean is verbally, writing, or computer-based input), and how did the experimenters record these answers?

      The questions were presented on paper, so the participants could write down their answers and experimenters could count the answers on paper. We have included the following information in the article(p.7).

      “After discussion, all triads were given 5 minutes to answer the following questions (i) the probability of three suspects, 0%-100% for each suspect; (ii) the motivation and tool of crime; and (iii) deduced the entire process of crime. The three questions were presented on paper, allowing participants to write their answers directly on the same sheet. Subsequently, three independent raters used these paper questionnaires to record and calculate the scores for each group.” p.7

      (2) I find the model presented in Figure 7 to be intriguing. Understanding why inter-brain synchronization occurs and how it is supported by specific single-brain activations or intra-brain functional connectivity is indeed a critical area for researchers conducting hyperscanning studies to explore. However, the content depicted in this model is not based on the results of this study. This is because the study did not investigate the causal relationships among the three metrics. I guess, Figure 5D might be intended to explain this, but the details of the analysis are not provided, making it unclear what is being presented. Please include a detailed explanation.

      The specific answers are available on page 5 of our response letter.

      (3) The analysis of single-brain activation analysis (and probably other analyses) focuses on the period from reading to making decisions (L237). Why was this entire interval chosen for analysis? Reading does not involve social interaction. As mentioned in a previous comment, the details of the tasks are unclear, so it's difficult to understand what was actually done in the reading period. Anyway, why were these different phases combined as the focus of analysis? Please clarify the reasoning behind this choice.

      Thank you for your feedback. The decision to analyze the entire interval, spanning from reading to decision-making, was primarily made to grasp the continuum of information processing comprehensively. While reading itself lacks social interaction, it serves as the foundation for subsequent decision-making, during which participants' cognitive states and affective responses gradually evolve. Therefore, examining these two phases collectively enables a more thorough investigation into how information influences decision-making. Furthermore, considering the task details remain ambiguous, we aim to uncover the underlying cognitive and affective mechanisms through a holistic analysis.

      (4) The method for analyzing single-brain activation is unclear. Please provide a detailed description of the analysis methods.

      Thank you for your suggestion, we have added more details in the Method section (p.11).

      “In the GLM model analysis, HbO was the dependent variable, and the regression amount was set to different task stages (a. Reading information, b. Sharing private information, c. Discussion information, d. Decision). After that, we convolved the regression factor with the Hemodynamic Response Function (HRF), and obtained the brain activation β value of each participant in each channel at different task stages through regression analysis.”  p.11

      (5) In the periods of Reading Information and Sharing Private Information, there appears to be no social interaction between participants (Figure1D). However, Figure 6 shows an increase in brain activity correlation even during the first 10 minutes (it corresponds to the Reading and Sharing period). Why does inter-brain correlation (GNS, in this study) increase even though there is no interaction between participants? Please provide an explanation.

      Sharing private information fosters interactive engagement, necessitating its exchange during Tencent Meetings to facilitate sharing. Previous research suggests that heightened correlations in brain activity can be attributed to (1) intrinsic cognitive processes, wherein participants display similar cognitive and emotional responses, fostering shared cognitive processing and brain activity synchronization despite limited external interaction; (2) emotional connections, as divulging private information elicits emotional responses that can be neurally correlated among individuals; and (3) environmental influences, where shared environments and contexts prompt neural interaction among participants even in the absence of direct social engagement. These factors collectively contribute to increased brain activity correlations without active interaction. Our primary focus, however, lies in the phase characterized by significant synchronized brain activity.

      Minor Comments:

      (6) Equation 1 Explanation: There is no explanation of Equation 1. It mentions Yi as the collective score, but what constitutes the collective score Yi is not defined in the manuscript. Additionally, while "i" is referred to as an item (in Line 196), the meaning of "item" is not clear. Therefore, the meaning of this equation is not understood.

      We apologize for this confusion. We have added a description in the manuscript (p.9).

      “In Eq.1, x is the individual score, y is the collective score (y is calculated from the three per capita scores), and i stands for the group number for the item. So, x_i means the individual score of participants in the _i group, and y_i means the collective score of the _i group. _d (x, y) r_epresents the distance from the individual to the collective score.”  p.9

      (7) Equation 2 Explanation: There is no explanation for Equation 2. Please provide descriptions for all variables such as S, t, and w.

      We have clearly stated the meaning of s, t, and w in the first edition of the manuscript article (p.12).

      As shown in L291-293: Here, t denotes the time, s denotes the wavelet scale, 〈⋅〉 represents a smoothing operation in time, and W is the continuous wavelet transform (Grinsted, Moore, & Jevrejeva, 2004).

      (8) Acronyms: Please define all acronyms upon their first appearance (e.g., CFI, TLI, RMSEA in L380).

      We apologize for these mistakes, and we have added full explanations for abbreviations upon their first use (p.16).

      “The mediation model demonstrated a satisfactory fit (CFI = 0.93, TLI = 0.93, RMSEA = 0.04) (CFI-Comparative Fit Index; TLI-Tucker-Lewis index; RMSEA-Root-Mean-Square Error of Approximation), suggesting that the perceived group identification of each individual affected the alterations in single-brain activations in the DLPFC, consequently leading to variations in their performance (β<sub>a</sub> = 0.16, t = 2.20, p = 0.030; β<sub>b</sub> = 0.26, t = 3.56, p < 0.001; β<sub>c</sub> = 0.18, t = 2.34, p = 0.020) (Figure 3C).”  p.16

      (9) Hyperscanning fMRI Studies: Since there are hyperscanning fMRI studies analyzing communication among three people (e.g., Xie et al., 2020, PNAS), it would be beneficial to cite this research. pnas.org/doi/pdf/10.1073/pnas.1917407117.

      As suggested, we have cited this paper. (p.4)

      (10) Line 272; Line 275: Should these references be to Benjamini & Hochberg (1995)?

      As suggested, we have revised our citation.

      (11) Research Objectives: The authors' aim seems to be understanding the relationship between Group Identification Level (High or Low), collective performance, and inter-brain synchronization (GNS). If so, shouldn't the results shown in Figure 6 illustrate how these differ between High and Low groups?

      We are grateful to the reviewer for your insightful comment. This study aimed to investigate the impact of group identity levels on collective performance and interbrain synchronization. Our analysis primarily focused on inter-group disparities to elucidate the potential influence of varying levels of group identification on collective behavior and neural synchrony, as highlighted by the reviewer. It is important to note that the relationship between group identification levels and collective performance, as well as neural synchronization, may represent a continuous or correlational process, rather than a binary comparison between two distinct groups. Notably, we treated group identification as a continuous variable and, consequently, Figure 6 was designed to illustrate trends in the association between group identification levels and both collective performance and neural synchronization, without conducting significance tests between groups. We are confident that the depiction in Figure 6 effectively captures the evolving dynamics between group identification levels and both collective performance and neural synchronization.

      (12) Figure 6 Star-Marker: What is the star marker shown in Figure 6? Please provide an explanation.

      We apologize for this confusion. We have added this explanation to the article. (p.21)

      “The red star sign indicates that at this time point, the neural signal began to increase significantly.” p.21

      (13) Pearson's Correlation: Use "Pearson's correlation" instead of "Pearson correlation."

      Thanks for your comments, we've changed Pearson correlation to Pearson's Correlation for a total of 10 places in the original text (pp. 9,11,13, 15,16, 19,23).

      “Moreover, the Pearson’s correlation was used to examine the relationship between group identification_2 and collective performance.” p.9

      “Subsequently, we used Pearson’s correlation analyses to investigate the relationship between single-brain activation and individual performance.” p.11

      “Second, the Pearson’s correlation between GNS and collective performance was performed.” p.13

      “Following that, we analyzed Pearson’s correlations between the original HbO data in the region related to individual and collective performance, denoted as brain activation connectivity (Lu et al., 2010).” p.13

      “Subsequently, the Pearson’s correlation between the quality of information exchange and collective performance was assessed.” p.15

      “Furthermore, the results of the Pearson’s correlation indicated that groups with higher group identification were more likely to exhibit better collective performance (r \= 0.38, p \= 0.003) (Figure 2B).” p.15

      “The Pearson’s correlation and its associated analyses were based on the data from group identification_2. *p < 0.05.” p.16

      “We first extracted the HbO brain activities related to individual performance (e.g., DLPFC, CH4) and collective performance (e.g., OFC, CH21) of each group member and conducted a Pearson’s correlation between the two.” p.19

      “Subsequently, Pearson’s correlation was used to test whether individual differences in the similarity in individual-collective performance were reflected by DLPFC-OFC connectivity.” p.19

      “Pearson’s correlation showed that the higher quality of information exchange, the better collective performance (r \= 0.36, p \= 0.007) (Figure 8C).” p.23

      (14) MNI Coordinates: The MNI coordinates for each channel are listed in the supporting information. How were these coordinates measured? Were they consistent for all participants? Was MRI conducted for each participant to obtain these coordinates?

      Thank you for your reminder, we have included the necessary instructions in the revised version. First, we need to clarify that we referred to previous literature to determine the placement of the optical probe plates. Following the completion of data collection, we utilized the Vpen positioning system to accurately locate the detection light poles, ultimately obtaining the MNI positioning coordinates. These coordinates were basically consistent for each participant. (p.8)

      “For each participant, one 3 × 5 optode probe set (8 emitters and 7 detectors forming 22 measurement points with 3 cm optode separation, see Table S1 for detailed MNI coordinates) was placed over the prefrontal cortex (reference optode is placed at Fpz, following the international 10-20 system for positioning). The other 2 × 4 probe set (4 emitters and 4 detectors forming 10 measurement points with 3 cm optode separation, see Table S2 for detailed MNI coordinates) was placed over the left TPJ (reference optode is placed at T3, following the international 10-20 system for positioning). The probe sets were examined and adjusted to ensure consistency of the positions across the participants. After the completion of data collection, we utilized the Vpen positioning system to accurately locate the detection light poles, ultimately obtaining the MNI positioning coordinates.”  p.8

    1. Author response:

      The following is the authors’ response to the previous reviews

      Joint Public Reviews:

      Summary

      This manuscript explores the transcriptomic identities of olfactory ensheathing cells (OECs), glial cells that support life-long axonal growth in olfactory neurons, as they relate to spinal cord injury repair. The authors show that transplantation of cultured, immunopurified rodent OECs at a spinal cord injury site can promote injury-bridging axonal regrowth. They then characterize these OECs using single-cell RNA sequencing, identifying five subtypes and proposing functional roles that include regeneration, wound healing, and cell-cell communication. They identify one progenitor OEC subpopulation and also report several other functionally relevant findings, notably, that OEC marker genes contain mixtures of other glial cell type markers (such as for Schwann cells and astrocytes), and that these cultured OECs produce and secrete Reelin, a regrowth-promoting protein that has been disputed as a gene product of OECs.

      Strengths

      This manuscript offers an extensive, cell-level characterization of OECs, supporting their potential therapeutic value for spinal cord injury and suggesting potential underlying repair mechanisms. The authors use various approaches to validate their findings, providing interesting images that show the overlap between sprouting axons and transplanted OECs, and showing that OEC marker genes identified using single-cell RNA sequencing are present in vivo, in both olfactory bulb tissue and spinal cord after OEC transplantation.

      Challenges

      Despite the breadth of information presented, and although many of the suggestions in the initial review were addressed well, some points related to quantification and discussion of sex differences are not fully addressed in this revision.

      (1) The request for quantification of OEC bridges is not fully addressed. We note that this revision includes the following statement (page 6): "We note, however, that such bridge formation is rare following a severe spinal cord injury in adult mammals." However, the title of the paper states that olfactory ensheathing cells promote neural repair and the abstract states that "OECs transplanted near the injury site modify the inhibitory glial scar and facilitate axon regeneration past the scar border and into the lesion." Statements such as these make it more crucial to include quantification of OEC bridges, because if single images are shown of remarkable, unusual bridges, but only one sentence acknowledges the low frequency of this occurrence, then this information taken together might present the wrong takeaway to readers.

      Including some sort of quantification of bridging, whether it be the number of rats exhibiting bridges, the percentage area of OECs near a lesion site, or some other meaningful analysis, would add rigor and clarity to the manuscript.

      The short answer to the OEC bridges quantification is that in our last 2 studies combined, we observed bridges in 3/13 OB-OEC-transplanted rats versus 0/16 control rats (p=0.042 by two-sample proportion test; Thornton et al., 2018, Dixie, 2019). In addition to the new data on bridge formation shown in the current manuscript, our previous and most impressive data of serotonergic axons (5-HT-labeled, red) that crossed the entire lesion site is shown below (from Thornton et al., 2018). The image together with Supplemental video 1 (https://ars.els-cdn.com/content/image/1-s2.0-S0014488618302632-mmc1.mp4) show a reconstruction of multiple sections containing serotonergic axons that bridge the injury site in one OEC-transplanted, completely transected rat (1/5 OEC vs. 0/5 fibroblast-transplanted rat). The video also shows retrogradely-labeled Pseudo-rabies virus taken up by a few scattered neurons (green dots) within and above the lesion site, additional evidence suggesting axonal regeneration.

      In addition to adding bridge quantification in the Results section, we now discuss quantified results on physiological and anatomical evidence of axon regeneration across the injury site from five of the six large spinal cord injury (SCI) studies conducted by the Phelps and Edgerton laboratories. Our studies used the most difficult SCI model, a complete, thoracic spinal cord transection in adult rats, followed by OB-OEC transplantation. This is the only model in which axon regeneration can be differentiated from axon sparing found in incomplete SCIs. An introductory paragraph now summarizes and references data generated from these studies that specifically addresses questions about how OECs modify the injury site and facilitate axonal outgrowth into and across into the lesion core. While relatively few axons cross the entire injury site to reach the caudal spinal cord, many more axons project into the injury site of OEC-transplanted rats compared to those in control rats. Quantification of axonal outgrowth into the lesion site of completely transected, OEC-transplanted rats from three previous long-term studies is now discussed in the Introduction. Based on both physiological and anatomical evidence reviewed from our previous work, we hope the editors and Reviewer agree that our previous studies have shown that OECs promote axonal outgrowth and modify the injury site.

      Page 5, Introduction:

      “Together with collaborators, we conducted six spinal cord injury studies in adult rats with a completely transected, thoracic spinal cord model followed by OB-OEC transplantation (Kubasak et al., 2008; Takeoka et al., 2011; Ziegler et al., 2011; Khankan et al., 2016; Thornton et al., 2018; Dixie, 2019). Results from five of our six studies showed physiological and anatomical evidence of axonal regeneration into and occasionally across the injury site. In 6-8-month-long studies, Takeoka et al. (2011) and Ziegler et al. (2011) reported physiological evidence of motor connectivity across the transection in OEC- but not media-transplanted rats. These experiments used transcranial electric stimulation of the motor cortex or brainstem to detect motor-evoked potentials (MEPs) with EMG electrodes in hindlimb muscles at 4- and 7-months post-transection. After 7 months, 70% of OEC-treated rats responded to stimulation with hindlimb MEPs (motor cortex, 5/20; brainstem 12/20; Takeoka et al, 2011). A complete re-transection above the original transection was carried out one month later and all MEPs in OEC-injected rats were eliminated. These results provide physiological evidence of axon conductivity across the injury site in OEC-treated rats. Additionally, three of our long-term studies evaluated anatomical axonal outgrowth of the descending serotonergic Raphespinal pathway into and through the injury site. Significantly more serotonergic-labeled axons crossed the rostral inhibitory scar border (Takeoka et al., 2011) or occupied a larger area within the injury site core (Thornton et al., 2018, Dixie, 2019) in OEC-transplanted rats than in fibroblast or media controls. In addition, significantly more neurofilament-labeled axons were found within the lesion core of OEC-transplanted versus control rats (Thornton et al., 2018, Dixie, 2019).”

      Page 7, Results: We revised the sentence below and added additional information.

      “We note, however, that such bridge formation is rare following severe spinal cord injury in adult mammals and was detected in 2 out of 8 OEC-transplanted rats and 0/11 media or fibroblast-transplanted controls in this study (Dixie, 2019). Combined with the 1/5 OEC-transplanted rats with axons crossing the injury and 0/5 fibroblast controls in our previous study (Thornton, 2018), we observed bridges in 3/13 OEC-transplanted rats vs 0/16 controls (p=0.042, two-sample proportion test). Bridge formation, in conjunction with the additional physiological and anatomical evidence of axonal connections across the injury site presented in our previous studies, strongly supports the capacity of OECs in neural repair.”

      Page 46, Figure legend 1: We added statistical data to the legend

      “Bridge formation across the injury site was observed in 2 of 8 OEC-transplanted and 0 of 11fibroblast- or media-transplanted spinal cord transected rats. Combined with the 1/5 OEC-transplanted rats with axons crossing the injury and 0/5 fibroblast controls in our previous study (Thornton, 2018), we observed bridges in 3/13 OEC-transplanted rats vs 0/16 controls (p=0.042, two-sample proportion test).”

      (2) The additional discussion of sex differences in OEC bridging elaborates on the choice to study female rats, citing bladder challenges in male rats, but does not note salient clinical implications of this choice. Men account for ~80% of spinal cord injuries and likely also have worsened urinary tract issues, so it would be important to acknowledge this clinical fact and consider including males in future studies.

      Response: We agree that studying SCI repair in male rodents is very important as most people with these injuries are male. We did find one publication by Walker et al. (2019, Journal of Neurotrauma 36:1974-1984) that looked at sex differences in aged-matched male and female rats after a moderate contusion SCI. They examined a number of histological and functional features, and did not find many differences between the genders. Compared to studies of moderate SCI, studies using a completely transected spinal cord model must carry out manual bladder expressions a minimum of twice a day throughout the entire 5 to 7-month study in order to maintain kidney health. Because male urethras are much longer than those of females, males are much more likely that females to die from kidney disease during a complicated, long-term studies such as ours. Fortunately, most SCIs in humans are contusions rather than complete transections so an incomplete contusion model is most appropriate for studying sex differences. We modified the previous statement in our Discussion section as below.

      Page 25, Discussion

      “We acknowledge that in humans, males account for ~80% of spinal cord injuries (National Spinal Cord Injury Statistical Center, 2024) and sustain more serious urinary tract issues than females. We examined females in the current study due to practical experimental considerations, but it is necessary to examine males in future studies.”

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      (1) It is strongly recommended that some sort of quantification of bridging be included in the figures or in a table, whether this is the number of rats showing bridges, the percent area of OECs near the lesion site, or some other meaningful analysis.

      As discussed in the response in Challenge section (1) above, we observed bridges in 3/13 OEC transplanted rats vs 0/16 controls across our two most recent studies. In addition, we added evidence of physiological and anatomical axonal connections across the injury site from our previous studies. We have added the additional information in the Introduction, Results, and Figure legend 1.

      (2) It is recommended that clinical sex differences in spinal cord injury (with ~80% occurring in men) be acknowledged in the Discussion. This clinical fact could be directly mentioned without much justification.

      See Challenge (2) above and addition to the Discussion on page 25.

      (3) Figs. 1, 5, 6: There is still no quantification included for these figures, which detracts from the ability of readers to understand the context and importance of these results. It is recommended to include quantification for these figures.

      Response regarding quantification associated with Figures 1, 5 and 6:

      Regarding Figure 1: We have discussed the additions to the text of the Introduction, Results and the legend of Figure 1 in detail on pages 2-3 of this response. These are important new additions to our paper.

      Regarding Figure 5: We added quantitative information regarding the analysis of Connective Tissue Growth Factor (Ctgf) expression in the injury site.

      Page 10-11, Results:

      “We found high levels of Ctgf expression in GFP-OECs (n=4 rats) that bridged much of the injury site and also detected Ctgf on near-by cells (Figure 5d, d1-2). GFP-labeled fibroblast transplantations (n=3 rats) served as controls and also expressed Ctgf.”

      Page 36, Methods:

      “To examine Ctgf expression in the spinal cord lesion site, we processed 1 slide per animal with ~6 equally-spaced sagittal sections throughout spinal cord from the Khankan et al. (2016) study. Our aim was to assess if transplanted OECs (n=4 rats) and transplanted fibroblasts (n=3 rats) express CTGF in the injury site.”

      Regarding Figure 6: The statistics for Figure 6 are found on page 13 of the Results section and page 38 of the Methods section. We now added the statistics to the Figure 6 legend on page 49.

      Page 13, Results:

      “To determine if the proliferative OECs differ in appearance from adult OECs, and whether there is concordance between our OEC subtypes based on gene expression markers and previously described morphology-based OEC subtyping (Franceschini & Barnett, 1996), we analyzed OECs identified with the anti-Ki67 nuclear marker and anti- Ngfr<sup>p75</sup> (Figure 6g-h). Of the Ki67-positive OECs in our cultures, 24% ± 8% were strongly Ngfr<sup>p75</sup>-positive and spindle-shaped, whereas 76% ± 8% were flat and weakly Ngfr<sup>p75</sup>-labeled (n=4 cultures, p\= 0.023). Here we show that a large percentage (~3/4<sup>ths</sup>) of proliferative OECs are characterized by large, flat morphology and weak Ngfr<sup>p75</sup> expression resembling the previously described morphology-based astrocyte-like subtype. Our results indicate the two types of OEC classifications share certain degrees of overlap, indicating similarities but also differences between the two classification methods.”

      Page 38, Methods: Morphological analyses of Ki67 OEC subtypes

      “To determine if OEC progenitor cells marked with Ki67 immunoreactivity have a distinctive morphology, purified and fixed OEC cultures from 4 rats were processed with anti- Ngfr<sup>p75</sup>, anti-Ki67 and counterstained with Hoechst (Bis-benzimide, 1:500, Sigma-Aldrich, #B2261). Images were acquired from 7-10 randomly selected fields/sample using an Olympus AX70 microscope and Zen image processing and analysis software (Carl Zeiss). We distinguished the larger, flat ‘astrocyte-like’ OECs from the smaller, fusiform ‘Schwann cell-like’ OECs, and recorded their expression of Ngfr<sup>p75</sup> and Ki67. Cell counts from each field were averaged per rat and then averaged into a group mean ± SEM. A Student t-test was conducted to compare the effect of Ngfr<sup>p75</sup>-labeled cell morphology and the proliferative marker Ki67. Statistical significance was determined by p < 0.05.”

      Page 49, Figure 6 legend:

      “Of the OEC progenitors that express Ki67, 76% ± 8 of them display low levels of Ngfr<sup>p75</sup> immunoreactivity and a “flat” morphology (g2, h2; green nuclei, arrowheads). The remainder of Ki67-expressing OECs express high levels of Ngfr<sup>p75</sup> and are fusiform in shape (24% ± 8%, n=4 cultures, Student-t test, p= 0.023).”

      (4) Fig. 9: Quantification is still not included in the figure for these Western blots, although it is appreciated that the authors included some quantification in their response letter. Including this in the figure would provide clarification for the reader.

      Thank you for your suggestion. We now add the quantification to figure 9, together with the methods used for western blot quantification and the figure legend.

      Page 32, Methods:

      “For quantification, ImageJ software (NIH) was used to analyze the densitometric data. Western blot images at 400, 300, and 150 kDa resolution were converted to grayscale followed by manually defining a Region of Interest (ROI) frame that captured the entire band in each lane using the "Rectangular" tool. The area of each selected band was measured by employing the same ROI frame around the band to record the integrated density, “Grey Mean Value”. Background measurements were similarly quantified, and background subtraction was performed by deducting the inverted background from the inverted band value. For relative quantification, target protein bands were normalized to the corresponding loading control (GAPDH) to derive normalized protein expression (fold change). Band intensities were quantified in triplicate for each sample. Data were analyzed with the Mann-Whitney U test to compare normalized protein expression between the Reln<sup>-/-</sup> group and the other groups. A one-sided p-value was calculated to test the hypothesis that protein expression levels in the other groups are greater than those in the Reln<sup>-/-</sup> group (negative control). Statistical significance was determined at p < 0.05. Analysis was performed using GraphPad Prism (version 9).”

      Page 52, Figure legend 9f:

      “(f) Quantitation of multiple isoforms of Reelin from 4-15% gradient gels. Positive and negative controls are Reln<sup>+/+</sup> and Reln<sup>-/-</sup> mouse cortices. Both rat tissue from the ONL (n=3) and CM (n=9) contain more 400 and 300 kDa Reelin compared to the Reln<sup>-/-</sup> mouse. Bars represent the standard deviation of the mean. One-sided Mann-Whitney U test was used to test that protein expression levels in the other groups are greater than those in the Reln<sup>-/-</sup> group, indicative of significant expression of Reln in the test groups. *p < 0.05.”

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public Review):

      Summary.

      In this meticulously conducted study, the authors show that Drosophila epidermal cells can modulate escape responses to noxious mechanical stimuli. First, they show that activation of epidermal cells evokes many types of behaviors including escape responses. Subsequently, they demonstrate that most somatosensory neurons are activated by activation of epidermal cells, and that this activation has a prolonged effect on escape behavior. In vivo analyses indicate that epidermal cells are mechanosensitive and require stored-operated calcium channel Orai. Altogether, the authors conclude that epidermal cells are essential for nociceptive sensitivity and sensitization, serving as primary sensory noxious stimuli.

      Strengths.

      The manuscript is clearly written. The experiments are logical and complementary. They support the authors' main claim that epidermal cells are mechanosensitive and that epidermal mechanically evoked calcium responses require the stored-operated calcium channel Orai. Epidermal cells activate nociceptive sensory neurons as well as other somatosensory neurons in Drosophila larvae, and thereby prolong escape rolling evoked by mechanical noxious stimulation.

      Weaknesses.

      Core details are missing in the protocols, including the level of LED intensity used, which are necessary for other researchers to reproduce the experiments. For most experiments, the epidermal cells are activated for 60 s, which is long when considering that nocifensive rolling occurs on a timescale of milliseconds. It would be informative to know the shortest duration of epidermal cell activation that is sufficient for observing the behavioral phenotype (prolongation of escape behavior) and activation of sensory neurons.

      (1) We agree with the reviewer that the LED intensity is an important detail of the experimental paradigm. We updated the methods to include intensity measurements for the stimuli used throughout the manuscript.

      (2) The Reviewer asks about the shortest duration of epidermal cell activation sufficient for observing the behavior phenotype. We note in the manuscript that behavioral responses to optogenetic epidermal stimulation are apparent within 2 seconds of stimulus (see Figure 2F); this is consistent with our calcium imaging data in which C4da response reaches its maximum within 2-3 sec of stimulation.

      Reviewer #1 (Recommendations):

      (1) The epidermal cells in this study are activated for 60 s. In the real world, the nociceptive stimulation (a poke, such as penetration by the ovipositor of a parasitic wasp) that evokes escape rolling is short. Does optogenetic activation of 1 s or less still evoke rolling? For example, it is unclear in Figure 4K how long the epidermal cells need to be activated before the poke stimulus prolongs rolling. Is it possible to test behavior and GCaMP activity in sensory neurons when epidermal cells are briefly (1 second) activated?

      As described above, behavioral responses to optogenetic epidermal stimulation are apparent within 2 seconds of stimulus (see Figure 2F); this is consistent with our calcium imaging data in which C4da response reaches its maximum within 2-3 sec of stimulation. The kinetics are consistent with a role for epidermal cells in modulating neuronal responses to nocifensive stimuli, and similar to the response kinetics observed in mammalian epidermal cells that modulate neuronal touch and pain responses  (Maksimovic et al., 2014; Woo et al., 2014; Mikesell et al., 2022).

      (2) The protocol for optogenetic screening states that the authors used a 488-nm LED. Why was a 488-nm LED used instead of the 610-nm LED for Chrimson activation? No information (except figure 4K) about the light intensity is provided in the figure legend or the protocol section. Please state the LED intensity used for all optogenetic experiments (GCaMP imaging, behavioral experiments, etc.).

      We used 488 nm light for the initial screen for technical reasons. The screen was conducted by students at the MBL Neurobiology course (hence the affiliation; student authors are included in the manuscript), and the only LED available to us at that time delivered insufficient illumination at longer wavelenths to be useful. We chose to include the student’s data because (1) we found that the 488 nm light alone did not induce rolling in our setup, (2) we repeated and extended the studies with the epidermal drivers using a higher resolution imaging platform and longer wavelength stimulation (all studies other than Fig. 1), and (3) we observed qualitatively similar results when we repeated stimulation with all drivers using 561 nm light.

      We agree that the LED intensity is an important detail of the experimental paradigm. We updated the methods to include intensity measurements for the stimuli used throughout the manuscript. We also include the intensities here:

      - 30 μW/mm^2 for calcium imaging experiments Fig 3B-E, Fig 4A, Fig 3S1A-D, Fig 4S1A

      - 300 μW/mm^2 for behavior studies in Fig 2B-E, Fig 1S6, Fig 2S1, Fig 3E-F, Fig 3S2A-C

      - 25 μW/mm^2 for behavior studies in Fig 4E-J

      - 1.16 μW/mm^2 for behavior studies in Fig 4K

      (3) Lines 150 - 152: Although the authors refer to "a stereotyped behavior sequence" in Fig 2D, there are no data supporting this claim in Fig 2. Rather, the data appear to represent proportions of different types of behavior at each time point, rather than behavior sequences. If the authors wish to claim that the data show stereotyped behavior sequences, they should analyze the data using a different method (e.g., Markov models).

      We agree that in the absence of additional analysis we should avoid commenting on stereotypy of behavior sequences; we therefore adjusted the text to reflect the tendency of nociceptive behaviors to precede non-nociceptive behaviors. The raster plots shown in Supplemental Fig. 2A illustrate this point: in larvae exhibiting nociceptive behaviors, these behaviors appear first, followed by backing and frequently freezing. As one quantitative readout of this sequence we show that the latency of rolling (nociceptive) is shorter compared with backing or freezing (non-nociceptive) (Fig. 2F, Fig. S2G).

      (4) Figure 3A-E: a cursory glance at the data suggests that the most responsive sensory neurons are C1da, with all sensory neurons activated. However, at the behavioral level, only some sensory neurons are activated. If all sensory were activated by Chrimson, what behavioral phenotypes would the authors expect to see? Would it be the same as epidermal activation?

      The Reviewer raises an interesting question, but we intentionally avoid comparing the response properties among sensory neurons because of differences in driver strength. Likewise, extrapolating “activation” at the behavioral level is exceedingly difficult if/when multiple sensory neurons are simultaneously activated. In response to the Reviewer’s specific question, when all da neurons are activated simultaneously, larvae largely exhibited hunching rather than rolling (Hwang et al., 2007). We find that epidermal stimulation rarely elicits hunching; instead, epidermal stimulation generally triggers nocifensive behaviors followed by non-nocifensive behaviors such as backing and freezing, suggesting an order or priority in neurons activated by epidermal cells (or different response times). Defining the mechanisms by which epidermal cells communicate with different types of sensory neurons is therefore a top priority for future studies.

      (5) Figure 3S2; The behavior phenotypes between Fig. 3E, F and Fig 3S2 seems a slightly different. I suggest adding some comments in different behavior phenotype depending on the different GAL4. Specifically, is there increased freezing in some genotypes (e.g., ppk-LexA or NompC-lexA)? Can you show this without TNT data? Is this a background effect or specific GAL4 phenotype?

      We currently do not have the driver-only control for this experiment, but our effector-only control experiment (see Fig. 3S2A) suggests that larvae carrying the AOP-TNT insertion exhibit enhanced nociceptive behavioral responses. This point is addressed in our manuscript by the following (copied from the figure legend):

      “We note that although baseline rolling probability is elevated in all genetic backgrounds containing the AOP-LexA-TnT insertion, silencing C4da and C3da neurons significantly attenuates responses to epidermal stimulation.”

      (6) Calcium-free solution is used in Figure 3. Why do the authors still observe calcium influx? Does this mean that internal calcium stores are released? If so, does the calcium influx represent an action potential? How do the authors focus their LED stimulation to activate epidermal cells and avoid activation of the imaging laser?

      The specimens were imaged in calcium-free solution to minimize movement artifacts. However, the CNS is wrapped by glial cells and over short timescales such as those used for the imaging we speculate that extracellular calcium persists in the CNS.

      (7) It is unclear when animals begin to crawl after the epidermal cells are mechanically stimulated. How do the authors distinguish between peristaltic crawling and a poke by Orai receptors? Although the in vitro experiments beautifully show radial tensions, it is unclear to what extent A-P axis tension (peristaltic crawling) and radial tension (poke) differ. It might be helpful to explain in the discussion section how epidermal cells are selectively activated.

      The Reviewer raises an interesting question about the types and thresholds of forces required to elicit epidermal responses. We cannot eliminate the possibility that peristaltic crawling (or crawling through a 3D substrate) stimulates epidermal cells to a certain degree. Indeed, our results demonstrate a dose-dependent response of Drosophila epidermal cells and human keratinocytes to radial stretch. However, we do not have any information about selectivity in response to different stimuli, though we agree that this is an intriguing avenue for future studies. For example, we don't know whether stretch-responsive cells are more or less responsive to poke. But, a salient feature of our studies is the recruitment of greater numbers of responders with increasing stimulus intensity, therefore we added the following statement to the discussion to clarify our model:

      “Finally, we find that epidermal cells exhibit a dose-dependent response to radial stretch; we therefore anticipate that the output of epidermal cells is likewise dependent on the stimulus intensity.  Hence, rather than a fixed threshold beyond which epidermal cells are selectively activated, we hypothesize that increasing stimulus intensities drive increasing signal outputs to neurons.”

      (8) Some Protocols are missing. For example, in Figure 4, many stimulus combinations were used to test behavior. How were stimuli of different modalities applied to the animals? Further details need to be provided in the protocols.

      We thank the Reviewer for identifying this oversight. The methods section of our original submission detailed most of the stimulus combinations but omitted the opto + mechano combination (4F). We updated our methods to correct these omissions.

      (9) It might be helpful if the authors could provide a sample video for each behavior to clarify how they were each defined.

      Our manuscript includes a table with a detailed description of the behaviors (Table S2), and we added two annotated videos that show representative behavioral responses to optogenetic nociceptor or epidermis stimulation.

      (10) A supplementary summary table of genotypes might be helpful for the reader.

      Experimental genotypes are provided in the figure legends, and a detailed list of all alleles used in the study as well as their source is provided in supplemental table S1.

      Reviewer #2 (Public Review):

      Summary.

      The authors provide compelling evidence that stimulation of epidermal cells in Drosophila larvae results in the stimulation of sensory neurons that evoke a variety of behavioral responses. Further, the authors demonstrate that epidermal cells are inherently mechanoresponsive and implicate a role for store-operated calcium entry (mediated by Stim and Orai) in the communication to sensory neurons.

      Strengths.

      The study represents a significant advance in our understanding of mechanosensation. Multiple strengths are noted. First, the genetic analyses presented in the paper are thorough with appropriate consideration to potential confounds. Second, behavioral studies are complemented by sophisticated optogenetics and imaging studies. Third, identification of roles for store-operated calcium entry is intriguing. Lastly, conservation of these pathways in vertebrates raise the possibility that the described axis is also functional in vertebrates.

      Weaknesses.

      The study has a few conceptual weaknesses that are arguably minor. The involvement of store-operated calcium entry implicates ER calcium store release. Whether mechanical stimulation evokes ER calcium release in epidermal cells and how this might come about (e.g., which ER calcium channels, roles for calcium-induced calcium release etc.) remains unaddressed. On a related note, the kinetics of store-operated calcium entry is very distinct from that required for SV release. The link between SOC and epidermal cells-neuron transmission is not reconciled. Finally, it is not clear how optogenetic stimulation of epidermal cells results in the activation of SOC.

      (1) The involvement of store-operated calcium entry implicates ER calcium store release. Whether mechanical stimulation evokes ER calcium release in epidermal cells and how this might come about (e.g., which ER calcium channels, roles for calcium-induced calcium release etc.) remains unaddressed.

      Our studies suggest that mechanically evoked responses in epidermal cells involve both ER calcium release and store-operated calcium entry. Notably, we show that depletion of ER calcium stores before mechanical stimulation, by treating with thapsigargin, reduces (but does not eliminate) mechanically evoked calcium responses in fly epidermal cells (Fig. 6C-6F). Likewise, fly epidermal cells and human keratinocytes both exhibit mechanically evoked calcium responses in the absence of extracellular calcium (10mM EGTA to chelate all free calcium ions). These data support a model whereby mechanical stimuli trigger calcium release from ER stores and influx. Indeed, several cell types have been shown to display mechanically evoked release of calcium from stores. For example, mechanical stimulation of enteroendocrine cells of the gut epithelium results in both calcium release from ER stores and calcium influx across the plasma membrane (Knutson et al., 2023). Similar to our findings, Knutson et al found that depleting stores decreased mechanically evoked calcium signals by over 70% in these gut epithelial stores. In our revised manuscript we have more clearly emphasized these points.

      We agree with the reviewer that deciphering the mechanisms by which mechanical stimuli promote ER calcium release and subsequent store-operated calcium entry is an exciting topic to explore. One potential mechanism is the activation of a mechanosensitive receptor that promotes calcium release from the ER via calcium-induced calcium release or IP3 production, as has been proposed for enteroendocrine cells. A recent paper demonstrated that the ER itself is mechanosensitive and that mechanical stimuli promotes calcium release via the opening of calcium-permeable ion channels in the ER membrane (Song et al., 2024). Determining the relative contributions of store-operated calcium entry and ER calcium release and deciphering their underlying mechanisms will require a thorough investigation of ER calcium channels and receptors, thus we believe this would be beyond the scope of the present manuscript and merits publication on its own. However, we now include this in our discussion as an exciting new direction we aim to pursue.

      (2) The kinetics of store-operated calcium entry is very distinct from that required for SV release. The link between SOC and epidermal cells-neuron transmission is not reconciled.

      The Reviewer raises an interesting point regarding the mode of epidermal cell-neuronal communication. We demonstrated a requirement for dynamin-dependent vesicle release from epidermal cells in mechanical sensitization. However, the nature of the vesicular pool, the mode and kinetics of release, and the type of neuromodulator released remain to be characterized. Hence, it’s not clear that kinetics of synaptic vesicle release is an appropriate comparison. Our studies do demonstrate that behavioral responses to optogenetic epidermal stimulation are relatively slow – on the order of seconds – which is not incompatible with the kinetics of store-operated calcium entry. Furthermore, the primary functional output we define for epidermal mechanosensory responses, mechanical nociceptive sensitization, is apparent 10 sec following the stimulus and persists for minutes in our behavior assays. Consistent with this model, studies of the mammalian touch dome have shown that touch-sensitive Merkel cells secrete neurotransmitters to modulate neurons and promote sustained action potential firing on a similar timescale. Likewise, mechanically evoked ER calcium-release promotes sustained secretion of serotonin from enterochromaffin cells.

      (3) It is not clear how optogenetic stimulation of epidermal cells results in the activation of SOC.

      We appreciate the opportunity to clarify our results. We demonstrate that optogenetic epidermal stimulation elicits behavioral responses in larvae and calcium responses in somatosensory neurons, but we do not claim that optogenetic epidermal stimulation elicits SOC. Our optogenetic studies demonstrate the capacity for epidermal stimulation to modulate somatosensory function, but we characterize contributions of SOC only to mechanical stimuli which are more physiologically relevant. However, it is worth noting that CsChrimson is a calcium-permeable channel, suggesting that an increase in intracellular calcium may trigger epidermal-evoked neuronal responses and behaviors during optogenetic stimulation.

      References

      Hwang, RY, Zhong, L, Xu, Y, Johnson, T, Zhang, F, Deisseroth, K, and Tracey, WD (2007). Nociceptive neurons protect Drosophila larvae from parasitoid wasps. Curr Biol 17, 2105–2116.

      Knutson, KR, Whiteman, ST, Alcaino, C, Mercado-Perez, A, Finholm, I, Serlin, HK, Bellampalli, SS, Linden, DR, Farrugia, G, and Beyder, A (2023). Intestinal enteroendocrine cells rely on ryanodine and IP3 calcium store receptors for mechanotransduction. J Physiol 601, 287–305.

      Maksimovic, S, Nakatani, M, Baba, Y, Nelson, AM, Marshall, KL, Wellnitz, SA, Firozi, P, Woo, S-H, Ranade, S, Patapoutian, A, et al. (2014). Epidermal Merkel cells are mechanosensory cells that tune mammalian touch receptors. Nature 509, 617–621.

      Mikesell, AR, Isaeva, O, Moehring, F, Sadler, KE, Menzel, AD, and Stucky, CL (2022). Keratinocyte PIEZO1 modulates cutaneous mechanosensation. Elife 11, e65987.

      Song, Y, Zhao, Z, Xu, L, Huang, P, Gao, J, Li, J, Wang, X, Zhou, Y, Wang, J, Zhao, W, et al. (2024). Using an ER-specific optogenetic mechanostimulator to understand the mechanosensitivity of the endoplasmic reticulum. Dev Cell 59, 1396-1409.e5.

      Woo, S-H, Ranade, S, Weyer, AD, Dubin, AE, Baba, Y, Qiu, Z, Petrus, M, Miyamoto, T, Reddy, K, Lumpkin, EA, et al. (2014). Piezo2 is required for Merkel-cell mechanotransduction. Nature 509, 622–626.

    1. Reviewer #1 (Public review):

      Summary:

      The authors show certain memory deficits in a mouse knock-in model of Alzheimer's Disease (AD). They show that the observed memory deficits can be explained by a computational model, the latent cause model of associative memory. The memory tasks used include the fear memory task (CFC) and the 'reverse' Barnes maze. Research on AD is important given its known huge societal burden. Likewise, better characterization of the behavioral phenotypes of genetic mouse models of AD is also imperative to advance our understanding of the disease using these models. In this light, I applaud the authors' efforts.

      Strengths:

      (1) Combining computational modelling with animal behavior in genetic knock-in mouse lines is a promising approach, which will be beneficial to the field and potentially explain any discrepancies in results across studies as well as provide new predictions for future work.

      (2) The authors' usage of multiple tasks and multiple ages is also important to ensure generalization across memory tasks and 'modelling' of the progression of the disease.

      Weaknesses:

      (1) I have some concerns regarding the interpretation of the behavioral results. Since the computational model then rests on the authors' interpretation of the behavioral results, it, in turn, makes judging the model's explanatory power difficult as well. For the CFC data, why do knock-in mice have stronger memory in test 1 (Figure 2C)? Does this mean the knock-in mice have better memory at this time point? Is this explained by the latent cause model? Are there some compensatory changes in these mice leading to better memory? The authors use a discrimination index across tests to infer a deficit in re-instatement, but this indicates a relative deficit in re-instatement from memory strength in test 1. The interpretation of these differential DIs is not straightforward. This is evident when test 1 is compared with test 2, i.e., the time point after extinction, which also shows a significant difference across groups, Figure 2F, in the same direction as the re-instatement. A clarification of all these points will help strengthen the authors' case

      (2) I have some concerns regarding the interpretation of the Barnes maze data as well, where there already seems to be a deficit in the memory at probe test 1 (Figure 6C). Given that there is already a deficit in memory, would not a more parsimonious explanation of the data be that general memory function in this task is impacted in these mice, rather than the authors' preferred interpretation? How does this memory weakening fit with the CFC data showing stronger memories at test 1? While I applaud the authors for using multiple memory tasks, I am left wondering if the authors tried fitting the latent cause model to the Barnes maze data as well.

      (3) Since the authors use the behavioral data for each animal to fit the model, it is important to validate that the fits for the control vs. experimental groups are similar to the model (i.e., no significant differences in residuals). If that is the case, one can compare the differences in model results across groups (Figures 4 and 5). Some further estimates of the performance of the model across groups would help.

      (4) Is there an alternative model the authors considered, which was outweighed in terms of prediction by this model? One concern here is also parameter overfitting. Did the authors try leaving out some data (trials/mice) and predicting their responses based on the fit derived from the training data?

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript proposes that the use of a latent cause model for the assessment of memory-based tasks may provide improved early detection of Alzheimer's Disease as well as more differentiated mapping of behavior to underlying causes. To test the validity of this model, the authors use a previously described knock-in mouse model of AD and subject the mice to several behaviors to determine whether the latent cause model may provide informative predictions regarding changes in the observed behaviors. They include a well-established fear learning paradigm in which distinct memories are believed to compete for control of behavior. More specifically, it's been observed that animals undergoing fear learning and subsequent fear extinction develop two separate memories for the acquisition phase and the extinction phase, such that the extinction does not simply 'erase' the previously acquired memory. Many models of learning require the addition of a separate context or state to be added during the extinction phase and are typically modeled by assuming the existence of a new state at the time of extinction. The Niv research group, Gershman et al. 2017, have shown that the use of a latent cause model applied to this behavior can elegantly predict the formation of latent states based on a Bayesian approach, and that these latent states can facilitate the persistence of the acquisition and extinction memory independently. The authors of this manuscript leverage this approach to test whether deficits in the production of the internal states, or the inference and learning of those states, may be disrupted in knock-in mice that show both a build-up of amyloid-beta plaques and a deterioration in memory as the mice age.

      Strengths:

      I think the authors' proposal to leverage the latent cause model and test whether it can lead to improved assessments in an animal model of AD is a promising approach for bridging the gap between clinical and basic research. The authors use a promising mouse model and apply this to a paradigm in which the behavior and neurobiology are relatively well understood - an ideal situation for assessing how a disease state may impact both the neurobiology and behavior. The latent cause model has the potential to better connect observed behavior to underlying causes and may pave a road for improved mapping of changes in behavior to neurobiological mechanisms in diseases such as AD.

      Weaknesses:

      I have several substantial concerns which I've detailed below. These include important details on how the behavior was analyzed, how the model was used to assess the behavior, and the interpretations that have been made based on the model.

      (1) There is substantial data to suggest that during fear learning in mice separate memories develop for the acquisition and extinction phases, with the acquisition memory becoming more strongly retrieved during spontaneous recovery and reinstatement. The Gershman paper, cited by the authors, shows how the latent causal model can predict this shift in latent states by allowing for the priors to decay over time, thereby increasing the posterior of the acquisition memory at the time of spontaneous recovery. In this manuscript, the authors suggest a similar mechanism of action for reinstatement, yet the model does not appear to return to the acquisition memory state after reinstatement, at least based on the examples shown in Figures 1 and 3. Rather, the model appears to mainly modify the weights in the most recent state, putatively the 'extinction state', during reinstatement. Of course, the authors must rely on how the model fits the data, but this seems problematic based on prior research indicating that reinstatement is most likely due to the reactivation of the acquisition memory. This may call into question whether the model is successfully modeling the underlying processes or states that lead to behavior and whether this is a valid approach for AD.

      (2) As stated by the authors in the introduction, the advantage of the fear learning approach is that the memory is modified across the acquisition-extinction-reinstatement phases. Although perhaps not explicitly stated by the authors, the post-reinstatement test (test 3) is the crucial test for whether there is reactivation of a previously stored memory, with the general argument being that the reinvigorated response to the CS can't simply be explained by relearning the CS-US pairing, because re-exposure the US alone leads to increase response to the CS at test. Of course there are several explanations for why this may occur, particularly when also considering the context as a stimulus. This is what I understood to be the justification for the use of a model, such as the latent cause model, that may better capture and compare these possibilities within a single framework. As such, it is critical to look at the level of responding to both the context alone and to the CS. It appears that the authors only look at the percent freezing during the CS, and it is not clear whether this is due to the contextual US learning during the US re-exposure or to increased response to the CS - presumably caused by reactivation of the acquisition memory. For example, the instance of the model shown in Figure 1 indicates that the 'extinction state', or state z6, develops a strong weight for the context during the reinstatement phase of presenting the shock alone. This state then leads to increased freezing during the final CS probe test as shown in the figure. By not comparing the difference in the evoked freezing CR at the test (ITI vs CS period), the purpose of the reinstatement test is lost in the sense of whether a previous memory was reactivated - was the response to the CS restored above and beyond the freezing to the context? I think the authors must somehow incorporate these different phases (CS vs ITI) into their model, particularly since this type of memory retrieval that depends on assessing latent states is specifically why the authors justified using the latent causal model.

      (3) This is related to the second point above. If the question is about the memory processes underlying memory retrieval at the test following reinstatement, then I would argue that the model parameters that are not involved in testing this hypothesis be fixed prior to the test. Unlike the Gershman paper that the authors cited, the authors fit all parameters for each animal. Perhaps the authors should fit certain parameters on the acquisition and extinction phase, and then leave those parameters fixed for the reinstatement phase. To give a more concrete example, if the hypothesis is that AD mice have deficits in differentiating or retrieving latent states during reinstatement which results in the low response to the CS following reinstatement, then perhaps parameters such as the learning rate should be fixed at this point. The authors state that the 12-month-old AD mice have substantially lower learning rate measures (almost a 20-fold reduction!), which can be clearly seen in the very low weights attributed to the AD mouse in Figure 3D. Based on the example in Figure 3D, it seems that the reduced learning rate in these mice is most likely caused by the failure to respond at test. This is based on comparing the behavior in Figures 3C to 3D. The acquisition and extinction curves appear extremely similar across the two groups. It seems that this lower learning rate may indirectly be causing most of the other effects that the authors highlight, such as the low σx, and the changes to the parameters for the CR. It may even explain the extremely high K. Because the weights are so low, this would presumably lead to extremely low likelihoods in the posterior estimation, which I guess would lead to more latent states being considered as the posterior would be more influenced by the prior.

      (4) Why didn't the authors use the latent causal model on the Barnes maze task? The authors mention in the discussion that different cognitive processes may be at play across the two tasks, yet reversal tasks have been suggested to be solved using latent states to be able to flip between the two different task states. In this way, it seems very fitting to use the latent cause model. Indeed, it may even be a better way to assess changes in σx as there are presumably 12 observable stimuli/locations.

    1. Reviewer #1 (Public review):

      Summary:

      In the current study, Huang et al. examined ACC response during a novel discrimination-avoid task. The authors concluded that ACC neurons primarily encode post-action variables over extended periods, reflecting the animal's preceding actions rather than the outcomes or values of those actions. Specifically, they identified two subgroups of ACC neurons that responded to different aspects of the actions. This work represents admirable efforts to investigate the role of ACC in task-performing mice. However, in my opinion, alternative explanations of the data were not sufficiently explored, and some key findings were not well supported.

      Strengths:

      The development of the new discrimination-avoid task is applauded. Single-unit electrophysiology in task-performing animals represents admirable efforts and the datasets are valuable. The identification of different groups of encoding neurons in ACC can be potentially important.

      Weaknesses:

      One major conclusion is that ACC primarily encodes the so-called post-action variables (specifically shuttle crossing). However, only a single example session was included in Figure 2, while in Supplementary Figure 2 a considerable fraction of ACC neurons appears to respond to either the onset of movement or ramp up their activity prior to movement onset. How did the authors reach the conclusion that ACC preferentially respond to shuttle crossing?

      In Figure 4, it was concluded that ACC neurons respond to action independent of outcome. Since these neurons are active on both correct and incorrect shuttle but not stay trials, they seem to primarily respond to overt movement. If so, the rationale for linking ACC activity and adaptive behavior/associative learning is not very clear to me. Further analyses are needed to test whether their firing rates correlated with locomotion speed or acceleration/deceleration. On a similar note, to what extent are the action state neurons actually responding to locomotion-related signals? And can ACC activity actually differentiate correct vs. incorrect stays?

      Given that a considerable amount of ACC neurons encode 'action content', it is not surprising that by including all neurons the model is able to make accurate predictions in Figure 6. How would the model performance change by removing the content neurons?

      Moving on to Figure 7. Since Figure 4 showed that ACC neurons respond to movement regardless of outcome, it is somewhat puzzling how ACC activity can be linked to future performance.

      Two mice contributed about 50% of all the recorded cells. How robust are the results when analyzing mouse by mouse?

      Lastly, the development of the new discrimination-avoid task is applauded. However, a major missing piece here is to show the importance of ACC in this task and what aspects of this behavior require ACC.

    2. Reviewer #2 (Public review):

      Summary:

      The current dataset utilized a 2x2 factorial shuttle-escape task in combination with extracellular single-unit recording in the anterior cingulate cortex (ACC) of mice to determine ACC action coding. The contributions of neocortical signaling to action-outcome learning as assessed by behavioral tasks outside of the prototypical reward versus non-reward or punished vs non-punished is an important and relevant research topic, given that ACC plays a clear role in several human neurological and psychiatric conditions. The authors present useful findings regarding the role of ACC in action monitoring and learning. The core methods themselves - electrophysiology and behavior - are adequate; however, the analyses are incomplete since ruling out alternative explanations for neural activity, such as movement itself, requires substantial control analyses, and details on statistical methods are not clear.

      Strengths:

      (1) The factorial design nicely controls for sensory coding and value coding, since the same stimulus can signal different actions and values.

      (2) The figures are mostly well-presented, labeled, and easy to read.

      (3) Additional analyses, such as the 2.5/7.5s windows and place-field analysis, are nice to see and indicate that the authors were careful in their neural analyses.

      (4) The n-trial + 1 analysis where ACC activity was higher on trials that preceded correct responses is a nice addition, since it shows that ACC activity predicts future behavior, well before it happens.

      (5) The authors identified ACC neurons that fire to shuttle crossings in one direction or to crossings in both directions. This is very clear in the spike rasters and population-scaled color images. While other factors such as place fields, sensory input, and their integration can account for this activity, the authors discuss this and provide additional supplemental analyses.

      Weaknesses:

      (1) The behavioral data could use slightly more characterization, such as separating stay versus shuttle trials.

      (2) Some of the neural analyses could use the necessary and sufficient comparisons to strengthen the authors' claims.

      (3) Many of the neural analyses seem to utilize long time windows, not leveraging the very real strength of recording spike times. Specifics on the exact neural activity binning/averaging, tests, classifier validation, and methods for quantification are difficult to find.

      (4) The neural analyses seem to suggest that ACC neurons encode one variable or the other, but are there any that multiplex? Given the overwhelming evidence of multiplexing in the ACC a bit more discussion of its presence or absence is warranted.

    3. Reviewer #3 (Public review):

      Summary:

      The authors record from the ACC during a task in which animals must switch contexts to avoid shock as instructed by a cue. As expected, they find neurons that encode context, with some encoding of actions prior to the context, and encoding of neurons post-action. The primary novelty of the task seems to be dynamically encoding action-outcome in a discrimination-avoidance domain, while this is traditionally done using operant methods. While I'm not sure that this task is all that novel, I can't recall this being applied to the frontal cortex before, and this extends the well-known action/context/post-context encoding of ACC to the discrimination-avoidance domain.

      While the analysis is well done, there are several points that I believe should be elaborated upon. First, I had questions about several details (see point 3 below). Second, I wonder why the authors downplayed the clear action coding of ACC ensembles. Third, I wonder if the purported 'novelty' of the task (which I'm not sure of) and pseudo-debate on ACC's role undermines the real novelty - action/context/outcome encoding of ACC in discrimination-avoidance and early learning.

      Strengths:

      Recording frontal cortical ensembles during this task is particularly novel, and the analyses are sophisticated. The task has the potential to generate elegant comparisons of action and outcome, and the analyses are sophisticated.

      Weaknesses:

      I had some questions that might help me understand this work better.

      (1) I wonder if the field would agree that there is a true 'debate' and 'controversy' about the ACC and conflict monitoring, or if this is a pseudodebate (Line 34). They cite 2 very old papers to support this point. I might reframe this in terms of the frontal cortex studying action-outcome associations in discrimination-avoidance, as the bulk of evidence in rodents comes from overtrained operant behavior, and in humans comes from high-level tasks, and humans are unlikely to get aversive stimuli such as shocks.

      (2) Does the purported novelty of the task undermine the argument? While I don't have an exhaustive knowledge of this behavior, the novelty involves applying this ACC. There are many paradigms where a shock triggers some action that could be antecedents to this task.

      (3) The lack of details was confusing to me:

      a) How many total mice? Are the same mice in all analyses? Are the same neurons? Which training day? Is it 4 mice in Figure 3? Five mice in line 382? An accounting of mice should be in the methods. All data points and figures should have the number of neurons and mice clearly indicated, along with a table. Without these details, it is challenging to interpret the findings.

      b) How many neurons are from which stage of training? In some figures, I see 325, in some ~350, and in S5/S2B, 370. The number of neurons should be clearly indicated in each figure, and perhaps a table.

      c) Were the tetrodes driven deeper each day? The depth should be used as a regressor in all analyses?

      d) Was is really ACC (Figure 2A)? Some shanks are in M2? All electrodes from all mice need to be plotted as a main figure with the drive length indicated.

      e) It's not clear which sessions and how many go into which analysis

      f) How many correct and incorrect trials (<7?) are there per session?

      g) Why 'up to 10 shocks' on line 358? What amplitudes were tried? What does scrambled mean?

      (4) Why do the authors downplay pre-action encoding? It is clearly evident in the PETHs, and the classifiers are above chance. It's not surprising that post-shuttle classification is so high because the behavior has occurred. This is most evident in Figure S2B, which likely should be a main figure.

      (5) The statistics seem inappropriate. A linear mixed effects model accounting for between-mouse variance seems most appropriate. Statistical power or effect size is needed to interpret these results. This is important in analyses like Figure 7C or 6B.

      (6) Better behavioral details might help readers understand the task. These can be pulled from Figures S2 and S5. This is particularly important in a 'novel' task.

      (7) Can the authors put post-action encoding on the same classification accuracy axes as Figure 6B? It'd be useful to compare.

      (8) What limitations are there? I can think of several - number of animals, lack of causal manipulations, ACC in rodents and humans.

      Minor:

      (1) Each PCA analysis needs a scree plot to understand the variance explained.

      (2) Figure 4C - y and x-axes have the same label?

      (3) What bin size do the authors use for machine learning (Not clear from line 416)?

      (4) Why not just use PCA instead of 'dimension reduction' (of which there are many?)

      (5) Would a video enhance understanding of the behavior?

    1. Reviewer #2 (Public review):

      This study provides some interesting observations on how different flavors of e-cigarettes can affect lung immunology, however there are numerous flaws including a low number of replicates and a lack of effective validation methods which reduces the robustness and rigor of the findings.

      Strengths:

      The strength of the study is the successful scRNA-seq experiment which gives good preliminary data that can be used to create new hypotheses in this area.

      Weaknesses:

      The major weakness is the low number of replicates and the limited analysis methods. Two biological n per group is not acceptable to base any solid conclusions. Any validatory data was too little (only cell % data) and did not always support the findings (e.g. Figure 4D does not match 4C). Often n seems to be combined and only one data point is shown, it is not at all clear how the groups were analysed and how many cells in each group were compared.

      Other specific weaknesses were identified in addition to the ones above:

      (1) Only 71,725 cells means only 7,172 per group, which is 3,586 per animal - how many of these were neutrophils, T-cells, and macrophages? This was not shown and could be too low.

      (2) The dynamic range of RNA measurement using scRNAseq is known to be limited - how do we know whether genes are not expressed or just didn't hit detection? This links into the Ly6G negative neutrophil comment, but in general, the lack of gene expression in this kind of data should be viewed with caution, especially with a low n number and few cells.

      (3) There is no rigorous quantification of Ly6G+ and Ly6G- cells int he flow cytometry data.

      (4) Eosinophils are heavily involved in lung biology but are missing from the analysis.

      (5) The figures had no titles so were difficult to navigate.

      (6) PGVG is not defined and not introduced early enough.

      (7) Neutrophils are not well known to proliferate, so any claims about proliferation need to be accompanied by validation such as BrdU or other proliferation assays.

      (8) It was not clear how statistics were chosen and why Table S2 had a good comparison (two-way ANOVA with gender as a variable) but this was not used for other data particularly when looking at more functional RNA markers (Table S2 also lacks the interaction statistic which is most useful here).

      (9) Many statistics are only vs air control, but it would be more useful as a flavour comparison to see these vs PGVG. In some cases, the carrier PGVG looks worse than some of the flavours (which have nicotine).

      (10) The n number is a large issue, but in Figures such as 4, 6, and 7 it could be a bigger factor. The number of significant genes identified has been determined by chance rather than any real difference, e.g. Is Il1b not identified in Fruit flavour vs air because there wasn't enough n, while in Air vs Tobacco, it randomly hit the significance mark. This is but an example of the problems with the analysis and conclusions

      (11) The data in Figure 7A is confusing, if this is a comparison to air, then why does air vs air not equal 1? Even if this was the comparison to the average of air between males and females, then this doesn't explain why CCL12 is >1 in both. Is this z-score instead? Regardless the data is difficult to interpret in this format.

      (12) Individual n was not shown for almost all experiments - e.g. Figure 1D - what is this representative of? Figure 2D - is this bulk-grouped data for all cells and all mice? The heatmaps are also pooled from 2n and don't show the variability.

    2. Reviewer #3 (Public review):

      This work aims to establish cell-type specific changes in gene expression upon exposure to different flavors of commercial e-cigarette aerosols compared to control or vehicle. Kaur et al. conclude that immune cells are most affected, with the greatest dysregulation found in myeloid cells exposed to tobacco-flavored e-cigs and lymphoid cells exposed to fruit-flavored e-cigs. The up-and-down-regulated genes are heavily associated with innate immune response. The authors suggest that a Ly6G-deficient subset of neutrophils is found to be increased in abundance for the treatment groups, while gene expression remains consistent, which could indicate impaired function. Increased expression of CD4+ and CD8+ T cells along with their associated markers for proliferation and cytotoxicity is thought to be a result of activation following this decline in neutrophil-mediated immune response.

      Strengths:

      (1) Single-cell sequencing data can be very valuable in identifying potential health risks and clinical pathologies of lung conditions associated with e-cigarettes considering they are still relatively new.

      (2) Not many studies have been performed on cell-type specific differential gene expression following exposure to e-cig aerosols.

      (3) The assays performed address several factors of e-cig exposure such as metal concentration in the liquid and condensate, coil composition, cotinine/nicotine levels in serum and the product itself, cell types affected, which genes are up- or down-regulated and what pathways they control.

      (4) Considerations were made to ensure clinical relevance such as selecting mice whose ages corresponded with human adolescents so that the data collected was relevant.

      Weaknesses:

      (1) The exposure period of 1 hour a day for 5 days is not representative of chronic use and this time point may be too short to see a full response in all cell types. The experimental design is not well-supported based on the literature available for similar mouse models.

      (2) Several claims lack supporting evidence or use data that is not statistically significant. In particular, there were no statistical analyses to compare results across sex, so conclusions stating there is a sex bias for things like Ly6G+ neutrophil percentage by condition are observational.

      (3) Statistical analyses lack rigor and are not always displayed with the most appropriate graphical representation.

      (4) Overall, the paper and its discussion are relatively limited and do not delve into the significance of the findings or how they fit into the bigger picture of the field.

      (5) The manuscript lacks validation of findings in tissue by other methods such as staining.

      (6) This paper provides a foundation for follow-up experiments that take a closer look at the effects of e-cig exposure on innate immunity. There is still room to elaborate on the differential gene expression within and between various cell types.

    3. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      The authors tackled the public concern about E-cigarettes among young adults by examining the lung immune environment in mice using single-cell RNA sequencing, discovering a subset of Ly6G- neutrophils with reduced IL-1 activity and increased CD8 T cells following exposure to tobacco-flavored e-cigarettes. Preliminary serum cotinine (nicotine metabolite) measurements validated the effective exposure to fruit, menthol, and tobacco-flavored e-cigarettes with air and PG/VG serving as control groups. They also highlighted the significance of metal leaching, which fluctuated over different exposure durations to flavored e-cigarettes, underscoring the inherent risks posed by these products. The scRNAseq analysis of e-cig exposure to flavors and tobacco demonstrated the most notable differences in the myeloid and lymphoid immune cell populations. Differentially expressed genes (DEGs) were identified for each group and compared against the air control. Further sub-clustering revealed a flavor-specific rise in Ly6G- neutrophils and heightened activation of cytotoxic T cells in response to tobacco-flavored e-cigarettes. These effects varied by sex, indicating that immune changes linked to e-cig use are dependent on gender. By analyzing the expression of various genes and employing gene ontology and gene enrichment analysis, they identified key pathways involved in this immune dysregulation resulting from flavor exposure. Overall, this study affirmed that e-cigarette exposure can suppress the neutrophil-mediated immune response, subsequently enhancing T cell toxicity in the lung tissue of mice.

      Strengths:

      This study used single-cell RNA sequencing to comprehensively analyze the impact of e-cigarettes on the lung. The study pinpointed alterations in immune cell populations and identified differentially expressed genes and pathways that are disrupted following e-cigarette exposure. The manuscript is well written, the hypothesis is clear, the experiments are logically designed with proper control groups, and the data is thoroughly analyzed and presented in an easily interpretable manner. Overall, this study suggested novel mechanisms by which e-cigs impact lung immunity and created a dataset that could benefit the lung immunity field.

      We thank the reviewer for identifying the strengths of our work.

      Weaknesses:

      The authors included a valuable control group - the PG/VG group, since PG/VG is the foundation of the e-liquid formulation. However, most of the comparative analyses use the air group as the control. Further analysis comparing the air group to the PG/VG group, and the PG/VG group to the individual flavored e-cig groups will provide more clear insights into the true source of irritation. This is done for a few analyses but not consistently throughout the paper. Flavor-specific effects should be discussed in greater detail. For example, Figure 1E shows that the Fruit flavor group exhibits more severe histological pathology, but similar effects were not corroborated by the single-cell data.

      We thank the reviewer for this query. We agree that PG/VG group is the foundation of the e-liquid formulation and hence comparisons with this group is of significance to understand the effect of individual flavors on the cell population. Though we compared the flavored e-cig groups with PG/VG group, we did not discuss it in detail within the manuscript to avoid confusions in interpretation for such a big dataset. However, we will include the comparisons with the PG/VG group as a Supplement File in our revised manuscript to facilitate proper interpretation of our omics data to interested readers.

      While we agree that flavor-specific effects might be of interest, we did not delve into exploring them in detail as the fruit flavored e-liquids have now been regulated for sale in the US. Thus, from regulatory point of view, the effects of tobacco- and menthol-flavored e-liquids hold most interest. Since at the time of conducting this study, fruit flavors were in the market, we have still included the data. However, studying it further was not the focus of this work. Nevertheless, interested readers of our manuscript can have access to our dataset to allow further analyses and interpretation of our results.

      The characterization of Ly6g+ vs Ly6g- neutrophils is interesting and potentially very impactful. Key results like this from scRNAseq analyses should be validated by qPCR and flow cytometry.

      Also, a recent study by Ruscitti et al reported Ly6g+ macrophages in the lung which can potentially confound the cell type analysis. A more detailed marker gene and sub-population analysis of the myeloid clusters could rule out this potential confounding factor.

      We agree with the reviewer that the loss of Ly6G on neutrophils is a very interesting find and we are in process of designing neutrophil specific experiments to study the impact of e-cig exposure on neutrophil maturation and function which will be discussed in subsequent work by our group. However, to address the concerns raised by the reviewer, we are staining the lung tissue samples from air-and differently flavored e-cig aerosol exposed mouse lungs with Ly6G and S100A8 (universal marker for neutrophil) to see the infiltration of Ly6g+ vs Ly6g- neutrophils within the lungs of exposed and unexposed mice. This would also address the question if these populations were neutrophils or belong to another myeloid origin as suggested by recent publications. We will share the results from our findings in the revised manuscript and update our interpretations accordingly with better validations.

      Reviewer #2 (Public review):

      This study provides some interesting observations on how different flavors of e-cigarettes can affect lung immunology, however there are numerous flaws including a low number of replicates and a lack of effective validation methods which reduces the robustness and rigor of the findings.

      Strengths:

      The strength of the study is the successful scRNA-seq experiment which gives good preliminary data that can be used to create new hypotheses in this area.

      We appreciate the reviewer for recognizing the strength of this work.

      Weaknesses:

      The major weakness is the low number of replicates and the limited analysis methods. Two biological n per group is not acceptable to base any solid conclusions. Any validatory data was too little (only cell % data) and did not always support the findings (e.g. Figure 4D does not match 4C). Often n seems to be combined and only one data point is shown, it is not at all clear how the groups were analyzed and how many cells in each group were compared.

      We thank the reviewer for the critique to allow us to improve our analyses. We understand that the low number of replicates in this work makes the analyses difficult to draw solid conclusions, but this was a pilot study to understand the changes in the mouse lung upon acute exposures to flavored e-cig aerosols at a single cell level. So far, the e-cig field has been primarily focused on conducting toxicological studies to help regulatory bodies to set standards and enforce laws to better regulate the manufacture, sale and distribution of e-cig products. However, adolescents and young adults are still getting access to these products, and there is little to no understanding of how this may affect the lung health upon acute and chronic exposures. Single cell technology is a powerful tool to analyze the gene expression changes within cell populations to study cell heterogeneity and function. Yet, it is a costly tool, owing to which, conducting such analyses on large sample sizes is not ideal. This pilot study was designed to get some initial leads for future studies involving larger sample sizes and chronic exposures. Further, we still intend to share our results with the scientific community due to the value of such a dataset for a wider audience interested in learning about the mechanistic underpinnings of e-cig exposures in vivo.

      We understand that the validations are limited in our current work and so we are in process of conducting some immunostaining to validate a few targets made through this work. We also want to add here that validating single cell findings using any of the classical methods of experimentation including ELISA, qPCR or flow cytometry is sometimes difficult as many of these techniques still investigate the tissue while the changes shown in single cell analyses are mainly pertaining to a single cell type. This could be a probable reason for the scRNA seq results not aligning with our findings from flow cytometry. The data/findings from this pilot study have now allowed us to be better informed to design an effective flow panel for our future studies. In terms of the statistics and the number of cells for each analysis, we will share the detailed account and information for each to allow better interpretation of our results.

      Only 71,725 cells means only 7,172 per group, which is 3,586 per animal - how many of these were neutrophils, T-cells, and macrophages? This was not shown and could be too low.

      We do agree that the number of cells could be too low, but to avoid this we never studied the gene expression variations at the finest level of cell identity. We classified the cell clusters into general annotations -myeloid, lymphoid, endothelial, stromal and epithelial- and identified the changes in the gene expressions. Of these, only two clusters (myeloid and lymphoid) with more than ~1000 cells per cell type per group were studied in detail. We will include the cell count information to allow better interpretation of our results in the revised manuscript.

      The dynamic range of RNA measurement using scRNAseq is known to be limited - how do we know whether genes are not expressed or just didn't hit detection? This links into the Ly6G negative neutrophil comment, but in general, the lack of gene expression in this kind of data should be viewed with caution, especially with a low n number and few cells.

      This is a well-made point, and we thank the reviewer for this comment. We agree that the dynamic range RNA measurement is limited and for low cell numbers that could lead to bias. We are in process of validating the findings regarding the presence of Ly6G+ and Ly6G- cells in our control and treated lungs, the outcome of which will be discussed in the revised manuscript. We will also provide the cell number for the Ly6G- cell cluster for each sample with more detailed discussion of our findings. Due to the small sample size and cell capture, few limitations are hard to overcome which will be further elaborated upon in our revisions.

      There is no rigorous quantification of Ly6G+ and Ly6G- cells in the flow cytometry data.

      We understand that flow-based quantification of our scRNA seq findings would be interesting. However, flow cytometry and single cell suspension to perform sequencing were performed parallelly for this study. We used a basic flow panel using single markers to identify individual immune cell type. We did identify changes in the Ly6G population in our treated and control samples using scRNA seq and intend to include it as a marker for our future studies using flow cytometry. But unfortunately, the same analyses could not be performed for the current batch of samples. We will still include results from IHC staining to identify the Ly6G+ and Ly6G- population in the lung tissues from control and treated mice in revised manuscript to address some of the concerns raised here.

      Eosinophils are heavily involved in lung biology but are missing from the analysis.

      We used RBC lysis buffer to remove the excess RBCs during lung digestion for preparation of single cell suspension for scRNA seq in this study. Reports suggest that RBC lysis could adversely affect the eosinophil number and function. We did not identify any cell cluster, representing markers for eosinophils through our scRNA seq data and we believe that our lung digestion protocol could be the reason for the same. We have studied the eosinophil number changes through flow cytometry in these samples and have found significant changes as well. However due to our inability to find cell clusters for eosinophil through scRNA seq data, we did not include these results in the final manuscript. To avoid confusions and maintain transparency we will include our results from flow cytometry experiments in the revised manuscript.

      The figures had no titles so were difficult to navigate.

      We will make necessary adjustments to the data representation and include the titles to enable easy navigation of the Figures.

      PG/VG is not defined and not introduced early enough.

      We agree that PG/VG is an important control to compare in e-cig studies. This was the reason why this group was included, and we performed comparisons with this group for scRNA seq studies as well. However, to reduce the complexity of the study, we only shared the comparisons with Air control in this manuscript. We will include the comparisons made with PG/VG group as a Supplementary File in the revised manuscript to allow the interested readers have access to the study results and make necessary interpretations for future research.

      Neutrophils are not well known to proliferate, so any claims about proliferation need to be accompanied by validation such as BrdU or other proliferation assays.

      We thank the reviewer for this suggestion; however, we cannot perform the BrDU or other proliferation assay on neutrophils for now. We are planning to include these in the study designs of our future work, however we have limitations of funds to continue further experimentation to support this claim for this study. We mention clearly that this is only a scRNA seq finding and requires further study to avoid over-interpretation of our results.

      It was not clear how statistics were chosen and why Table S2 had a good comparison (two-way ANOVA with gender as a variable) but this was not used for other data particularly when looking at more functional RNA markers (Table S2 also lacks the interaction statistic which is most useful here).

      We thank the reviewer for bringing this concern. We understand that this is a valid point and will include all the necessary information regarding the statistics and other related parameters in the revised manuscript.

      Many statistics are only vs air control, but it would be more useful as a flavor comparison to see these vs PG/VG. In some cases, the carrier PG/VG looks worse than some of the flavors (which have nicotine).

      We will include the comparisons with PG/VG as supplementary file in our revised manuscript, however we do not intend to describe all those changes in detail in the main manuscript.

      The n number is a large issue, but in Figures such as 4, 6, and 7 it could be a bigger factor. The number of significant genes identified has been determined by chance rather than any real difference, e.g. Is Il1b not identified in Fruit flavor vs air because there wasn't enough n, while in Air vs Tobacco, it randomly hit the significance mark. This is but an example of the problems with the analysis and conclusions.

      While we agree in part with the concern raised here, we wish to point out that there are limitations to every experiment. In our opinion, an omics study is not necessarily aimed to find the changes at transcript level with absolute certainty, rather to identify probable cell and gene targets to validate with subsequent work. We never claim that our findings are absolute outcomes but rather add the limitation of sample number and need for further research at every step. The strength of this work is to be the first study of its kind looking at changes in the lung cell population at single cell level upon e-cig aerosol exposure. This study has provided us with interesting gene and cell targets that we are now validating with future work. We still strongly believe that a dataset like this is a useful resource for a wider audience to allow efficient study designs and hence it is befitting to be published and discussed amongst our peers.

      The data in Figure 7A is confusing, if this is a comparison to air, then why does air vs air not equal 1? Even if this was the comparison to the average of air between males and females, then this doesn't explain why CCL12 is >1 in both. Is this z-score instead? Regardless the data is difficult to interpret in this format.

      We thank the reviewer for pointing this out. We realize that the data might be difficult to understand due to scaling of the color codes for the heatmap. We will change the graphical representation and include actual number for fold change in our revised manuscript to allow easy interpretation of these results.

      Individual n was not shown for almost all experiments - e.g. Figure 1D - what is this representative of? Figure 2D - is this bulk-grouped data for all cells and all mice? The heatmaps are also pooled from 2n and don't show the variability.

      While we have included a pictorial representation of the n number in Figure 1A and mentioned n number in the Figure legends for each figure, we understand that it maybe difficult to navigate. We will attempt to address this in a better manner in the revised manuscript.

      However, with respect to the second comment we would like to differ from the reviewer’s opinion. Each scRNA seq data had 2 samples – one for male and another for female which has been clearly shown in the current figures. The pooling of cells as mentioned in the comment happened at the stage of preparation of cell suspension from each sex/group at the start of the sequencing. We do not have any means to show the variability amongst pooled samples, which we acknowledge as a shortcoming of our work. So, in terms of representation of the heatmaps and data analyses we have included all the needed information to uphold transparency of our study design and data visualization for each figure and would like to stick to the current representations.

      Reviewer #3 (Public review):

      This work aims to establish cell-type specific changes in gene expression upon exposure to different flavors of commercial e-cigarette aerosols compared to control or vehicle. Kaur et al. conclude that immune cells are most affected, with the greatest dysregulation found in myeloid cells exposed to tobacco-flavored e-cigs and lymphoid cells exposed to fruit-flavored e-cigs. The up-and-down-regulated genes are heavily associated with innate immune response. The authors suggest that a Ly6G-deficient subset of neutrophils is found to be increased in abundance for the treatment groups, while gene expression remains consistent, which could indicate impaired function. Increased expression of CD4+ and CD8+ T cells along with their associated markers for proliferation and cytotoxicity is thought to be a result of activation following this decline in neutrophil-mediated immune response.

      Strengths:

      (1) Single-cell sequencing data can be very valuable in identifying potential health risks and clinical pathologies of lung conditions associated with e-cigarettes considering they are still relatively new.

      (2) Not many studies have been performed on cell-type specific differential gene expression following exposure to e-cig aerosols.

      (3) The assays performed address several factors of e-cig exposure such as metal concentration in the liquid and condensate, coil composition, cotinine/nicotine levels in serum and the product itself, cell types affected, which genes are up- or down-regulated and what pathways they control.

      (4) Considerations were made to ensure clinical relevance such as selecting mice whose ages corresponded with human adolescents so that the data collected was relevant.

      We thank the reviewer for identifying the key strengths of our work and listing it in a concise and well-rounded fashion.

      Weaknesses:

      The exposure period of 1 hour a day for 5 days is not representative of chronic use and this time point may be too short to see a full response in all cell types. The experimental design is not well-supported based on the literature available for similar mouse models.

      This study was not designed to study the effects of chronic exposures on lung tissues. We were interested in delineating the effect of acute exposures for which the proposed study design was chosen. Previous work by our group has performed similar exposures and has been well received by the community. We understand that chronic exposures will be interesting to look at, however that was not the purpose of this pilot study. We will now explicitly mention this aspect in the revised manuscript.

      Several claims lack supporting evidence or use data that is not statistically significant. In particular, there were no statistical analyses to compare results across sex, so conclusions stating there is a sex bias for things like Ly6G+ neutrophil percentage by condition are observational.

      We thank the reviewer for this observation, and we will include the necessary validations and details of the sex-based statistical analyses in the revised version of this manuscript.

      Statistical analyses lack rigor and are not always displayed with the most appropriate graphical representation.

      We thank the reviewer and will include all the necessary statistical details with more details in the revised manuscript.

      Overall, the paper and its discussion are relatively limited and do not delve into the significance of the findings or how they fit into the bigger picture of the field.

      We are in process of performing a few validatory experiments and intend to include few other pieces of data to this manuscript to add to the overall merit of our findings. However as pointed out by the reviewer themselves the strength of this work is in the first ever scRNA seq analyses of mouse exposed to differently flavored e-cig aerosols in vivo. We also show cell-specific differential gene expression and address some of the major queries made around e-cig research including release of metals on a day-to-day basis from the same coil. The limited sample number make it difficult to draw solid conclusions from this work, which has been discussed as a shortcoming. However the major strength of this work is not in identifying specific trends but rather to explore the possible cell and gene targets to expand the study for longer (chronic) exposures with a larger sample group.

      The manuscript lacks validation of findings in tissue by other methods such as staining.

      We are conducting some studies and will include the validatory experiments and staining in the revised manuscript to support our findings.

      This paper provides a foundation for follow-up experiments that take a closer look at the effects of e-cig exposure on innate immunity. There is still room to elaborate on the differential gene expression within and between various cell types.

      We thank the reviewer for this observation. The cell numbers for some cell clusters (especially epithelial cells) were too low. So, though we have performed the differential gene expression analyses on all the cell clusters, we refrained from discussing it in the manuscript to avoid over interpretation of our results. Only clusters with high enough (~1000) cells per sex per group were used to plot the heatmaps. We will also include the cell numbers for each cell type in the revisions to allow better interpretation of our data. Furthermore, the raw data from this study will be freely available to the public upon publication of this manuscript. This would enable the interested readers to access the raw data and study the cell types of interest in detail based on their study requirements. This data will be a useful resource for all in this community to inform and design future studies.

    1. Reviewer #1 (Public review):

      Summary:

      Praegel et al. explore the differences in learning an auditory discrimination task between adolescent and adult mice. Using freely moving (Educage) and head-fixed paradigms, they compare behavioral performance and neuronal responses over the course of learning. The mice were initially trained for seven days on an easy pure frequency tone Go/No-go task (frequency difference of one octave), followed by seven days of a harder version (frequency difference of 0.25 octave). While adolescents and adults showed similar performances on the easy task, adults performed significantly better on the harder task. Quantifying the lick bias of both groups, the authors then argue that the difference in performance is not due to a difference in perception, but rather to a difference in cognitive control. The authors then used neuropixel recordings across 4 auditory cortical regions to quantify the neuronal activity related to the behavior. At the single-cell level, the data shows earlier stimulus-related discrimination for adults compared to adolescents in both the easy and hard tasks. At the neuronal population level, adults displayed a higher decoding accuracy and lower onset latency in the hard task as compared to adolescents. Such differences were not only due to learning, but also to age as concluded from recordings in novice mice. After learning, neuronal tuning properties had changed in adults but not in adolescents. Overall, the differences between adolescent and adult neuronal data correlate with the behavior results in showing that learning a difficult task is more challenging for younger mice.

      Strengths:

      (1) The behavioral task is well designed, with the comparison of easy and difficult tasks allowing for a refined conclusion regarding learning across ages. The experiments with optogenetics and novice mice complete the research question in a convincing way.

      (2) The analysis, including the systematic comparison of task performance across the two age groups, is most interesting and reveals differences in learning (or learning strategies?) that are compelling.

      (3) Neuronal recording during both behavioral training and passive sound exposure is particularly powerful and allows interesting conclusions.

      Weaknesses:

      (1) The presentation of the paper must be strengthened. Inconsistencies, mislabeling, duplicated text, typos, and inappropriate color code should be changed.

      (2) Some claims are not supported by the data. For example, the sentence that says that "adolescent mice showed lower discrimination performance than adults (l.22) should be rewritten, as the data does not show that for the easy task (Figure 1F and Figure 1H).

      (3) The recording electrodes cover regions in the primary and secondary cortices. It is well known that these two regions process sounds quite differently (for example, one has tonotopy, the other does not), and separating recordings from both regions is important to conclude anything about sound representations. The authors show that the conclusions are the same across regions for Figure 4, but is it also the case for the subsequent analysis? In Figure 7 for example, are the quantified properties not distinct across primary and secondary areas? If this is not the case, how is it compatible with the published literature?

      (4) Some analysis interpretations should be more cautious. For example, I do not understand how the lick bias, defined -according to the method- as the inverse normal distribution of the z-score (hit rate) +z-scored (false alarm rate; Figure 1j?, l.749-750), should reflect a cognitive difficulty (l. 161-162, l.171). A lower lick rate in general could reflect a weaker ability to withhold licking- as indicated on l.164, but also so many other things, like a lower frustration threshold, lower satiation, more energy, etc).

    2. Reviewer #3 (Public review):

      Summary:

      In this study, Benedikt et al. sought to understand how adolescents and adult mice differ in auditory cortical processing, performance on a go/nogo sound-guided task, and learning. They report that behavioral performance is superior in adults. They also report that neuronal representations of both the acoustic stimulus and behavioral choice are weaker and sluggish in adolescents compared to adults and that these differences were larger in expert mice than in novices. The neural basis of adolescent auditory cognition is an important topic (both clinically and from a basic science perspective) and vastly understudied. However, many aspects of the study fell short, thereby undermining the primary conclusions drawn by the authors. My major concerns are as follows:

      (1) The authors report that "adolescent mice showed lower auditory discrimination performance compared to adults" and that this performance deficit was due to (among other things) "weaker cognitive control". I'm not fully convinced of this interpretation, for a few reasons. First, the adolescents may simply have been thirstier, and therefore more willing to lick indiscriminately. The high false alarm rates in that case would not reflect a "weaker cognitive control" but rather, an elevated homeostatic drive to obtain water. Second, even the adult animals had relatively high (~40%) false alarm rates on the freely moving version of the task, suggesting that their behavior was not particularly well controlled either. One fact that could help shed light on this would be to know how often the animals licked the spout in between trials. Finally, for the head-fixed version of the task, only d' values are reported. Without the corresponding hit and false alarm rates (and frequency of licking in the intertrial interval), it's hard to know what exactly the animals were doing.

      (2) There are some instances where the citations provided do not support the preceding claim. For example, in lines 64-66, the authors highlight the fact that the critical period for pure tone processing in the auditory cortex closes relatively early (by ~P15). However, one of the references cited (ref 14) used FM sweeps, not pure tones, and even provided evidence that the critical period for this more complex stimulus occurred later in development (P31-38). Similarly, on lines 72-74, the authors state that "ACx neurons in adolescents exhibit high neuronal variability and lower tone sensitivity as compared to adults." The reference cited here (ref 4) used AM noise with a broadband carrier, not tones.

      (3) Given that the authors report that neuronal firing properties differ across auditory cortical subregions (as many others have previously reported), why did the authors choose to pool neurons indiscriminately across so many different brain regions? And why did they focus on layers 5/6? (Is there some reason to think that age-related differences would be more pronounced in the output layers of the auditory cortex than in other layers?)

    3. Author response:

      Reviewer #1:

      A) The presentation of the paper must be strengthened. Inconsistencies, mislabelling, duplicated text, typos, and inappropriate colour code should be changed.

      We will revise the manuscript to correct the abovementioned issues.

      B) Some claims are not supported by the data. For example, the sentence that says that "adolescent mice showed lower discrimination performance than adults (l.22) should be rewritten, as the data does not show that for the easy task (Figure 1F and Figure 1H).

      We will carefully review, verify claims, and correct conclusions where needed.

      C) In Figure 7 for example, are the quantified properties not distinct across primary and secondary areas?

      We will analyse the data in Figure 7 separately for AUDp and secondary auditory cortices to test regional differences. Additionally, we will provide a table summarizing key neuronal firing properties for each area during passive recordings to clarify how activity varies across cortical subregions and developmental stages.

      D) Some analysis interpretations should be more cautious. (..) A lower lick rate in general could reflect a weaker ability to withhold licking- as indicated on l.164, but also so many other things, like a lower frustration threshold, lower satiation, more energy, etc).

      We will address issues around lick bias including alternative explanations, such as differences in motivation or impulsivity.

      Reviewer #2:

      A) For some of the analyses that the authors conducted it is unclear what the rationale behind them is and, consequently, what conclusion we can draw from them.

      We will edit the discussion and clarify these points. In addition, we will adjust and extend the methodology section to clarify the rationale of our analysis.

      B) The results of the optogenetic manipulation, while very interesting, warrant a more in-depth discussion.

      We agree that the effects observed in our optogenetic manipulation warrant further discussion. We will extend on the analysis and discussion of ACx silencing.

      Reviewer #3:

      A) One fact that could help shed light on this would be to know how often the animals licked the spout in between trials. Finally, for the head-fixed version of the task, only d' values are reported. Without the corresponding hit and false alarm rates (and frequency of licking in the intertrial interval), it's hard to know what exactly the animals were doing.

      We recognize the need for a more nuanced analysis for the head-fixed version of the task. We will extend the behavioral analysis and provide more details to clarify these points.

      B) There are some instances where the citations provided do not support the preceding claim. For example, in lines 64-66, the authors highlight the fact that the critical period for pure tone processing in the auditory cortex closes relatively early (by ~P15). However, one of the references cited (ref 14) used FM sweeps, not pure tones, and even provided evidence that the critical period for this more complex stimulus occurred later in development (P31-38). Similarly, on lines 72-74, the authors state that "ACx neurons in adolescents exhibit high neuronal variability and lower tone sensitivity as compared to adults." The reference cited here (ref 4) used AM noise with a broadband carrier, not tones.

      We appreciate the reviewer pointing out instances where our citations may not fully support our claims. We will carefully review the relevant citations and revise them to ensure they accurately reflect the findings of the cited studies. We will update references in lines 64–66 and 72–74 to better align with the specific stimulus types and developmental timelines discussed.

      C) Given that the authors report that neuronal firing properties differ across auditory cortical subregions (as many others have previously reported), why did the authors choose to pool neurons indiscriminately across so many different brain regions?

      We agree that pooling neurons from multiple auditory cortical regions could potentially obscure region-specific differences. However, we addressed this concern by analyzing regional differences in neuronal firing properties, as shown in Supplementary Figures S4-1 and S4-2, and Supplementary Tables 2 and 3. Additionally, we examined stimulus-related and choice-related activity across regions and found no significant differences, as presented in Supplementary Figure S4-3. Please see our response to Reviewer 1, where we further elaborate on this point.

      D) And why did they focus on layers 5/6? (Is there some reason to think that age-related differences would be more pronounced in the output layers of the auditory cortex than in other layers?)

      We acknowledge that other cortical layers are also of interest and may contribute differently to auditory processing across development. Our focus on layers 5/6 was motivated by both methodological considerations and biological relevance. These layers contain many of the principal output neurons of the auditory cortex, and are therefore well positioned to influence downstream decision-making circuits. We will clarify this rationale in the revised manuscript and note the limitations of our approach.

    1. Reviewer #3 (Public review):

      Summary:

      Here the authors describe the role of mORs in synaptic glutamate release from substance P and cholinergic neurons in the medial habenula to the interpeduncular nucleus (IPN) circuit in adult mice. They show that mOR activation reduces evoked glutamate release from substance P neurons yet increases evoked glutamate release and Ach release from cholinergic neurons. Unlike glutamate release, Ach release is only detected when potassium channels are blocked with 4-AP or dendrotoxin, implicating Kv1.2. The authors also report a previously unidentified glutamatergic input to IPR mediated from SP neurons and describe the developmental timing of mOR-facilitation in adolescent mice.

      Strengths:

      (1) The experiments provide new insight into the role of mORs in controlling evoked glutamate release in a circuit with high levels of mORs and established roles in relevant behaviors.

      (2) The experimental design is generally rigorous, and the results are clear-cut. The conclusions are largely supported by the data.

      (3) The findings will be of interest to those working in the field.

      Weaknesses:

      (1) The mechanistic underpinnings of the most interesting results are not pursued. For example, the experiments do not provide new insight into the differential effects of evoked and spontaneous glutamate/Ach release by Gi/o coupled mORs, nor the differential threshold for glutamate versus Ach release.

      (2) The significance of the ratio of AMPA versus nACh EPSCs shown in Figure 6 is unclear since nAChR EPSCs measured in the K+ channel blockers are compared to AMPA EPSCs in control (presumably 4-AP would also increase AMPA EPSCs).

      (3) The authors note that blocking Kv1 channels typically enhances transmitter release by slowing action potential repolarization. The idea that Kv1 channels serve as a brake for Ach release in this system would be strengthened by showing that these channels are the target of neuromodulators or that they contribute to activity-dependent regulation that allows the brake to be released.

    1. Reviewer #2 (Public review):

      Summary:

      This study uses functional MRI to evaluate visual contrast sensitivity across the visual field at the level of the visual cortex, testing the method in a small group of normally sighted individuals and one with sight loss as proof of principle. The results suggest a promising technique to measure vision objectively across the visual field and overcomes the requirement for careful fixation which is often challenging in those with low vision or sight loss.

      Strengths:

      (1) Objective measure of central vision: The proposed method may provide a more comprehensive and objective assessment of residual visual function in individuals with sight loss. This may be particularly useful for those with central visual field loss without the requirement of stable fixation or subjective motor responses.

      (2) More sensitive measure: The use of slope to calculate contrast sensitivity across a range of contrasts within the brain is clever and likely more sensitive than single threshold measurements or standard clinical measures of visual acuity using letter charts. Standard supra-threshold (high contrast) tests are not ideal for capturing residual vision or partial vision loss.

      (3) Good agreement with standard atlas: The Benson atlas provides a good estimate of visual field maps within V1 based on anatomical landmarks, and the authors take steps to refine this informed by cortical magnification and V1 surface area (brain size) for each individual participant. This could allow the technique to be generalised without the need to collect lengthy individual mapping data from every participant.

      (4) Within-subject reproducibility: The measurements appear to be sensitive and reproducible, particularly in those with normal vision, and are consistent with known features of visual sensitivity differences in different parts of the visual field.

      (5) Potential tool to measure visual field sensitivity in controls: Even if the proposed methods are not ideal for widespread clinical translation, they do offer an exciting tool to test hypotheses about visual field differences in healthy controls. For example, there seems to be an increase in sensitivity on either side of the simulated ring scotoma (Figure 6 - perhaps due to the release of lateral inhibition?). Reliability measures suggest that individual differences are consistent in healthy controls (although not tested statistically, perhaps due to the small sample size?). Whether they reflect behaviourally meaningful differences in visual field sensitivity could be tested in individuals by comparing them to behavioural measures across the visual field.

      (6) Potential tool to test novel treatments: The proposed techniques could be used to test within-subject changes in visual function in environments that are equipped to measure and analyse fMRI data, including clinical trials aimed at determining the success of novel treatments. Further testing should reveal whether the method is suitable for testing low-vision patients with unstable fixation (e.g., nystagmus) and whether this affects slope and contrast sensitivity estimates. In theory, it should not have a substantial effect, except perhaps in regions near the stimulus edges.

      Weaknesses:

      (1) Questionable sensitivity to differences in patients. The variability in heat maps across healthy control participants is somewhat surprising. Do differences between individuals represent actual visual sensitivity differences, or are they an artifact of the measurement technique, e.g., due to signal-to-noise differences introduced by local variations in brain anatomy? Will the substantial variance across controls allow for a sufficiently stable baseline to detect meaningful differences in individual patients? Also, as the authors rightly point out, Benson atlas does not model differences along meridians, so upper/lower field differences might not be detectable.

      (2) Effects of unstable fixation/eye movements not explicitly tested: The methods state, 'In all tasks, participants were asked to report when the color of a central fixation dot changed', suggesting participants maintained fairly good fixation. Most of the results seem to pertain to measurements where central fixation is required. How does unstable fixation affect measurements?

      (3) Potential for clinical translation. Although it is a sensitive measure, functional MRI is costly, is not available in all clinical settings, requires significant post-processing analyses, and may be contraindicated in some individuals due to safety (e.g., metallic implants) or other concerns (e.g., claustrophobia). These could present significant barriers to widespread clinical translation if this were the ultimate goal of the study.

      (4) Limited range of spatial frequencies. The spatial frequencies tested were still quite low (0.3 and 3cpd) compared to measures such as visual acuity. Extending the measurements to higher spatial frequencies could allow better characterization of central vision, although necessarily for peripheral vision.

    2. Reviewer #3 (Public review):

      Summary:

      Chow-Wing-Bom et al. introduce an innovative wide-field visual stimulation setup for 3T experiments that enables stimulation up to a diameter of 40{degree sign} visual angle while allowing continuous gaze tracking. Using this setup, the authors systematically investigate contrast sensitivity across the visual field by presenting subjects with sinusoidal gratings varying in contrast and spatial frequency. Their findings confirm the expected organization of contrast sensitivity, demonstrating a preference for high spatial frequencies in the central field and lower frequencies in the periphery. They also extend these measurements to eccentricities up to 20{degree sign}, which exceeds previous fMRI-based reports. Moreover, the study explores the potential of using contrast sensitivity calculations as a method for detecting visual field defects, as demonstrated in both a healthy subject with an artificial, ring-shaped scotoma and a patient with LHON.

      Strengths:

      (1) The manuscript is well written and provides comprehensive methodological details, ensuring high transparency and reproducibility.

      (2) The visual stimulation setup represents a significant technical advance by enabling wide-field stimulation with continuous eye tracking, which is crucial for both research and potential clinical applications.

      (3) The study confirms established findings regarding the organization of contrast sensitivity while extending them to a larger eccentricity range.

      (4) The efforts to establish a measure for visual field losses align with current efforts to develop objective alternatives to conventional perimetry.

      Weaknesses:

      (1) The authors should more strongly emphasize their findings on the organization of contrast sensitivity, particularly in light of the stimulation extent provided by the wide-field setup.

      (2) Certain methodological aspects require further clarification, particularly regarding the correction of eccentricity values from the Benson atlas. It's not clear which V1 masks are used for the specific analysis which could have a substantial impact on the reported differences between the two approaches of pRF mapping and atlas-based pRF parameters.

      (3) Minor inconsistencies in reporting, e.g., the introduction of a second session in the Results section.

      (4) The conclusion that high-contrast patterns as in pRF mapping are not optimal to test for subtle but potentially clinically relevant changes in the visual field coverage is very valid. The suggested use of contrast sensitivity can therefore be a potentially well-suited parameter for estimating visual field losses. The presented work is an interesting starting point and the proposed method of using contrast sensitivity as a measure for partial vision loss should further be explored.

    1. Reviewer #1 (Public review):

      Summary:

      The Szczupak lab published a very interesting paper in 2012 (Rodriquez et al. J Neurophysiol 107:1917-1924) on the effects of the segmentally-distributed non-spiking (NS) cell on crawl-related motoneurons. As far as I can tell, the working model presented in 2012, for how the non-spiking (NS) cell impacts the crawling motor pattern, is the same functional model presented in this new paper. Unfortunately, the Discussion does not address any of the findings in the previous paper or cite them in the context of NS alterations of fictive crawling. Aside from different-looking figures and some new analyses, the results and conclusions are the same.

      Strengths:

      The figures are well illustrated.

      Weaknesses:

      The paper is a mix of what appears to be two different studies and abruptly switches gears to examine how closely the crawl patterning is in the intact animal as compared to the fictive crawl patterning in the intact animal. Unfortunately, previous studies in other labs are not cited even though identical results have been obtained and similar conclusions were made. Thus, the novelty of the results is missing for those who are familiar with the leech preparation. The lack of appropriate citations and discussion of previous studies also deprives the scientific community of fully comprehending the impact of the data presented and the science it was built upon.

      (1) Results, Lines 167-170: "While multiple extracellular recordings have been performed previously (Eisenhart et al., 2000), these results present the first quantitative analysis of motor units activated throughout the crawling cycle. The In-Phase units are expected to control the contraction stage by exciting or inhibiting the longitudinal or circular muscles, respectively, and the Anti-Phase units to control the elongation stage by exciting or inhibiting the circular or longitudinal muscles, respectively."

      The first line above is misleading. The study by Puhl and Mesce (2008, J. Neurosci, 28:4192- 420) contains a comprehensive analysis of the motoneurons active during fictive crawling with the aim of characterizing their roles and phase relationships and solidifying the idea that the oscillator for crawling resides in a single ganglion. Intracellular recordings from a number of key crawl-related motoneurons were made in combination with extracellular recordings of motoneuron DE-3, a key monitor of crawling. In their paper, it was shown that motoneurons AE, VE-4, DI-1, VI-2, and CV were all correlated with crawl activity, and fired repeatedly either in phase or out-of-phase with DE-3. They were shown to be either excitatory or inhibitory.

      At a minimum, the above paper should be cited. The submitted paper would be strengthened if some of these previously identified motoneurons were again recorded with intracellular electrodes and concomitant NS cell stimulation. The power of the leech preparation is that cells can be identified as individuals with dual somatic (intracellular) and axonal recordings (extracellular). The shortfall of this aspect of the study (Figure 5) is that the extracellular units have not been identified here. In fact, these units might not even be motoneurons. They could represent activity from the centrally located sensory neurons, dopamine-modulated afferent neurons or peripherally projecting modulatory neurons. Essentially, they may not have much to do with the crawl motor pattern at all.

      (2) Results Lines 206-210: "with the elongation and contraction stages of in vivo behavior. However the isometric stages displayed in vivo have no obvious counterpart in the electrophysiological recordings. It is important to consider that the rhythmic movement of successive segments along the antero-posterior axis of the animal requires a delay signal that allows the appropriate propagation of the metachronal wave, and this signal is probably absent in the isolated ganglion."

      The so-called isometric stages, indeed, have an electrophysiological counterpart due in part to the overlapping activities across segments. This submitted paper would be considerably strengthened if it referred to the body of work that has examined how the individual crawl oscillators operate in a fully intact nerve cord, excised from the body but with all the ganglia (and cephalic ganglion) attached. Puhl and Mesce 2010 (J. Neurosci 30: 2373-2383) and Puhl et al. 2012 (J. Neurosci, 32:17646 -17657) have shown that "appropriate propagation of the metachronal wave" requires the brain, especially cell R3b-1. They also show that the long-distance projecting cell R3b-1 synapses with the CV motoneuron, providing rhythmic excitatory input to it.

      For this and other reasons, the paper would be much more informative and exciting if the impacts of the NS cell were studied in a fully intact nerve cord. Those studies have never been done, and it would be exciting to see how and if the effects of NS cell manipulation deviated from those in the single ganglion.

      (3) Discussion Lines 322-324. "The absence of descending brain signals and/or peripheral signals are assumed as important factors in determining the cycle period and the sequence at which the different behavioral stages take place."

      The authors could strengthen their paper by including a more complete picture of what is known about the control of crawling. For example, Puhl et al. 2012 (J Neurosci, 32:17646-17657) demonstrated that the descending brain neuron R3b-1 plays a major role in establishing the crawl-cycle frequency. With increased R3b-1 cell stimulation, DE-3 periods substantially shortened throughout the entire nerve cord. Thus, the importance of descending brain inputs should not be merely assumed; empirical evidence exists.

      (4) Discussion Lines 325-327: "the sequence of events, and the proportion of the active cycle dedicated to elongation and contraction were remarkably similar in both experimental settings. This suggests that the network activated in the isolated ganglion is the one underlying the motor behavior."

      The results and conclusions drawn in the current manuscript mirror those previously reported by Puhl and Mesce (2008, J. Neurosci, 28:4192- 420) who first demonstrated that the essential pattern-generating elements for leech crawling were contained in each of the segmental ganglia comprising the nerve cord. Furthermore, the authors showed that the duty cycle of DE-3, in a single ganglion treated with dopamine, was statistically indistinguishable from the DE-3 duty cycle measured in an intact nerve cord showing spontaneous fictive crawling, in an intact nerve cord induced to crawl via dopamine, and in the intact behaving animal. What was statistically significant, however, was that the DE-3 burst period was greatly reduced in the intact animal (i.e., a higher crawl frequency), which was replicated in the submitted paper.

      In my opinion, the novelty of the results reported in the submitted manuscript is diminished in the light of previously published studies. At a minimum, the previous studies should be cited, and the authors should provide additional rationale for conducting their studies. They need to explain in the discussion how their approach provided additional insights into what has already been reported.

    1. 原子内の電子の軌道は、実際には量子力学に従っているため、軌道は「量子化されている」(=飛び飛びの値を取る)。電子の軌道は4種類の量子数という自然数値によって特徴づけられる。4つの量子数は、電子がK殻、L殻、M殻…のいずれに入るかを決める主量子数、殻の中のs軌道、p軌道、d軌道…のいずれに入るかを決める方位量子数、軌道角運動量、スピン角運動量がそれぞれ上向きか下向きかを決める磁気量子数、スピン量子数からなる[2]。
    1. Overview of the Nervous System The nervous system, illustrated in Figure 4.1.24.1.2\PageIndex{2}, is the human organ system that coordinates all of the body’s voluntary and involuntary actions by transmitting electrical and chemical signals to and from different parts of the body. Specifically, the nervous system extracts information from the internal and external environments using sensory receptors. It then usually sends signals encoding this information to the brain, which processes the information to determine an appropriate response. Finally, the brain sends signals to muscles, organs, or glands to bring about the necessary response. The two main divisions of the nervous system are the central nervous system (CNS), consisting of the brain and the spinal cord, and the peripheral nervous system (PNS), which includes all other nervous tissue, such as ganglia and nerves, outside the brain and spinal cord. The CNS and PNS are covered in greater detail in separate sections. In the example above, your eyes detected the skateboarder, the information traveled to your brain, and your brain instructed your body to act to avoid a collision.

      Cool how the nervous system aacts as the messenger adn the decision-maker. Also gathering info, processing it, then commanding acation.

    1. 「负空间」

      负空间(Negative Space)是一个艺术和设计中的重要概念,指的是图像中空白未被填充的区域,它与图像的主体部分(正空间)形成对比。虽然负空间通常被认为是“空白”或“无物”的部分,但它在构图中起到了至关重要的作用,并能够对作品的视觉平衡焦点情感表达产生深远影响。

      1. 负空间的定义

      负空间(也称为空白空间)指的是在艺术作品中未被描绘的区域。它是周围的环境或背景,而不是直接展示的主体或焦点部分。负空间不仅仅是空白的地方,它与图像的正空间(图像中的主体或主要元素)相互作用,并且通常被用来定义形状引导视线创造对比增强视觉效果

      在传统艺术、绘画摄影雕塑等多个领域,负空间都是构图的一个核心要素。它不仅仅是“无”——它具有自己的存在感,并且对作品的整体感知产生深刻影响。

      2. 负空间的功能和作用

      1. 视觉平衡与对比

      负空间与正空间的关系有助于平衡画面。设计师和艺术家通常通过调整负空间的位置、形状和大小来确保画面看起来均衡且和谐。如果正空间太拥挤而负空间太少,作品可能会显得混乱过于紧凑,难以引起观众的注意;而合理的负空间使用则能够帮助观众的视线自然流动,突出主体

      2. 创造深度和层次感

      通过合理利用负空间,艺术家可以为作品增添深度层次感。负空间有时被用来传达物体之间的距离感或物体的相对位置,帮助观众感受到画面中的立体性深远感

      3. 强调主体

      负空间可以帮助突出作品中的主体,使其成为焦点。例如,在摄影中,艺术家可以通过将主体置于画面的一侧,并在其周围留出较大的负空间,来引导观众的视线集中于主体,从而强化其视觉效果和情感表达。

      4. 增强意象的意义

      在许多艺术作品中,负空间不仅仅是无形的空白区域,它有时可以通过特定的形状、结构或设计,创造隐含的图像象征性形态。这使得负空间不仅仅是填补背景的空白,而是通过其形态与正空间形成对比,增强作品的象征意义多重解读

      5. 创造趣味与惊喜

      有时,负空间的形态本身也能引起观众的兴趣或惊喜。例如,在一些著名的双重意象视觉谜题作品中,负空间本身可能形成一个意外的图像,这种技巧不仅增强了作品的视觉冲击力,也给观众带来思考的乐趣。

      3. 负空间的应用实例

      1. 视觉艺术中的负空间

      • “白色背景”:许多绘画作品使用白色背景,刻意将负空间保留在画面中,以突出主题元素。通过这些空白区域,画面中的主体更加引人注目,增强了作品的简洁感和深度感。
      • 双重意象艺术:在一些艺术作品中,负空间和正空间共同作用,创造出双重意象,让观众从不同的角度解读图像。例如,埃舍尔(M.C. Escher)的作品经常使用负空间和正空间的巧妙配合,形成视错觉和深刻的哲学寓意。

      2. 设计中的负空间

      • 标志设计:负空间在品牌标志设计中被广泛使用,它能巧妙地传达公司理念增强视觉吸引力。比如FedEx的标志中,字母“E”和“X”之间的负空间巧妙地形成了一个箭头,代表着速度进取心,这种设计让观众产生惊喜并加深对品牌的记忆。
      • 海报设计:负空间在海报设计中帮助聚焦视线和创意展示,设计师通过精心安排负空间来引导观众关注作品的主题或信息,避免画面过于繁杂。

      3. 摄影中的负空间

      • 摄影中,负空间的运用至关重要,它能够帮助摄影师控制构图并突出拍摄的主题。通过在画面中留出空白区域,摄影师可以引导观众的视线,使他们聚焦于拍摄的主体,避免视觉的杂乱无章。
      • 负空间的使用还可以增强情感表达,比如,通过大面积的空白来传达孤独感、寂静感或广阔感。

      4. 负空间与正空间的对比

      • 负空间和正空间并不是对立的,而是互相依赖的关系。在艺术作品中,负空间和正空间共同构成了完整的图像结构。正空间(主体)给人带来直接的视觉感受,而负空间则通过提供视觉“呼吸”来增强正空间的存在感。
      • 在构图中,负空间的合理利用可以让主体更加突出,同时也为作品带来层次感深度

      5. 负空间的哲学与文化意义

      负空间不仅仅是艺术技巧,它还能够表达出深刻的哲学和文化意蕴。空白部分可能代表着未知无形空虚,而这些在文化或哲学层面上具有丰富的象征意义。例如,禅宗艺术中对的重视,正是通过负空间的表现来传递一种的哲学思考,反映了人类对于存在的思考。

      6. 总结

      负空间是艺术和设计中不可忽视的元素,它通过与正空间的关系,创造出平衡、层次感、情感表达和象征意义。无论在绘画设计摄影还是雕塑中,负空间都起到了至关重要的作用。通过巧妙地运用负空间,艺术家可以强化作品的表现力,激发观众的思考,增加作品的视觉冲击力情感深度

    1. 这是一份Dify平台的环境变量配置文件,对平台的各项功能、服务和组件进行参数设置。以下是对其主要部分的详细解读: 1. 通用变量(Common Variables) CONSOLE_API_URL:控制台API的后端URL,用于拼接授权回调。若为空,使用同一域名。CONSOLE_WEB_URL:控制台Web前端的URL,用于拼接前端地址和CORS配置。SERVICE_API_URL、APP_API_URL、APP_WEB_URL:分别为服务API、Web应用API后端及Web应用的URL,用于向前端展示或声明后端地址。FILES_URL:文件预览或下载的URL前缀,用于前端展示或多模型输入。 2. 服务器配置(Server Configuration) 日志相关: LOG_LEVEL:应用的日志级别,可选择 DEBUG、INFO、WARNING、ERROR、CRITICAL。LOG_FILE:日志文件路径。LOG_FILE_MAX_SIZE、LOG_FILE_BACKUP_COUNT:分别指定日志文件最大大小(MB)和最大备份数量。LOG_DATEFORMAT、LOG_TZ:日志日期格式和时区。 调试模式: DEBUG:应用调试模式,默认 false,本地开发建议开启。FLASK_DEBUG:Flask调试模式,开启可在界面输出跟踪信息。 其他: SECRET_KEY:用于安全签名会话cookie和加密数据库敏感信息的密钥。INIT_PASSWORD:管理员用户初始化密码,不设置则创建初始管理员账户时不提示密码。DEPLOY_ENV:部署环境,可选择 PRODUCTION、TESTING,默认 PRODUCTION。CHECK_UPDATE_URL:版本检查的URL。OPENAI_API_BASE:OpenAI API的基础地址,默认是官方地址,可替换为国内镜像或本地兼容API地址。MIGRATION_ENABLED:是否在应用启动前执行数据库迁移。FILES_ACCESS_TIMEOUT:文件访问超时时间(秒)。ACCESS_TOKEN_EXPIRE_MINUTES、REFRESH_TOKEN_EXPIRE_DAYS:分别为访问令牌和刷新令牌的过期时间。APP_MAX_ACTIVE_REQUESTS、APP_MAX_EXECUTION_TIME:应用的最大活动请求数和最大执行时间。 3. 容器启动相关配置(Container Startup Related Configuration) DIFY_BIND_ADDRESS、DIFY_PORT:API服务绑定的地址和端口。SERVER_WORKER_AMOUNT、SERVER_WORKER_CLASS、SERVER_WORKER_CONNECTIONS:API服务器工作进程的数量、类型和默认连接数。CELERY_WORKER_CLASS、CELERY_WORKER_AMOUNT:Celery工作进程的类型和数量。CELERY_AUTO_SCALE、CELERY_MAX_WORKERS、CELERY_MIN_WORKERS:是否启用Celery工作进程自动缩放及最大、最小工作进程数。GUNICORN_TIMEOUT:Gunicorn请求处理超时时间。API_TOOL_DEFAULT_CONNECT_TIMEOUT、API_TOOL_DEFAULT_READ_TIMEOUT:API工具的默认连接和读取超时时间。 4. 数据库配置(Database Configuration) DB_USERNAME、DB_PASSWORD、DB_HOST、DB_PORT、DB_DATABASE:PostgreSQL数据库的用户名、密码、主机、端口和数据库名。SQLALCHEMY_POOL_SIZE、SQLALCHEMY_POOL_RECYCLE、SQLALCHEMY_ECHO:数据库连接池的大小、回收时间和是否打印SQL语句。POSTGRES_MAX_CONNECTIONS、POSTGRES_SHARED_BUFFERS、POSTGRES_WORK_MEM、POSTGRES_MAINTENANCE_WORK_MEM、POSTGRES_EFFECTIVE_CACHE_SIZE:PostgreSQL数据库的相关资源配置参数。 5. Redis配置(Redis Configuration) REDIS_HOST、REDIS_PORT、REDIS_USERNAME、REDIS_PASSWORD、REDIS_USE_SSL、REDIS_DB:Redis的主机、端口、用户名、密码、是否使用SSL及数据库编号。REDIS_USE_SENTINEL、REDIS_SENTINELS、REDIS_SENTINEL_SERVICE_NAME、REDIS_SENTINEL_USERNAME、REDIS_SENTINEL_PASSWORD、REDIS_SENTINEL_SOCKET_TIMEOUT:是否使用Redis Sentinel模式及相关配置。REDIS_USE_CLUSTERS、REDIS_CLUSTERS、REDIS_CLUSTERS_PASSWORD:是否使用Redis Cluster模式及相关配置。 6. Celery配置(Celery Configuration) CELERY_BROKER_URL、BROKER_USE_SSL:Celery使用的消息代理URL及是否使用SSL。CELERY_USE_SENTINEL、CELERY_SENTINEL_MASTER_NAME、CELERY_SENTINEL_SOCKET_TIMEOUT:若使用Redis Sentinel,相关的配置参数。 7. CORS配置(CORS Configuration) WEB_API_CORS_ALLOW_ORIGINS、CONSOLE_CORS_ALLOW_ORIGINS:分别指定Web API和控制台API的跨域请求允许来源。 8. 文件存储配置(File Storage Configuration) STORAGE_TYPE:文件存储类型,如 opendal。OPENDAL_SCHEME、OPENDAL_FS_ROOT:OpenDAL存储方案及本地文件系统根目录。S3_*、AZURE_BLOB_*、GOOGLE_STORAGE_*等:不同云存储服务(S3、Azure Blob、Google Storage等)的配置参数。 9. 向量数据库配置(Vector Database Configuration) VECTOR_STORE:向量存储类型,支持多种类型如 weaviate、qdrant 等。针对不同向量存储类型(如 WEAVIATE_*、QDRANT_*、MILVUS_* 等)的相应配置参数,包括端点URL、API密钥、超时时间等。 10. 知识配置(Knowledge Configuration) UPLOAD_FILE_SIZE_LIMIT、UPLOAD_FILE_BATCH_LIMIT:上传文件的大小限制(MB)和一次上传的最大文件数。ETL_TYPE:ETL类型,可选择 dify 或 Unstructured。UNSTRUCTURED_API_URL、UNSTRUCTURED_API_KEY:当 ETL_TYPE 为 Unstructured 时,Unstructured.io的API路径和密钥。 11. 模型配置(Model Configuration) PROMPT_GENERATION_MAX_TOKENS、CODE_GENERATION_MAX_TOKENS:提示生成和代码生成工具中允许的最大令牌数。 12. 多模态配置(Multi-modal Configuration) MULTIMODAL_SEND_FORMAT:多模态模型输入的图像、视频、音频、文档格式,默认 base64,也可选择 url。UPLOAD_IMAGE_FILE_SIZE_LIMIT、UPLOAD_VIDEO_FILE_SIZE_LIMIT、UPLOAD_AUDIO_FILE_SIZE_LIMIT:分别为上传图像、视频、音频文件的大小限制(MB)。 13. Sentry配置(Sentry Configuration) SENTRY_DSN、API_SENTRY_DSN、WEB_SENTRY_DSN:Sentry服务的DSN地址,用于应用监控和错误日志跟踪。API_SENTRY_TRACES_SAMPLE_RATE、API_SENTRY_PROFILES_SAMPLE_RATE:API服务Sentry事件和配置文件的报告比率。 14. Notion集成配置(Notion Integration Configuration) NOTION_INTEGRATION_TYPE:Notion集成类型,可选择 public 或 internal。NOTION_CLIENT_SECRET、NOTION_CLIENT_ID、NOTION_INTERNAL_SECRET:根据集成类型所需的客户端密钥、ID或内部密钥。 15. 邮件相关配置(Mail related configuration) MAIL_TYPE:邮件类型,支持 resend 或 smtp。MAIL_DEFAULT_SEND_FROM:默认发件人邮箱地址。RESEND_API_URL、RESEND_API_KEY:当 MAIL_TYPE 为 resend 时,Resend邮件服务的API地址和密钥。SMTP_SERVER、SMTP_PORT、SMTP_USERNAME、SMTP_PASSWORD、SMTP_USE_TLS、SMTP_OPPORTUNISTIC_TLS:当 MAIL_TYPE 为 smtp 时,SMTP服务器的相关配置。 16. 其他配置(Others Configuration) INDEXING_MAX_SEGMENTATION_TOKENS_LENGTH:索引时分割令牌的最大长度。INVITE_EXPIRY_HOURS、RESET_PASSWORD_TOKEN_EXPIRY_MINUTES:成员邀请链接和重置密码令牌的有效时间。CODE_EXECUTION_*:代码执行服务的端点、API密钥及各种超时时间、数据限制等配置。WORKFLOW_*:工作流运行时的各种配置,如最大执行步骤、时间、调用深度、变量大小等。HTTP_REQUEST_NODE_*:工作流中HTTP请求节点的配置,如最大二进制和文本大小、SSL验证。SSRF_PROXY_*:SSRF代理服务器的HTTP和HTTPS URL。LOOP_NODE_MAX_COUNT:工作流中循环节点的最大循环次数。MAX_TOOLS_NUM、MAX_PARALLEL_LIMIT:代理中可使用的最大工具数和工作流中最大并行分支数。 17. 各服务特定环境变量 TEXT_GENERATION_TIMEOUT_MS:Web服务中文本生成的超时时间(毫秒)。PGUSER、POSTGRES_PASSWORD、POSTGRES_DB、PGDATA:数据库服务的相关环境变量。SANDBOX_*:沙箱服务的API密钥、运行模式、超时时间等配置。WEAVIATE_*:当使用Weaviate作为向量存储时的相关配置。CHROMA_*:当使用Chroma作为向量存储时的认证和持久化配置。ORACLE_*:当使用Oracle作为向量存储时的相关配置。MILVUS_*:当使用Milvus作为向量存储时的ETCD、MinIO等相关配置。PGVECTOR_*:当使用pgvector或pgvecto - rs作为向量存储时的数据库相关配置。OPENSEARCH_*:当使用OpenSearch作为向量存储时的相关配置。 18. Nginx反向代理配置 NGINX_*:Nginx服务器的配置,包括服务器名称、是否启用HTTPS、端口、SSL证书及密钥文件名、协议、工作进程数、客户端最大请求体大小、保持连接超时时间、代理读写超时时间等。CERTBOT_*:Certbot获取证书的相关配置,如邮箱、域名、命令选项。 19. SSRF代理配置 SSRF_*:SSRF代理的端口、核心转储目录、反向代理端口、超时时间等配置。 20. 其他杂项配置 COMPOSE_PROFILES:根据向量数据库类型指定Docker Compose配置文件。POSITION_TOOL_*、POSITION_PROVIDER_*:用于指定应用中可使用的工具和模型提供商,通过引脚、包含或排除列表配置。CSP_WHITELIST:内容安全策略(CSP)的白名单。CREATE_TIDB_SERVICE_JOB_ENABLED:是否启用创建TiDB服务作业。MAX_SUBMIT_COUNT:并行节点执行时线程池中的最大提交线程数。TOP_K_MAX_VALUE:RAG中top - k值的最大值。 21. 插件守护进程配置 DB_PLUGIN_DATABASE:插件数据库名称。EXPOSE_PLUGIN_DAEMON_PORT、PLUGIN_DAEMON_PORT:插件守护进程暴露的端口。PLUGIN_DAEMON_KEY、PLUGIN_DAEMON_URL:插件守护进程的密钥和URL。PLUGIN_MAX_PACKAGE_SIZE:插件最大包大小。PLUGIN_PPROF_ENABLED:是否启用插件性能分析。PLUGIN_DEBUGGING_*:插件调试相关的主机和端口配置。PLUGIN_DIFY_INNER_API_KEY、PLUGIN_DIFY_INNER_API_URL:插件与Dify内部API通信的密钥和URL。ENDPOINT_URL_TEMPLATE:端点URL模板。MARKETPLACE_ENABLED、MARKETPLACE_API_URL:是否启用插件市场及市场API URL。FORCE_VERIFYING_SIGNATURE:是否强制验证签名。PLUGIN_PYTHON_ENV_INIT_TIMEOUT、PLUGIN_MAX_EXECUTION_TIMEOUT、PIP_MIRROR_URL:插件Python环境初始化超时时间、最大执行超时时间和PIP镜像URL。

      -[ ] dify 学习 #dify #AI

    1. O how good is God, that gives us of this excellent juice! I call him to witness, if I had been in the time of Jesus Christ, I would have kept him from being taken by the Jews in the garden of Olivet. And the devil fail me, if I should have failed to cut off the hams of these gentlemen apostles who ran away so basely after they had well supped, and left their good master in the lurch.

      Rabelais has vague historical background, but what is well known from his lifetime was his transition from a religious lifestyle to the studies of medicine and humanities (Coutinho 4). This shift in mindset reflects heavy on Rabelais work as he pokes fun at religious symbols and uses biblical events in what was considered "poor taste" at the time. This is one of the numerous reasons Rabelais' novels were considered "forbidden books" at the time in frace (Coutinho 4). Gargantua gives thanks to God for liquor and uses Jesus' betrayal in a satirical form of gratitude.

      Coutinho, Léo, Carlos Henrique Ferreira Camargo, and Hélio Afonso Ghizoni Teive. “François Rabelais and His Dystonic Giants.” Arquivos de neuro-psiquiatria 82.7 (2024): 1–2. Web.

    2. (they) were married together. And if they had formerly in Theleme lived in good devotion and amity, they did continue therein and increase it to a greater height in their state of matrimony;

      In this passage, Rabelais is representing the importance of marriage to the proliferation of the French Kingdom. According to Babin of Louisiana State University, marriage was "the glue that held together the household" because marriage represented a "microcosm of society and the government" (2). As illustrated by the quote "at the request of his parents" just prior to his description of marriage, Rabelais indicates his support for legal marriages formulated by parents and his disdain for "clandestine marriages" (Bauschatz 404). Because clandestine marriages were inherently disordered and independent of the state, they threatened the order of the Kingdom of France (Babin 4). Not only that, these clandestine marriages also went against the will of God and were concerned only with "glory, greed [and] sensual enjoyment" (Bauschatz 407). Therefore, while Rabelais did have patronage from the French court, his experience in religious positions likely means that he was more concerned with marriages that ignored God than threatened the state, but both points lend credence towards Rabelais's support of legitimate marriage.

      Sources: Babin, Adam. "Marriage in seventeenth-century French theater." LSU Master's Theses, Louisiana State University, 2011, pp. 1-4. Bauschatz, Cathleen. "Rabelais and Marguerite de Navarre on Sixteenth-Century Views of Clandestine Marriage." University of Maine, vol.XXXIV, no. 2, 2003, pp. 1-14.

    1. Figure 44\PageIndex{4}: Titration of a Weak Polyprotic Acid. Another 10 mL, or a total of 20 mL, of the titrant is added to the weak polyprotic acid to reach the second equivalence point. (CC BY; Heather Yee via LibreTexts)

      This figure is misleading. The Equivalence Point is where the moles of acid and conjugate base are equal. The Midpoint to ionization is NOT where the moles are equal. The midpoint is where the pKa is determined... by sliding left from the midpoint dot to the Y axis and reading the pH. The pKa = pH at the Midpoint (which is found by taking the 1/2 Equivalence Point (or half the distance required (on the X axis) for the ionization) and moving up to the graphed line). Presumably the next diagram also has the same error.

    1. Reviewer #3 (Public Review):

      In their manuscript, Umetani, et al. address the question of the origin of persister bacteria using single-cell approaches. Persistence refers to a physiological state where bacteria are less sensitive to antibiotherapy, although they have not acquired a resistance mutation; importantly, the concept of persistence has been refined in the past decade to distinguish it from tolerance where bacteria are only transiently insensitive. Since persister cells are very rare in growing populations (typically 1e-5 or 1e-6), it is very challenging to observe them directly. It had been proposed that individual cells surviving antibiotics are not growing at the start of the treatment, but recent studies (nicely reviewed in the introduction) where persister bacteria were observed directly do not support this link. Following a similar line, the authors nonetheless still aim at "investigating whether non-growing cells are predominantly responsible for bacterial persistence". Based on new experimental data, they claim the contrary that most surviving cells were "actively growing before drug exposure" and that their work "reveals diverse survival pathways underlying antibiotic persistence".

      The main strengths of the manuscript are in my opinion:

      - To report on direct observation of E. coli persisters to ampicillin (200µg/mL) in 5 different growth media (typically 20 persisters or more per condition, one condition with 12 only), which constitutes without a doubt an experimental tour de force.

      - To aim at bridging the population level and the single-cell level by measuring relevant variables for each and analyzing them jointly.

      - To demonstrate that in most conditions a large fraction of surviving cells was actively growing before drug exposure.

      In addition, although it is well-known that E. coli doesn't need to maintain its rod shape for surviving and dividing, I found very remarkable in their data the extent to which morphology can be affected in persister cells and their progeny, since this really challenges our understanding of E. coli's "lifestyle" (these swimming amoeba-like cells in Supp Video 11 are mind-blowing!).

      Unfortunately, these positive aspects are counter-balanced by several shortcomings in the way experiments are analyzed and interpreted, which I explain below. Moreover, the manuscript is written in a way that makes it very hard to find important information on how experiments are done and is likely to leave the reader with an impression of confusion about what the main findings actually are.

      My major concerns are the following:

      (1) The main interpretation framework proposed by the authors is to assess whether cells not growing before drug exposure (so-called "dormant") are more or less likely to survive the treatment than growing ones ("non-dormant"). Fig 2A and Fig 3G show the main conclusions of the article from this perspective, that growing cells can survive the treatment and that the fraction of persisters in a given condition is not explained by the fraction of "dormant" cells, respectively. With this analysis, the authors essentially assume that "dormant" cells are of the same type in their different conditions, which ignores the progress in this field over the last decade (Balaban et al. 2019). I argue on the contrary that the observation of "diverse modes of survival in antibiotic persistence" is expected from their experimental design. In particular, the sensitivity of E. coli to beta-lactams such as ampicillin is expected to be much lower during the lag out of the stationary phase, a phenomenon which has been coined "tolerance"; hence in the Late Stationary condition, two subpopulations coexist for which different response to ampicillin is expected. I propose steps toward a more compelling interpretation of the experimental data. Should this point be taken seriously by the authors, it, unfortunately, implies a major rewriting of the article, including its title.

      (2) The way the authors describe their experiments with bacteria in the stationary phase is very problematic. For instance, they write that they "sampled cells from early and late stationary phases (...) and exposed them to 200 μg/mL of Amp in both batch and single-cell cultures." For any reader in a hurry (hence skipping methods and/or supplementary figure), this leads to believe that bacteria sampled in the stationary phase were exposed to the drug right away (either by adding the drug to the stationary phase sample, or more classically by transferring cells to fresh media with antibiotics). However, it turns out that, after sampling and loading in the microfluidic device, bacteria are grown 2 h in LB (or 4 h in M9) - I don't know what to think of such a blatant omission. The names chosen for each condition should reflect their most important aspects, here "stationary" is simply not appropriate - maybe something like "post early stationary" instead. In any case, I believe that this point highlights further the misconception pointed out in 1 and implies that the average reader will be at best confused, and probably misled.

      (3) Figures 4 and 5 are of very minor significance, and the methodology used in Fig 4 is questionable. The authors measure the abundance of an Rpos-mCherry translational fusion because its "high expression has been suggested to predict persistence". The rationale for this (that an RpoS-mCherry fusion would be a proxy for intracellular ppGpp levels, and in turn predict persistence) has never been firmly established, and the standards used in the article where this reporter was introduced (Maisonneuve, Castro-Camargo, and Gerdes 2013) are notoriously low (which eventually led to its retraction) - I don't know what to think of the fact that the authors cite a review by this group rather than their retracted article. While transcriptional fusions of promoters regulated by RpoS have been proposed to measure its regulatory activity (Patange et al. 2018), the combination of self-regulation and complex post-translational regulation of rpoS makes the physical meaning of the reporter used here completely unclear. Moreover, this translational fusion is introduced without doing any of the necessary controls to demonstrate that the activity of RpoS is not impaired by the addition of the fluorescent protein. Fig 5 simply reports the existence of persisters to ciprofloxacin growing before the treatment. This might be a new observation but it is not unexpected given that a similar observation has been made with a similar drug, ofloxacin (Goormaghtigh and van Melderen 2019), as pointed out in the introduction. There is no further quantitative claim on this.

      (4) The authors don't mention the dead volume nor the speed of media exchange in their device. Hopefully, it is short compared to the duration of the treatment; however, it is challenging to remove all antibiotics after the treatment and only 1e-3 or 1e-4 of the treatment concentration is already susceptible to affecting regrowth in fresh media. If this is described in another article, it would be worth adding a comment in the main text.

      (5) Fig 2A supports the main finding that a significant fraction of bacteria surviving the treatment are growing before drug exposure, but it uses a poorly chosen representation.<br /> - In order to compare between conditions, one would like to see the fraction of each type in the population.<br /> - The current representation (of a fraction of each type among surviving cells) requires a side-by-side comparison with a random sample (which will practically be equivalent to the fraction of each type among killed cells) in order to be informative.

    2. Author response:

      Reviewer #1 (Public Review):

      The work of Umetani et al. monitors the death of about 100,000 cells caused by lethal antibiotic treatments in a microfluidic device. They observe that the surviving bacteria are either in a dormant or in a non-dormant state prior to the antibiotic treatment. They then study the relative abundances of these different persister cells when varying the physiological state of the culture. In agreement with previous observations, they observe that late stationary phase cultures harbor a high number of dormant persister cells and that this number goes down as the culture is more exponential but remains non-zero, suggesting that cultures at the exponential phase contain different types of persister bacteria. These results were qualitatively similar in a rich and poor medium. Further characterization of the growing persister bacteria shows that they often form Lforms, have low RpoS-mcherry expression levels and grow only slightly more slowly than the non-persister bacteria. Taken together, these results draw a detailed view of persister bacteria and the way they may survive extensive antibiotic treatments. However, in order to represent a substantial advance on previous knowledge, a deeper analysis of the persister bacteria should be done.

      We thank the reviewer for suggesting the addition of more detailed analyses of persister cells. As we wrote in our response to Essential Revision 1, we now include a new section titled “Response of growing persisters to Amp exposure is heterogeneous” (Page 11-12) and present the results of the detailed analyses of single-cell dynamics of growth and cell morphology over the course of the pre-exposure, exposure, and post-exposure periods (Fig. 2D and H, Fig. 4B and D, Fig. 4 – figure supplement 1 and 2, Fig. 5B and D, Fig. 5 – figure supplement 1, Fig. 8B and D, and Figure 8 – figure supplement 1). The new results characterize differential responses to Amp treatment among growing persister cells (Fig. 4A-D, Fig. 4 – figure supplement 1, Fig. 4 – figure supplement 2A, Fig. 5A-D, and Fig. 5 – figure supplement 1), comparable division rates of MG1655 between non-surviving cells and persister cells growing prior to antibiotic treatments (Fig. 4E and Fig. 8E), except for the post-exponential phase cell populations of MF1 to Amp treatment in the LB medium and the post-exponential phase cell populations of MG1655 to Amp treatment in the M9 medium (Fig. 4 – figure supplement 2B and Fig. 5E) and the presence of persister cells to CPFX that avoid filamentation after the treatment (Fig. 8C and D, and Fig. 8 – figure supplement 1). We believe that these new analyses would provide new insights into the diverse dynamics and survival modes of antibiotic persistence at the single-cell level and represent important contributions to the field.

      Reviewer #2 (Public Review):

      The main question asked by Umenati et al. is whether persister cells to ampicillin arise preferentially from dormant, non-dividing cells or from cells that are actively growing before antibiotic exposure. The authors tracked persister cells generated from populations at different growth phases and culture media using a microfluidic device coupled to fluorescence microscopy, which is a challenge due to the low frequency of these persister cells. One of the main conclusions is that the majority of persisters arising in exponentially-growing populations originated from actively-dividing cells before the antibiotic treatment, reinforcing the idea that dormancy is not a prerequisite for persister formation. The authors made use of a fluorescent reporter monitoring RpoS activity (RpoS-mCherry fusion) and observed that RpoS levels in these persister cells were low. In the few lineages that exhibited no growth before the ampicillin treatment, RpoS levels were low as well, indicating that RpoS is not a predictive marker for persistence. By performing the same experiment with early and late stationary phase cultures, the authors observed that the proportion of persister cells that originated from dormant cells before the ampicillin treatment is significantly increased under these conditions. In the late stationary phase condition, dormant cells were expressing high levels of RpoS. The authors suggested that RpoS-mCherry proteins form aggregates which were suggested by the authors to be a characteristic of 'deep dormancy'. These cells were mostly unable to restart growth after the antibiotic removal while others with the lowest levels of RpoS tended to be persister. Confirming that these cells indeed contain protein aggregates as well as determining the physiological state of these cells appears to be crucial.

      We thank reviewer #2 for pointing out the critical issue with the RpoS-mCherry fusion that we used to quantify RpoS expression levels in single cells in the original manuscript. As explained in our reply to the comments below, we performed a suggested experiment and confirmed that the RpoS function was impaired by tagging it with mCherry. To resolve this issue, we repeated almost all the experiments using the wild-type strain MG1655 and confirmed the reproducibility of the main results (Fig. 3, Fig. 3 – figure supplement 1, and Fig. 7). Due to this change of the main strain used in this study, we removed the results on the correlation between RpoS expression and the persistence trait in the revised manuscript because it may not reflect the relationship of intact RpoS. However, we decided to still keep and show some of the results with the MF1 strain, such as the population killing curves and the survival mode analyses, because they also provide insight into the role of RpoS in antibiotic persistence. In particular, we found both beneficial and detrimental effects of RpoS on antibiotic persistence, depending on culture conditions and duration of antibiotic treatment (Fig. 1 – figure supplement 3 and Fig. 6 – figure supplement 1). Therefore, we have included these results and related discussions in the revised manuscript.

      Reviewer #3 (Public Review):

      In their manuscript, Umetani, et al. address the question of the origin of persister bacteria using single-cell approaches. Persistence refers to a physiological state where bacteria are less sensitive to antibiotherapy, although they have not acquired a resistance mutation; importantly, the concept of persistence has been refined in the past decade to distinguish it from tolerance where bacteria are only transiently insensitive. Since persister cells are very rare in growing populations (typically 1e-5 or 1e-6), it is very challenging to observe them directly. It had been proposed that individual cells surviving antibiotics are not growing at the start of the treatment, but recent studies (nicely reviewed in the introduction) where persister bacteria were observed directly do not support this link. Following a similar line, the authors nonetheless still aim at "investigating whether non-growing cells are predominantly responsible for bacterial persistence". Based on new experimental data, they claim the contrary that most surviving cells were "actively growing before drug exposure" and that their work "reveals diverse survival pathways underlying antibiotic persistence".

      We thank the reviewer for this helpful comment, which suggested to us that some revisions in our Introduction would better place our study in the context of previous understanding of antibiotic persistence. As mentioned in our response to Essential Revision 4 and the second comment of Reviewer 1's Recommendations for the authors, we have modified the Introduction to more appropriately place our study in the context of the field.

      The main strengths of the manuscript are in my opinion:

      - To report on direct observation of E. coli persisters to ampicillin (200µg/mL) in 5 different growth media (typically 20 persisters or more per condition, one condition with 12 only), which constitutes without a doubt an experimental tour de force.

      - To aim at bridging the population level and the single-cell level by measuring relevant variables for each and analyzing them jointly.

      - To demonstrate that in most conditions a large fraction of surviving cells was actively growing before drug exposure.

      In addition, although it is well-known that E. coli doesn't need to maintain its rod shape for surviving and dividing, I found very remarkable in their data the extent to which morphology can be affected in persister cells and their progeny, since this really challenges our understanding of E. coli's "lifestyle" (these swimming amoeba-like cells in Supp Video 11 are mind-blowing!).

      We are grateful to the reviewer for the articulation of the strength of this study. 

      Unfortunately, these positive aspects are counter-balanced by several shortcomings in the way experiments are analyzed and interpreted, which I explain below. Moreover, the manuscript is written in a way that makes it very hard to find important information on how experiments are done and is likely to leave the reader with an impression of confusion about what the main findings actually are.

      We thank the reviewer for pointing out these important issues regarding the original manuscript. Please see our replies below regarding how we corresponded to each specific comment to resolve the issue. To make the experimental methods and procedures more accessible and interpretable, we have added more explanations of the experimental details to the Results and Methods sections. Furthermore, since we understood that some of the confusions came from the insufficient explanation of the preculture procedures for the microfluidic experiments, we have modified the schematic illustration of the method shown in Fig. S1 in the original manuscript and moved it as the first main figure in the revised manuscript (Fig. 1C and D). We have also added an illustration that explains the cultivation procedures for the batch culture experiments as Fig.

      6A. 

      My major concerns are the following:

      (1) The main interpretation framework proposed by the authors is to assess whether cells not growing before drug exposure (so-called "dormant") are more or less likely to survive the treatment than growing ones ("non-dormant"). Fig 2A and Fig 3G show the main conclusions of the article from this perspective, that growing cells can survive the treatment and that the fraction of persisters in a given condition is not explained by the fraction of "dormant" cells, respectively. With this analysis, the authors essentially assume that "dormant" cells are of the same type in their different conditions, which ignores the progress in this field over the last decade (Balaban et al. 2019). I argue on the contrary that the observation of "diverse modes of survival in antibiotic persistence" is expected from their experimental design. In particular, the sensitivity of E. coli to beta-lactams such as ampicillin is expected to be much lower during the lag out of the stationary phase, a phenomenon which has been coined "tolerance"; hence in the Late Stationary condition, two subpopulations coexist for which different response to ampicillin is expected. I propose steps toward a more compelling interpretation of the experimental data. Should this point be taken seriously by the authors, it, unfortunately, implies a major rewriting of the article, including its title.

      We thank the reviewer for bringing to our attention the point that may have caused confusion in the original manuscript. 

      The primary purpose of this manuscript was not to assess whether non-growing cells prior to drug exposure are more or less likely to survive treatment than growing cells. Rather, we wanted to examine how different persister cell dynamics emerge at the single-cell level depending on previous cultivation history, growth media, and antibiotic types. We believe that this point is clearer in the revised manuscript with the newly added single-cell dynamics data (Fig. 2D, 2H, 4B, 4D, Fig. 4 – figure supplement 1 and 2A, Fig. 5B, 5D, Fig. 5 – figure supplement 1, Fig. 8B, 8D, and Fig. 8 – figure supplement 1). 

      We also did not mean to imply that "dormant cells" were of the same type under different conditions, as we were aware of the diversity of cellular states of non-growing cells, as well as the reduced sensitivity of cells to antibiotics during the lag out of stationary phase. We believe that one of the reasons this point may have been unclear is that in the previous version we had referred to all cells that were not growing prior to antibiotic treatment as "dormant cells", a term that is often used in a more restricted way to refer to cells under prolonged growth arrest. Therefore, in the revised manuscript, we have avoided the term "dormant cells" and instead simply referred to these as "non-growing cells". Accordingly, we have changed the title of the paper from "Observation of non-dormant persister cells reveals diverse modes of survival in antibiotic persistence" to "Observation of persister cell histories reveals diverse modes of survival in antibiotic persistence".

      To further address these points, we have improved the description of the experimental procedures for the single-cell measurements (see the reviewer's next comment as well). The nongrowing persisters of the MF1 strain found in the post-exponential phase cell populations must be of a different type than those found in the post-early and post-late stationary phase cell populations due to the experimental design. All early and late stationary phase cells were maintained in a non-growing state by flowing conditioned media prepared from the early and late stationary phase cultures until the start of the time-lapse measurements. Thus, aside from potential physiological heterogeneity, the non-growing cells prior to drug treatment are all long lagging cells. On the other hand, for the post-exponential phase condition, we maintained exponential growth conditions during the period from the start of the second pre-culture to the start of antibiotic treatment, including the period during sample preparation for time-lapse measurements. Given the exponential dilution by growth of cell populations, the non-growing persisters are unlikely to be long lagging cells (see our response to Reviewer 2's third comment  in "Recommendations for the authors"). We now describe these experimental procedures in more detail in the Results section (L161-178, L287-297). In addition, we discuss the diversity of cellular states of both non-growing and growing cells in Discussion, citing literature (L545-557).

      (2) The way the authors describe their experiments with bacteria in the stationary phase is very problematic. For instance, they write that they "sampled cells from early and late stationary phases (...) and exposed them to 200 μg/mL of Amp in both batch and single-cell cultures." For any reader in a hurry (hence skipping methods and/or supplementary figure), this leads to believe that bacteria sampled in the stationary phase were exposed to the drug right away (either by adding the drug to the stationary phase sample, or more classically by transferring cells to fresh media with antibiotics). However, it turns out that, after sampling and loading in the microfluidic device, bacteria are grown 2 h in LB (or 4 h in M9) - I don't know what to think of such a blatant omission. The names chosen for each condition should reflect their most important aspects, here "stationary" is simply not appropriate - maybe something like "post early stationary" instead. In any case, I believe that this point highlights further the misconception pointed out in 1 and implies that the average reader will be at best confused, and probably misled.

      We again thank the reviewer for pointing out the insufficient explanation of the method for the single-cell measurements and the helpful recommendation regarding our nomenclature for different conditions. As mentioned above, we now present the previous supplementary figure that schematically explains the experimental procedure as the first main figure to clarify how we prepared the cells loaded into the microfluidic device for single-cell measurements (Fig. 1C and D). Also, following the reviewer's suggestion, we now refer to the conditions as "post-exponential phase," "post-early stationary phase," and "post-late stationary phase" in the revised manuscript. 

      We included a 2-hour (or 4-hour in M9) cultivation period in fresh medium in batch cultures for measuring killing curves to make the cultivation conditions prior to antibiotic treatment as similar as possible between batch and microfluidic experiments. We have clarified the presence of preexposure cultivation of post-early stationary and post-late stationary phase cell populations in the fresh medium before treating them with antibiotics (L264-269, Fig. 6A), so that readers can more easily recognize the experimental conditions.

      (3) Figures 4 and 5 are of very minor significance, and the methodology used in Fig 4 is questionable. The authors measure the abundance of an Rpos-mCherry translational fusion because its "high expression has been suggested to predict persistence". The rationale for this (that an RpoS-mCherry fusion would be a proxy for intracellular ppGpp levels, and in turn predict persistence) has never been firmly established, and the standards used in the article where this reporter was introduced (Maisonneuve, Castro-Camargo, and Gerdes 2013) are notoriously low (which eventually led to its retraction) - I don't know what to think of the fact that the authors cite a review by this group rather than their retracted article. While transcriptional fusions of promoters regulated by RpoS have been proposed to measure its regulatory activity (Patange et al. 2018), the combination of self-regulation and complex post-translational regulation of rpoS makes the physical meaning of the reporter used here completely unclear. Moreover, this translational fusion is introduced without doing any of the necessary controls to demonstrate that the activity of RpoS is not impaired by the addition of the fluorescent protein. Fig 5 simply reports the existence of persisters to ciprofloxacin growing before the treatment. This might be a new observation but it is not unexpected given that a similar observation has been made with a similar drug, ofloxacin (Goormaghtigh and van Melderen 2019), as pointed out in the introduction. There is no further quantitative claim on this.

      We thank the reviewer for pointing out the issue of the RpoS-mCherry fusion. As we mentioned in our response to Essential Revision 2 and also to the comment from reviewer #2, we have tested the sensitivity of this fluorescent reporter strain to oxidative stress and confirmed that it is as sensitive as the rpoS strain (Fig. 1 – figure supplement 1C). Therefore, the RpoS function seems to be defective in this strain, as now explained in Results (L69-79). After confirming the problem with the RpoS-mCherry fusion, we removed all analyses and related arguments that relied on the RpoS expression level (previous Figure 4). In addition, we repeated almost all the experiments with the original MG1655 strain to confirm that the observed results are not specific to the problematic reporter strain. 

      Regarding the experiments with CPFX, we have added a more detailed analysis of single cell dynamics and found that, contrary to the reported results for ofloxacin, not all persistent cells show filamentation after drug withdrawal (Fig. 8C and D, Fig. 8 – figure supplement 1). In addition, we performed new microfluidic experiments in which we treated post-late stationary phase cells with CPFX (Fig. 3). In contrast to the Amp treatment result and the previous study that reported the persistence of post-stationary phase cell populations to ofloxacin (ref. 20), all the persisters for which we identified the pre-exposure growth traits in this condition grew normally prior to CPFX treatment. These newly added analyses and experiments clarify the significance of the CPFX experiments. 

      (4) The authors don't mention the dead volume nor the speed of media exchange in their device. Hopefully, it is short compared to the duration of the treatment; however, it is challenging to remove all antibiotics after the treatment and only 1e-3 or 1e-4 of the treatment concentration is already susceptible to affecting regrowth in fresh media. If this is described in another article, it would be worth adding a comment in the main text.

      We thank the reviewer for bringing up this important point. We have added the perfusion chamber volume and medium flow rate information in the Methods section (L809-817).   

      In the study in which two of the authors participated, the medium exchange rate across the semipermeable membrane was evaluated in a similar device with similar microchamber dimensions (ref. 26). There, we confirmed that the medium exchange was completed within 5 min, which is much shorter than the period of antibiotic treatment and post-antibiotic treatment periods for observing regrowth. We have also included this information in the main text with the reference (L58-63).

      Despite the relatively high medium exchange rate, we cannot formally exclude the possibility that a small amount of antibiotic may remain in the device, e.g. due to non-specific adsorption on the internal surface of the microchambers. In such cases, the residual antibiotics may influence the physiological states of the cells and the regrowth kinetics in the post-exposure periods, as suggested by the reviewer. However, the frequencies of persister cells in the cell populations in our single-cell measurements are comparable to those in the batch culture measurements. Therefore, the removal of antibiotic drugs in our device is at least as efficient as in the batch culture assay. To clarify this point, we have added a paragraph to the Discussion with a reference that reviews the influence of antibiotics at concentrations significantly lower than the MICs (L482-

      489).    

      (5) Fig 2A supports the main finding that a significant fraction of bacteria surviving the treatment are growing before drug exposure, but it uses a poorly chosen representation.

      - In order to compare between conditions, one would like to see the fraction of each type in the population.

      - The current representation (of a fraction of each type among surviving cells) requires a side-byside comparison with a random sample (which will practically be equivalent to the fraction of each type among killed cells) in order to be informative.

      We have changed the style of the previous Fig. 2A to show the fraction of each type in the population instead of the fraction of each type among surviving cells (Fig. 3 and Fig. 3-figure supplement 1).

    1. Briefing Doc : L'avenir des associations face à la raréfaction des subventions publiques

      Thème central : La raréfaction des subventions publiques et privées pour les associations en France représente un risque majeur pour la démocratie, la cohésion sociale et la capacité de la société civile à répondre aux besoins et à innover.

      Sources :

      • Excerpts from "Jeudi du Développement Local "un jour les associations sans subventions publiques : un risque ? "" : Transcription d'une table ronde introduite par Claude Grivel (président de Lunadel) et
      • animée par Carole et Anne Carton (Lunadel), avec les interventions de Cécile Malot (Fondation de France) et Dominique Joseph (membre du CESE et co-rédactrice du rapport "Financement des associations : une urgence démocratique").

      Points clés et Idées Principales :

      1. Introduction et Contexte (Claude Grivel & Carole) :

      Les "Jeudis du Développement Local" sont un rendez-vous régulier depuis 2020, témoignant de l'engagement continu autour des enjeux du développement local et de la vie associative. La situation des associations ne s'est pas améliorée depuis le rapport du CESE de mai 2024, suscitant des inquiétudes quant à leur gestion et leur avenir. L'absence d'associations aurait un coût social considérable, difficile à évaluer précisément, soulignant leur rôle essentiel dans la société française. Le développement local repose sur la capacité à rassembler divers acteurs et habitants autour d'un territoire et d'un désir commun de "faire société", rôle dans lequel les associations sont fondamentales. La matinée est structurée autour des regards croisés de différents intervenants concernés par la vie associative.

      2. Le Rôle et la Fragilisation des Associations (Cécile Malot) :

      • Rôle essentiel : Les associations sont créatrices d'emploi (11% des salariés du secteur privé), offrent un sens à la vie des bénévoles et salariés, assurent la prise en charge de l'intérêt général en complément de l'action publique, sont des lieux d'agilité, d'expérimentation et de créativité, et constituent un gage de cohésion et de lien social. "s'il y avait pas d'association je crois que la société irait très mal".
      • Fragilisation :Baisse des financements publics : Que ce soit de l'État ou des collectivités territoriales, cette baisse ne pourra pas être compensée par la philanthropie. "la Fondation de France ne peut pas faire de fausses promesses de ce point de vue nous n'aurons pas les moyens de compenser la baisse des financement publics et ce n'est pas notre rôle."
      • Mise en concurrence par les appels à projets : Tant publics que privés, les appels à projets peuvent être vertueux lorsqu'ils soutiennent des initiatives libres des associations. Cependant, lorsqu'ils prennent la forme de commandes, ils entravent l'expression citoyenne, baissent la qualité des projets et peuvent nourrir un sentiment de rejet et de ressentiment.
      • Injonction à l'hybridation des ressources : La vente de biens et services, bien que pouvant apporter des financements, peut fragiliser la capacité des associations à recevoir des dons et certaines activités à caractère économique ne sont pas éligibles au mécénat.

      3. La Nouvelle Stratégie de la Fondation de France (Cécile Malot) :

      • Constat : Le fonctionnement basé à 95% sur les appels à projets avait des limites (machine à dire non, temps passé pour les associations et les équipes, enfermement des projets dans des "tuyauteries").
      • Virage stratégique :Développement de collectifs d'action : Neuf collectifs thématiques réunissant des acteurs philanthropiques, des personnalités qualifiées et des personnes concernées pour établir des stratégies de financement communes. "ensemble acteurs philanthropiques personnes qualifiées personnes concernées nous allons établir une stratégie de financement sur le thème qui est le nôtre."
      • Réduction de la part des appels à projets : Objectif de maximum 50%, impliquant le développement d'autres modes d'action.
      • Développement du "repérage direct" : Présence des équipes dans les territoires pour des diagnostics sensibles, soutien à des structures relais (comme les écoutes territoriales de Lunadel), dialogue organique avec le secteur associatif pour nourrir la stratégie.
      • Soutien structurel pluriannuel : Moins de projets mais des subventions plus conséquentes et dans la durée, orientées vers la structure plutôt que vers une action spécifique.
      • Logique de changement systémique : Agir à la racine des problèmes, pas uniquement sur les symptômes.
      • Défis de la nouvelle approche : Question de l'équité, de l'ouverture, de la manière de ne pas être dans un entre-soi et de financer toujours les mêmes, conciliation entre approche systémique et soutien aux besoins urgents.

      4. L'Urgence Démocratique et le Rôle des Associations (Dominique Joseph) :

      • Une France sans associations : Toujours imaginable, mais avec des conséquences désastreuses sur les liens sociaux, la confiance citoyenne (les associations étant un rempart face à la crise de confiance), la réponse aux besoins (au-delà des services, les besoins de lien), et la démocratie (lieux d'apprentissage du débat et de la décision collective). "une France sans association c'est une France dont les liens entre les citoyens et citoyennes seraient complètement brisés".
      • Lien entre fin des subventions et risque démocratique : Le développement des appels à projets a contribué à un changement de paradigme où les associations sont analysées à l'aune de ce qu'elles font plutôt que de ce qu'elles sont, menaçant leur éthique et leur capacité d'initiative citoyenne. Le manque de financement peut entraîner la disparition d'associations essentielles au lien social et à la réponse aux besoins, notamment dans les "diagonales du vide".
      • Éléments marquants du rapport CESE :La professionnalisation/gestionnarisation et la fatigue exprimée par les bénévoles et salariés.
      • Le nombre important de réponses à l'enquête (6500), soulignant l'ampleur du problème.
      • La prise de conscience au sein du CESE de l'urgence démocratique.
      • Réactions et impact du rapport :Sentiment de reconnaissance et de prise en compte des problématiques par les associations.
      • Possibilité de "coalition" et de mobilisation des acteurs associatifs.
      • Intérêt et demandes de présentation du rapport par des collectivités territoriales et des parlementaires.
      • Nécessité d'une prise de conscience des décideurs sur la transformation des modalités de financement au-delà des enveloppes globales.

      5. Perspectives et Mobilisation (Jean-Baptiste Jobard, Collectif des Associations Citoyennes) :

      • Grille de lecture du CAC : Le rapport CESE est lu à travers la grille "Quel monde associatif demain ?" (scénario d'affaiblissement vs. renforcement).
      • Scénario de l'affaiblissement (4+1 mots clés) : Marchandisation, Instrumentalisation, Managérialisation, Répression, Désadministration (conséquences des réformes de l'administration).
      • Antidotes : Démarcheisation, Coconstruction, Démocratisation interne, Droit, Alliances.
      • Appropriation du rapport CESE : Importance de préconisations comme la remise en question du CER (Contrat d'Engagement Républicain) et la transformation de la déductibilité des dons en crédit d'impôts.
      • Mot clé : Subvention. Vue comme l'autre nom de l'initiative citoyenne organisée, l'enjeu est la sécurisation et la démocratisation de la subvention (collectivisation de la délibération sur la répartition des fonds).
      • Convergence avec la philanthropie : Le rapport permet de mieux asseoir l'action associative dans une perspective "polyesque" (économie solidaire au-delà du marché).
      • Mobilisation "Vers des soulèvements associatifs" : Lancement le 1er juillet pour informer, agir et faire entendre la voix des associations comme porteuses de solutions face aux défis sociétaux (et non seulement défendre leur propre financement). Actions symboliques autour de la devise républicaine. Articulation avec les forums associatifs de septembre et une campagne nationale de grande ampleur à venir.

      Conclusion Générale (Claude Grivel) :

      • Nécessité de passer d'une logique de défiance à une logique de confiance envers les associations.
      • Importance de la contractualisation, de la coopération et d'une meilleure connaissance mutuelle entre les sphères publique, privée et associative.
      • Le monde associatif est une source essentielle de créativité et d'innovation au service du développement local et de la réponse aux besoins.
      • Consolider la démocratie passe par la valorisation de la liberté associative et de la capacité à "faire société" ensemble.
      • Le monde associatif est un rempart essentiel pour la défense de la démocratie et des valeurs républicaines.
      • Prochain rendez-vous : Jeudi du Développement Local, 24 avril, 10h, sur le thème des 32 heures et de l'engagement.
    1. Reviewer #2 (Public review):

      Summary

      The authors have used the UK Biobank data to interrogate the association between plasma metabolites and glaucoma.

      (1) They initially assessed plasma metabolites as predictors of glaucoma: The addition of NMR-derived metabolomic data to existing models containing clinical and genetic data was marginal.<br /> (2) They then determined whether certain metabolites might protect against glaucoma in individuals at high genetic risk: Certain molecules in bioenergetic pathways (lactate, pyruvate and citrate) conferred protection.<br /> (3) They provide support for protection conferred by pyruvate in a murine model.

      Weaknesses

      (1) Although it is an invaluable treasure trove of data, selection bias and self-reporting are inescapable problems when using the UK Biobank data for glaucoma research. The high-impact glaucoma-related GWAS publications (Ref 26 and 27) referenced in support of the method suffer the same limitations. This doesn't negate the conclusions but must be taken into consideration. The authors might note that it is somewhat reassuring that the proportion of glaucoma cases (4%) is close to what would be expected in a population-based study of 40-69-year-olds of predominantly white ethnicity.<br /> (2) As noted by the authors, a limitation is the predominantly white ethnicity profile that comprises the UK Biobank.<br /> (3) Also as noted by the authors, the study is cross-sectional and is limited by the "correlation does not imply causation" issue.<br /> (4) The optimal collection, transport and processing of the samples for NMR metabolite analysis is critical for accurate results. Strict policies were in place for these procedures, but deviations from protocol remain an unknown influence on the data.<br /> (5) In addition, all UK Biobank blood samples had unintended dilution during the initial sample storage process at UK Biobank facilities. (Julkunen, H. et al. Atlas of plasma NMR biomarkers for health and disease in 118,461 individuals from the UK Biobank. Nat Commun 14, 604 (2023) Samples from aliquot 3, used for the NMR measurements, suffered from 5-10% dilution. (Allen, Naomi E., et al. Wellcome Open Research 5 (2021): 222.) Julkunen et al. report that "The dilution is believed to come from mixing of participant samples with water due to seals that failed to hold a system vacuum in the automated liquid handling systems. While this issue is likely to have an impact on some of the absolute biomarker concentration values, it is expected to have limited impact on most epidemiological analyses."

      Strengths

      The huge sample size supports a powerful statistical analysis and the opportunity for the inclusion of multiple covariates and interactions without overfitting the models.<br /> The authors have constructed a robust methodology and statistical design.<br /> The manuscript is well-written, and the study is logically presented.<br /> The Figures are of good quality.

      Broadly, the conclusions are justified by the findings.

      Impact<br /> The findings advance personalized prognostics for glaucoma that combine metabolomic and genetic data. In addition, the protective effect of certain metabolites influences further research on novel therapeutic strategies.

      Comments on revisions:

      The authors have thoughtfully and comprehensively addressed my comments. I have no further comments.

    1. Reviewer #2 (Public review):

      Summary:

      This work assesses the genetic interaction between the Bmp signaling pathway and the factor Numb, which can inhibit Notch signalling. It follows up on the previous studies of the group (Tian, eLife, 2014; Tian, PNAS, 2014) regarding BMP signaling in controlling stem cell fate decision as well as on the work of another group (Sallé, EMBO, 2017) that investigated the function of Numb on enteroendocrine fate in the midgut. This is an important study providing evidence of a Numb-mediated back up mechanism for stem cell maintenance.

      Strengths:

      (1) Experiments are consistent with these previous publications while also extending our understanding of how Numb functions in the ISC.<br /> (2) Provides an interesting model of a "back up" protection mechanism for ISC maintenance.

      Weaknesses:<br /> (1) Aspects of the experiments could be better controlled or annotated:<br /> (a) As they "randomly chose" the regions analyzed, it would be better to have all from a defined region (R4 or R2, for example) or to at least note the region as there are important regional differences for some aspects of midgut biology.<br /> (b) It is not clear to me why MARCM clones were induced and then flies grown at 18{degree sign}C? It would help to explain why they used this unconventional protocol.

      (2) There are technical limitations with trying to conclude from double-knockdown experiments in the ISC lineage, such as those in Figure 1 where Dl and put are both being knocked down: depending on how fast both proteins are depleted, it may be that only one of them (put, for example) is inactivated and affects the fate decision prior to the other one (Dl) being depleted. Therefore, it is difficult to definitively conclude that the decision is independent of Dl ligand.

      (3) Additional quantification of many phenotypes would be desired.<br /> (a) It would be useful to see esg-GFP cells/total cells and not just field as the density might change (2E for example).<br /> (b) Similarly, for 2F and 2G, it would be nice to see the % of ISC/ total cell and EB/total cell and not only per esgGFP+ cell.<br /> (c) Fig1: There is no quantification - specifically it would be interesting to know how many esg+ are su(H)lacZ positive in Put- Dl- condition compared to WT or Put- alone. What is the n?<br /> (d) Fig2: Pros + cells are not seen in the image? Are they all DllacZ+?<br /> (e) Fig3: it would be nice to have the size clone quantification instead of the distribution between groups of 2 cell 3 cells 4 cell clones.<br /> (f) How many times were experiments performed?

      (4) The authors do not comment on the reduction of clone size in DSS treatment in Figure 6K. How do they interpret this? Does it conflict with their model of Bleo vs DSS?

      (5) There is probably a mistake on sentence line 314 -316 "Indeed, previous studies indicate that endogenous Numb was not undetectable by Numb antibodies that could detect Numb expression in the nervous system".

      Comments on revisions:

      The authors have by and large addressed my main points.

    2. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public review):

      Summary:

      By way of background, the Jiang lab has previously shown that loss of the type II BMP receptor Punt (Put) from intestinal progenitors (ISCs and EBs) caused them to differentiate into EBs, with a concomitant loss of ISCs (Tian and Jiang, eLife 2014). The mechanism by which this occurs was activation of Notch in Put-deficient progenitors. How Notch was upregulated in Put-deficient ISCs was not established in this prior work. In the current study, the authors test whether a very low level of Dl was responsible. But co-depletion of Dl and Put led to a similar phenotype as depletion of Put alone. This result suggested that Dl was not the mechanism. They next investigate genetic interactions between BMP signaling and Numb, an inhibitor of Notch signaling. Prior work from Bardin, Schweisguth and other labs has shown that Numb is not required for ISC self-renewal. However the authors wanted to know whether loss of both the BMP signal transducer Mad and Numb would cause ISC loss. This result was observed for RNAi depletion from progenitors and for mad, numb double mutant clones. Of note, ISC loss was observed in 40% of mad, numb double mutant clones, whereas 60% of these clones had an ISC. They then employed a two-color tracing system called RGT to look at the outcome of ISC divisions (asymmetric (ISC/EB) or symmetric (ISC/ISC or EB/EB)). Control clones had 69%, 15% and 16%, respectively, whereas mad, numb double mutant clones had much lower ISC/ISC (11%) and much higher EB/EB (37%). They conclude that loss of Numb in moderate BMP loss of function mutants increased symmetric differentiation which lead caused ISC loss. They also reported that numb<sup>15</sup> and numb<sup>4</sup> clones had a moderate but significant increase in ISC-lacking clones compared to control clones, supporting the model that Numb plays a role in ISC maintenance. Finally, they investigated the relevance of these observation during regeneration. After bleomycin treatment, there was a significant increase in ISC-lacking clones and a significant decrease in clone size in numb<sup>4</sup> and numb<sup>15</sup> clones compared to control clones. Because bleomycin treatment has been shown to cause variation in BMP ligand production, the authors interpret the numb clone under bleomycin results as demonstrating an essential role of Numb in ISC maintenance during regeneration.

      Strengths:

      (i) Most data is quantified with statistical analysis

      (ii) Experiments have appropriate controls and large numbers of samples

      (iii) Results demonstrate an important role of Numb in maintaining ISC number during regeneration and a genetic interaction between Mad and Numb during homeostasis.

      Weaknesses:

      (i) No quantification for Fig. 1

      Quantification of Fig.1 has been added. 

      (ii) The premise is a bit unclear. Under homeostasis, strong loss of BMP (Put) leads to loss of ISCs, presumably regardless of Numb level (which was not tested). But moderate loss of BMP (Mad) does not show ISC loss unless Numb is also reduced. I am confused as to why numb does not play a role in Put mutants. Did the authors test whether concomitant loss of Put and Numb leads to even more ISC loss than Put-mutation alone.

      We have tested the genetic interaction between put and numb using Put RNAi and Numb RNAi driven by esg<sup>ts</sup>. According to the results in this study and our previously published data, put mutant clone or esg<sup>ts</sup> > Put-RNAi induced a rapid loss of ISC (whin 8 days). We did not observe further enhancement of stem cell loss phenotype in Put and Numb double RNAi guts.

      (iii) I think that the use of the word "essential" is a bit strong here. Numb plays an important role but in either during homeostasis or regeneration, most numb clones or mad, numb double mutant clones still have ISCs. Therefore, I think that the authors should temper their language about the role of Numb in ISC maintenance.

      We have revised the language and changed “essential” to important”.

      Reviewer #2 (Public review):

      Summary:

      This work assesses the genetic interaction between the Bmp signaling pathway and the factor Numb, which can inhibit Notch signalling. It follows up on the previous studies of the group (Tian, Elife, 2014; Tian, PNAS, 2014) regarding BMP signaling in controlling stem cell fate decision as well as on the work of another group (Sallé, EMBO, 2017) that investigated the function of Numb on enteroendocrine fate in the midgut. This is an important study providing evidence of a Numb-mediated back up mechanism for stem cell maintenance.

      Strengths:

      (1) Experiments are consistent with these previous publications while also extending our understanding of how Numb functions in the ISC.

      (2) Provides an interesting model of a "back up" protection mechanism for ISC maintenance.

      Weaknesses:

      (1) Aspects of the experiments could be better controlled or annotated:

      (a) As they "randomly chose" the regions analyzed, it would be better to have all from a defined region (R4 or R2, for example) or to at least note the region as there are important regional differences for some aspects of midgut biology.

      Thank you for the suggestion. In fact, we conducted all the analyses in region 4, we have added statement to clarify this in the revised manuscript.

      (b) It is not clear to me why MARCM clones were induced and then flies grown at 18{degree sign}C? It would help to explain why they used this unconventional protocol.

      We kept the flies at 18°C to avoid spontaneous clone.

      (2) There are technical limitations with trying to conclude from double-knockdown experiments in the ISC lineage, such as those in Figure 1 where Dl and put are both being knocked down: depending on how fast both proteins are depleted, it may be that only one of them (put, for example) is inactivated and affects the fate decision prior to the other one (Dl) being depleted. Therefore, it is difficult to definitively conclude that the decision is independent of Dl ligand.

      In our hand, Dl-RNAi is very effective and exhibited loss of N pathway activity (as determined by the N pathway reporter Su(H)-lacZ ) after RNAi for 8 days (Fig. 1D). Therefore, the ectopic Su(H)-lacZ expression in Punt Dl double RNAi (fig. 1E) is unlikely due to residual Dl expression. Nevertheless, we have changed the statement “BMP signaling blocks ligand-independent N activity” to” Loss of BMP signaling results in ectopic N pathway activity even when Dl is depleted”

      (3) Additional quantification of many phenotypes would be desired.

      (a) It would be useful to see esg-GFP cells/total cells and not just field as the density might change (2E for example).

      We focused on R4 region for quantification where the cell density did not exhibit apparent change in different experimental groups. In addition, we have examined many guts for quantification. It is very unlikely that the difference in the esg-GFP+ cell number is caused by change in cell density.

      (b) Similarly, for 2F and 2G, it would be nice to see the % of ISC/ total cell and EB/total cell and not only per esgGFP+ cell.

      Unfortunately, we didn’t have the suggested quantification. However, we believe that quantification of the percentage of ISC or EB among all progenitor cells, as we did here, provides a meaningful measurement of the self-renewal status of each experimental group.

      (c) Fig1: There is no quantification - specifically it would be interesting to know how many esg+ are su(H)lacZ positive in Put- Dl- condition compared to WT or Put- alone. What is the n?

      Quantification of Fig.1 has been added. 

      (d) Fig2: Pros + cells are not seen in the image? Are they all DllacZ+?

      Anti-Pros and anti-E(spl)mβ-CD2 were stained in the same channel (magenta).  Pros+ exhibited “dot-like” nuclear staining while CD2 staining outlined the cell membrane of EBs. We have clarified this in the revised figure legend.

      (e) Fig3: it would be nice to have the size clone quantification instead of the distribution between groups of 2 cell 3 cells 4 cell clones.

      Because of the heterogeneity of clone size for each genotype, we chose to group clones based on their sizes ( 2, 3-6, 6-8, >8 cells) and quantified the distribution of individual groups for each genotype, which clearly showed an overall reduction in clone size for mad numb double mutant clones. We and others have used the same clone size analysis in previous studies (e.g., Tian and Jiang, eLife 2014).

      (f) How many times were experiments performed?

      All experiments were performed at least 3 times.

      (4) The authors do not comment on the reduction of clone size in DSS treatment in Figure 6K. How do they interpret this? Does it conflict with their model of Bleo vs DSS?

      Guts containing numb<sup>4</sup> clones treated with DSS exhibited a slight reduction of clone size, evident by a higher percentage of 2-cell clones and lower percentage of > 8 cell clones. This reduction is less significant in guts containing numb<sup>15</sup> clones. However, the percentage of Dl<sup>+</sup>-containing clones is similar between DSS and mock-treated guts. It is possible that ISC proliferation is lightly reduced due to numb<sup>4</sup> mutation or the genetic background of this stock.

      (5) There is probably a mistake on sentence line 314 -316 "Indeed, previous studies indicate that endogenous Numb was not undetectable by Numb antibodies that could detect Numb expression in the nervous system".

      We have modified the sentence.

      Reviewer #3 (Public review):

      Summary:

      The authors provide an in-depth analysis of the function of Numb in adult Drosophila midgut. Based on RNAi combinations and double mutant clonal analyses, they propose that Numb has a function in inhibiting Notch pathway to maintain intestinal stem cells, and is a backup mechanism with BMP pathway in maintaining midgut stem cell mediated homeostasis.

      Strengths:

      Overall, this is a carefully constructed series of experiments, and the results and statistical analyses provides believable evidence that Numb has a role, albeit weak compared to other pathways, in sustaining ISC and in promoting regeneration especially after damage by bleomycin, which may damage enterocytes and therefore disrupt BMP pathway more. The results overall support their claim.

      The data are highly coherent, and support a genetic function of Numb, in collaborating with BMP signaling, to maintain the number and proliferative function of ISCs in adult midguts. The authors used appropriate and sophisticated genetic tools of double RNAi, mutant clonal analysis and dual marker stem cell tracing approaches to ensure the results are reproducible and consistent. The statistical analyses provide confidence that the phenotypic changes are reliable albeit weaker than many other mutants previously studied.

      Weaknesses:

      In the absence of Numb itself, the midgut has a weak reduction of ISC number (Fig. 3 and 5), as well as weak albeit not statistically significant reduction of ISC clone size/proliferation. I think the authors published similar experiments with BMP pathway mutants. The mad<sup>1-2</sup> allele used here as stated below may not be very representative of other BMP pathway mutants. Therefore, it could be beneficial to compare the number of ISC number and clone sizes between other BMP experiments to provide the readers with a clearer picture of how these two pathways individually contribute (stronger/weaker effects) to the ISC number and gut homeostasis.

      Thanks for the comment. We have tested other components of BMP pathway in our previously study (Tian et al., 2014). More complete loss of BMP signaling (for example, Put clones, Put RNAi, Tkv/Sax double mutant clones or double RNAi) resulted in ISC loss regardless the status of numb, suggesting a more predominant role of BMP signaling in ISC self-renewal compared with Numb. We speculate that the weak stem cell loss phenotype associated with numb mutant clones in otherwise wild type background could be due to fluctuation of BMP signaling in homeostatic guts.

      The main weakness of this manuscript is the analysis of the BMP pathway components, especially the mad<sup>1-2</sup> allele. The mad RNAi and mad<sup>1-2</sup> alleles (P insertion) are supposed to be weak alleles and that might be suitable for genetic enhancement assays here together with numb RNAi. However, the mad<sup>1-2</sup> allele, and sometimes the mad RNAi, showed weakly increased ISC clone size. This is kind of counter-intuitive that they should have a similar ISC loss and ISC clone size reduction.

      We used mad<sup>1-2</sup> and mad RNAi here to test the genetic interaction with numb because our previous studies showed that partial loss of BMP signaling under these conditions did not cause stem cell loss, therefore, may provide a sensitized background to determine the role of Numb in ISC self-renewal. The increased proliferation of ISC/ clone size associated with mad<sup>1-2</sup> and mad RNAi is due to the fact that reduction of BMP signaling in either EC or EB non-autonomously induces stem cell proliferation. However, in mad numb double mutant clones, there was a reduction in clone size due to loss of ISC in many clones.

      A much stronger phenotype was observed when numb mutants were subject to treatment of tissue damaging agents Bleomycin, which causes damage in different ways than DSS. Bleomycin as previously shown to be causing mainly enterocyte damage, and therefore disrupt BMP signaling from ECs more likely. Therefore, this treatment together with loss of numb led to a highly significant reduction of ISC in clones and reduction of clone size/proliferation. One improvement is that it is not clear whether the authors discussed the nature of the two numb mutant alleles used in this study and the comparison to the strength of the RNAi allele. Because the phenotypes are weak and more variable, the use of specific reagents is important.

      We have included information about the two numb alleles in the “Materials and Methods”. numb<sup>15</sup> is a null allele, and the nature of numb<sup>4</sup> has not been elucidated. According to Domingos, P.M. et al., numb<sup>15</sup> induced a more severe phenotype than numb<sup>4</sup> did. Consistently, we also found that more numb<sup>15</sup> mutant clones were void of stem cell than numb<sup>4</sup> mutant clones.

      Furthermore, the use of possible activating alleles of either or both pathways to test genetic enhancement or synergistic activation will provide strong support for the claims.

      Activation of BMP (esgts>Tkv<sup>CA</sup>) alone induced stem cell tumor (Tian et al., 2014) whereas overexpression of Numb did not induce increase stem cell number although overexpression of Numb in wing discs produced phenotypes indictive of inhibition of N (our unpublished observation), making it difficult to test the synergistic effect of activating both BMP and Numb.

      Reviewer #1 (Recommendations for the authors):

      - Cartoon of RGT in Fig 4 needs to be improved. We need to know what chromosome harbors the esgts. It is not sufficient to simply put the location of the ubi-GFP and ubi-RFP (on 19A) and not show the location of other components of the RGT system.

      Thank you for the suggestion. We have revised the cartoon in Fig. 4 to include all three pairs of chromosomes and indicate where the esgts driver and UAS-RNAi are located. In addition, we have included the genotypes for all the genetic experiments in the Method section.

      - Quantification of the results in Fig. 1

      Quantification of Fig.1 has been added. 

      - The authors need to explain the premise more carefully (see above) and explain whether or not they tested put, numb double knockdowns.

      We have explained why not testing put numb double RNAi (see above).

      Reviewer #2 (Recommendations for the authors):

      The number of times the experiments have been performed would be useful to include.

      This information has been added in the figure legends.

    1. Reviewer #2 (Public review):

      Summary:

      The authors used whole-network imaging to identify sensory neurons that responded to the repellant 1-octanol. While several olfactory neurons responded to the initial onset of odor pulses, two neurons consistently responded to all the pulses, ASH and AWC. ASH typically activates in response to repellants, and AWC typically activates in response to the removal of attractants. However, in this case, AWC activated in response to the removal of 1-octanol, which was unexpected because 1-octanol is a harmful repellant to the worm. The authors further investigated this phenomenon by testing different concentrations of 1-octanol in a chemotaxis assay and found that at lower (less harmful) concentrations the odor is actually an attractant, but becomes repulsive at higher concentrations. The amplitude of the ASH response appeared to be modulated by concentration, but this was not true for AWC. The authors propose a model where the behavioral response of the worm is the result of integrating these two opposing drives, where repulsion is a result of the increased ASH activity overriding the positive drive from AWC. The authors further tested this theory by testing mutants that ablated the AWC response (tax-4) or ASH response (osm-9), which produced results consistent with their hypothesis. While the interneuron(s) that integrate these signals to influence behavior were not identified, the authors did find that increasing concentrations of 1-octanol did increase the likelihood of AVA activity, a neuron that drives reversals (and hence, behavioral repulsion).

      Strengths:

      This was simple and elegant work that identified specific neurons of interest which generated a hypothesis, which was further tested with mutants that altered neuronal activity. The authors performed both neuronal imaging and behavioral experiments to verify their claims.

      Weaknesses:

      tax-4, but not osm-9 mutants were used in chemotaxis and imaging assays. It would have been nice to have osm-9 results as well for these assays. The mutants are not specific to AWC and ASH. Cell-specific rescue of these neurons would have strengthened the proposed model.

    2. Reviewer #3 (Public review):

      Summary:

      This work describes how two chemosensory neurons in C. elegans drive opposite behaviors in response to a volatile cue. Because they have different concentration dependencies, this leads to different behavioral responses (attraction at low concentration and repulsion at high concentration). It has been known that many odorants that are attractive at low concentrations are aversive at high concentrations, and the implicated neurons (at least AWC for attraction and ASH for repulsion) have been well established. Nonetheless, studying behavior and neural responses in a common context (odor pulses, as opposed to gradients) provides a clear picture of how these sensory neurons may guide the dose-dependent response by separately modulating odor entry and odor exit behaviors.

      Strengths:

      (1) There is good evidence that worms are attracted to low concentrations and repelled by high concentrations of 1-oct. Calcium imaging also makes it clear that dose dependence is stronger for ASH than AWC.

      (2) There is good evidence for conc. dependent responses via ASH (Figure 4E) and attractive inhibition via tonic IAA (Figure 7A).

      (3) This work presents calcium imaging and behavior with the same stimulus (sudden pulses in volatile odor concentration), while previous studies often focus on using neuronal responses to pulses to understand the navigation of gentle gradients.

      Weaknesses:

      (1) It is not clear precisely how important AWC is (compared to other cells) for the attractive response, though the presence of odor-off behavior implicates it. This could be resolved by looking at additional mutants (tax-4 is broad).

      (2) Relatedly, dose-dependent chemotaxis data (Figure 4C, D) should be provided for osm-9 animals to get a sense of the degree to which dose-dependence is explained by ASH.

      (3) Figure 4A, B should include average traces with errors, as there are several ways the responses can vary across conditions.

      (4) The data in Figure 6G does not appear to have error bars. Also, it would help to include a more conventional demonstration of AIB responding to stimuli (e.g. averaging stimulus-aligned responses as a percent of the fluorescence value at stimulus onset to perform the desired subtraction). Subtracted calcium traces are harder to interpret. As it stands, the evidence that sensory signals are persisting in AIB and not being shunted by proprioceptive feedback in microfluidic devices is not strong.

    1. City Life works to build the Housing Justice Movement. Our special emphasis withinthat broad current is anti-displacement, anti-eviction organizing. We focus on building ananti-displacement movement for several reasons:1. It is the front line of the housing crisis. It is the point where the drive for maximumreal estate profit confronts tenant resistance to no-fault eviction.2. Working class communities of color, who are directly targeted by the real estateindustry, can be effectively organized through anti-displacement work to play aleading role in housing and other social justice movement struggles.3. City Life’s anti-capitalist orientation, our willingness to challenge market orthodoxy,and our commitment to an intersectional approach, make it possible for us to do thiswork. For the same reason, few organizations can do it.4. Our political orientation helps create a new community of resistance, led by workingclass people of color, that attracts broad sections of the working class: students,homeowners, lawyers, activists from other movements and others drawn to ‘dosomething’ about the out of control housing crisis.5. New leaders rapidly emerge from this community and are trained to becomeorganizers. City Life’s organizing intentionally is structured to develop leaders whocome from our base and who have the skills to sustain radical base building and anti-displacement movement building over the long term.
    2. Organizing requires two other elements – a sense of righteousness and a sense ofcapacity or power. You can’t organize around your grievance if you think it was ‘your fault,’or even if you think it was nobody’s fault. You can’t think about organizing if you don’t thinkthere is any possibility of ‘winning.’ Part of feeling capacity and power is having anunderstanding of strategy.City Life documents have sometimes summed up these two elements with thisshorthand:1. There is a structural crisis of displacement.2. That crisis is not our fault, certainly not the fault of working class people of color.3. That crisis is the fault of people who are identifiable.4. We have the capacity to defeat those people.
    1. The chaotic nature of the game also serves to represent the horrors and chaos that one experiences during trafficking. This page has paths: 1 2025-01-23T03:17:35+00:00 Kristine Kelly 704347a0fb0f4b5c42bc63d040b84f065ec3a67c Immersion and Simulation Work Kristine Kelly 6 plain 2025-03-27T18:42:55+00:00 Kristine Kelly 704347a0fb0f4b5c42bc63d040b84f065ec3a67c Contents of this path: 1 2025-03-25T19:09:10+00:00 Immersion Exercise Between “C ya laterrrr” and Murray 6 Yash, Arnesa, Shaun plain 2025-03-25T19:37:41+00:00 1 2025-03-25T19:07:57+00:00 Immersion and Simulation Group 2 9 Rhetorical Analysis of "Motions" plain 2025-03-27T18:46:24+00:00 1 2025-03-25T19:23:04+00:00 Group 3 - "c ya laterrrr" and Frasca - Julia, Ava, Sanjana, and Sriya 8 plain 2025-03-27T18:47:17+00:00 1 2025-03-25T19:06:23+00:00 Immersion and Simulation Work (Frasca & Motions Group 4) 19 Reef / Alex / Justin plain 2025-03-27T18:44:34+00:00 This page references: 1 2025-03-25T19:08:58+00:00 Motions - Register Your Child 1 Hazel Smith - Motions plain 2025-03-25T19:08:58+00:00 1 2025-03-25T19:10:42+00:00 Motions - Djamel Hides 1 Hazel Smith - Motions plain 2025-03-25T19:10:42+00:00

      We also agree that the format of the game can illustrate the severities of trafficking and allow users to emphasize with the victims. However, as referred to in the concept of being a "spectactor", the player is still unable to fully understand how a victim of human trafficking may feel.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility, and clarity)

      The manuscript by Song et al presents evidence to show that the predicted cysteine protease type 6 secretion system (T6SS) effector Cpe1 inhibits target cell growth by cleaving type II DNA Topoisomerases GyrB and ParE. The authors determined the structure of the protein complex formed by Cpe1 and its immunity protein Cpi1, which allowed them to reveal the mechanism of inhibition. Moreover, the authors identified type II DNA topoisomerases GyrB and ParE as the targets of Cpe1. Overall, the major conclusions were well supported by experimental data of high quality. The findings have expanded our appreciation of the mechanism utilized by T6SS effectors to inhibit target cell growth.

      We thank the reviewer for their positive remarks and valuable suggestions to improve this manuscript.


      Major comments

      To better establish that GyrB and ParE are the sole targets of Cpe1, the authors should express the GG mutant in target cells and determine whether these cells become resistant to Cpe1-mediated killing (inhibition). They can also determine whether co-expression of the cleavage resistant mutants suppresses the toxicity of Cpe1.

      We appreciate the reviewer’s suggestion to investigate additional substrates of Cpe1 beyond GyrB and ParE, which may not have been fully captured in our crosslinking-mass spectrometry experiments due to technical limitations or low protein abundance. To address this topic, we generated target cells heterologously expressing cleavage-resistant GyrB and ParE variants (GyrBΔG102 and ParEΔG98) that are not susceptible to Cpe1, as described in our original manuscript (Figures 3h, i). We performed both Cpe1 expression assay and competition assay to assess if expression of the cleavage-resistant variants suppresses Cpe1 toxicity (Author Response Figures 1a, b). However, we did not observe a substantial protective effect. While this outcome could suggest that GyrB and ParE are not the sole targets of Cpe1, alternative explanations are also plausible. In the Cpe1 expression assay, high levels of Cpe1 could still act on endogenous wild-type GyrB and ParE, and although we attempted to increase variant expression, precise quantification remains challenging. In the competition assay, highly active Cpe1 may have continued to target wild-type substrates throughout the experiment, potentially masking any protective effect. Additionally, reduced activity of the mutant proteins could contribute to the observed results. Finally, deletion of the global repressor H-NS in the Cpe1-producing E. coli strain may have induced other interbacterial competition mechanisms1, leading to growth inhibition independently of Cpe1. Addressing these questions comprehensively would require a more systematic investigation under a wider range of conditions. We consider this an important avenue for future studies.

      Results in Figure 7 clearly show that Cpi1 is capable of displacing ParE from Cpe1 due to higher affinity. Yet, the "competitive inhibition model" described in the last result section does not completely match what is really happening in Cpe1-mediated interbacterial competition. If Cpi1 is in the target cell, it would more likely engage the incoming Cpe1 before it can interact with ParE or GyrB, so competition does not occur in this scenario. Similarly, in the predatory cells expressing Cpe1 and Cpi1, these two proteins will form a stably protein complex, and no competition with the target will occur. The authors should reconsider their model.

      We thank the reviewer for their comments and appreciate the opportunity to clarify this point. First, we believe the reviewer is referring to Figure 5 rather than Figure 7. In our model, the primary role of immunity proteins in interbacterial competition is to neutralize cognate toxins and prevent self- or kin-intoxication. These immunity proteins exhibit high specificity and strong binding affinity toward their associated toxins, ensuring effective protection2. In predatory cells, immunity proteins are typically co-expressed with their corresponding toxins, likely enabling immediate suppression upon translation. During kin competition, immunity proteins can protect cells even after foreign toxins engage their substrates.

      Our results demonstrate that Cpi1 binds Cpe1 with higher affinity than its substrates and can displace them from pre-formed Cpe1-substrate complexes (Figures 5b-f). This aligns with the established function of immunity proteins in interbacterial competition and provides a mechanistic basis for how they confer protection, even when toxins have initially engaged their targets2. We acknowledge the reviewer’s point that in both scenarios—whether in the recipient cell or the toxin-producing cell—Cpe1 may first encounter Cpi1. However, our model underscores that Cpi1 not only binds at the substrate site but also exhibits superior affinity for Cpe1, ensuring robust protection against Cpe1-mediated toxicity.

      Minor comments

      "Intoxication" was used throughout the text numerous times to describe the activity of Cpe1. Looking in the Marriam-Webster dictionary, "Intoxication" means "a condition of being drunk". This word should be replaced with "toxicity" or some other terms in this line.

      We thank the reviewer for this comment. We acknowledge that the term "intoxication" is commonly associated with alcohol consumption, yet the Merriam-Webster dictionary also defines it as "an abnormal state that is essentially a poisoning" (https://www.merriam-webster.com/dictionary/intoxication). This definition aligns with its well-established usage in the field of interbacterial competition to describe the effects of interbacterial toxins during antagonism3-5, which we have adopted in our manuscript. However, we appreciate the reviewer’s concern and remain open to revising the terminology if deemed necessary for clarity.

      Lines 46-48, references on contact-dependent killings by these systems mentioned should cited. Ref. 9 cited does NOT cover the information at all.

      We thank the reviewer for this comment. We have revised the citation and now reference studies that specifically describe contact-dependent killing systems in the relevant sentences (Lines 45–____50)

      "characterizations" should be "characterization".

      We have now modified the sentence as requested (Line 69)

      Line 229 "Cpe1-Bpa monomers" should be " apo Cpe1-Bpa". The results cannot distinguish whether these bands are monomers or multimers.

      We appreciate the reviewer’s careful assessment of our manuscript. The results in Line 233 (Figure 3c) show the enrichment of His-tagged proteins, including crosslinked complexes and overproduced Cpe1-Bpa. Based on the molecular weight marker, the Cpe1-Bpa bands appear between 10–15 kDa, consistent with the molecular weight of Cpe1 monomers (Figure 3a). Therefore, we have labeled this band as “Cpe1-Bpa monomers” and maintained this terminology throughout the text. This designation aligns with previous studies utilizing site-specific crosslinking via Bpa incorporation6,7

      Line 283, was the mutation deletion? Substitution was used I think.

      We thank the reviewer for highlighting this point. The GyrB and ParE mutants used to confirm the cleavage sites were deletion mutants, with a single glycine removed from the predicted double-glycine motifs. We have now revised the text for clarity (Lines 285–290)

      Lines 439-444 the discussion should be extended to include other bacterial toxins that target type II DNA topoisomerases (e.g. PMID: 26299961 and PMID: 26814232).

      We appreciate the reviewer’s suggestion. The studies referenced (PMID: 26299961 and PMID: 26814232) describe FicT toxin with adenylyl transferase activity that target and post-translationally modify GyrB and ParE at their ATPase domains, highlighting a potential hotspot for topoisomerase inhibition. We have now incorporated an additional paragraph in the Discussion section to describe these findings (Lines 424–439).

      Reviewer #1 (Significance)

      The authors determined the structure of the protein complex formed by Cpe1 and its immunity protein Cpi1, which allowed them to reveal the mechanism of inhibition. Moreover, the authors identified type II DNA topoisomerases GyrB and ParE as the targets of Cpe1. Overall, the major conclusions were well supported by experimental data of high quality. The findings have expanded our appreciation of the mechanism utilized by T6SS effectors to inhibit target cell growth.

      We sincerely thank the reviewer for their positive comments and for the suggestions to improve our manuscript.

      Reviewer #2 (Evidence, reproducibility, and clarity)

      The manuscript, titled "An Interbacterial Cysteine Protease Toxin Inhibits Cell Growth by Targeting Type II DNA Topoisomerases GyrB and ParE", describes how an effector family was identified and characterized as a papain-like cysteine protease (PLCP) that negatively impacts bacterial growth in the absence of its co-encoded immunity protein. This thorough report includes (1) bioinformatic analysis of prevalence, finding this PLCP effector encoded in many gram-negative bacteria, (2) confirming conservation of catalytic active site via structural (crystallographic) analysis, as well as visualizing contacts with the immunity protein, (3) validation of results using growth studies combined with mutagenesis, (4) using a cell-based cross-linking method to pull out potential targets, which were subsequently identified via mass spectrometry, (5) validation of these results using in vitro protease assays with purified (potential) substrates, including verification of the motif recognized on the substrate(s), and cell-based phenotype analyses, and finally, (6) demonstrating competition between immunity protein and ParE substrate using an in vitro pull-down approach. Overall, this is a strong body of work with compelling conclusions that are well supported by multiple experimental approaches.

      We appreciate the reviewer for their positive comments regarding our original submission.

      Major comments

      The claims made based on the presented results are well supported, including that this PLCP effector toxin is widespread, is neutralized in a competitive mechanism by its immunity partner, and that it effectively cleaves both GyrB and ParE (subunits of bacterial type II topoisomerases) at a conserved motif, resulting in suppression of bacterial cell growth via mis-regulating chromosome segregation. No additional experiments are needed to further validate these results, and the authors are commended on the cell-based and in vitro studies to deduce very specific mechanisms and structural details.

      We appreciate the reviewer’s positive feedback.

      Minor comments

      While the writing and data presentation are extremely clear, in general I recommend the authors indicate the level(s) of replication for experiments. Figure legends generally note that mean values with standard deviations are shown, but I did not find where the number of replicates (and independent versus technical) were listed.

      We appreciate the reviewer’s suggestion. We have now revised the manuscript to specify the levels of replication (independent vs. technical) for each experiment in the figure legends, particularly in Figures 2 and 3.

      The figures are very clear, but in many instances the addition of PLCP toxin is indicated as "before" and "after"; while a modest change, I recommend altering this to some type of "-" and "+" type nomenclature rather than a time-based notation (especially as presumably both samples were treated identically, just with or without protease).

      We thank the reviewer for this helpful comment. In Figures 3 and Supplementary Figures 5, 9, we used "before" and "after" to indicate the time points for in vitro cleavage assays verifying Cpe1 cleavage. To minimize variations between reactions, the catalytic mutant Cpe1tox (Cpe1toxC362A) was used as a comparison rather than a reaction without Cpe1tox. In these assays, duplicate reaction mixtures were prepared: one was denatured immediately after preparation ("before" reaction) to serve as a baseline, while the other was incubated to allow enzymatic activity ("after" reaction). This labeling clarifies the comparison between initial and processed samples. We believe this approach clearly distinguishes the effects of Cpe1 activity and provides a reliable basis for assessing proteolysis in our assays.

      I also suggest quantifying the intensities of the gel images presented in Figure 5c, d (for example, Cpe1 intensity as a ratio to that of the ParE ATPase domain), to make the interpretation even more evident.

      We thank the reviewer for the valuable suggestion to quantify the signal intensities of the gel images presented in Figures 5c, d. We have now included the quantification results in Supplementary Figures 9e, f and have updated the respective text in the manuscript (Lines 826-828 and 1066-1087).

      Crystallographic structure: the PDB report notes some higher-than-expected RZR (RSRZ) scores; I interpret this to mean that there was strain around the catalytic site of one of the two toxins in the asymmetric unit, or that this copy was less well ordered. The RZR outliers likely arise from non-optimal weighting for geometric restraints. While no figures of electron density are presented, these modest outliers are not expected to alter the conclusions reached in the current work. One point of interest that is not addressed, however, is if any variance between the two complexes in the asymmetric unit are noted? A passage compares the current toxins to others in the larger subfamily and notes a rotation of a side chain is needed to superpose (Line 159). Can the authors please clarify around which bond this rotation is needed, and if both copies in the asymmetric unit are in the same orientation at this site?

      We appreciate the reviewer’s insightful comments.

      1. We have provided the electron density map for the RSR-Z outlier residues along with the model (Author response Figure 2a). These outlier residues are located at the loop regions of a molecule within the asymmetric unit in the crystal (Chain B). As a result, the electron density for their side chains appears to be noisier compared to residues in the well-folded regions, leading to higher RSR-Z scores. Notably, when we superimposed the models of two complexes within the asymmetric unit, the calculated RMSD value was 0.402 Å (Author response Figure 2b), indicating that the two models are structurally very similar and that these residues are properly assigned. Therefore, the RSR-Z outliers do not significantly impact the overall structure.
      2. Here, we provide a zoomed-in view of Figure 2d, highlighting the superimposed crystal structures of Cpe1 and the closely related PLCPs, ComA and LahT (Author response Figure 2c). As shown, the side chain of the catalytic cysteine residue in ComA adopts a different orientation, positioning it slightly farther from the homologous residues in Cpe1 and LahT. However, since the backbone and catalytic pockets remain structurally intact, we believe that this deviation arises due to results from crystal packing effects rather than an inherent functional distinction. We have now modified the main text (Lines 159-166) to clarify this and prevent any potential misinterpretation.

      Reviewer #2 (Significance)

      Bacteria encode numerous effectors to successfully compete in natural environments or to mediate virulence; these effectors are typically associated with type VI secretion system machinery or referred to as contact dependent inhibition systems. The current work has identified a sub-family of papain-like cysteine protease effectors that are unique by targeting type II topoisomerases. Among the actionable findings is the identification of both the specific site of interaction with the topo substrates, as well as the specific motif recognized for cleavage. This should enable the field to move forward probing for this activity with other toxins and substrates. The insights provided by the competitive neutralization mechanism also stand out as an important contribution that can be more broadly applied. Within the literature, few effector targets are identified, making the current study stand out as impactful by the well-executed experiments that directly support the conclusions.

      While the current study has strong elements of novelty and is complete, it also nicely sets up future studies for remaining open questions. For example, does the nucleotide-bound status of the ATPase domain, or other catalytic intermediate, impact the susceptibility of topoisomerases to cleavage? Is this identified motif found in other ATPase domains? Is the negative supercoiling activity unique to gyrase also impacted, or is the phenotypic mechanism of cell toxicity reliant only on chromosome segregation? What types of kinetic parameters do this class of toxins demonstrate, and does sequence variability alter this? These ideas are a testament to the intriguing study as presented, capturing the readers' curiosity for additional details that are clearly beyond the scope of the current work.

      I anticipate this work will be of interest to the broad field of microbiologists that study interbacterial communication as well as pathogenic mechanisms. While the research is largely fundamental in nature, it is wide in scope with applications to many gram-negative bacteria that inhabit a myriad of niches. The work will also be of interest to specialists in topoisomerases, as the list of toxins that target these essential enzymes is growing and the therapeutic utility of topoisomerase inhibition remains vital. My interest lies in the latter, in toxin-mediated inhibition of topoisomerase enzymes as a means to alter bacterial cell growth. While I have strong expertise in structural biology, I am lacking in expertise for mass spectrometry. I note this because this method was used for the identification of the target substrate.

      We appreciate the reviewer’s insightful discussion and interest in our study. We agree that further investigations are crucial to address the open questions posed, and we have initiated work on some of these avenues.

      For example, considering Cpe1's specificity for the ATPase domain of GyrB and ParE, we have begun examining whether Cpe1 targets other ATPase domains by searching for the consensus sequence or double glycine motifs in the sequences of ATPase domains beyond GyrB and ParE. Among the 42 E. coli ATPase domains identified by the PEC database8, we found several with double glycine residues. However, none contained the exact LHAGGKF consensus sequence identified in GyrB and ParE, which are targeted by Cpe1 (Author Response Figure 3). These findings suggest that Cpe1 is less likely to target other ATPase domains. Nonetheless, due to Cpe1’s potential tolerance of certain variations within the consensus sequence, we cannot draw a definitive conclusion without further investigation into the cleavage sites.

      Another critical open question is the impact of Cpe1-mediated cleavage on the function of GyrB and ParE. To address this topic, we have begun investigating if Cpe1 cleavage affects the ATPase activity of these proteins. As expected, our biochemical analysis has demonstrated a significant decrease in ATP hydrolysis in the presence of active Cpe1tox, but not in the presence of the catalytic mutant Cpe1toxC362A (Author response Figures 4a, b). These results confirm that the ATP-dependent activities of both GyrB and ParE are disrupted following Cpe1 cleavage9. Previous work on FicT toxin that inhibits GyrB and ParE ATPase activity through post-translational modification found that ATP-dependent activities such as DNA supercoiling, relaxation, and decatenation were inhibited10,11. Interestingly, GyrB’s relaxation of negative supercoiled DNA, which does not require ATP, was also affected to some extent. This outcome raises the question as to whether Cpe1-cleaved GyrB results in similar downstream defects. Investigating this possibility would provide valuable insights into Cpe1’s mode of action, although we feel doing so is beyond the scope of the current study. Consequently, we view this as an important area for future research.

      Finally, regarding the potential applications of Cpe1, we are interested in further investigating its enzymatic specificity and properties. In this study, we analyzed the binding kinetics between Cpe1 and its substrate (Figure 5f) and currently we are endeavoring to characterize the kinetics of Cpe1-mediated proteolysis. To better probe hydrolytic dynamics, we plan to utilize a substrate with a reporting group (such as a chromogenic or fluorogenic leaving group) to monitor cleavage over time. We could achieve this by designing a recombinant substrate based on our knowledge of Cpe1’s native substrates (GyrB and ParE) and the target sequence (“LHAGGKF”). Alternatively, a secondary reaction leading to colorimetric changes could be employed for detection. We consider this an exciting research direction and an important next step for this study.

      Overall, we are grateful for the reviewer’s recognition of the novelty and importance of our work in advancing the understanding of interbacterial toxins and their inhibitory effects on topoisomerases. We plan to further investigate the consequences of Cpe1 cleavage on GyrB and ParE and to explore Cpe1 kinetics and its mechanistic actions in more detail. This will not only deepen our understanding of bacterial toxin-mediated inhibition but may also provide critical insights into strategies for targeting type II DNA topoisomerases. The reviewer’s insightful feedback has proven invaluable in shaping our ongoing and future research directions.

      Reviewer #3 (Evidence, reproducibility, and clarity)

      Bacterial warfare in microbial communities has become illuminated by recent discoveries on molecular weapons that allow contact-dependent injection of bacterial toxins between competitors. Among the best characterized systems are the type VI secretion system (T6SS) or the contact-dependent inhibition (CDI) system (i.e. some of the T5SSs). These systems are delivering a plethora of toxins with various biochemical activities and a broad range of targets. In recent years many such toxins have been characterized and their relevance in pointing at appropriate drug targets is increasing.

      In this study the authors built on a previously published association of a family of proteins, papain-like cysteine proteases (PLCPs), with their delivery by T6SS or CDI into target bacterial cells. Whereas this observation is not particularly novel, the findings that this set of proteins, that the authors called now Cpe1, can specifically target bacterial proteins such as ParE and GyrB, so that it affects chromosome partitioning and cell division, is groundbreaking. The authors are clearly demonstrating that Cpe1 cleaves their target proteins at double glycine recognition site which is in line with previous characterization of such proteases when fused to a particular category of ABC transporters. Even more remarkably they can show using biochemical approaches that Cpi1 is a cognate immunity for CpeI, preventing its activity, not by interfering with the catalytic site, but instead with the substrate binding site. The mechanism of competitive inhibition between immunity and substrate is also substantiated by biochemical data.

      We sincerely appreciate the reviewer’s interest in and support of our study.

      Major comments

      • This is a very well conducted study which combines bacterial genetics and phenotypes with excellent biochemical evidence.

      We thank the reviewer for their positive comments.

      • There are 8 targets identified for Cpe1 and yet only two are cleaved by the enzyme. It is intriguing that FtsZ is one identified target by the pull down but not confirmed for cleavage. The authors rules this as false positive but the cell division defect associated with Cpe1 activity would be consistent here. Are there any double glycine in FtsZ that could be identified as cleavage site? Is it possible that slightly different incubation conditions may promote degradation of FtsZ?

      We appreciate the reviewer’s thoughtful comment regarding FtsZ as a potential substrate of Cpe1. This was indeed an intriguing possibility, especially given the cell division defects observed following Cpe1 intoxication. Early on in the project, we also identified FtsZ as a Cpe1 interactor in our proteomic crosslinking assays, which further fueled the hypothesis that FtsZ might be a target.

      To explore this possibility, first we examined the FtsZ protein sequence for potential Cpe1 cleavage sites and identified several double glycine motifs (Author response Figure 5a). However, none of these motifs matched the consensus sequence identified in GyrB and ParE, which is LHAGGKF, a sequence that we have shown to be critical for Cpe1 cleavage activity. In an effort to better understand if FtsZ could still be cleaved by Cpe1, we conducted additional cleavage assays under various conditions (Author response Figure 5b). We tested different incubation temperatures, including increasing the temperature to 37 °C, and extended the reaction time to overnight. However, we did not observe any cleavage of FtsZ under these conditions. Given that FtsZ undergoes significant conformational changes upon binding to GTP12, we also considered the possibility that the GTP-bound form of FtsZ might be cleaved by Cpe1. However, even under those conditions, no significant cleavage of FtsZ was detected (Author response Figure 5b). Based on these results, we do not have any evidence to support that FtsZ is a target of Cpe1. The observed cell division defects are more likely a secondary effect resulting from the cleavage of GyrB and ParE, direct targets of Cpe1 that are crucial for chromosome segregation.

      • Could it be structurally predicted whether the GG of ParE or GyrB is fitted into the catalytic site of Cpe1.

      We appreciate the reviewer’s insightful question regarding the structural prediction of the GG motif of ParE and GyrB fitting into the catalytic site of Cpe1. To address this possibility, we used Alphafold 3 to predict the interaction structure between Cpe1 and its substrates13. The resulting model of Cpe1 interacting with the ATPase domain of GyrB (GyrBATPase) is shown in Supplementary Figure 9c. As illustrated, the loop of the GyrB ATPase domain containing the consensus targeting sequence (“LHAGGKF”) fits into the catalytic site of Cpe1, with the GG motif positioned closest to the catalytic cysteine residue, which likely facilitates hydrolysis. We also attempted to model the interaction between Cpe1 and the ATPase domain of ParE. However, confidence for this model was lower (ipTM = 0.74, pTM = 0.71), possibly due to Alphafold’s preference for certain protein configurations. To gain a more accurate understanding of how Cpe1 binds and recognizes its substrates, we are currently working on co-crystallizing Cpe1tox with GyrB and ParE. This long-term project aims to provide precise structural insights into the Cpe1-substrate interaction and further elucidate the mechanism of cleavage.

      Minor comments

      • The authors described a family of proteases, PLPCs, and characterized one here called Cpe1. Not clear whether this is a generic name or one specific protein from one particular bacterial species. Indeed, it is unclear from which bacterial strain the Cpe1 protein studied here originates.

      We thank the reviewer for this comment and apologize for the lack of clarity. To provide better context, we have now revised the manuscript (Lines 136-137 and 141-145) to clearly state that the Cpe1 protein characterized in this study originates from E. coli strain ATCC 11775.

      • It may be worth to emphasize that the Cpe1 domain is found in all possible configurations as T6SS cargo and that is to be linked to VgrG, PAAR or Rhs.

      Thank you for this suggestion. We have revised the manuscript accordingly to emphasize this point (Lines 106-109).

      • Line 49 the authors could indicate that the Esx system is also known as type VII secretion system (T7SS).

      Thank you for this suggestion. We have revised the manuscript accordingly (Line 48-50).

      • Line 113 it may be better to use Proteobacteria instead of Pseudomonadota

      We have revised the manuscript (Lines 114-115) as suggested by the reviewer. It is important to note that following the recent decision by the International Committee on Systematics of Prokaryotes (ICSP) to amend the International Code of Nomenclature of Prokaryotes (ICNP) and formally recognize "phylum" under official nomenclature rules14,15, the taxonomy database used in our analysis has adopted the updated nomenclature. To ensure consistency, we followed this updated nomenclature throughout the original manuscript.

      Reviewer #3 (Significance)

      This is an excellent piece of work. The characterization of Cpe1 might look poorly novel at the start when compared to previous studies. Yet the findings go crescendo by characterizing original mechanisms of action of the cognate immunity, and by identifying the molecular target of Cpe1. This is providing real conceptual advance in the T6SS field and not just reporting yet another T6SS toxin.

      As a T6SS expert I genuinely feel that these findings are groundbreaking and could be targeted to broad audience since the possible implications of these observations for future antimicrobial drugs discovery or therapeutic approaches is highly relevant.

      We sincerely appreciate the reviewer’s positive remarks and support of our study.

      References

      1. Ishihama, A., and Shimada, T. (2021). Hierarchy of transcription factor network in Escherichia coli K-12: H-NS-mediated silencing and Anti-silencing by global regulators. FEMS Microbiol Rev 45. 10.1093/femsre/fuab032.
      2. Hersch, S.J., Manera, K., and Dong, T.G. (2020). Defending against the Type Six Secretion System: beyond Immunity Genes. Cell Rep 33, 108259. 10.1016/j.celrep.2020.108259.
      3. Russell, A.B., Singh, P., Brittnacher, M., Bui, N.K., Hood, R.D., Carl, M.A., Agnello, D.M., Schwarz, S., Goodlett, D.R., Vollmer, W., and Mougous, J.D. (2012). A widespread bacterial type VI secretion effector superfamily identified using a heuristic approach. Cell Host Microbe 11, 538-549. 10.1016/j.chom.2012.04.007.
      4. Jana, B., Fridman, C.M., Bosis, E., and Salomon, D. (2019). A modular effector with a DNase domain and a marker for T6SS substrates. Nat Commun 10, 3595. 10.1038/s41467-019-11546-6.
      5. Halvorsen, T.M., Schroeder, K.A., Jones, A.M., Hammarlof, D., Low, D.A., Koskiniemi, S., and Hayes, C.S. (2024). Contact-dependent growth inhibition (CDI) systems deploy a large family of polymorphic ionophoric toxins for inter-bacterial competition. PLoS Genet 20, e1011494. 10.1371/journal.pgen.1011494.
      6. Nguyen, T.T., Sabat, G., and Sussman, M.R. (2018). In vivo cross-linking supports a head-to-tail mechanism for regulation of the plant plasma membrane P-type H(+)-ATPase. J Biol Chem 293, 17095-17106. 10.1074/jbc.RA118.003528.
      7. Liu, Y., Yu, J., Wang, M., Zeng, Q., Fu, X., and Chang, Z. (2021). A high-throughput genetically directed protein crosslinking analysis reveals the physiological relevance of the ATP synthase 'inserted' state. FEBS J 288, 2989-3009. 10.1111/febs.15616.
      8. Yamazaki, Y., Niki, H., and Kato, J. (2008). Profiling of Escherichia coli Chromosome database. Methods Mol Biol 416, 385-389. 10.1007/978-1-59745-321-9_26.
      9. Reece, R.J., and Maxwell, A. (1991). DNA gyrase: structure and function. Crit Rev Biochem Mol Biol 26, 335-375. 10.3109/10409239109114072.
      10. Harms, A., Stanger, F.V., Scheu, P.D., de Jong, I.G., Goepfert, A., Glatter, T., Gerdes, K., Schirmer, T., and Dehio, C. (2015). Adenylylation of Gyrase and Topo IV by FicT Toxins Disrupts Bacterial DNA Topology. Cell Rep 12, 1497-1507. 10.1016/j.celrep.2015.07.056.
      11. Lu, C., Nakayasu, E.S., Zhang, L.Q., and Luo, Z.Q. (2016). Identification of Fic-1 as an enzyme that inhibits bacterial DNA replication by AMPylating GyrB, promoting filament formation. Sci Signal 9, ra11. 10.1126/scisignal.aad0446.
      12. Matsui, T., Han, X., Yu, J., Yao, M., and Tanaka, I. (2014). Structural change in FtsZ Induced by intermolecular interactions between bound GTP and the T7 loop. J Biol Chem 289, 3501-3509. 10.1074/jbc.M113.514901.
      13. Abramson, J., Adler, J., Dunger, J., Evans, R., Green, T., Pritzel, A., Ronneberger, O., Willmore, L., Ballard, A.J., Bambrick, J., et al. (2024). Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493-500. 10.1038/s41586-024-07487-w.
      14. Oren, A., Arahal, D.R., Rossello-Mora, R., Sutcliffe, I.C., and Moore, E.R.B. (2021). Emendation of Rules 5b, 8, 15 and 22 of the International Code of Nomenclature of Prokaryotes to include the rank of phylum. Int J Syst Evol Microbiol 71. 10.1099/ijsem.0.004851.
      15. Oren, A., and Garrity, G.M. (2021). Valid publication of the names of forty-two phyla of prokaryotes. Int J Syst Evol Microbiol 71. 10.1099/ijsem.0.005056.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript, titled "An Interbacterial Cysteine Protease Toxin Inhibits Cell Growth by Targeting Type II DNA Topoisomerases GyrB and ParE", describes how an effector family was identified and characterized as a papain-like cysteine protease (PLCP) that negatively impacts bacterial growth in the absence of its co-encoded immunity protein. This thorough report includes (1) bioinformatic analysis of prevalence, finding this PLCP effector encoded in many gram-negative bacteria, (2) confirming conservation of catalytic active site via structural (crystallographic) analysis, as well as visualizing contacts with the immunity protein, (3) validation of results using growth studies combined with mutagenesis, (4) using a cell-based cross-linking method to pull out potential targets, which were subsequently identified via mass spectrometry, (5) validation of these results using in vitro protease assays with purified (potential) substrates, including verification of the motif recognized on the substrate(s), and cell-based phenotype analyses, and finally, (6) demonstrating competition between immunity protein and ParE substrate using an in vitro pull-down approach. Overall, this is a strong body of work with compelling conclusions that are well supported by multiple experimental approaches.

      Major comments:

      The claims made based on the presented results are well supported, including that this PLCP effector toxin is widespread, is neutralized in a competitive mechanism by its immunity partner, and that it effectively cleaves both GyrB and ParE (subunits of bacterial type II topoisomerases) at a conserved motif, resulting in suppression of bacterial cell growth via mis-regulating chromosome segregation. No additional experiments are needed to further validate these results, and the authors are commended on the cell-based and in vitro studies to deduce very specific mechanisms and structural details.

      Minor comments:

      While the writing and data presentation are extremely clear, in general I recommend the authors indicate the level(s) of replication for experiments. Figure legends generally note that mean values with standard deviations are shown, but I did not find where the number of replicates (and independent versus technical) were listed.

      The figures are very clear, but in many instances the addition of PLCP toxin is indicated as "before" and "after"; while a modest change, I recommend altering this to some type of "-" and "+" type nomenclature rather than a time-based notation (especially as presumably both samples were treated identically, just with or without protease). I also suggest quantifying the intensities of the gel images presented in Figure 5c, d (for example, Cpe1 intensity as a ratio to that of the ParE ATPase domain), to make the interpretation even more evident.

      Crystallographic structure: the PDB report notes some higher-than-expected RZR scores; I interpret this to mean that there was strain around the catalytic site of one of the two toxins in the asymmetric unit, or that this copy was less well ordered. The RZR outliers likely arise from non-optimal weighting for geometric restraints. While no figures of electron density are presented, these modest outliers are not expected to alter the conclusions reached in the current work. One point of interest that is not addressed, however, is if any variance between the two complexes in the asymmetric unit are noted? A passage compares the current toxins to others in the larger subfamily and notes a rotation of a side chain is needed to superpose (Line 159). Can the authors please clarify around which bond this rotation is needed, and if both copies in the asymmetric unit are in the same orientation at this site?

      Significance

      Bacteria encode numerous effectors to successfully compete in natural environments or to mediate virulence; these effectors are typically associated with type VI secretion system machinery or referred to as contact dependent inhibition systems. The current work has identified a sub-family of papain-like cysteine protease effectors that are unique by targeting type II topoisomerases. Among the actionable findings is the identification of both the specific site of interaction with the topo substrates, as well as the specific motif recognized for cleavage. This should enable the field to move forward probing for this activity with other toxins and substrates. The insights provided by the competitive neutralization mechanism also stand out as an important contribution that can be more broadly applied. Within the literature, few effector targets are identified, making the current study stand out as impactful by the well-executed experiments that directly support the conclusions.

      While the current study has strong elements of novelty and is complete, it also nicely sets up future studies for remaining open questions. For example, does the nucleotide-bound status of the ATPase domain, or other catalytic intermediate, impact the susceptibility of topoisomerases to cleavage? Is this identified motif found in other ATPase domains? Is the negative supercoiling activity unique to gyrase also impacted, or is the phenotypic mechanism of cell toxicity reliant only on chromosome segregation? What types of kinetic parameters do this class of toxins demonstrate, and does sequence variability alter this? These ideas are a testament to the intriguing study as presented, capturing the readers' curiosity for additional details that are clearly beyond the scope of the current work.

      I anticipate this work will be of interest to the broad field of microbiologists that study interbacterial communication as well as pathogenic mechanisms. While the research is largely fundamental in nature, it is wide in scope with applications to many gram-negative bacteria that inhabit a myriad of niches. The work will also be of interest to specialists in topoisomerases, as the list of toxins that target these essential enzymes is growing and the therapeutic utility of topoisomerase inhibition remains vital. My interest lies in the latter, in toxin-mediated inhibition of topoisomerase enzymes as a means to alter bacterial cell growth. While I have strong expertise in structural biology, I am lacking in expertise for mass spectrometry. I note this because this method was used for the identification of the target substrate.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Song et al presents evidence to show that the predicted cysteine protease type 6 secretion system (T6SS) effector Cpe1 inhibits target cell growth by cleaving type II DNA Topoisomerases GyrB and ParE. The authors determined the structure of the protein complex formed by Cpe1 and its immunity protein Cpi1, which allowed them to reveal the mechanism of inhibition. Moreover, the authors identified type II DNA topoisomerases GyrB and ParE as the targets of Cpe1. Overall, the major conclusions were well supported by experimental data of high quality. The findings have expanded our appreciation of the mechanism utilized by T6SS effectors to inhibit target cell growth.

      Specific comments:

      Main points:

      1. To better establish that GyrB and ParE are the sole targets of Cpe1, the authors should express the GG mutant in target cells and determine whether these cells become resistant to Cpe1-mediated killing (inhibition). They can also determine whether co-expression of the cleavage resistant mutants suppresses the toxicity of Cpe1.
      2. Results in Figure 7 clearly show that Cpi1 is capable of displacing ParE from Cpe1 due to higher affinity. Yet, the "competitive inhibition model" described in the last result section does not completely match what is really happening in Cpe1-mediated interbacterial competition. If Cpi1 is in the target cell, it would more likely engage the incoming Cpe1 before it can interact with ParE or GyrB, so competition does not occur in this scenario. Similarly, in the predatory cells expressing Cpe1 and Cpi1, these two proteins will form a stably protein complex, and no competition with the target will occur. The authors should reconsider their model.

      Minor points:

      1. "Intoxication" was used throughout the text numerous times to describe the activity of Cpe1. Looking in the Marriam-Webster dictionary, "Intoxication" means "a condition of being drunk". This word should be replaced with "toxicity" or some other terms in this line.
      2. Lines 46-48, references on contact-dependent killings by these systems mentioned should cited. Ref. 9 cited does NOT cover the informatin at all.
      3. "characterizations" should be "characterization".
      4. Line 229 "Cpe1-Bpa monomers" should be " apo Cpe1-Bpa". The results cannot distinguish whether these bands are monomers or multimers.
      5. Line 283, was the mutation deletion? Substitution was used I think.
      6. Lines 439-444 the discussion should be extended to include other bacterial toxins that target type II DNA topoisomerases (e.g. PMID: 26299961 and PMID: 26814232).

      Significance

      The authors determined the structure of the protein complex formed by Cpe1 and its immunity protein Cpi1, which allowed them to reveal the mechanism of inhibition. Moreover, the authors identified type II DNA topoisomerases GyrB and ParE as the targets of Cpe1. Overall, the major conclusions were well supported by experimental data of high quality. The findings have expanded our appreciation of the mechanism utilized by T6SS effectors to inhibit target cell growth.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      We thank the reviewers for their evaluation of our previous submission and have responded to each point in detail below. Overall, we have revised the manuscript with the addition of several new data and corresponding figure panels that strengthen our previous conclusions and add new insights allowing us to extend the conclusions of the study. Important additions include new data showing the impact of loss of CLU on adapting to additional stressors during metabolic transitions that supports a mechanistic understanding of our omics results; by poly(dT) FISH we show that fly Clu granules indeed contain mRNAs; FRAP microscopy analysis supports that Clu1 granules have dynamic content similar to other LLPS membraneless organelles; and we have re-analysed our data to demonstrate more clearly the impact of Clu1 on translation efficiency and also the relative binding of mRNAs during translation. In addition, we provide some extra control analyses for completeness.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript the authors study the Clustered mitochondrial proteins Clu of Drosophila melanogaster and Clu1 of Saccharomyces cerevisiae, two homologues of the mammalian protein CLUH. They show in compelling microscopy analysis that both proteins form granules. This was the case for flies fed on yeast paste after starvation and in yeast in post-diauxic phase, in respiratory media or during mitochondrial stress. They show that these granules are found in proximity to mitochondria and that they behave like liquid-liquid-phase separated condensates. They show by co-staining for P-bodies and stress granules that Clu1-granules are distinct from these RNA granules. Furthermore, they found that the formation required active translation. In the second part, they show that Clu1 interacts with ribosomal and mitochondrial proteins by BioID. The deletion of Clu1 leads to slightly impaired growth on media containing Ethanol as a carbon source. They find that nascent polypeptides of some mitochondrial precursor proteins are decreased in the deletion of Clu1 and conclude that Clu1 regulates translation of these proteins. Using RNA immunoprecipitation of Clu1-GFP in presence of cycloheximid, EDTA and puromycin. The mRNAs of nuclear-encoded mitochondrial proteins found to be interacting with Clu1 were purified in conditions when the ribosomes are intact and the RNAs showed no interaction when ribosomes were disassembled. They show in sucrose gradients that Clu1 co-migrates with polysomes independent of its distribution state or carbon source. However, when cells are grown in conditions of granule formation, then polysomes and Clu1 run less deeply into the gradient. Form these data, the authors conclude that Clu/Clu1 regulates the translation of nuclear-encoded mitochondrial proteins.

      Major comments:

      -The authors state that Clu1 is regulating translation during metabolic shifts. However, it is not clear what the real impact on mitochondrial function is. They show that there is a minor growth defect on ethanol media when CLU1 is deleted. However, if Clu1 is necessary mainly for adaptation, the phenotype will be strongest observed in conditions where cells switch carbon sources. Growth curves would be suitable in which the lag-phase of yeast cells precultured either in glucose or glycerol switched to media of different carbon sources (glucose to glycerol or glycerol to glucose) are measured. One would expect that the deletion mutant shows a longer lag-phase compared to the wild type when shifted from glucose to glycerol media.

      We agree that this is an important question, and, duly, we previously attempted to address this exactly as the reviewer described. Surprisingly, we were not able to observe any substantial differences in the duration of the lag phase between the wild-type and CLU1 knockout strains under these conditions. However, we did note that CLU1 knockout cells consistently reached stationary phase with a lower optical density when switched to ethanol media, consistent with these cells having a different metabolic efficiency during growth on ethanol media.

      To further explore the role of Clu1, we noted that several of the Clu1 mRNA interactors were mitochondrial heat shock proteins (HSPs), which are crucial for mitochondrial protein folding and import during the transition from fermentation to respiration. Hence, we hypothesised that the absence of Clu1 might lead to increased sensitivity to heat shock during the metabolic shift.

      To test this, we subjected both wild-type and CLU1 knockout cells to heat shock under three different conditions: (1) during growth on glucose-containing media (fermentation), (2) after shifting cells to media containing ethanol during the lag phase, when cells are adapting to respiration, and (3) after cells had fully adapted to ethanol and resumed growth. Interestingly, CLU1 knockout cells were more sensitive to heat shock selectively during the adaptation to respiration, which involves the translation of an extensive number of mitochondrial proteins. We think that the small difference in translation of mitochondrial HSPs becomes evident only upon additional heat shock, likely due to a deficient mitochondrial protein folding and import. These findings support our hypothesis that Clu1 is essential for optimal mitochondrial function during metabolic shifts.

      These results have been added to the manuscript and shown in Fig. S6 and described on page 9.

      -In line with this, how different is the mitochondrial proteome of the WT and the mutant? Do hits of the BioID, RIP and Punch-P experiments change at steady state or during metabolic shifts? Either proteomics of isolated mitochondria or western blots of whole cells or isolated mitochondria of WT and the deletion mutant grown in conditions of Clu1-granule formation or no granules for the hits could answer this question.

      We also considered this question during the course of the work. However, in exploratory analyses we saw no obvious differences in overall mitochondrial proteomics at steady-state which is what prompted us to look at more subtle effects on translation. Considering this further, changes in steady-state levels can be complex to interpret as they represent the combined effects of protein production and degradation. Small changes arising from altered production could be masked by compensatory changes in turnover rate. In light of this, we believe that the translational regulation differences identified in our study remain central to understanding the role of Clu1, and any downstream proteomic changes would not alter our primary conclusions.

      -The authors analyze RNAs bound in polysomes to assess translation efficiency. Translation efficiency is usually calculated by the fraction of RNA bound by ribosomes to the total RNA amount of an RNA species. Thus, doing RT-qPCR from whole cells would be necessary to assess if the occupancy of ribosomes on the transcripts is due to changes in RNA abundance or other regulatory pathways and would help to further assess what causes the observed changes.

      Thanks for this recommendation. To address this and expand our analysis to other proteins differentially translated in clu1Δ cells, we measured the mRNA steady-state levels by performing RNAseq on WT and clu1Δ strains grown under the same conditions as used for Punch-P. We then calculated the translation efficiency by dividing the nascent protein levels (Punch-P) by steady-state mRNA levels (RNAseq), as previously described for Punch-P data (PMID: 26824027). The translation efficiency for the majority of proteins with reduced translation in the clu1Δ cells by Punch-P analysis was lower. Similarly, the majority of proteins with increased translation had higher translation efficiency.

      The mRNA quantification in polysomes we originally presented in the manuscript, further showed that the decrease in translation efficiency is not caused by a simple decrease of mRNA engaged in translation and that Clu1 is regulating protein translation at the ribosome level. In contrast, for higher translated proteins, we detected an increase in mRNAs engaged in polysomes, likely underlying the increased translation. These results further support our conclusions regarding the regulatory effects of Clu1 on translation.

      These results have been added to the manuscript and shown in Fig. 7E and described on page 9.

      OPTIONAL:

      -The authors show a co-localization of Clu/Clu1 with mitochondrial fission factors and conclude that the granules appear likely near fission sites. Indeed, CLUH has been implied in the past to play a role in mitochondrial fission (Yang, H., Sibilla, C., Liu, R. et al. Clueless/CLUH regulates mitochondrial fission by promoting recruitment of Drp1 to mitochondria. Nat Commun 13, 1582 (2022). https://doi.org/10.1038/s41467-022-29071-4). Thus, are fission sites required for Clu-granule localizations? What is the role of the mitochondrial network integrity for the granule distribution? Expressing Clu-GFP/Clu1-GFP in cells depleted for the fission factors would provide information on that.

      Thanks for this suggestion. We agree that it would be interesting to know whether Clu1 granules still appear when mitochondrial fission is blocked. We tried to address this question but encountered some technical limitations. First, overexpression of Clu1-GFP via a plasmid did not replicate the endogenous Clu1 behaviour, making it necessary to delete the fission factors in the Clu1-GFP background. While crossing the Clu1-GFP strain with already available knockout strains would be straightforward, we would need access to a tetrad dissecting microscope, which unfortunately was not available to us. We also attempted PCR-based gene deletion but the sequence homology between the GFP-tagging cassette and the deletion cassettes made this very challenging. Given these limitations, and as the lab's yeast expert had already left, we were not able to pursue this experiment further and have removed these observations from our manuscript. We hope that future studies will explore this question in more detail.

      -The author assess convincingly that Clu1 interacts with ribosomes and runs with polysomal fractions. However, how it actually regulates translation is not clear. To answer this question, selective ribosomal profiling would be necessary. The authors have established conditions which would be suitable for the experiment. They could use crosslinking and sucrose cushions to IP ribosomes with Clu1-GFP bound to be used for ribosomal profiling. However, this experiment is quite time-intensive (3-4 months) and expensive, thus, an optional suggestion.

      We thank the reviewer for this suggestion. We agree that ribosome profiling could provide novel insights into the function of Clu1/Clu. While we recognise the potential of this approach, as the reviewer points out, this experiment would indeed be time- and resource-intensive. Based on our initial tests, where we included cross-linked samples (UV and formaldehyde) we anticipate that it could even take longer than the estimated 3-4 months, as the IP using cross-linked lysates was not as successful as the IP using non-cross-linked samples: we were not able to immunoprepitate Clu1 so efficiently likely to the epitope being poorly exposed to the antibody. Although we have optimised working conditions for co-immunoprecipitating Clu1 with ribosomes, performing ribosome profiling using our setup within the timeframe and resources of this study is unfortunately not currently feasible.

      Minor comments:

      Fig1: B, C, please add scale bars into the zoom ins.

      These have been added.

      Fig 2 would profit from inlets of zoom ins to visualize the distribution better.

      These have been added.

      Fig.3: Panel C does not really add much information. I would rather remove it or put it into supplements and therefore show a zoom of Panel E with a line plot showing the rings. It is not clear from the represented images where the rings are formed.

      We think some confusion has arisen from the text description. It seems that the reviewer was under the impression that Fig. 3C and 3E were intended to be showing the Clu1 rings around the mitochondria, but this was shown only in Fig. S3A. We have re-written these sentences for better clarity. To be clear, Fig. 3C is a 3D rendering of the left-hand cell in 3B (3D is a line plot of part of the right-hand cell) and 3E is a different experiment showing the formation of Clu1 granules under a different respiratory stress (galactose plus CCCP). We have also added a line plot showing Clu1-GFP and mito-mCherry fluorescence intensity to highlight the Clu1 rings around the mitochondria in Fig. S3A.

      Fig.3 panel F: Max projections are not appropriate to show colocalization as they can lead to false-positive overlaps. Just remove the max projections.

      We tried a number of different approaches to improve this analysis but, ultimately, we were not able to generate sufficiently robust data to be convincing so we decided to remove this from the manuscript. The coincidence of Clu1 granules with mitochondrial fission factors was an adjunct observation and not a major part of the story and has been discussed by others relating to fly Clu (PMID: 35332133), so removal from the current manuscript does not impact the key conclusions of the study.

      References 21 and 22 are the same.

      Thanks. This has been fixed.

      Reviewer #1 (Significance (Required)):

      This manuscript shows in a convincing way that Clu and Clu1 form RNA granules and that Clu1 interacts with ribosomes. It is written in a clear way and the figures support the conclusions drawn in the text. The finding that Clu/Clu1 is important for metabolic adaptation has not been shown in fly or yeast to my knowledge. It is in line with findings for the mammalian homologue CLUH. Thus, the findings are supported by earlier work. This study is of value for a broader audience of the basic research field, especially of the mitochondrial and RNA granule field, as it supports the idea of post-transcriptional regulation of nuclear-encoded mitochondrial protein gene expression for dynamic adaptation of mitochondrial function. The conditions when Clu granules form is studied in detail, followed up by identification of target RNAs and interaction partners. Though the interaction of Clu1 with ribosomes is shown in a compelling way, a detailed mechanism of the function of Clu/Clu1 is missing and would require more experiments. Thus, even though a detailed mechanism is missing, the study does expand on our understanding of Clu/Clu1 in regulating mitochondrial biogenesis and is therefore of high interest of the mitochondrial field.

      Expertise: mitochondria, yeast, RNA granules, mitochondrial biogenesis, next-generation sequencing, fluorescence microscopy

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript the authors use D. melanogaster and S. cerevisiae to study the role of CLUH in the translation of nuclear-encoded mitochondrial proteins. During conditions requiring aerobic respiration, CLUH forms RNA-dependent granules that localise in the proximity to mitochondria. Furthermore, the authors demonstrate that CLUH interacts with translating ribosomes to facilitate the translation of specific target mRNAs. For this, the authors use a combination of GFP-tagged CLUH models. BioID, polysome translating proteomics, RNA-IP. The authors' main conclusions are that (i) CLUH forms dynamic, membrane-less, RNA-dependent granules under conditions that demand aerobic respiration, (ii) CLUH interacts with specific mRNAs encoding metabolic factors, and (iii) CLUH interacts with the translating ribosome. The manuscript is well written and the conclusions stand in proportion to the experimental output and the results. The main concern is with regards to lack of advancement in relationship to published data.

      We appreciate the reviewer's feedback and specific comments which we respond to individually below. However, we would like to first address the point regarding "lack of advancement" and the use of the "CLUH" terminology which the reviewer uses throughout their critique. We would like to reiterate, as the reviewer states, our work focussed exclusively on yeast Clu1 and Drosophila Clu. None of our data relates to mammalian CLUH. While these proteins share substantial sequence homology, it is imprudent and scientifically unsound to assume cross-species equivalence without directly testing. Indeed, one of the central aims of our study was to characterise the molecular function of yeast Clu1, which remains almost entirely unstudied.

      We acknowledge that some of the observations contained within our study have been described by others and we have appropriately noted and cited these in context. Nevertheless, (a) independent replication is always valuable but easily criticised as lacking novelty, and (b) the majority of the work was analysing the molecular dynamics and function of yeast Clu1 which is almost completely unstudied and may help provide hypotheses for others to test for conservation in mammalian CLUH. Hence, we consider that summarising the work as 'lacking advancement' is misplaced.

      Comments:

      To this reviewer it is not clear how CLUH can regulate the translation of specific mRNAs while being bound to ribosomes, regardless of being in a diffuse or granular state. The authors suggest that under metabolically active conditions, CLUH might aggregate translating ribosomes, forming the granular structures. How CLUH though can both be bound to translating ribosomes and recruit specific mRNAs at the same time is not explained.

      It was indeed surprising to us that the data indicate that Clu1 can bind both mRNAs and ribosomes to affect translation, and we share the reviewer's curiosity about the precise mechanism of how this occurs. While we have provided novel insights into this situation, dissecting the precise molecular mechanisms is beyond the scope of the current study.

      The authors might want to discuss how changes in metabolic demands signal the aggregation of CLUH, and how CLUH can recognise its target mRNAs.

      We appreciate the reviewer's point here but as this would be pure speculation we have made only brief comments on this at the end of the Discussion.

      What was the rationale to perform the RIP or the PUNCH-P experiments only under non-challenged conditions, but not under conditions demanding aerobic respiration?

      We appreciate the reviewer's question. In fact, the Punch-P analysis was carried out on cells that had been transferred to ethanol to induce respiration. This was stated in the Methods, but we appreciate that this may have been missed so we have now clarified this in the main text (p9).

      Regarding the RIP, our initial tests showed that mRNAs encoding proteins found to interact with Clu1 by BioID were interacting with Clu1 in both fermenting and respiring conditions. Due to this consistency, it did not seem necessary to perform the RIP experiments under both metabolic conditions, so we chose to conduct the experiment under the simpler growth condition.

      If CLUH is ubiquitously bound to ribosomes, has CLUH been seen in any structural representation of the cytosolic ribosome?

      This is a good question, and we wondered the same. To our knowledge, Clu1/Clu/CLUH has not been observed in any structural studies of the ribosome, and no formal structure of any Clu family proteins has been resolved.

      Nevertheless, we would like to clarify that we do not think, or suggest in the manuscript, that Clu/Clu1 is ubiquitously bound to ribosomes. First, current evidence supports that Clu/Clu1 only regulates a specific subset of mRNAs. Second, our work, particularly the sucrose gradient experiments, shows that Clu1 interacts transiently with ribosomes, as cross-linking was required to capture the full extent of this interaction. This transient and selective interaction of Clu/Clu1 with the ribosome, together with the fact that transient interactors are often lost during ribosome purification, makes Clu/Clu1 detection in structural studies unlikely. Due to the transient interaction and dynamic localisation of Clu/Clu1, capturing Clu/Clu1 in ribosomal structures will require significant work in the future.

      Reviewer #2 (Significance (Required)):

      CLUH has been studied in various publications, showing data very similar to that presented in this manuscirpt. However, the authors provide a comprehensive analysis on both yeast and fly CLUH. The strength of the manuscript is the combination of several elegant methods and genetically modified model systems in two species to elucidate the role of CLUH during the translation of specific mRNA. In my view through, the advancement of understanding the function of CLUH is limited.

      Although the authors work in yeast and DM, the results seem applicable to other species, including humans, and thus, the presented results will be of interest in a range of researchers working in the field of metabolic regulation and gene expression.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: This study from Miller-Fleming et al. employs yeast and Drosophila as model systems to explore the function of the RNA-binding protein Clu1, which is involved in mitochondrial biogenesis. The first part of the manuscript characterizes so called "Clu1 granules", and their dependance from metabolic transitions. In particular, using yeast, they find a relocalisation of Clu1 upon starvation and several mitochondrial stress conditions. These granules are not stress granules, and are dissolved by RNAse and puromycin treatment. The second part of the study aims to understand the molecular function of the protein and its link to translation. The results confirm an evolutionary conserved role of Clu1 in binding mRNAs for mitochondrial proteins and in interacting with mitochondrial proteins, ribosomal components and polysomes. In addition, the authors claim that binding of Clu1 to RNA is enhanced when mRNAs are trapped in polysomes by treatment with cycloheximide (CHX), leading to the proposal that Clu1 binds mRNAs during active translation.

      Major comments:

      -The claim of Clu1 granule localization next to mitochondria (Figure 3) would be more convincing if any of the experiment would be quantified. Especially in the case of panel 3G in Drosophila egg chambers where there are a lot of mitochondria, one wonders whether the closeness to mitochondria is just random. Furthermore, mdv1-signal does not look very convincing, being blurry and not dotty as expected. Thus, the conclusion that Clu1 granules partially colocalization with site of fission appears premature.

      The claim that Clu/Clu1 granules are often found in close proximity to mitochondria was inferred from observations from multiple analyses from yeast (we looked at hundreds of cells in several different conditions) and flies, where it had already been demonstrated (Cox and Spradling, 2009). We agree that observations of the fly egg chambers are challenging due to the very high density of mitochondria (and other cellular components - see the new analysis of poly(A) mRNAs) in these highly active cells. These considerations motivated us to take the CLEM approach (in addition to investigating the membraneless nature), to gain a much higher resolution view of the localisation of the granules. This analysis unequivocally showed that the Clu granules were exactly juxtaposed to several mitochondria. It is noteworthy that even in the TEM images shown, there is ample cytoplasm in which the Clu granule could be located if the association with mitochondria was coincidental and all granules had mitochondria in close proximity.

      Regarding the possible coincidence of Clu1 with mitochondrial fission factors, as mentioned above for Reviewer 1, we tried a number of different approaches to improve this analysis but, ultimately, we were not able to generate sufficiently robust data to be convincing so have decided to remove this from the manuscript. Since this was an adjunct observation and not a major part of the story and has been discussed by others relating to fly Clu (PMID: 35332133), removal from the current manuscript does not impact the key conclusions of the study.

      Based on the ability of 1,6-hexanediol to dissolve the granules (Figure 4), the authors conclude that: "Clu1 foci have membraneless nature". As they correctly state in the discussion, treatment with 1,6-hexanediol can have other effects. I suggest to be more cautious with the conclusions or add additional experiments. Are the granules dynamics if using FRAP? Do they fuse?

      The inference that the Clu1 granules are membraneless organelles was not solely based on the observation that they disassemble upon 1,6-hexanediol treatment but was made in conjunction with the CLEM analysis that showed unambiguously that Clu granules are not associated with any detectable membrane, which is strong evidence that these granules are membraneless in nature. Indeed, as the reviewer mentioned, we are cautious in concluding they have been formed by liquid-liquid phase separation (LLPS) and we do acknowledge that 1,6-hexanediol can have other effects in cells. Nevertheless, following the reviewer's suggestion we have analysed Clu1 granule dynamics using FRAP, even though we are aware that FRAP is also not a definitive proof that a structure is formed by LLPS. The FRAP analysis, shown in new Figure 4C, D, revealed approximately 50% recovery over 10 min imaging timeframe. As discussed on page 13, this indicates a dynamic nature of these granules, but this dynamism can vary widely between different types of granules and even different proteins within the same granule. Further work is warranted to fully investigate the dynamic nature of Clu/Clu1 granule components.

      The experiment in which the granules are dissolved by treatment with RNAse is very interesting. However, per se this does not directly demonstrate that the granules contain mRNA. To state this the author should perform FISH experiments for example using a probe to detect poly-A.

      We thank the reviewer for this suggestion. We have performed poly(dT) FISH in egg chambers. Initial analysis showed that the fluorescence was diffuse and widely distributed, as expected for these highly active cells, but with no specific accumulation in Clu granules. Interestingly, we observed that treatment with RNase A, which we initially used to demonstrate probe specificity, revealed an enrichment of poly(A) RNAs in Clu granules. So, while treating the live egg chambers with RNase revealed that granules depend on RNA for their stability, treating fixed egg chambers revealed more directly the presence of RNAs in granules.

      These results have been added to the manuscript and shown in Fig. 5 and described on page 7.

      The authors show that puromycin prevents the granule formation before insulin addition in the fly. Are these results (upon RNAse treatment and puromycin treatment) recapitulated in the yeast system? The authors conclude that Clu1 formation depends on mRNAs being engaged in translation, but never show that the granules are site of active translation. More experiments in this direction (for example using puro-PLA of specific mRNAs) are missing and would clearly improve the manuscript.

      Thanks for this very interesting consideration. We agree that we have not formally shown that the Clu1 granules are sites of active translation. A major limitation to addressing this is that puromycin is not able to penetrate the yeast cell wall, so cannot be used for analysis of intact cells as would be needed in this case. We agree that this would be a welcome addition but is beyond the scope of the current study.

      The interactome of Clu1-neighbouring proteins (Figure 6) is interesting and a valuable addition to data in other organisms. I am wondering why the authors have not used as a control a cytosolic BirA-GFP, which would have been the right control for this experiment, especially since GFP tends to form aggregates.

      We thank the reviewer for this comment. With hindsight, we agree that a cytosolic BirA-GFP would have been a better control. However, we are confident in our results for the following reasons:

      1. The levels of GFP obtained from Clu1-GFP expression are low, and under these conditions, we observed no evidence of GFP aggregation. Even in experiments where GFP is overexpressed from a high-copy 2µ plasmid under a strong promoter, we do not detect aggregation. Aggregation is not a concern in our experimental setup.
      2. Our conclusions are not solely based on the interactome analysis (BioID) but are supported by complementary findings. Specifically, several proteins identified in the proximity to Clu1 in the BioID analysis showed reduced translation in Clu1 knockout cells, and their corresponding mRNAs were found to interact with Clu1 during translation. These complementary results from independent techniques provide strong evidence for Clu1's role and validate the findings of the interactome analysis. Given this robust and complementary dataset, having BirA as a control strain was sufficient to validate our conclusions.

      Figure 7B: The log 2 FC for the changed proteins are in many cases small, implying that the difference in translation for these proteins is not so large. For this reason, it is relevant to know how was the statistical significance calculated for these MS measurements. In the supplementary Tables and in Fig 7B, a p value is indicated and it is not clear if this is a simple p value or an adjusted p value (FDR or q value). If not shown, I recommend showing the adjusted p value, so that one can have an idea of the solidity of the data and the claim. Again, this is an important piece of evidence, since the authors base on this experiment the conclusion that Clu1 controls translation of these mRNAs.

      Thanks for this comment. We have now included the q-value in the supplementary table.

      Minor comments:

      -Figure 1: The change in Clu1 localisation in post-diauxic phase or upon changing of the medium is evident from the images shown. However, it seems that the experiment has been performed only once (the same for Figure 2). Is this the case? An important information would be to show the expression levels of Clu1-GFP in the different conditions. Does recruitment of CLU1 to granules associate to increased expression levels?

      The experiments shown in figures 1 and 2 were performed independently at least three times, as stated in the figure legends. The numbers shown are indicative values from one of the replicate experiments. This has now been added to the figure legends.

      We agree that providing the information regarding the expression levels of Clu1-GFP is important to address whether the recruitment of Clu1 to granules is associated with changes in its abundance. To this end, we have performed an additional experiment to quantify Clu1-GFP levels under the conditions where Clu1 is diffuse (log growth phase in glucose-containing media) and when Clu1 is in granules (sodium azide treatment).

      These results have been added to the manuscript and shown in Fig. S2 and described on page 4.

      Figure 2 A-B. The authors claim that the only stressor capable of inducing Clu1 granules formation alone is inhibition of complex IV activity via sodium azide treatment. Other mitochondrial stresses like CCCP treatment or OA treatment are efficient only when combined to starvation. It should be mentioned that sodium azide treatment is not only capable of inhibiting complex IV but has also uncoupling function.

      Thanks for this comment. We have now mentioned this (p4).

      Figure 2 D-E: investigation of colocalization with Bre5 would help to understand how similar the yeast Clu1 granules are compared to the mammalian CLUH granules (Pla-Martin et al., 2020).

      This is an interesting suggestion and one that we also considered, but with limited time and resources we were not able to pursue this line of inquiry as well.

      Figure 8. This figure summarizes one of the most novel pieces of data about Clu1, the interaction with mRNAs via the ribosome. The way how panel A-C are represented is however a bit misleading. The Y axis in Figure B and C has the same amplitude as the one in A. Therefore, potential differences in Clu1-RNA pull-down in presence of EDTA or puromycin cannot be assessed. It is true that in presence of CHX there is much more pulled down RNA, but one cannot judge from these panels if there is any difference between Clu1 targets and controls also in the other conditions. The graphs should be modified and statistics added.

      We appreciate the reviewer's feedback regarding the presentation of the RIP-qPCR data in Fig. 8. Based on the comments, we have revised how the results are represented, improved the normalisation of the data and added statistical analysis.

      First, it is worth clarifying that the presentation of the original charts was done specifically to highlight the huge differences between RNA-pulldown in CHX versus disrupted ribosomes. It is also important to note that these RIP experiments were performed simultaneously under identical experimental conditions, so any differences lie in the treatments applied. To improve cross-comparison between treatments we have now incorporated an additional normalisation step. We normalised the enrichment levels of each mRNA tested against the non-specific binding observed with the negative control housekeeping genes (UBC6 and TAF10). This ensures that differences in bead loss or other technical variations are accounted for.

      We now show the comparison of the six positive hits and two negative controls normalised as described above, on the same scale (Fig. 8A). We now also present the relative effects of the three conditions (CHX, EDTA, and puromycin) within the same graph for each mRNA tested (Fig. 8B). This format enables direct comparison of Clu1 target mRNA enrichment and two negative controls across treatments, which is the relevant comparison for testing the hypothesis of ribosome-dependent interactions. We have adjusted the Y-axis scaling for each mRNA, as requested by the reviewer, and added statistical comparisons. For clarity, the data shown in Fig. 8A are also represented in the panels of Fig. 8B (CHX). We have amended the text appropriately and hope that these changes improve the comparisons between treatments and more readily demonstrate that Clu1 target enrichment is lost upon ribosome disassembly, either by EDTA or by puromycin.

      In addition, RNAse treatment in panel L does not seem to have really worked.

      These samples were cross-linked prior to treatment to preserve the transient interaction of Clu1 with the ribosome, hence, the normal dramatic effect of RNase to collapse the polysomes is much less pronounced. Nevertheless, the purpose of this experiment was to monitor whether Clu1 co-migrated with ribosomes, which it does.

      The authors should cite Vornlocher et al. (PMID: 10358023), who were the first to implicate Clu1 (Tif31) with translation.

      Thank you for this prompt. We have now added a comment on this in the Discussion (page 13).

      References 21 and 22 are the same.

      Thanks. This has been fixed.

      Reviewer #3 (Significance (Required)):

      The data reported in this manuscript are valuable, because they confirm an evolutionary conserved role of Clu1 in binding mRNAs for mitochondrial proteins and regulating their translation. It is also interesting that in yeast, similar to Drosophila and mammalian cells, Clu1 can form granular structures upon metabolic rewiring. A limitation of the study is that direct experiments to support the claim that Clu1 concentrates ribosomes engaged in translation are not provided. Furthermore, it is not clear what is the functional role of the Clu1 granules, since the proximity interactome and the binding of Clu1 to the polysomes is not affected by treatments that dissolve or stimulate granule formation.

      The study is of interest to a general cell biology audience.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study from Miller-Fleming et al. employs yeast and Drosophila as model systems to explore the function of the RNA-binding protein Clu1, which is involved in mitochondrial biogenesis. The first part of the manuscript characterizes so called "Clu1 granules", and their dependance from metabolic transitions. In particular, using yeast, they find a relocalisation of Clu1 upon starvation and several mitochondrial stress conditions. These granules are not stress granules, and are dissolved by RNAse and puromycin treatment. The second part of the study aims to understand the molecular function of the protein and its link to translation. The results confirm an evolutionary conserved role of Clu1 in binding mRNAs for mitochondrial proteins and in interacting with mitochondrial proteins, ribosomal components and polysomes. In addition, the authors claim that binding of Clu1 to RNA is enhanced when mRNAs are trapped in polysomes by treatment with cycloheximide (CHX), leading to the proposal that Clu1 binds mRNAs during active translation.

      Major comments:

      • The claim of Clu1 granule localization next to mitochondria (Figure 3) would be more convincing if any of the experiment would be quantified. Especially in the case of panel 3G in Drosophila egg chambers where there are a lot of mitochondria, one wonders whether the closeness to mitochondria is just random. Furthermore, mdv1-signal does not look very convincing, being blurry and not dotty as expected. Thus, the conclusion that Clu1 granules partially colocalization with site of fission appears premature.
      • Based on the ability of 1,6-hexanediol to dissolve the granules (Figure 4), the authors conclude that: "Clu1 foci have membraneless nature". As they correctly state in the discussion, treatment with 1,6-hexanediol can have other effects. I suggest to be more cautious with the conclusions or add additional experiments. Are the granules dynamics if using FRAP? Do they fuse?
      • The experiment in which the granules are dissolved by treatment with RNAse is very interesting. However, per se this does not directly demonstrate that the granules contain mRNA. To state this the author should perform FISH experiments for example using a probe to detect poly-A.
      • The authors show that puromycin prevents the granule formation before insulin addition in the fly. Are these results (upon RNAse treatment and puromycin treatment) recapitulated in the yeast system? The authors conclude that Clu1 formation depends on mRNAs being engaged in translation, but never show that the granules are site of active translation. More experiments in this direction (for example using puro-PLA of specific mRNAs) are missing and would clearly improve the manuscript.
      • The interactome of Clu1-neighbouring proteins (Figure 6) is interesting and a valuable addition to data in other organisms. I am wondering why the authors have not used as a control a cytosolic BirA-GFP, which would have been the right control for this experiment, especially since GFP tends to form aggregates.
      • Figure 7B: The log 2 FC for the changed proteins are in many cases small, implying that the difference in translation for these proteins is not so large. For this reason, it is relevant to know how was the statistical significance calculated for these MS measurements. In the supplementary Tables and in Fig 7B, a p value is indicated and it is not clear if this is a simple p value or an adjusted p value (FDR or q value). If not shown, I recommend showing the adjusted p value, so that one can have an idea of the solidity of the data and the claim. Again, this is an important piece of evidence, since the authors base on this experiment the conclusion that Clu1 controls translation of these mRNAs.

      Minor comments:

      • Figure 1: The change in Clu1 localisation in post-diauxic phase or upon changing of the medium is evident from the images shown. However, it seems that the experiment has been performed only once (the same for Figure 2). Is this the case? An important information would be to show the expression levels of Clu1-GFP in the different conditions. Does recruitment of CLU1 to granules associate to increased expression levels?
      • Figure 2 A-B. The authors claim that the only stressor capable of inducing Clu1 granules formation alone is inhibition of complex IV activity via sodium azide treatment. Other mitochondrial stresses like CCCP treatment or OA treatment are efficient only when combined to starvation. It should be mentioned that sodium azide treatment is not only capable of inhibiting complex IV but has also uncoupling function.
      • Figure 2 D-E: investigation of colocalization with Bre5 would help to understand how similar the yeast Clu1 granules are compared to the mammalian CLUH granules (Pla-Martin et al., 2020).
      • Figure 8. This figure summarizes one of the most novel pieces of data about Clu1, the interaction with mRNAs via the ribosome. The way how panel A-C are represented is however a bit misleading. The Y axis in Figure B and C has the same amplitude as the one in A. Therefore, potential differences in Clu1-RNA pull-down in presence of EDTA or puromycin cannot be assessed. It is true that in presence of CHX there is much more pulled down RNA, but one cannot judge from these panels if there is any difference between Clu1 targets and controls also in the other conditions. The graphs should be modified and statistics added. In addition, RNAse treatment in panel L does not seem to have really worked.
      • The authors should cite Vornlocher et al.. ( PMID: 10358023), who were the first to implicate Clu1 (Tif31) with translation.
      • References 21 and 22 are the same.

      Significance

      The data reported in this manuscript are valuable, because they confirm an evolutionary conserved role of Clu1 in binding mRNAs for mitochondrial proteins and regulating their translation. It is also interesting that in yeast, similar to Drosophila and mammalian cells, Clu1 can form granular structures upon metabolic rewiring. A limitation of the study is that direct experiments to support the claim that Clu1 concentrates ribosomes engaged in translation are not provided. Furthermore, it is not clear what is the functional role of the Clu1 granules, since the proximity interactome and the binding of Clu1 to the polysomes is not affected by treatments that dissolve or stimulate granule formation. The study is of interest to a general cell biology audience.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript the authors study the Clustered mitochondrial proteins Clu of Drosophila melanogaster and Clu1 of Saccharomyces cerevisiae, two homologues of the mammalian protein CLUH. They show in compelling microscopy analysis that both proteins form granules. This was the case for flies fed on yeast paste after starvation and in yeast in post-diauxic phase, in respiratory media or during mitochondrial stress. They show that these granules are found in proximity to mitochondria and that they behave like liquid-liquid-phase separated condensates. They show by co-staining for P-bodies and stress granules that Clu1-granules are distinct from these RNA granules. Furthermore, they found that the formation required active translation. In the second part, they show that Clu1 interacts with ribosomal and mitochondrial proteins by BioID. The deletion of Clu1 leads to slightly impaired growth on media containing Ethanol as a carbon source. They find that nascent polypeptides of some mitochondrial precursor proteins are decreased in the deletion of Clu1 and conclude that Clu1 regulates translation of these proteins. Using RNA immunoprecipitation of Clu1-GFP in presence of cycloheximid, EDTA and puromycin. The mRNAs of nuclear-encoded mitochondrial proteins found to be interacting with Clu1 were purified in conditions when the ribosomes are intact and the RNAs showed no interaction when ribosomes were disassembled. They show in sucrose gradients that Clu1 co-migrates with polysomes independent of its distribution state or carbon source. However, when cells are grown in conditions of granule formation, then polysomes and Clu1 run less deeply into the gradient. Form these data, the authors conclude that Clu/Clu1 regulates the translation of nuclear-encoded mitochondrial proteins.

      Major comments:

      • The authors state that Clu1 is regulating translation during metabolic shifts. However, it is not clear what the real impact on mitochondrial function is. They show that there is a minor growth defect on ethanol media when CLU1 is deleted. However, if Clu1 is necessary mainly for adaptation, the phenotype will be strongest observed in conditions where cells switch carbon sources. Growth curves would be suitable in which the lag-phase of yeast cells precultured either in glucose or glycerol switched to media of different carbon sources (glucose to glycerol or glycerol to glucose) are measured. One would expect that the deletion mutant shows a longer lag-phase compared to the wild type when shifted from glucose to glycerol media. In line with this, how different is the mitochondrial proteome of the WT and the mutant? Do hits of the BioID, RIP and Punch-P experiments change at steady state or during metabolic shifts? Either proteomics of isolated mitochondria or western blots of whole cells or isolated mitochondria of WT and the deletion mutant grown in conditions of Clu1-granule formation or no granules for the hits could answer this question.
      • The authors analyze RNAs bound in polysomes to assess translation efficiency. Translation efficiency is usually calculated by the fraction of RNA bound by ribosomes to the total RNA amount of an RNA species. Thus, doing RT-qPCR from whole cells would be necessary to assess if the occupancy of ribosomes on the transcripts is due to changes in RNA abundance or other regulatory pathways and would help to further assess what causes the observed changes.

      Optional:

      • The authors show a co-localization of Clu/Clu1 with mitochondrial fission factors and conclude that the granules appear likely near fission sites. Indeed, CLUH has been implied in the past to play a role in mitochondrial fission (Yang, H., Sibilla, C., Liu, R. et al. Clueless/CLUH regulates mitochondrial fission by promoting recruitment of Drp1 to mitochondria. Nat Commun 13, 1582 (2022). https://doi.org/10.1038/s41467-022-29071-4). Thus, are fission sites required for Clu-granule localizations? What is the role of the mitochondrial network integrity for the granule distribution? Expressing Clu-GFP/Clu1-GFP in cells depleted for the fission factors would provide information on that.
      • The author assess convincingly that Clu1 interacts with ribosomes and runs with polysomal fractions. However, how it actually regulates translation is not clear. To answer this question, selective ribosomal profiling would be necessary. The authors have established conditions which would be suitable for the experiment. They could use crosslinking and sucrose cushions to IP ribosomes with Clu1-GFP bound to be used for ribosomal profiling. However, this experiment is quite time-intensive (3-4 months) and expensive, thus, an optional suggestion.

      Minor comments:

      Fig1: B, C, please add scale bars into the zoom ins.

      Fig 2 would profit from inlets of zoom ins to visualize the distribution better.

      Fig.3: Panel C does not really add much information. I would rather remove it or put it into supplements and therefore show a zoom of Panel E with a line plot showing the rings. It is not clear from the represented images where the rings are formed.

      Fig.3 panel F: Max projections are not appropriate to show colocalization as they can lead to false-positive overlaps. Just remove the max projections.

      References 21 and 22 are the same.

      Significance

      This manuscript shows in a convincing way that Clu and Clu1 form RNA granules and that Clu1 interacts with ribosomes. It is written in a clear way and the figures support the conclusions drawn in the text. The finding that Clu/Clu1 is important for metabolic adaptation has not been shown in fly or yeast to my knowledge. It is in line with findings for the mammalian homologue CLUH. Thus, the findings are supported by earlier work. This study is of value for a broader audience of the basic research field, especially of the mitochondrial and RNA granule field, as it supports the idea of post-transcriptional regulation of nuclear-encoded mitochondrial protein gene expression for dynamic adaptation of mitochondrial function. The conditions when Clu granules form is studied in detail, followed up by identification of target RNAs and interaction partners. Though the interaction of Clu1 with ribosomes is shown in a compelling way, a detailed mechanism of the function of Clu/Clu1 is missing and would require more experiments. Thus, even though a detailed mechanism is missing, the study does expand on our understanding of Clu/Clu1 in regulating mitochondrial biogenesis and is therefore of high interest of the mitochondrial field.

      Expertise: mitochondria, yeast, RNA granules, mitochondrial biogenesis, next-generation sequencing, fluorescence microscopy

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1

      1. The structures of the lamina propria of murine colon mucosa are nicely described. However, in the introduction of the manuscript the structures of fibroblasts, myofibroblasts and ECM are not described. The structures of the lamina propria of murine colon mucosa should be well described in the induction and discussed in the discussion.

      We will revise the Introduction to include a more detailed description of fibroblasts, myofibroblasts, and the ECM within the lamina propria of the murine colon mucosa. We will also expand the Discussion section to address these structures in the context of our findings.

      2. The UMAP plot suggests potential heterogeneity within Cluster 1, raising questions about whether the chosen clustering resolution (e.g., parameter settings in Seurat's "FindClusters") optimally captures subpopulations.

      We appreciate this insightful observation. We agree that the UMAP plot suggests potential heterogeneity within Cluster 1 and that the current clustering resolution may not fully capture underlying subpopulations. We could revisit the clustering parameters and explore reclustering at a lower resolution. However, we note that lowering the resolution often increases the total number of clusters, which may introduce noise and complicate biological interpretation. To more precisely dissect the heterogeneity within Cluster 1 while minimizing artificial subdivisions, we propose to perform subclustering specifically within Cluster 1.

      3. Some subpopulations express marker genes characteristic of pericytes and smooth muscle cells (e.g., Desmin). How did the authors ensure proper discrimination between fibroblasts and these other cell types?

      We thank the reviewer for this important comment. We acknowledge the challenge in distinguishing between fibroblasts, pericytes, and smooth muscle cells (SMCs) based solely on single-cell RNA sequencing data, particularly given the overlapping expression of markers such as Desmin.

      Pericytes vs. Myofibroblast/SM-Pericyte-Like Fibroblasts: Due to the highly similar transcriptional profiles of pericytes and pericyte-like fibroblasts, scRNA-seq alone does not allow for unambiguous discrimination between these populations. However, we were able to distinguish them based on morphology and spatial localization observed in high-resolution imaging. Notably, we identified a population of large (50–150 µm), elongated myofibroblast/SM-pericyte-like fibroblasts that, unlike typical pericytes, are not positioned directly on blood vessels but are distributed around the crypts. Some of these cells also appear to contact both blood vessels and the muscle layer, raising the possibility that they represent a specialized pericyte-like population. While their precise function remains uncertain, we agree that further characterization is warranted. To address this, we propose additional staining for canonical pericyte markers to help clarify their identity and spatial relationship to the vasculature.

      Smooth Muscle Cells vs. Myofibroblast/SM-Pericyte-Like Fibroblasts: We are confident that the analyzed fibroblast populations do not include smooth muscle cells. The mucosa was carefully dissected and separated from the underlying smooth muscle layer prior to RNA sequencing, which was performed exclusively on the mucosal compartment. Therefore, contamination by SMCs is unlikely.

      4. The manuscript also did not show the distribution and structures of ECM. It is better to show the relationships of fibroblasts and myofibroblasts with in the lamina propria of murine colon mucosa.

      In the supplementary material we show distribution of main ECM proteins such as Laminin, Collagen I, Collagen IV, and Fibronectin1.

      5. The integration with previously published datasets lacks clear connection to the authors' own findings. A more detailed comparison and discussion of how these integrated analyses relate to the newly generated data would improve the manuscript's coherence.

      We thank the reviewer for this helpful comment. Our RNA-seq dataset shows strong consistency with previously published datasets, supporting the robustness of our fibroblast isolation and transcriptional profiling strategy. We agree that a more explicit integration and comparison will improve the manuscript. We have now revised the Discussion to better highlight the spatial localization and organization of the different fibroblast populations identified in our study, with an emphasis on the duality of their functions. In particular, we discuss how our findings extend existing datasets by providing spatial context and functional insights that were not previously resolved. These comparisons underscore the novelty and value of our integrated approach.

      6. While the authors focus on colonic mucosa, the integrated public datasets include data from both colon and small intestine. Were these distinct tissue sources accounted for in the analysis? Clarification on this point is necessary to ensure the validity of comparisons.

      We thank the reviewer for raising this important point. Among the integrated datasets, only one—McCarthy et al. (GEO GSE130681)—originates from the small intestine; all others, including our own, were derived from the colon. Specifically, we used the following datasets:

      • GEO GSE113043 (Degirmenci et al., PMID: 29875413) – Colon (1 sample)
      • GEO GSE114374 (Kinchen et al., PMID: 30270042) – Colon (3 samples)
      • GEO GSE130681 (McCarthy et al., PMID: 32884148) – Small intestine (2 samples)
      • GEO GSE142431 (Roulis et al., PMID: 32322056) – Colon (5 samples) We selected these datasets based on their relevance to fibroblast biology, particularly those that specifically focused on mural fibroblasts. The inclusion of the McCarthy dataset was guided by its high-quality profiling of fibroblast populations and its utility in expanding our comparative framework.

      Importantly, review by McCarthy et al. (https://doi.org/10.1038/s41556-020-0567-z) reported minimal differences in fibroblast clustering between the small intestine and colon. Our integrated analysis supports this conclusion: fibroblasts from both regions consistently co-cluster, indicating a high degree of transcriptional similarity. This suggests that inclusion of the small intestine dataset did not bias or compromise the integrity of our colon-focused findings.

      Nevertheless, our primary emphasis remains on the colon, particularly due to the relative scarcity of studies addressing fibroblast localization and morphology in this tissue compared to the small intestine. Additionally, at the time of analysis, the datasets we used represented the most comprehensive publicly available single-cell profiles of intestinal mural fibroblasts.

      7. Many aspects of the described fibroblast subpopulations, including their single-cell expression profiles and physiological functions, appear to have been previously reported. The authors should more explicitly highlight the novel contributions of their work to advance our understanding of intestinal fibroblast biology.

      We thank the reviewer for this important observation. While it is true that aspects of fibroblast heterogeneity have been previously reported, our study provides several novel contributions that advance the current understanding of intestinal fibroblast biology. We will revise the manuscript to more explicitly highlight the following key findings:

      1. Functional distinction between ECM production and contractility: Our integrative analysis reveals a clearer separation between fibroblast subpopulations based on their functional specializations—specifically, ECM production versus contractile properties. This distinction has not been well delineated in prior studies and is particularly relevant in the context of inflammatory bowel disease, where fibrosis remains a major complication. Our findings may help identify specific fibroblast subtypes that contribute to pathological remodeling.
      2. Detailed characterization of fibroblast localization and morphology: We provide new spatial insights by demonstrating the lack of overlap between GFP⁺ and CD34⁺ basket cell populations in vivo. Additionally, we highlight the presence of large, elongated myofibroblasts and pericyte/smooth muscle-like fibroblasts that span from the vasculature to the underlying muscle layer—morphologies and arrangements that have not been thoroughly described before. These observations offer a more refined anatomical and functional framework for understanding fibroblast roles within the colonic mucosa. We will revise both the Results and Discussion sections to more explicitly emphasize these novel contributions.

      Reviewer 2:

      Major points:

      1. The order of the present manuscript should be reconstructed. The main message is in the discussion part. It is worth bringing it to the front.

      We appreciate this thoughtful suggestion. We agree that the main message of the manuscript is currently more prominent in the Discussion section, and bringing it forward would improve the overall clarity and impact of the work. We will restructure the manuscript accordingly to ensure that the key findings and their significance are introduced earlier and more clearly communicated throughout the text.

      2. Figure 1A, the authors employed the "vimentin+" filter to distinguish between fibroblasts and other cell types in the single-cell RNA sequencing (scRNA-seq) data. However, they did not provide a rationale for this choice in the manuscript. It would be worthwhile to consider the incorporation of an "Epcam-" or "E-cadherin-" filter as well, given the potential impact on the subsequent analysis's significance. Notably, the original UMAP plot generated before the application of the "vimentin+, Krt8-" filter, is absent from both the main figures and the supplementary data. The availability of this data is crucial for the identification of specific fibroblast populations among the sorted cells.

      The rationale for using the “vimentin⁺” filter is based on its long-standing use as a canonical marker for fibroblasts and mesenchymal cells in both developmental and adult tissues, including the intestinal lamina propria. Vimentin has consistently been used to distinguish fibroblasts from epithelial and immune cell populations in scRNA-seq studies.

      Regarding the exclusion of epithelial cells, we chose to apply a “Krt8⁻” filter instead of “Epcam⁻” or “E-cadherin⁻”, as Krt8 is a highly specific marker for colonocytes in the intestinal epithelium. We found this to be a reliable criterion for excluding epithelial cells in our dataset. We will revise the Methods section to clearly explain this rationale and selection.

      Additionally, we agree that the original UMAP plot—prior to the application of the “vimentin⁺, Krt8⁻” filter—would provide valuable context. We will include this plot in the supplementary figures to allow better visualization of the initial clustering and to support the identification of fibroblast populations among the sorted cells.

      3. Page4 line 12, the authors claim that they did not find specific markers for the cluster 1, despite the fact that cluster 1 is distinctly separated from clusters 0, 5, 4 and 3 in figure 1B. Furthermore, the cells in the cluster 1 do not cluster together based on the resolution applied in the present manuscript. The authors claim that cells in cluster 1 are in a transition state, and therefore, they did not include them in the functional analysis. However, later they claim that the cluster 1 are multipotent progenitors, which is not clear.

      We appreciate the reviewer’s careful reading and valuable critique. We acknowledge the confusion regarding the identity and interpretation of Cluster 1 and would like to clarify our reasoning and planned revisions.

      When identifying marker genes using Seurat’s FindMarkers() or FindAllMarkers() functions, the output highlights genes that are significantly enriched in a given cluster relative to others—but these genes are not necessarily uniquely or exclusively expressed in that cluster. This is the case with Cluster 1: although it is spatially distinct in the UMAP (Figure 1B), many of the top-ranked marker genes are also expressed in other clusters, albeit at lower levels. As a result, defining Cluster 1 based solely on unique gene expression signatures is challenging, and we initially interpreted this cluster as a “transitional population” due to its ambiguous marker profile.

      However, we acknowledge the apparent inconsistency in referring to Cluster 1 as both "in transition" and "multipotent progenitors." We will clarify our interpretation and terminology in the revised manuscript. Specifically, we will refer to Cluster 1 as a __ transitory population__, and provide a more nuanced discussion of its potential roles.

      As mentioned in our response to Reviewer 1 (Comment 2), we will also perform reclustering within Cluster 1 to better explore its internal heterogeneity. Additionally, we will now include Cluster 1 in the functional enrichment analysis to further assess its biological relevance and contribution to fibroblast diversity.

      4. Figure 1E and F, authors only use gene ontology to define the functions of different clusters of fibroblasts which constrain the present manuscript at the hypothesis stage. To substantiate the claims, it is imperative to conduct more precise experiments. At the very least, co-staining with cluster marker genes and candidate genes identified in GO analysis is necessary. In the event that antibodies are not available, RNA scope can serve as a viable alternative. Further functional experiments will be required to prove their unique function. For instance, the identification of specific cell surface markers to isolate different clusters of fibroblasts for coculture with intestinal organoids in vitro can be facilitated by scRNA-seq data.

      We appreciate the reviewer’s insightful suggestions regarding the functional validation of GO-based predictions.

      While we recognize that RNAscope is a valuable alternative when antibodies are unavailable, its use requires much thinner tissue sections than those employed in our current imaging approach. Our analysis is based on thicker sections, which preserve the 3D architecture and spatial relationships of fibroblasts within the colonic mucosa—an essential aspect of our study. Transitioning to thinner sections would compromise our ability to visualize these cells in their full anatomical context.

      To suppor the GO analysis with experimental validation, we will include __co-staining for cluster __marker genes along with representative candidate genes____ identified through GO analysis to better substantiate the predicted functions of different fibroblast clusters.

      We acknowledge the importance of functional studies such as co-culture assays with intestinal organoids, and indeed, several such experiments have been reported by other groups. Additionally, isolating specific fibroblast populations via FACS sorting for in vitro studies presents practical challenges, including low cell survival rates, which limit the feasibility of downstream functional assays. Thus, we believe that these types of experiments are beyond the scope of the current manuscript. We hope that our integrative approach and spatial validation will serve as a valuable foundation for future functional investigations into fibroblast biology.

      5. DAPI staining is absent in the majority of the images, which complicates the task of distinguishing cells from different clusters. Multiplex staining is necessary to show all specific markers: EGFP, SMA, CD34, Desmin, Pdgfra, Pil6, and Clu, regarding six clusters in one section or image.

      We appreciate the reviewer’s comment and the emphasis on the importance of cellular context in multiplex imaging.

      We acknowledge that DAPI staining is absent in some of the presented images, which may limit nuclear visualization and make it more challenging to distinguish cell boundaries. However, to achieve high-content multiplexing, we employed protocols allowing up to 5–6 fluorophores per section, as previously demonstrated by Chikina et al. (Cell, 2020). Due to spectral limitations and the risk of fluorophore overlap and signal bleed-through, we occasionally excluded DAPI to allocate the 405 nm channel for markers of greater relevance to our study. In these cases, Tomato or EGFP signals served as effective surrogates for cellular localization, as they label cell membranes, providing sufficient morphological context.

      Regarding multiplex staining for Pi16 and Clu, we tested several commercially available antibodies, but unfortunately, none yielded specific or reproducible signals in our hands. As a result, we were unable to include these markers reliably in our multiplex panels.

      6. Figure 4, the authors utilize supervised methods to execute trajectory analysis, defining cluster 1 as the initial point based on its hybrid expression state of genes. This assertion, however, lacks sufficient substantiation, as cluster 1 could also function as a transition point, not necessarily an initial point.The data presented in the current manuscript is inadequate to support the conclusion of multipotency in cluster 1.To substantiate these claims, the authors should employ additional evidence, such as SENIC analysis, to demonstrate the expression of specific transcription factors for each lineage along the trajectory. In order to substantiate the assertion that cluster 1 is a multipotent progenitor capable of differentiating into other specific populations, such as fibroblasts, further functional experiments are required. These experiments could include isolating the population in question and conducting a differentiation test in vitro or tracking the population's response to wound healing.The absence of immunofluorescence images or gene signatures for this cluster in the study is a cause of confusion for the reader.

      We thank the reviewer for this thoughtful and constructive comment. We agree that Cluster 1 could plausibly represent either an initial or transitional state. In trajectory analysis, the starting point must be defined, and we selected Cluster 1 due to its hybrid gene expression profile—exhibiting low-level expression of marker genes associated with multiple other clusters—suggesting a less differentiated or “primed” state. However, we fully acknowledge that this assignment does not preclude its interpretation as a transitional population, and we will revise the manuscript text to reflect both possibilities more clearly and cautiously.

      We appreciate the suggestion to perform __SENIC __analysis (https://www.nature.com/articles/nmeth.4463). This algorithm aims to identify gene regulatory networks and their associated transcription factors for each cell cluster. While interpreting such analysis can be quite challenging, it could provide interesting insights and thus we propose to apply it.

      Regarding functional validation, we agree that experiments such as isolation and in vitro differentiation assays, or in vivo lineage tracing during injury models, would offer more definitive insights. However, as we noted, the lack of specific surface markers currently makes it challenging to isolate Cluster 1 by FACS for such assays.

      We also acknowledge the reviewer’s concern about the __lack of immunofluorescence images or distinct gene signatures__for Cluster 1. We will revise the text to clearly communicate that this limitation.

      7. Figure 5B, the data set of Kinchen et al is from human samples. Is it relevant and significant to merge murine data and the human data together?

      We appreciate the reviewer’s attention to detail. To clarify, the dataset from Kinchen et al__.__ used in Figure 5B refers exclusively to their murine samples, not the human data. Only murine datasets were included in our analysis to ensure consistency and biological relevance. Therefore, merging the Kinchen murine data with other murine datasets in Figure 5B is both appropriate and justified.

      We will revise the figure legend and Methods section to clearly state that only mouse data were used throughout the analysis.

      Minor points:

      8. The chapter entitled "Subepithelial Fibroblast Do Not Proliferate" is not necessary to be an independent chapter. It can be considered a fusion chapter, as it is combined with Chapter 2. Further experiments such as Brdu or Edu are needed to strengthen the current hypothesis.

      We agree with the reviewer that this section does not require a standalone chapter and would be better integrated into Chapter 2. We will revise the manuscript accordingly.

      In addition, to further support our observations regarding fibroblast proliferation, we will perform a 2-hour EdU pulse-chase experiment and include the results in the revised manuscript. We believe this will strengthen our conclusions and provide more direct evidence regarding the proliferative status of subepithelial fibroblasts.

      1. DAPI staining is absent in majority of the images.

      indeed this is a limitation of the unmixing technique we use.

      10. Put the number of each cluster next to the arrow in all the IF images.

      We will do this.

      11. Immunofluorescent staining of cluster markers identified in the previous studies should be included in the present study such as: CD81, FoxL1, Myh11, Pdgfrb and Gli1.

      We will include those markers.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The summitted article entitle "Intestinal fibroblast heterogeneity: unifying RNA-seq studies and introducing consensus-driven nomenclature" by Glisovic et al., identify six distinct populations of fibroblast with unique molecular signatures, spatial localization and specific function in mouse colon using scRNA-seq. Moreover, with different bioinformatic methods, they show the potential differentiation trajectories of fibroblast in mouse colon mucosa. Finally, they propose a standardized nomenclature for colonic fibroblast by integrating the data of this manuscript and the four published scRNA-seq data of mouse and human intestinal colonic fibroblast. Several similar studies cited by the authors in the present manuscript have been done and the different populations of colonic fibroblasts have been well characterized in these previous studies. Here the authors utilized another mouse model, the "aSMAcreERT2" to target the murine colonic fibroblast population which is novel compared to previous published data. Although the authors have provided multiple bioinformatic analyses and immunofluorescent staining of certain markers to support their conclusions, many points are overclaimed or not clear based on the data of the present manuscript, especially for the differentiation trajectories and unique function of different clusters of subepithelial colonic fibroblast. Functional experiment data are absent from the present manuscript.

      Major comments

      • The order of the present manuscript should be reconstructed. The main message is in the discussion part. It is worth bringing it to the front.
      • Figure 1A, the authors employed the "vimentin+" filter to distinguish between fibroblasts and other cell types in the single-cell RNA sequencing (scRNA-seq) data. However, they did not provide a rationale for this choice in the manuscript. It would be worthwhile to consider the incorporation of an "Epcam-" or "E-cadherin-" filter as well, given the potential impact on the subsequent analysis's significance. Notably, the original UMAP plot generated before the application of the "vimentin+, Krt8-" filter, is absent from both the main figures and the supplementary data. The availability of this data is crucial for the identification of specific fibroblast populations among the sorted cells.
      • Page4 line 12, the authors claim that they did not find specific markers for the cluster 1, despite the fact that cluster 1 is distinctly separated from clusters 0, 5, 4 and 3 in figure 1B. Furthermore, the cells in the cluster 1 do not cluster together based on the resolution applied in the present manuscript. The authors claim that cells in cluster 1 are in a transition state, and therefore, they did not include them in the functional analysis. However, later they claim that the cluster 1 are multipotent progenitors, which is not clear.
      • Figure 1E and F, authors only use gene ontology to define the functions of different clusters of fibroblasts which constrain the present manuscript at the hypothesis stage. To substantiate the claims, it is imperative to conduct more precise experiments. At the very least, co-staining with cluster marker genes and candidate genes identified in GO analysis is necessary. In the event that antibodies are not available, RNA scope can serve as a viable alternative. Further functional experiments will be required to prove their unique function. For instance, the identification of specific cell surface markers to isolate different clusters of fibroblasts for coculture with intestinal organoids in vitro can be facilitated by scRNA-seq data.
      • DAPI staining is absent in the majority of the images, which complicates the task of distinguishing cells from different clusters. Multiplex staining is necessary to show all specific markers: EGFP, SMA, CD34, Desmin, Pdgfra, Pil6, and Clu, regarding six clusters in one section or image.
      • Figure 4, the authors utilize supervised methods to execute trajectory analysis, defining cluster 1 as the initial point based on its hybrid expression state of genes. This assertion, however, lacks sufficient substantiation, as cluster 1 could also function as a transition point, not necessarily an initial point. The data presented in the current manuscript is inadequate to support the conclusion of multipotency in cluster 1.To substantiate these claims, the authors should employ additional evidence, such as SENIC analysis, to demonstrate the expression of specific transcription factors for each lineage along the trajectory. In order to substantiate the assertion that cluster 1 is a multipotent progenitor capable of differentiating into other specific populations, such as fibroblasts, further functional experiments are required. These experiments could include isolating the population in question and conducting a differentiation test in vitro or tracking the population's response to wound healing. The absence of immunofluorescence images or gene signatures for this cluster in the study is a cause of confusion for the reader.
      • Figure 5B, the data set of Kinchen et al is from human samples. Is it relevant and significant to merge murine data and the human data together?

      Minor comments

      • The chapter entitled "Subepithelial Fibroblast Do Not Proliferate" is not necessary to be an independent chapter. It can be considered a fusion chapter, as it is combined with Chapter 2. Further experiments such as Brdu or Edu are needed to strengthen the current hypothesis.
      • DAPI staining is absent in majority of the images.
      • Put the number of each cluster next to the arrow in all the IF images.
      • Immunofluorescent staining of cluster markers identified in the previous studies should be included in the present study such as: CD81, FoxL1, Myh11, Pdgfrb and Gli1.

      Significance

      In this study, the researchers employed an alternative mouse model, the "aSMAcreERT2," to target the murine colonic fibroblast population. This approach represents a novel contribution to the field, offering a fresh perspective on previous findings. While the authors have presented several bioinformatic analyses and immunofluorescent staining of specific markers to support their conclusions, certain aspects of their argument require further elaboration or clarification, particularly regarding the differentiation trajectories and unique functions of the various clusters of subepithelial colonic fibroblasts. The present manuscript is constrained at the descriptive level due to an absence of functional experiment data.

      Strengths: The authors utilize "aSMAcreETR2" as a research model to target murine colonic fibroblasts, a novel approach that complements previously published data. By comparing and combining four published single-cell RNA sequencing (scRNA-seq) of colonic fibroblasts, they proposed a novel classification with five distinct subpopulations: telocytes, trophocytes/extracellular matrix (ECM) fibroblast, fibroblast, myofibroblast, and smooth muscle/pericyte-like fibroblast.This new classification, together with their unique molecular signature, can be useful for people in the colon and intestine research field. However, the manuscript is not without its limitations. First, the novel classification and unique molecular signature are not substantiated by functional experimentation, which is essential for validating the fibroblast subcluster's functionality. Additionally, the characterization of cluster 1 is lacking, particularly concerning its ability to differentiate into the five distinct subcultures, which is crucial for confirming its status as a multipotent progenitor. Despite the proposal of a novel classification and detailed molecular signature of the colonic fibroblasts, no isolation strategy is proposed in the present manuscript to allow further characterization. If the authors can address these points, the manuscript can make a significant contribution to the field. This study might interest people who perform basic research in the intestine and colon.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study utilizes scRNA-seq to delineate six fibroblast subpopulations in mouse colonic mucosa, revealing their molecular heterogeneity, functional specialization, and spatial distribution. The high-quality confocal microscopy images effectively illustrate the spatial distribution of cells within the colon mucosa. However, several concerns should be addressed:

      1. The structures of the lamina propria of murine colon mucosa are nicely described. However, in the introduction of the manuscript the structures of fibroblasts, myofibroblasts and ECM are not described. The structures of the lamina propria of murine colon mucosa should be well described in the induction and discussed in the discussion.
      2. The UMAP plot suggests potential heterogeneity within Cluster 1, raising questions about whether the chosen clustering resolution (e.g., parameter settings in Seurat's "FindClusters") optimally captures subpopulations.
      3. Some subpopulations express marker genes characteristic of pericytes and smooth muscle cells (e.g., Desmin). How did the authors ensure proper discrimination between fibroblasts and these other cell types?
      4. The manuscript also did not show the distribution and structures of ECM. It is better to show the relationships of fibroblasts and myofibroblasts with in the lamina propria of murine colon mucosa.
      5. The integration with previously published datasets lacks clear connection to the authors' own findings. A more detailed comparison and discussion of how these integrated analyses relate to the newly generated data would improve the manuscript's coherence.
      6. While the authors focus on colonic mucosa, the integrated public datasets include data from both colon and small intestine. Were these distinct tissue sources accounted for in the analysis? Clarification on this point is necessary to ensure the validity of comparisons.
      7. Many aspects of the described fibroblast subpopulations, including their single-cell expression profiles and physiological functions, appear to have been previously reported. The authors should more explicitly highlight the novel contributions of their work to advance our understanding of intestinal fibroblast biology

      Significance

      The proposed standardized nomenclature for intestinal fibroblasts represents a valuable contribution toward unifying classification in the field.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the editor for handling our manuscript and the reviewers for their constructive critiques. We are deeply convinced that the reviewers’ suggestions have substantially raised the quality and possible impact of our manuscript. We also like to thank the reviewers for their judgements that the subject of our manuscript is biologically and clinically significant and of high importance, and that our manuscript might help to increase focus and visibility for affected individuals.

      New text passages in the manuscript are colored in red. Below is a point-by-point response to the reviewers’ comments.

      2. Point-by-point description of the revisions

      Response to reviewer 1 comments

      Major comments

      Point 1-1

      The authors performed qRT-PCR validation for markers of differentiation and hypoxia, with a major absence of VEGF and HIF1a. The paper would be strengthened by mention of these factors, especially by qRT-PCR or Western blot.

      We thank the reviewer for the suggestion to include the bona fide hypoxia markers Vegfa and Hif1-alpha. We followed the suggestion and performed qRT-PCR on Vegfa transcripts at each tested condition (Figs. 1A,2A,3A,4A,5A,5D,5I,5N). As Hif1α is rather regulated on protein than on transcript level, we followed the advice to perform Western blots. We analyzed Hif1α protein levels on proliferating cells and quantified by normalization to actin (Figs. 1B,C and 5 B,C).

      Point 1-2

      Please provide justification of selection 0.5% as their hypoxic condition or perhaps repeat experiments in a less extreme environment to see if their conclusions still hold true.

      We admit that our approach to use 0.5% hypoxia was a drastic challenge for the cells. It should be noted, however, that physiologic oxygen levels during pregnancy at times drop to lower than 1% (Hansen et al, 2020; Ng et al, 2017). In the first place, we had used oxygen levels lower than this, because we had wanted to ensure that we can detect responses by bulk RNA-seq with a limited number of samples. As we had many conditions to compare, we did not want to use more than 3-4 samples per condition. The fact that the cells showed normal proliferation underscores the fact that 0.5% O2 per se was not so low that it would be overly stressful to the cells.

      Nevertheless, we are very grateful to the reviewer for the suggestion to include a milder hypoxic condition. We chose 2% O2, because this equals the physiological oxygen concentration shortly before the onset of cranial neural crest cell (CNCC) differentiation. We could recapitulate the phenomenon of impaired differentiation to chondrocytes, osteoblasts and smooth muscle cells at these mild hypoxic conditions, as shown by qRT-PCR and immunofluorescence of typical markers (Figs. 5D-R). Moreover, the differentiation-specific induction of the two central hypoxia-attenuated risk genes associated with orofacial clefts that we had identified by our bioinformatic analyses at 0.5% O2 (Boc and Cdo1), was still observable at 2% O2 (Figs. S6C,D). Interestingly, in some rare cases, the attenuation of induction was lost or not as drastic as in 0.5% O2.

      We are convinced that the experiments at 2% O2 strongly increased the relevance of our manuscript, because we thus detected that oxygen levels prevailing shortly before the onset of CNCC differentiation still can influence their differentiation. This leads to the conclusion that only slight decreases of intra-uterine oxygen levels indeed might interfere with correct differentiation of CNCC.

      Point 1-3

      Standard immunohistochemistry or histology of differentiated cells would strengthen the authors' claims of reduced differentiation under hypoxic conditions, e.g., Alcian blue, alk-phos or Alizarin red, and smooth muscle actin or other indicator.

      We are grateful to the reviewer for the suggestion to include stainings of cells, as these stainings visualized the drastic effects of hypoxia on the cells. We performed immunofluorescent stainings against at least one marker protein for each differentiation paradigm. At 0.5% O2, each protein signals were nearly completely absent and cell morphology was disrupted (Figs. 2E,F, 3E, 4E). At 2% O2, we detected some more protein deposition than at 0.5%. Importantly, cells had retained their normal shape at mild hypoxia (Figs. 5H,M,R, S5A).

      Point 1-4

      The authors identify a few genes that appear down-regulated in all three differentiation conditions. If it is within the scope of the study, it would strengthen the claim of these genes' function to show the effect of knock-down or knock-out for validation.

      We thank the reviewer for the suggestion of gene knock-down or knock-out in order to prove functional relevance of our findings. As this would have been too much effort and beyond the scope of our study, we rather followed the suggestion of reviewer 2 (cf. points 2-6, and 2-8) that headed to the same direction: we mined publicly available sequence data on orofacial development for gene expression or marks of active enhancers. We found robust expression of the two central hypoxia-attenuated OFC risk genes Boc and Cdo1 during human craniofacial development (Fig. 7A) and we identified enhancers that are active in embryonic craniofacial mouse tissue (Fig. 7B). Moreover, we detected expression of both genes during murine craniofacial development in undifferentiated mesenchymal cells, osteoblasts, chondrocytes and smooth muscle cells with the help of a single cell RNA-seq dataset (Figs. 7C-E, S6B).

      Thus, we found evidence for the in vivo relevance of Boc and Cdo1 and could rule out a possible important role of Actg2, the third gene we had identified. We therefore are grateful for the suggestion to circumvent gene knockouts by reviewer 2, as we think these data strongly emphasized the importance of our findings.

      Point 1-5

      Another major critique lies in the initial claim that proliferation of O9-1 cells is not significantly impacted by hypoxia. In figures 1E-H, photograms of the cells cultured 24 -72 hours and quantifications of live vs dead cells are shown as evidence for this argument. However, the increased density of cells in normoxic conditions may be a confounding variable in this assay. It would be interesting for the researchers to assess the percent of dead vs alive cells between normoxic and hypoxic conditions when the plates reach equivalent densities.

      We apologize for the use of image sections from photographs with different cell densities. Of course, as demonstrated by our quantification, cell densities between 0.5% and 21% O2 in total were equal (cf. Figs. 1D,E). We therefore replaced the formerly used sections with new image sections with equal cell numbers.

      We thank the reviewer for the suggestion to examine if cell numbers influence cell death rates. We followed this advice by several approaches: first, we seeded cells at different densities, incubated them for 72 h (the same time span where a minimal difference had been detected) and performed live/dead stainings (Fig. S1B). The seeding density did not affect percentages of dead cells and the values were in the same range as in our initial experiment (Fig. 1J). Moreover, we performed TUNEL stainings of apoptotic cells at different time points to have an additional readout of cell death (Figs. 1K,L). As expected, the percentages of TUNEL-positive cells were identical between hypoxic and normoxic cells at all analyzed time points.

      We therefore concluded that hypoxia does not influence the rate of cell death of proliferating CNCC and accordingly specified our wording in the results section.

      Point 1-6

      At end of Fig 1 section authors attempt to tie phenotypes observed in a cell line in vitro to the complex biological processes. They are not comparable and in vivo models would be better suited for these types of comparisons.

      We apologize for the overconfident wording in our manuscript. Of course, our in vitro experiments cannot fully simulate the complex developmental processes taking place in vivo. We therefore changed the text to a more careful formulation. Moreover, we kept the wording in the discussion section that we cannot exclude that in the in vivo situation proliferation of CNCC is also affected by low oxygen levels because nutrients might not be available in such excess as they are in cell culture.

      Point 1-7

      Fig 2: if qRT-PCR did not show statistically different results between experimental and control groups why move on to bulk RNA seq?

      We apologize that the sentence about statistical significance was misleading. What we wanted to express is that there was only a little difference (if any at all) between differentiated cells at 0.5% O2 and proliferating cells at 0.5% O2 or 21% O2. For the sake of clarity and readability, we deleted this misleading sentence.

      Point 1-8

      Fig 5: hypoxia this intense is going to affect broad range of biological processes and genes. Finding a few genes that are affected in extreme hypoxia that are also risk genes is highly unlikely. How can the authors be assured that these overlaps are actually significant and not just by chance?

      We thank the reviewer for the suggestion to test for statistical significance. We tested significance of the overlap of respective gene sets (nsOFC vs. hyp-a; OFC vs. hyp-a) by Fisher’s exact test. We included Venn diagrams depicting the overlap and present the exact p-values (Figs. S5C,D). In each case where overlap of genes occurred, p-values indicated significance.

      Point 1-9

      Would appreciate discussion on how examination of neural crest is relevant for OFC, as most animal models of OFC demonstrate the pathogenesis in embryonic epithelium or periderm, not in the neural crest. Defects in neural crest are associated with other congenital craniofacial anomalies such as craniosynostosis or complex (Tessier) clefts, not the typical orofacial cleft. Please revise rationale of study, interpretation of data and Discussion to specifically state how neural crest cells are involved in the pathogenesis of orofacial cleft.

      We apologize for not pointing out enough the role of epithelial cells in the emergence of orofacial clefts. We revised our introduction, results and discussion sections in this regard and emphasized the role of epithelial cells. Importantly, we addressed the possible influence of the results gained in CNCC on epithelial cells by analyzing scRNA-seq data with the algorithm CellChat, as suggested by reviewer 2 (cf. point 2-8). We detected several cell communication pathways from CNCC to epithelial cells which contain components that are misexpressed upon hypoxia in our dataset (Figs. 7F-I). Therefore, during hypoxia, these pathways might influence epithelial cells and therefore indirectly cause orofacial clefts. We outlined this possible interplay in the discussion and briefly mentioned it in the abstract.

      We have not discussed more strongly the role of CNCC in the emergence of OFC in the revised manuscript, because we did not want to put even more emphasis on this matter. Numerous studies have proven the contribution of cranial neural crest tissue to the emergence of orofacial clefts. This fact is also pointed out in several review articles about orofacial clefts. In most cases, this knowledge was achieved by mouse models, because tissue-specific conditional knockouts are feasible (in contrast to genetic studies on patients), usually via deletion with the Wnt1-Cre driver. Funato et al. give an excellent (but quite old) overview of mouse models in which the neural crest-specific knockout of a gene leads to emergence of OFC and lists 17 genes for which this is the case (Funato et al, 2015). Moreover, several recent studies also report on the emergence of orofacial clefts upon neural crest-specific deletion (Forman et al, 2024; Li et al, 2025). These include genes responsible for DNA methylation (Ulschmid et al, 2024), and a study on subunits of chromatin remodeling complexes that are necessary for correct transcription of their target genes, which was conducted by our group (Gehlen-Breitbach et al, 2023).

      Minor comments

      __Point 1-10 __

      The author should replace "Final proof" in the introduction with "further evidence supporting."

      We apologize for the incorrect wording. Of course, it is highly questionable if there is such a thing as final proof in life sciences. We re-phrased the text according to the reviewer’s suggestion.

      Point 1-11

      Authors are inconsistent when referring to Figures- sometimes they capitalize (i.e. 1J) and other times they leave lower case (i.e. 1i). Needs to be consistent throughout. Figures are not numbered.

      We apologize for the inconsistency. We corrected the references to figures. Moreover, we apologize for the missing figure numbers. We also corrected this and included figure numbers.

      Point 1-12

      In figures authors would sometimes list 21% O2 first then 0.5% O2 or vice versa. (i.e. Fig on page 21 panels I, J, K). Needs to be consistent.

      We again apologize for being inconsistent. We corrected the inconsistency in Fig. 1D. Now, 21% O2 is presented before/above 0.5% O2.

      Point 1-13

      Figures on pages 28, 29, 30 panel J and page 31 panel F: there is no legend on what the scale/measurement is for the difference in expression level other than it ranges from -1 to +3.

      We thank the reviewer for the hint. We are aware that from the heatmaps we used one cannot infer relative expression rates of different genes or similar. If we would have considered expression strength of single genes, many of the gene-specific differing expression rates under the different conditions would have been hard to detect, as presentation would have been dominated by the differences in expression rates between genes. We therefore plotted gene-wise scaled expression.

      We included an explanation of the procedure in the materials and methods section.

      Point 1-14

      Will the authors please comment on the one normoxic sample in Figure 1I that did not cluster with the others? Did this meet the standards to merit exclusion as an outlier?

      We regret that the default scale of our plot of the principal component analysis is a bit misleading. This is the case because x-axis accounts for 80.3% of variance and y-axis only accounts for 6.1%. Therefore, the sample that might seem as an outlier actually met our standards. Nevertheless, we decided to keep the default scaling as is, in order not to embellish the graph (Fig. 1M).

      Point 1-15

      The authors refer to DEG as deregulated genes; while not strictly incorrect, the more standard usage is "differentially expressed genes." Please address.

      We apologize for the incorrect explanation of the acronym. Of course, this was corrected in the revised manuscript.

      Significance

      This work on neural crest cells and hypoxia are biologically and clinically significant.

      We are deeply grateful to the reviewer for considering our manuscript significant for both biologists and clinicians. We are convinced that the additional data we gathered in the course of the revision has significantly increased the importance of our work. Therefore, we once again express our gratitude to the reviewer for the valuable suggestions.

      Response to reviewer 2 comments

      Major comments

      Point 2-1

      The conclusions drawn from the experimental data are carefully formulated for the most part. One of the main concerns is that the cells were subjected to extreme hypoxic conditions, while it may be more biologically relevant to include a condition representing more mild hypoxia (e.g. 10%).

      Please refer to the response to point 1-2.

      Point 2-2

      One of the opening claims regarding severe hypoxia only mildly affecting cell proliferation is not shown clearly, since no mitotic markers have been analyzed (i.e. KI67 or PCNA staining or a simple EdU incorporation assay). Thus, the claim that they assessed cell proliferation is not very convincing, even though cell death was analyzed.

      We appreciate the reviewer’s suggestion to include a more thorough analysis of proliferation rates. We followed the advice and performed immunofluorescent stainings against Ki67 (accounting for cells in proliferative state) and phospho-histone H3 (accounting for cells undergoing mitosis). We performed this assay at different time points of culture in order to address the question if cell density might influence proliferation rates (Figs. 1F-H). Neither for Ki67 nor for pHH3 a difference was detected between 21% and 0.5% O2.

      We are convinced that these analyses strengthened our initial findings and provide strong evidence that hypoxia does not influence proliferation rates of CNCC.

      Point 2-3

      Additionally, cellular morphology of the cells could be assessed (brightfield images), since previous studies observed that hypoxia can be an inducive factor in cranial neural crest and driving EMT (Scully et al. 2016; Barriga et al. 2013).

      We thank the reviewer’s hint and followed the advice. We analyzed cellular morphology by the parameters cell length, total number of pseudopodia, number of filopodia and number of lobopodia (Figs. S1C-F). As outlined in the results section, we did not detect a difference in these parameters between 21% and 0.5% O2.

      We included the second reference mentioned by the reviewer (Barriga et al, 2013) additionally to Scully et al. 2016 that had already been cited.

      Point 2-4

      Furthermore, in the RNA seq analysis of chondrogenic fate biased cells the authors draw a conclusion based on the proximity of the samples on the PCA plot, which is not very convincing. More careful analysis of the bulk RNA seq data sets they have generated for key marker genes will be more convincing (for example, a heatmap with selected genes would be a helpful representation).

      We apologize for the rash and inaccurate conclusion based on proximity on PCA plots. We are grateful to the reviewer for the suggestion to include heatmaps with selected marker genes. Following this advice, we generated heatmaps on our bulk RNA-seq data with the GO terms specific for each differentiation paradigm (Figs. S2F, S3F, S4F).

      We are convinced that these maps are perfect additions to the heatmaps of the 200 top differentially-expressed genes that already had been included in the manuscript (Figs. 2K, 3J, 4J) and helped to strengthen our findings. For chondrocytes and smooth muscle cells, the new, GO-specific heatmaps perfectly recapitulated the phenomenon of hypoxia-attenuated induction. Interestingly, for osteoblasts, about half of the induced genes were hypoxia-attenuated, while the other half was induced stronger than under normoxia. This pointed to gene-specific mechanisms of hypoxia-dependent attenuation of transcription. Moreover, it shed light on a hypoxia-evoked complete dysregulation of transcriptional induction in osteoblasts, as nearly none of the genes was induced similar to normoxia.

      __ __

      Point 2-5

      As mentioned above, a straight-forward and not time consuming experiment (given that it was assessed for a maximum of 72 hrs) would be to repeat the culture of NCCs and stain for mitotic markers, and quantify the number of positively stained cells over total cell numbers. Furthermore, it is not that demanding to add an experimental condition of less severe hypoxia in this assay.

      We thank the reviewer for the suggestion and followed the advice (cf. point 2-2). The conducted experiments straightened our results, because the initially detected slight tendency to lower cell numbers at 0.5% O2 could thus be falsified: We did not detect any difference for Ki67 and pHH3 between 0.5% and 21% O2 at any analyzed time point (Figs. 1F-H). Moreover, percentages of dead or apoptotic cells at 0.5% O2 did not vary from 21% (Figs. 1I-L, S1B). As we could not detect any difference in proliferation between 21% and 0.5% O2, we skipped the analysis of proliferating cells at 2% O2.

      Point 2-6

      Without underestimating how time consuming this would be, a major lack of experimental validation of the key genes they identify as important across all conditions may be the limitation of the study (this would be the difference between correlation and a probable underlying mechanism). This can be circumvented by more extensive reference to in situ data sets from mouse or existing data sets of single cell and spatial transcriptomics. A suggested targeted knock-down (for example with siRNA, shRNA or CRISPR) to validate a few of the key genes revealed as important could take a few months, with an estimated cost up to 5,000 euros per targeted gene and replicate.

      We thank the reviewer for the notion that targeted knockdowns are beyond the scope of our manuscript. We are deeply grateful for the reviewer’s constructive criticism and for the suggestion to analyze publicly available data sets in order to gather data depicting in vivo relevance of our identified central hypoxia-attenuated OFC risk genes Boc, Cdo1 and Actg2 (cf. point 1-4). We detected robust expression of Boc and Cdo1 during human craniofacial development (Fig. 7A) and we identified enhancers that are active in embryonic craniofacial mouse tissue (Fig. 7B). Moreover, we detected expression of both genes during murine craniofacial development in undifferentiated mesenchymal cells, osteoblasts, chondrocytes and smooth muscle cells by reanalysis of a scRNA-seq dataset (Figs. 7C-E, S6B). This data comprised scRNA-seq of mouse embryonic maxillary prominence at stages E11.5 and E14.5 (Sun et al, 2023).

      Thus, we found evidence for the in vivo relevance of Boc and Cdo1 and could rule out a possible important role of Actg2, the third gene we had identified. We therefore are deeply grateful for the suggestion, as we think these data strongly emphasize the importance of our findings.

      Point 2-7

      On methods, replicates and statistics: The experimental methods and approach are described efficiently and seem reproducible. All biological and technical replicates are of a minimum of N=3 from independent experiments and statistical tests have been run in all cases.

      We thank the reviewer for the appreciation of our methodology, descriptions and statistical analyses.

      Minor points

      Point 2-8

      One of the key implications of NCCs in palate formation is interaction with orofacial epithelial cells, which the authors also mention. It may be interesting to check if any signaling pathways involved in this crosstalk are affected under hypoxic conditions in their existing data sets of bulk RNA SEQ. This can be done by using available algorithms such as CellChat (Jin et al. 2021; Jin, Plikus, and Nie 2023), which has been reported to work also in bulk RNA seq data analysis (according to GitHub). The authors could mine the literature for existing RNA sequencing data that include osteoblasts, chondrocytes and epithelial cells (Ozekin, O'Rourke, and Bates 2023; Piña et al. 2023).

      We are very grateful to the reviewer for this suggestion. Moreover, we like to thank the reviewer for mentioning exemplary references. We followed the advice by the methodology lined out in results and materials and methods sections: we applied the CellChat algorithm on a scRNA-seq dataset (Pina et al, 2023; Sun et al., 2023) to identify pathways containing components that are hypoxia-attenuated (and associated with a risk for OFC) in our bulk RNA-seq dataset (Figs. 7F-I). We did not use the datasets the reviewer had suggested, because the data were not available for us or the file format was not well-suited for the analysis with CellChat. Importantly, the dataset from Sun et al. has the following advantages over the suggested references: the complete maxillary prominence was used (instead of palatal shelves only), and different time points were included. Thus, we were able to follow the expression of genes of interest at different developmental stages before the onset of differentiation and after (Figs. 7C-E and S6B). By our approach, we identified several OFC-related pathways that contain hypoxia-attenuated components such as BMP and FGF signaling and deposition of collagen and fibronectin (Figs. 7F-I). Importantly, the named pathways (and others) send outgoing communication patterns to epithelial cells. Therefore, hypoxia-attenuated gene induction in CNCC could influence epithelial cells via these pathways.

      We believe that the use of the CellChat algorithm has brought a deeper understanding of how hypoxia can have indirect consequences on the important topic of epithelial cells and thus could also evoke OFC. We therefore once again like to express our gratitude to the reviewer.

      Point 2-9

      Additionally, another process that may be affected is EMT (epithelial-to-mesenchymal-transition) and is possible to assess by re-analysis of bulk RNA-seq data while focusing on key genes implicated in this process (i.e. E-cadherin, vimentin, EpCAM, Snail, Twist, PRRX1).

      We thank the reviewer for the advice. We followed the advice and analyzed cellular morphology by the parameters cell length, total number of pseudopodia, number of filopodia and number of lobopodia (Figs. S1C-F) (cf. point 2-3). As we did not detect any differences between 21% and 0.5% O2, and because the cells we used for our analyses represent mesenchymal cells, i.e. cells that had already undergone EMT, we did not re-analyze our dataset with the focus on EMT.

      Point 2-10

      Lastly, when the authors report on the significantly up- or down-regulated genes, it may be interesting to categorize them by ligands, receptors, intracellular molecules and transcription factors (and use separate plots to visualize them). While a big focus of the manuscript are down-regulated genes, less emphasis was given in upregulated genes (other than the response to hypoxia gene module).

      We thank the reviewer for the advice. Following this advice, we categorized genes according to Panther protein classes "intercellular signal molecule" (PC00207), "transmembrane signal receptor" (PC00197) and "gene-specific transcriptional regulator" (PC00264) and depicted the results with violin plots (Fig. S5B). We could not analyze intracellular molecules, because this protein class does not exist in the Panther database. We had not focused on the genes with stronger induction in hypoxic condition, because the number of genes was low in each differentiation paradigm (7 in chondrocytes, less than 30 in osteoblasts, none in smooth muscle cells) and the transcriptional changes were mostly not as drastic as for the attenuated genes. In order to achieve a broader overview of deregulated processes, we now included GO term analyses of genes downregulated during the differentiation regimes both at 21% and 0.5% O2 (Figs. S2D,E, S3D,E, S4D,E).

      Point 2-11

      The authors are referencing extensively and accurately existing studies in the field and the manuscript is exceptionally well-written, with only a few points of limited clarity or increased complexity. Such an example is when the authors refer to OFC risk genes, because it is not clearly stated how the referenced studies reached their conclusions (for example, are they mouse studies, do they involve mutants, are any of these studies based on GWAS on human cohorts). This matter would significantly improve the flow of the text and highlight the importance of the study and their findings.

      We would like to thank the reviewer very much for the appreciation of our scientific writing. We apologize for not explaining exactly how our OFC risk gene lists had been curated. We included this information for both non-syndromic and other OFC risk genes at the respective sites in the results section. Moreover, we included the Human Phenotype Ontology terms that had been used in the search in the materials and methods section.

      We thank the reviewer for this suggestion, as we agree that this information significantly highlights the importance of our findings.

      Point 2-12

      The figures could be redesigned to be more intuitive to interpret. For example, using violin plots and heatmaps, as discussed, and including references or re-analysis/re-use of existing spatial transcriptomics and in situs for marker genes.

      In all cases where there is a comparison of gene expression levels, violin plots would be a better representation of up- and down-regulated genes (i.e. selected genes from Fig1K, comparison of gene expression between normoxic and hypoxic NCCs, Fig 2G when analyzing chondrogenesis and the respective analysis for osteoblasts and smooth muscle cells, as well as when comparing the three fate-biasing conditions to identify common genes that are misregulated).

      We thank the reviewer for the advice and for the appreciation of the usage of heatmaps (Figs. 2K, 3J, 4J, 6F). Unfortunately, as the number of biological replicates is only three to four, the visualization of gene expression data from our bulk RNA-seq data with violin plots was not intuitive. We therefore retained the heatmaps rather than choosing bar graphs, because they are much clearer when presenting expression data of several to many genes. We included violin plots whenever possible due to high numbers of data points (Figs. S1C, S1D, S1E, S1F, S5B). Moreover, we added additional heatmaps to depict transcriptional changes of genes associated with GO terms with the various differentiation regimes (Figs. S2F, S3F, S4F). Unfortunately, we did not detect the three central hypoxia-attenuated genes in spatial transcriptomics data on craniofacial development. But we used scRNA-seq data of different stages of orofacial mouse tissue where we could identify expression of Boc and Cdo1 (cf. points 1-4 and 2-6). These data helped, together with other in vivo data to gain evidence for the in vivo function of Boc and Cdo1 during CNCC differentiation and helped to dismiss Actg2 as another central player.

      Significance

      Several pieces of evidence have pointed to hypoxia as an environmental factor contributing to congenital orofacial clefts, ranging from studies in mouse to observations in human. The authors are doing an excellent job in putting this information together and the question they are trying to answer is of high importance, given the prevalence of such congenital syndromes.

      We are deeply grateful to the reviewer for the appreciation of our work and for classifying our research topic as highly important.

      In terms of the methods and model employed, there are some limitations, related to the choice of a mouse cell line over one from human, the severe hypoxia induced (over a more mild), and the conditions of directed differentiation not allowing for simultaneous examination of more complex lineage transitions. The methods as a whole are not that up-to-date, given the single cell and multiplexed transcriptomic advances the last couple of decades, advanced bioinformatics that could be used in combination with in vitro lineage tracing methods.

      We thank the reviewer for the honest evaluation of our methods, especially for the constructive suggestions that were given to address our hypotheses with more up-to-date methods and at milder hypoxic conditions. As outlined above, we followed the advice and re-analyzed existing scRNA-seq datasets (cf. points 2-6 and 2-8) and checked our central hypotheses at milder hypoxic conditions (cf. response to point 1-3).

      We are deeply convinced that both significantly increased the biological relevance of our results, because we thus (1) gathered evidence for the in vivo function of Boc and Cdo1 and (2) were able to show that the phenomenon of hypoxia-attenuated gene induction still holds true at biologically relevant hypoxic conditions.

      The audience this work will reach are neural crest experts, developmental biologists, and potentially clinical doctors. The general public outreach of such a paper is also diverse, as more focus and visibility is required for the individuals affected by those syndromes and their families.

      We thank the reviewer for the judgement that our manuscript will not only reach neural crest experts, but also developmental biologists in general and potentially also clinicians. We are very much pleased that the reviewer shares our opinion that affected individuals should be more in the focus of public attention. We like to express our gratitude for the judgement that our manuscript might help to increase focus and visibility for them.

      References

      Barriga EH, Maxwell PH, Reyes AE, Mayor R (2013) The hypoxia factor Hif-1α controls neural crest chemotaxis and epithelial to mesenchymal transition. The Journal of cell biology 201: 759-776, 10.1083/jcb.201212100.

      Forman TE, Sajek MP, Larson ED, Mukherjee N, Fantauzzo KA (2024) PDGFRα signaling regulates Srsf3 transcript binding to affect PI3K signaling and endosomal trafficking. Elife 13, 10.7554/eLife.98531.

      Funato N, Nakamura M, Yanagisawa H (2015) Molecular basis of cleft palates in mice. World journal of biological chemistry 6: 121-138, 10.4331/wjbc.v6.i3.121.

      Gehlen-Breitbach S, Schmid T, Fröb F, Rodrian G, Weider M, Wegner M, Gölz L (2023) The Tip60/Ep400 chromatin remodeling complex impacts basic cellular functions in cranial neural crest-derived tissue during early orofacial development. International Journal of Oral Science 15: 16, 10.1038/s41368-023-00222-7.

      Hansen JM, Jones DP, Harris C (2020) The Redox Theory of Development. Antioxid Redox Signal 32: 715-740, 10.1089/ars.2019.7976.

      Li D, Tian Y, Vona B, Yu X, Lin J, Ma L, Lou S, Li X, Zhu G, Wang Y et al (2025) A TAF11 variant contributes to non-syndromic cleft lip only through modulating neural crest cell migration. Hum Mol Genet 34: 392-401, 10.1093/hmg/ddae188.

      Ng KYB, Mingels R, Morgan H, Macklon N, Cheong Y (2017) In vivo oxygen, temperature and pH dynamics in the female reproductive tract and their importance in human conception: a systematic review. Human Reproduction Update 24: 15-34, 10.1093/humupd/dmx028.

      Pina JO, Raju R, Roth DM, Winchester EW, Chattaraj P, Kidwai F, Faucz FR, Iben J, Mitra A, Campbell K et al (2023) Multimodal spatiotemporal transcriptomic resolution of embryonic palate osteogenesis. Nature communications 14: 5687, 10.1038/s41467-023-41349-9.

      Sun J, Lin Y, Ha N, Zhang J, Wang W, Wang X, Bian Q (2023) Single-cell RNA-Seq reveals transcriptional regulatory networks directing the development of mouse maxillary prominence. J Genet Genomics 50: 676-687, 10.1016/j.jgg.2023.02.008.

      Ulschmid CM, Sun MR, Jabbarpour CR, Steward AC, Rivera-González KS, Cao J, Martin AA, Barnes M, Wicklund L, Madrid A et al (2024) Disruption of DNA methylation-mediated cranial neural crest proliferation and differentiation causes orofacial clefts in mice. Proc Natl Acad Sci U S A 121: e2317668121, 10.1073/pnas.2317668121.

    1. Reviewer #3 (Public review):

      This manuscript reports interesting data on sex differences in expression across several somatic and reproductive tissues among 4 mice species or subspecies. The focus is on sex-biased expression in the somatic tissues, where the authors report high rates of turnover such that the majority of sex-biased genes are only sex-biased in one or two taxa. The authors show sex-biased genes have higher expression variance than unbiased genes but also provide some evidence that sex-bias is likely to evolve from genes with higher expression variance. The authors find that sex-biased genes (both female- and male-biased) experience more adaptive evolution (i.e., higher alpha values) than unbiased genes. The authors develop a summary statistic (Sex-Bias Index, SBI) of each individual's degree of sex-bias for a given tissue. They show that the distribution of SBI values often overlap considerably for somatic (but not reproductive) tissues and that SBI values are not correlated across tissues, which they interpret as indicating an individual can be relatively "male-like" in one tissue and relatively "female-like" in another tissue.

      Though the data are interesting, there are some disappointing aspects to how the authors have chosen to present the work. For example, their criteria for sex-bias requires an expression ratio of one sex to the other of 1.25. A reasonably large fraction of the "sex-biased genes" have ratios just beyond this cut-off (Fig. S1). A gene which has a ratio of 1.27 in taxa 1 can be declared as "sex-biased" but which has a ratio of 1.23 in taxa 2 will not be declared as "sex-biased". It is impossible to know from how the data are presented in the main text the extent to which the supposed very high turnover represents substantial changes in dimorphic expression. A simple plot of the expression sex ratio of taxa 1 vs taxa 2 would be illuminating but the authors declined this suggestion.

      I was particularly intrigued by the authors' inference of the proportion of adaptive substitutions ("alpha") in different gene sets. The show alpha is higher for sex-biased than unbiased genes and nicely shows that the genes that are unbiased in focal taxa but sex-biased in the sister taxa also have low alpha. It would be even stronger that sex-bias is associated with adaptive evolution to estimate alpha for only those genes that are sex-biased in the focal taxa but not in the sister taxa (the current version estimates alpha on all sex-biased genes within the focal taxa, both those that are sex-biased and those that are unbiased in the sister taxa).

      The author's Sex Bias Index is measured in an individual sample as: SBI = median(TPM of female-biased genes) - median(TPM of male-biased genes). This index has some strange properties when one works through some toy examples (though any summary statistic will have limitations). The authors do little to jointly discuss the merits and limitations of this metric. It would have been interesting to examine their two key points (degree of overlapping distributions between sexes and correlation across tissues) using other individual measures of sex-bias.

      Figure 5 shows symmetric gaussian-looking distributions of SBI but it makes me wonder to what extent this is the magic of model fitting software as there are only 9 data points underlying each distribution. Whereas Figure 5 shows many broadly overlapping distributions for SBI, Figure 6 seems to suggest the sexes are quite well separated for SBI (e.g., brain in MUS, heart in DOM).

      Fig. S1 should be shown as the log(F/M) ratio so it is easier to see the symmetry, or lack thereof, of female and male-biased genes.<br /> It is important to note that for the variance analysis that IQR/median was calculated for each gene within each sex for each tissue. This is a key piece of information that should be in the methods or legend of the main figure (not buried in Supplemental Table 17).

    2. Author response:

      The following is the authors’ response to the current reviews.

      We are disappointed that the reviewers do not acknowledge that our data constitute a major step forward for the field. We will prepare a revised version that takes care of the remaining small issues concerning the technical descriptions and a detailed response to the current round of comments. We will also add a summary of the major new findings of our study.


      The following is the authors’ response to the original reviews.

      We appreciate the time of the reviewers and their detailed comments, which have helped to improve the manuscript.

      Our study presents the largest systematic dataset so far on the evolution of sex-biased gene expression in animals. It is also the first that explores the patterns of individual variation in sex-biased gene expression and the SBI is an entirely new procedure to directly visulize these variance patterns in an intuitive way.

      Also, we should like to point out that our study contradicts recent conclusions that had suggested that a substantial set of sex-biased genes has conserved functions between humans and mice and that mice can therefore be informative for gender-specific medicine studies. Our data suggest that only a very small set of genes are conserved in their sex-biased expression between mice and humans in more than one organ.

      In the revised version we have made the following major updates:

      - added a rate comparison of gene regulation turnover between sex-biased and non-sex-biased genes

      - added additional statistics to the variance comparisons and selection tests

      - added a regulatory module analysis that shows that much of the gene turnover happens within modules

      - added a mosaic pattern analysis that shows the individual complexity of sex-biased patterns

      - extended introduction and discussion

      Reviewer #1 (Public Review):<br /> The authors describe a comprehensive analysis of sex-biased expression across multiple tissues and species of mouse. Their results are broadly consistent with previous work, and their methods are robust, as the large volume of work in this area has converged toward a standardized approach.

      I have a few quibbles with the findings, and the main novelty here is the rapid evolution of sex-biased expression over shorter evolutionary intervals than previously documented, although this is not statistically supported. The other main findings, detailed below, are somewhat overstated.

      (1) In the introduction, the authors conflate gametic sex, which is indeed largely binary (with small sperm, large eggs, no intermediate gametic form, and no overlap in size) with somatic sexual dimorphism, which can be bimodal (though sometimes is even more complicated), with a large variance in either sex and generally with a great deal of overlap between males and females. A good appraisal of this distinction is at . This distinction in gene expression has been recognized for at least 20 years, with observations that sex-biased expression in the soma is far less than in the gonad.

      For example, the authors frame their work with the following statement:

      "The different organs show a large individual variation in sex-biased gene expression, making it impossible to classify individuals in simple binary terms. Hence, the seemingly strong conservation of binary sex-states does not find an equivalent underpinning when one looks at the gene-expression makeup of the sexes"

      The authors use this conflation to set up a straw man argument, perhaps in part due to recent political discussions on this topic. They seem to be implying one of two things. a) That previous studies of sex-biased expression of the soma claim a binary classification. I know of no such claim, and many have clearly shown quite the opposite, particularly studies of intra-sexual variation, which are common - see https://doi.org/10.1093/molbev/msx293, https://doi.org/10.1371/journal.pgen.1003697, https://doi.org/10.1111/mec.14408, https://doi.org/10.1111/mec.13919, https://doi.org/10.1111/j.1558-5646.2010.01106.x for just a few examples. Or b) They are the first to observe this non-binary pattern for the soma, but again, many have observed this. For example, many have noted that reproductive or gonad transcriptome data cluster first by sex, but somatic tissue clusters first by species or tissue, then by sex (https://doi.org/10.1073/pnas.1501339112, https://doi.org/10.7554/eLife.67485)

      Figure 4 illustrates the conceptual difference between bimodal and binary sexual conceptions. This figure makes it clear that males and females have different means, but in all cases the distributions are bimodal.

      I would suggest that the authors heavily revise the paper with this more nuanced understanding of the literature and sex differences in their paper, and place their findings in the context of previous work.

      We are sorry that our introduction seems to have been too short to make our points sufficiently clear. Of course, overlapping somatic variation has been shown for morphological characters, but we were aiming to assess this at the sex-biased transcriptome level. Previous studies looking at sex-biased genes were usually limited by the techniques that were available at their times, resulting in a focus on gonads in most studies and almost all have too few individuals included to study within-group variation. We detail this below for the papers that are mentioned by the referee. In view of this, we cite them now as examples for the prevalent focus on gonadal comparisons in most studies. Only Scharmann et al. 2021 on plant leaf dimorphism is indeed relevant for our study with respect to its general findings and we make now extensive reference to it. In addition, we have generally modified the introduction and substantially extended the discussion to make our points clear.

      Snell-Rood 2010: the paper focuses on sex-specific morphological structures in beetles. It samples six somatic tissues for four individuals each of each class. Analysis is done via microarray hybridizations. While categorial differences were traced, variability between individuals was not discussed. By today´s standards, microarrays have anyway too much technical variability to even consider such a discussion.

      Pointer et al. 2013: this paper studies three sexual phenotypes in a bird species, females, dominant males and subordinate males. Tissues include telencephalon, spleen and left gonad. The focus of the analysis is on the gonads, since only few sex-biased genes were found in spleen and brain (according to suppl. Table S1, 0 for the spleen and 2 for the brain). No inferences could be made on somatic variation.

      Harrison 2015: this paper focuses on gonads plus spleen in six bird species with between 2-6 individuals for each sex collected. In the spleen, only one female biased gene and no male biased gene was detected. Hence, the data do not allow to infer patterns of somatic variation.

      Dean et al. 2016: this paper compares four categories of fish caught around nests, with four to seven individuals per category. Only gonads were analyzed, hence no inferences could be made about somatic variability between individuals.

      Cardoso et al. 2017: this paper test categories of fish with alternative reproductive tactics based on brain transcriptomes. While it uses 9-10 individuals per category, it uses pools for sequencing with two pools per category. This does not allow to make any inference on individual variation.

      Todd et al 2017: this paper focuses on three categories of a fish species, females and dominant and sneaker males. It uses brain and gonads as samples with five individuals each for each category. For the brain, more different genes were found between the two types of males, rather than between females and males (3 and 9 respectively). The paper focuses on individual gene descriptions and does not mention somatic variation.

      Scharmann 2021: the paper focuses on 10 species of plants with sexually dimorphic leafs. 5-6 individuals were sampled per sex. The major finding is that sex-biased gene expression does not correlate with the degree of sexual dimorphism of the leafes. The study shows also a fast evolution of sex-biased expression and states that signatures of adaptive evolution are weak. But it does not discuss variance patterns within populations.

      (2) The authors also claim that "sexual conflict is one of the major drivers of evolutionary divergence already at the early species divergence level." However, making the connection between sex-biased genes and sexual conflict remains fraught. Although it is tempting to use sex-biased gene expression (or any form of phenotypic dimorphism) as an indicator of sexual conflict, resolved or not, as many have pointed out, one needs measures of sex-specific selection, ideally fitness, to make this case (https://doi.org/10.1086/595841, 10.1101/cshperspect.a017632). In many cases, sexual dimorphism can arise in one sex only without conflict (e.g. 10.1098/rspb.2010.2220). As such, sex-biased genes alone are not sufficient to discriminate between ongoing and resolved conflict.

      We imply sexual conflict as a driver of genomic divergence patterns in a similar way as it has been done by many authors before (e.g. Mank 2017a, Price et al. 2023, Tosto et al. 2023). While we fully appreciate the point of the referee, we do not really see where we deviate from the standard wording that is used in the context of genomic data. In such data, it is of course usually assumed that they represent solved conflicts (Figure 1D in Cox and Calsbeek) where selection differentials would not be measurable anyway. (Please note also that the phylogenetic approach used in Oliver and Monteiro 2010 becomes rather problematic in view of introgressive hybridization patterns in butterflies), We have extended the discussion to address this.

      (3) To make the case that sex-biased genes are under selection, the authors report alpha values in Figure 3B. Alpha value comparisons like this over large numbers of genes often have high variance. Are any of the values for male- female- and un-biased genes significantly different from one another? This is needed to make the claim of positive selection.

      Sorry, we had accidentally not included the statistics in the final version of the figure. We have added this now in the supplementary table but have also generally changed the statistical approach and the design of the figure.

      Reviewer #2 (Public Review):

      The manuscript by Xie and colleagues presents transcriptomic experiments that measure gene expression in eight different tissues taken from adult female and male mice from four species. These data are used to make inferences regarding the evolution of sex-biased gene expression across these taxa. The experimental methods and data analysis are appropriate; however, most of the conclusions drawn in the manuscript have either been previously reported in the literature or are not fully supported by the data.

      We are not aware of any study that has analyzed somatic sex-biased expression in such a large and taxonomically well resolved closely related taxa of animals. Only the study by Scharman et al. 2021 on plant leaves comes close to it, but even this did not specifically analyze the intragroup variation aspects. Of course, some of our results confirm previous conclusions, but we should still like to point out that they go far beyond them.

      There are two ways the manuscript could be modified to better strengthen the conclusions.

      First, some of the observed differences in gene expression have very little to no effect on other phenotypes, and are not relevant to medicine or fitness. Selectively neutral gene expression differences have been inferred in previous studies, and consistent with that work, sex-biased and between-species expression differences in this study may also be enriched for selectively neutral expression differences. This idea is supported by the analysis of expression variance, which indicates that genes that show sex-biased expression also tend to show more inter-individual variation. This perspective is also supported by the MK analysis of molecular evolution, which suggests that positive selection is more prevalent among genes that are sex-biased in both mus and dom, and genes that switch sex-biased expression are under less selection at the level of both protein-coding sequence and gene expression.

      We have now revisited these points by additional statistical analysis of the variance patterns and an extended discussion under the heading "Neutral or adaptive?". 

      As an aside, I was confused by (line 176): "implying that the enhanced positive selection pressure is triggered by their status of being sex-biased in either taxon." - don't the MK values suggest an excess of positive selection on genes that are sex-biased in both taxa?

      There are different sets of genes that are sex-biased in these two taxa - hence this observation is actually a strong argument for selection on these genes. We have changed the correspondiung text to make this clearer.

      Without an estimate of the proportion of differentially expressed genes that might be relevant for broader physiological or organismal phenotypes, it is difficult to assess the accuracy and relevance of the manuscript's conclusions. One (crude) approach would be to analyze subsets of genes stratified by the magnitude of expression differences; while there is a weak relationship between expression differences and fitness effects, on average large gene expression differences are more likely to affect additional phenotypes than small expression differences.

      We agree that it remains a challenge to show functional effects for the sex-biased genes. The argument that they should have a function is laid out above (and stated in many reviews on the topic). To use the expression level as a proxy of function does not seem justified, given the current literature. For example, genes that are highly conected in modules are not necessrily highly expressed (e.g. transcription factors). Also, genes may be highly expressed in a rare cell type of an organ and have an important funtion there, but this would not show up across the RNA of the whole organ. The most direct functional relationship between sex-biased expression and phenotype comes from the human data in Naqvi et al. 2019 - which we had cited.

      Another perspective would be to compare the within-species variance to the between-species variance to identify genes with an excess of the latter relative to the former (similar logic to an MK test of amino acid substitutions).

      Such an analysis was actually our intial motivation for this study. However, the new (and surprising!) result is that the status of being sex-biased shows such a high turnover that not many genes are left per organ where one could even try to make such a test. However, we have extended the variance analysis with reciprocal gene sets (as we had done it for the MK test) and extended the discussion on the topic, including citation of our prior work on these questions.

      Second, the analysis could be more informative if it distinguished between genes that are expressed across multiple tissues in both sexes that may show greater expression in one sex than the other, versus genes with specialized function expressed solely in (usually) reproductive tissues of one sex (e.g. ovary-specific genes). One approach to quantify this distinction would be metrics like those used defined by [Yanai I, et al. 2005. Genome-wide midrange transcription profiles reveal expression-level relationships in human tissue specification. Bioinformatics 21:650-659.] These approaches can be used to separate out groups of genes by the extent to which they are expressed in both sexes versus genes that are primarily expressed in sex-specific tissue such as testes or ovaries. This more fine-grained analysis would also potentially inform the section describing the evolution/conservation of sex-biased expression: I expect there must be genes with conserved expression specifically in ovaries or testes (these are ancient animal structures!) but these may have been excluded by the requirement that genes be sex-biased and expressed in at least two organs.

      Given that our study focuses on somatic sex-biased genes, we refrain from a comparative analysis of genes that are only expressed in the sex-organs in this paper. With respect to sharing of sex-biased gene expresssion between the somatic tissues, we show in Figure 8 that there are only very few of them (8 female-biased and 3 male-biased). A separate statistical treatment is not possible for this small set of genes.

      There are at least three examples of statements in the discussion that at the moment misinterpret the experimental results.

      The discussion frames the results in the context of sexual selection and sexually antagonistic selection, but these concepts are not synonymous. Sexual selection can shape phenotypes that are specific to one sex, causing no antagonism; and fitness differences between males and females resulting from sexually antagonistic variation in somatic phenotypes may not be acted on by sexual selection. Furthermore, the conditions promoting and consequence of both kinds of selection can be different, so they should be treated separately for the purposes of this discussion.

      We cannot make such a distinction for gene expression patterns - and we are not aware that this was done before in the literature (except gene expression was directly linked to a morphological structure). We have updated this discussion accordingly.

      The discussion claims that "Our data show that sex-biased gene expression evolves extremely fast" but a comparison or expectation for the rate of evolution is not provided. Many other studies have used comparative transcriptomics to estimate rates of gene expression evolution between species, including mice; are the results here substantially and significantly different from those previous studies? Furthermore, the experimental design does not distinguish between those gene expression phenotypes that are fixed between species as compared to those that are polymorphic within one or more species which prevents straightforward interpretation of differences in gene expression as interspecific differences.

      Our statement was in relation to the comparison between somatic and gondadal gene turnover, as well as the comparison to humans. We have now included an additional analysis for a direct comparison with non-sex-biased genes in the same populations (Figure 2B). Note that gene expression variances cannot get fixed anyway, they can only become different in average and magnitude.

      The conclusion that "Our results show that most of the genetic underpinnings of sex differences show no long-term evolutionary stability, which is in strong contrast to the perceived evolutionary stability of two sexes" - seems beyond the scope of this study. This manuscript does not address the genetic underpinnings of sex differences (this would involve eQTL or the like), rather it looks at sex differences in gene expression phenotypes.

      This comes back to the points discussed above about the validity to infer function from sex-biased expression. We have updated the text to clarify this.

      Simply addressing the question of phenotypic evolutionary stability would be more informative if genes expressed specifically in reproductive tissues were separated from somatic sex-biased genes to determine if they show similar patterns of expression evolution.

      Our study is generally focused on somatic gene expression. The comparison with reproductive tissues serves merely as a reference. Since they are of course very different tissues, they should not be compared with each other in the same way. We have now specifically addressed this point in the discussion.

      Reviewer #3 (Public Review):

      This manuscript reports some interesting and important patterns. The results on sex-bias in different tissues and across four taxa would benefit from alternative (or additional) presentation styles. In my view, the most important results are with respect to alpha (fraction of beneficial amino acid changes) in relation to sex-bias (though the authors have made this as a somewhat minor point in this version).

      The part that the authors emphasize I don't find very interesting (i.e., the sexes have overlapping expression profiles in many nongonadal tissues), nor do I believe they have the appropriate data necessary to convincingly demonstrate this (which would require multiple measures from the same individual).

      This is the first study that reports such overlaps and we show that this is not always the case (e.g. liver and kidney data in mice). We are not aware of any preditions of how such patterns would look like and how they would evolve - why should such a new finding not be interesting? Concerning the appropriateness of the data we do not agree with the point the referee makes - see response below.

      This study reports several interesting patterns with respect to sex differences in gene expression across organs of four mice taxa. An alternative presentation of the data would yield a clearer and more convincing case that the patterns the authors claim are legitimate.

      I recommend that the authors clarify what qualifies as "sex-bias".

      This is defined by the statistical criteria that we have applied, following the general standard of papers on this topic.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      (1) "However, already Darwin has pointed out that the phenotypes of the sexes should evolve fast". I think the authors mean that Darwin was quick to point out that sex-specific phenotypes evolve quickly".

      We have modified this text part.

      (2) Non-gonadal is more often referred to as somatic. I would encourage the authors to use this more common term for accessibility.

      We have adopted this term

      (3) Figure 5 is interesting, however, it is difficult to know whether the decreased bimodality in humans compared to mice is biological or technical due to the differences in the underlying data. For example, the mouse samples tightly controlled age and environmental conditions within each species. It is not possible to do that with human samples, and there are very good reasons to think that these factors will affect variance in both sexes.

      Yes, this is certainly true and we know this also from other comparative data between mice and humans. Still, this is human reality vs mouse artificialness. We pick this now up in the discussion.

      (4) Line 273. The large numbers of cells needed for single-cell analysis require that most studies pool multiple samples, however these pools are helpful in themselves. This approach was used by https://doi.org/10.1093/evlett/qrad013 to quantify the degree of sex-bias within cell types across multiple tissues and to compare how bulk and single-cell sex-bias measures compare. Sex-bias in some somatic cell types was very high, even when bulk sex-bias in those tissues was not. This suggests that the bulk data the authors use in this study may in fact obscure the pattern of sex-bias.

      Yes, we agree, and this is exactly how we did the analysis and interpretation, based on the cited paper.

      (5)- Line 379 "Total RNAs were" should be "Total RNA was"

      Corrected

      References cited in this review and which should be included in the manuscript :

      Sam L Sharpe, Andrew P Anderson, Idelle Cooper, Timothy Y James, Alexandra E Kralick, Hans Lindahl, Sara E Lipshutz, J F McLaughlin, Banu Subramaniam, Alicia Roth Weigel, A Kelsey Lewis, Sex and Biology: Broader Impacts Beyond the Binary, Integrative, and Comparative Biology, Volume 63, Issue 4, October 2023, Pages 960-967.

      Included

      Masculinization of Gene Expression Is Associated with Exaggeration of Male Sexual Dimorphism Pointer MA, Harrison PW, Wright AE, Mank JE (2013) Masculinization of Gene Expression Is Associated with Exaggeration of Male Sexual Dimorphism. PLOS Genetics 9(8): e1003697.

      Included

      Erica V Todd, Hui Liu, Melissa S Lamm, Jodi T Thomas, Kim Rutherford, Kelly C Thompson, John R Godwin, Neil J Gemmell, Female Mimicry by Sneaker Males Has a Transcriptomic Signature in Both the Brain and the Gonad in a Sex-Changing Fish, Molecular Biology and Evolution, Volume 35, Issue 1, January 2018, Pages 225-241.

      Included

      Cardoso SD, Gonçalves D, Goesmann A, Canário AVM, Oliveira RF. Temporal variation in brain transcriptome is associated with the expression of female mimicry as a sequential male alternative reproductive tactic in fish. Mol Ecol. 2018; 27: 789-803.

      Included

      Dean, R., Wright, A.E., Marsh-Rollo, S.E., Nugent, B.M., Alonzo, S.H. and Mank, J.E. (2017), Sperm competition shapes gene expression and sequence evolution in the ocellated wrasse. Mol Ecol, 26: 505-518.

      Included

      Emilie C. Snell‐Rood, Amy Cash, Mira V. Han, Teiya Kijimoto, Justen Andrews, Armin P. Moczek, DEVELOPMENTAL DECOUPLING OF ALTERNATIVE PHENOTYPES: INSIGHTS FROM THE TRANSCRIPTOMES OF HORN‐POLYPHENIC BEETLES, Evolution, Volume 65, Issue 1, 1 January 2011.

      Not included, since its technical approach is not really comparable

      Harrison PW, Wright AE, Zimmer F, Dean R, Montgomery SH, Pointer MA, Mank JE (2015) Sexual selection drives evolution and rapid turnover of male gene expression. Proceedings of the National Academy of Sciences, USA 112: 4393-4398.

      Included

      Mathias Scharmann, Anthony G Rebelo, John R Pannell (2021) High rates of evolution preceded shifts to sex-biased gene expression in Leucadendron, the most sexually dimorphic angiosperms eLife 10:e67485.

      Included

      Sexually Antagonistic Selection, Sexual Dimorphism, and the Resolution of Intralocus Sexual Conflict. Robert M. Cox and Ryan Calsbeek , The American Naturalist 2009 173:2, 176-187.

      Included

      Ingleby FC, Flis I, Morrow EH. Sex-biased gene expression and sexual conflict throughout development. Cold Spring Harb Perspect Biol. 2014 Nov 6;7(1):a017632.

      Included

      Oliver JC, Monteiro A 2011. On the origins of sexual dimorphism in butterflies. Proc Biol Sci 278: 1981-1988.

      Included

      Iulia Darolti, Judith E Mank, Sex-biased gene expression at single-cell resolution: cause and consequence of sexual dimorphism, Evolution Letters, Volume 7, Issue 3, June 2023, Pages 148-156.

      Included

      Reviewer #2 (Recommendations For The Authors):

      I am concerned the smoothed density plots in Figure 4 may be providing a misleading sense of the distributions since each distribution is inferred from only 9 values. A boxplot might better represent the data to the reader.

      Boxplots with 9 values are much more difficult to interpret for a reader, this is the very reason why one tends to smoothen them. In this way, they also become similar to the standard plots that are used for showing morphological variation between the sexes. Note that the original data are availble for the individual values, if these are of special interest in some cases. In addition, our new “mosaic” analysis (Figure 6) provides another presentation for readers.

      Line 235: "the overall numbers are lower" I assume this is the number of genes included in the analyses, but this should be explicitly stated.

      Clarified in the text

      The analysis of gene expression from different brain regions in control individuals from the Alzheimer's study (line 273) suffers from low power and it is not clear to me how much taking samples from different brain regions eliminates the issue of different cell types within a sample (the stated motivation for this analysis). While I support publishing negative results, this section does not feel like it adds much to the manuscript and could be cut in my opinion.

      This is actually a study on single cell types, differentiating each of them. We are sorry that the text was apparently unclear about this. Given that there are studies that show the importance of looking at single cell data, we still think that is a suitable analysis. We have updated the text to make it clearer.

      It might be useful to separate out X-linked genes from autosomal genes to see if they show consistent patterns with regard to sex-bias.

      We have added this information in suppl. Table S2 and include some description in the text.

      Reviewer #3 (Recommendations For The Authors):

      Comments follow the order of the Results section:

      (1) The latter half of this line in the Methods is too vague to be helpful: "We have explored a range of cutoffs and found that a sex-bias ratio of 1.25-fold difference of MEDIAN expression values combined with a Wilcoxon rank sum test and Benjamini-Hochberg FDR correction (using FDR <0.1 as cutoff) (Benjamini & Hochberg, 1995) yields the best compromise between sensitivity and specificity". What precisely is meant by "the best compromise between sensitivity and specificity"?

      We explain now that this was based on pre-tests with comparing randomized with actual data. However, we agree that this is in the end a subjective decision, but there is no single standard used in the literature, especially when somatic organs are included. We consider our criteria as rather stringent.

      (2) The 1.25 number for sex bias is, ultimately, an arbitrary cut-off. It is common in this literature to choose some arbitrary level and, in this sense, the authors are following common practice. The choice of 1.25 should be stated in the main text as it is a lower (but not reasonable) value than has been used in many other papers.

      It is not only the cutoff, but also the Wilcoxon test and FDR correction that defines the threshold. See also comment above.

      (3) In truth, dimorphism is continuous rather than discrete (i.e, greater or less than 1.25 fold different). Thus, where possible it would be useful to present results in a fashion that allows readers to see the continuous range of ratios rather than having to worry about whether the patterns are due to the rather arbitrary choices of how genes were binned into sex-bias categories.

      It is necessary to work with cutoffs in such cases - and this is the usual practice for any such paper. But we provide now in Figure 1 Figure supplement 1 plots with the female/male ratio distributions.

      a) Number of genes that are female- / male-biased. I would like to be able to see a version of Figure 1 showing the full distribution of TPM ratios rather than bar graphs of the numbers of (arbitrarily defined) female- and male-biased genes. This will be, of course, a larger figure (a full distribution rather than 2 bars for each species for each organ) and so could be relegated to Supplementary Material (assuming the message of that figure is the same as the current Figure 1).

      This is a very unusual request, given that no other paper has done this either. It would indeed result in a non-managable figure size, or many separate figures that would be difficult to scrutinize. Note that there would be one plot of two (female and male) TPM distributions for each sex-biased gene in each organ and each taxon, leading to hundreds of thousands of plots. We think that by providing the general distributions as plots (see above), and the original data as supplements is sufficient.

      b) Turnover of genes with sex bias. This important issue is addressed in Figure 2. First, it is not precisely clear what "percentages of sums of shared genes for any pairwise comparison" in Figure 2 legend means and no further detail is given in the Methods; this must be made clearer or the info in Figure 2 is meaningless. Regardless, this approach again relies heavily on the arbitrary criterion of defining sex-bias. Thus, I would like to see correlation plots of the log(TPM ratio) between taxa as done in the classic multispecies fly paper of Zhang et al. 2007. In Figure 2 it is quite clear that male-biased genes evolve with respect to sex bias more rapidly than female-biased genes.

      We have provided a better explanation of this analysis. Note that the Zhang et al. 2007 paper was not focussing on somatic expression and covers a much broader evolutionary spectrum. Hence, the results are not comparable. Also, we doubt that it would be so helpful to generate a huge figure with all these plots.

      (4) Is there a simpler explanation for the results in the "Variance patterns" section? The total variance for any variable can be decomposed into the variance within and among "groups". If we use "sex" as the group, then there are genes - labelled sex-biased genes - that were identified as such, in essence, because they have high among-group variance. Given that we then know a priori at the start of this section of sex-biased genes have high among-group variance, is it at all surprising that they have higher total variance than the unbiased genes (which we know a priori have low among-group variance)? Perhaps I misunderstood the point of this section. Maybe it would be more meaningful to examine the WITHIN-SEX variance (averaged across the two sexes) instead.

      We did calculate IQR/median (“normalized variance”) with the nine mice for each gene and each sex in each organ, hence sex is not a variance factor in this calculation. The algorithm steps are outlined in suppl. Table S17. We have now also added a variance calculation for reciprocal gene sets and added an extended discussion of these results.

      (5) Analysis of alpha for sex-biased genes. This was the most interesting part of this manuscript to me.

      (a) More information about what SNVs were used is required.

      i. Were only sites where SPR was fixed used? (If not, how was polarization done?)

      ii. Were sites only considered diverged if they were fixed for different bases in DOM and MUS? (If not, what was the criteria?)

      iii. Using, say, DOM as the focal species, a site must be polymorphic in DOM. But did its status (polymorphic/fixed) in MUS matter?

      We have added a more detailed description on this in the Methods section. For the direct answers of the three questions: (i) yes; (ii) yes; (iii) no, considering that DOM and MUS are two subspecies of Mus musculus separating recently, a variant might occur before separating and there might be gene flow between them.

      (b) A particularly interesting part of the analysis is the investigation of alpha for genes that are NOT sex-biased in one taxa but are sex-biased in the other. At the moment (as I understand it), alpha is only calculated for these genes in the taxa where they are NOT sex-biased (and this alpha value can be compared to the alpha of sex-biased genes and of unbiased genes in that taxa). I would like to see both sets of genes (set 1: those sex-biased in MUS and not in DOM; set 2: those sex-biased DOM and not in MUS) analyzed in each of the 2 species, with results presented in a 2x2 table.

      By definition of these categories, these genes are sex-biased in the respective other taxon, hence the values are already in the table. They are named as “reciprocal”.

      (c) No confidence intervals are given for the alpha values, despite the legend of Figure 3 referring to them.

      These were accidentally omitted - we now included the full table in suppl. Table S6; Figure 3 was modified to show violin plots of the bootstrap distributions

      The author's creation and use of a "sex-bias index" (SBI). My greatest skepticism of this manuscript is with respect to the value of their manufactured index, SBI. Of course, it is possible to create such an index but does this literature really need this index or does this just add to the "clutter" in the literature for this field? Is it helping to illuminate important patterns? This index is presumably some attempt to quantify how "male-like" or "female-like" overall expression is for a given individual (for a given organ). It is calculated as SBI = (MEDIAN of all female-biased tpm) - (MEDIAN of all male-biased tpm).

      (6) A main result that comes from this is that the sexes tend to overlap for these values for most nongonad tissues but are clearly distinct for gonadal tissues. I do not think this result would come as a surprise to almost anyone and I'm far from convinced that this metric is a good way to quantify that point. Let's consider testes vs. ovaries. Compared to non-gonadal tissues, I am reasonably certain that not only are there many more genes that are classified as "sex-biased" in gonads but also the magnitude of sex-bias among these genes is typically much greater than it is for the so-called sex-biased genes in nongonadal tissue (density plots requested in #3a would make this clear). In other words, males and females are, on average, very different with respect to expression in gonads so even allowing for variation within each sex will still result in a clear separation of all individuals of the two sexes. In contrast, males and females are, on average, much less different in, say, heart so when we consider the variation within each sex, there is overlap. One could imagine a variety of different metrics which could be used to make this point. The merits of "SBI" are unclear. It is a novel metric and its properties are poorly understood. (A simple alternative would be looking at individual scores along the axis separating mean/median males and females; almost certainly, for gonads, this would be very similar to PC scores for PC1.)

      As throughout the text, we use gonadal comparisons only as general reference, not as the main result. The main result that we are stressing is the fast turnover of these patterns, including from binary to overlapping for kidney and liver in mouse. We consider this as a new finding. If it comes "not to a surprise to anyone", isn´t it great that one does not have to guess anymore but has finally real data on this?

      We have now also added a mosaic analysis to show that the SBI can be used as summary measure in different presentations.

      The use of a single PC axis is no good alternative, since it throws away the information from the other axis.

      We have now included an explicit discussion on the usefulness of the SBI.

      (7) For simplicity, let's assume all males are identical and all females are identical. Let's imagine that heart and kidney have the exact same set of sex-biased genes. There are 20 female-biased genes; they all happen to be identical in expression level (within tissue) and look like this:

      Female TPM Male TPM TPM ratio (F:M)

      Heart 4 2 2

      Kidney 40 20 2

      And there are 20 male-biased genes that look like this:

      Female TPM Male TPM TPM ratio (F:M)

      Heart 1 3 1/3

      Kidney 10 30 1/3

      Most people would describe these two tissues as equally sex-biased.

      However, the SBIs would be:

      Female SBI Male SBI Sex difference (F - M)

      Heart 4-1 = 3 2 - 3 = -1 4

      Kidney 40-10 =30 20-30 = -10 40

      Is it a desirable property that by this metric these two tissues have wildly different SBI values for each sex as well as for the difference between sexes? (At the very least, shouldn't you make readers aware of these strange properties of SBI so they can decide how much value they put into them?)

      Actually, in this example the simple ratio between the expression levels has a strange property, since it does not reflect a much higher expression of the relevant genes in the kidney. The SBI is actually more suitable for making such cases clear. Of course, this is under the assumption that expression level has a meaning for the phenotype, but this is the general assumption for all RNA-Seq experiment comparisons.

      (8) With respect to Figure 4, why do females often have mean SBI values close to zero or even negative (e.g., kidney, mammary glands)? Is this simply because the female-biased genes tend to have lower TPM than the male-biased genes? It seems that the value zero for this metric is really not very biologically meaningful because this metric is a difference of two things that are not necessarily expected to be equal.

      This is the extra information about the expression levels that is gained via the SBI values (see comment above). However, we noticed that people can get confused about this. We have now added a re-scaling step to focus completely on the variance information in these plots.

      (9) Interpreting variances. A substantial fraction of the latter half of the manuscript focuses on interpreting variances among individual samples. This is problematic because there is no replication within individuals (i.e.., "repeatability"), thus it is impossible to infer the extent of observed variance among individuals of a given group (e.g., among females) is due to true biological differences among individuals or is simply due to noise (i.e., "measurement error" in the broad sense). Is the larger variance for mammary glands than liver or gonads just due to measurement error? What is the evidence?

      This point was of course a major issue during the times where microarrays were used for transcriptome studies. However, the first systematic RNA-Seq studies showed already that the technical replicability is so high, that technical replicates are not required. In fact, practically all RNA-Seq studies are done without technical replicates for this reason.

      (10) Because I have little confidence in the SBI metric (#7-8) and in interpreting within sex variances (#9), I found little value in the human results and how SBI distributions (and degree of overlap between sexes) compare between humans and mice.

      We disagree - the current published status is that there are thousands of sex-biased gene in humans and this has implications for gender-specific medicine (Oliva et al. 2020). Our results show a much more nuanced picture in this respect.

      (11) I found even less value in the single-cell data. It too suffers from the issues above. Further, as the authors more or less state, the data are too limited to say much of value here. It is impossible to tell to what extent the results are simply due to data limitations.

      We have pointed out that it is still valuable to have them. They are good enough to exclude the possibility that only a small set of cells drives the overall pattern across an organ. We have further clarified this in the text.

      (12) The code for data analysis should be posted on GitHub or some other repository.

      The code for the sex-biased gene detection and analysis has been posted on GitHub (see Code availability in the manuscript).

    1. Author response:

      The following is the authors’ response to the original reviews

      Public reviews:

      Reviewer #1:

      Weaknesses:

      As this paper only uses anatomical analyses, no functional interpretations of cell function are tested.

      The aim of this paper was to describe the ultrastructural organization of compound eyes in the extremely small wasp Megaphragma viggianii. The authors successfully achieved this aim and provided an incredibly detailed description of all cell types with respect to their location, volume, and dimensions. As this is the first of its kind, the results cannot easily be compared with previous work. The findings are likely to be an important reference for future work that uses similar techniques to reconstruct the eyes of other insect species. The FIB-SEM method used is being used increasingly often in structural studies of insect sensory organs and brains and this work demonstrates the utility of this method.

      We thank you for your high assessment of our work. Unfortunately, it is hard to test our functional interpretations and check them with electrophysiological methods due to the extremely small size of the animal. Studies on three-dimensional ultrastructural datasets obtained using vEM have just started to appear, and we hope that a lot of data will become available for comparison in the nearest future.

      Reviewer #2:

      Thank you for your work and for your high assessment of our manuscript.

      Reviewer #3:

      Weaknesses:

      The claim that the large dorsal part of the eye is the dorsal rim area (DRA), supported by anatomical data on rhabdomere geometry and connectomics in authors' earlier work, would eventually greatly benefit from additional evidence, obtained by immunocytochemical staining, that could also reveal a putative substrate for colour vision. The cell nuclei that are located in the optical path in the DRA crystalline cone have only a putative optical function, which may be either similar to pore canals in hymenopteran DRA cornea (scattering) or to photoreceptor nuclei in camera-type eyes (focussing), both explanations being mutually exclusive.

      We thank the Reviewer for high assessment of our study and for detailed analysis of our manuscript. Your comments and recommendations are very valued and helped us to improve the text. We understand that immunocytochemical methods could improve our findings and supply additional evidence, but there is no technical possibility for this in present. Megaphragma is a very complicated model organism for such methods. We are currently working on the optimization of the protocol for staining, which is needed because of the high level of autoluminescence and because of insufficient penetration of dyes into the samples.

      Recommendations for the authors:

      Reviewer #1:

      I do not have any major concerns about the content of the paper.

      There are some minor spelling and grammatical errors throughout the text but these can be identified most readily using a spelling/grammar check.

      We have revised the text, checked the spelling, and fixed the grammatical errors throughout the text.

      I suggest consistency when referring to the capitalization of the term 'non-DRA' as it is sometimes 'Non-DRA' in the text.

      We have fixed the term “non-DRA” throughout the text. Thank you.

      Also, check carefully the spelling of headings in the tables as there are a few mistakes in Table 1 and 5 in particular.

      The grammar errors have been fixed.

      Figure 7 legend: an explanation of the abbreviation RPC should be added.

      We have done so.

      Reviewer #2:

      (1) The paper presents the data in great detail, however, since this is the first time the technique has been applied to get whole insect eyes, even if on a small insect, it would be worth outlining in the methods section what innovations in the staining/ scanning or sample preparation allowed these improvements and a roadmap for extending this method to larger insects if possible.

      The whole method, including sample preparation, staining, and scanning, was described in our previous paper (Polilov et al., 2021), where it was presented in every detail. Due to the complicated methodology we suppose that it is not necessary to include all the stages of the technique in the present paper, and thus described it more briefly.

      (2) The optical modelling needs a statement in the discussion providing a disclaimer on parameters like sensitivity, anatomical measurements can provide limits and some measure, but the inherent optics are also key and it is worth qualifying these as only estimates and measurements that give a sense of the variation in morphology, only coupled with optical and potentially neural measurements could one confirm the true sensitivity and acceptance angle.

      In the absence of experimental data or precise computational models of Megaphragma vision, we try to discuss rather carefully the functions of structures based on their morphology, ultrastructure, first-order visual connectome, and analogies with other species. This is reflected in the methods and those sections of our paper that contain functional interpretations.

      Reviewer #3

      (1) The finding that the CNS neurons are enucleated, while the compound eye contains cell nuclei, deserves another word. I would confidentially say that the optical demands of a miniaturized compound eye (the minimal size of the optics due to diffraction, the rhabdomere size, and the minimal thickness of optically insulating granules) are such that further cellular miniaturization is not possible, and the minimal sizes even render the cells that build the eye sufficiently large to accommodate cell nuclei. This is in my opinion a parsimonious explanation, yet speculative and I leave it up to you to embrace it or not.

      We agree with the Reviewer and understand the limiting factors and the optical demands of a miniaturized compound eye. According to our data, nuclei occupy a considerable volume in the eye (in the cells of compound eye there are more nuclei than in the whole brain), and on average the cell volume is larger than in Trichogramma, which is minute, but larger than Megaphragma. But as the Reviewer rightly assumed, it is speculative; therefore, we would like to avoid it.

      (2) Our current understanding of DRA optics and function is limited and I claim that your interpretation of the cell nuclei in the DRA dioptrical apparatuses is inappropriate. Please consider a few articles on hymenopteran DRA, starting with the one below and the citing literature:

      Meyer, E.P., Labhart, T. Pore canals in the cornea of a functionally specialized area of the honey bee's compound eye. Cell Tissue Res. 216, 491-501 (1981). https://doi.org/10.1007/BF00238646

      Honebyee DRA has a milky appearance under a stereomicroscope and can be discerned from the outside. This is due to pore canals in the cornea. I happen to be studying this exact structure and its function right now. I found that the result of those canals is not so much the extended receptor acceptance angles, but rather a minimized light gain. This is counterintuitive, but think of the following. The DRA photoreceptors must encode the limited range of polarization contrasts with a maximal working dynamic range (= voltage) of the photoreceptors, which results in a very steep stimulus-response curve.

      Physiologically such a curve is due to very high transduction gain and a high cell input resistance. In most of the retina, small contrasts are transcoded by LMC neurons, but DRA receptors are long visual fibres and must do the job themselves. The skylight intensity (especially antisolar, where the polarized pattern is maximal) varies little during the day. Hence, the DRA receptors work almost at a fixed intensity range. In order to prevent receptor saturation and keep steep contrast coding, the corneal lenses in DRA have a built-in diffusor ring, which diminishes the light influx. Unfortunately, I have yet to publish this and I may be wrong, of course. But if I look into your data, I see consistently smaller corneal lenses and crystalline cones in the DRA, plus the cell nuclei obstructing the incident light. I think this is similar to the optics of honeybee DRA.

      You do not support your claim that the nuclei additionally focus light by optical calculations, but cite literature on camera-type eyes, which is not OK.

      In any case, I think it is fair to limit the discussion by saying that the nuclei may have an optical role. Further evidence from hymenopteran and vertebrate literature is controversial. “so that the nuclei act as extra collecting lenses, as was reported for rod cells of nocturnal vertebrates (Solovei et al., 2009; Błaszczak et al., 2014)” - please consider omitting this.

      We thank the Reviewer for this piece of advice. And we have rewritten the text, to omit the comparison with vertebrates, but left the citation as an illustration of the fact that nuclei could perform the optical role.

      “Since the nuclei in DRA and non-DRA ommatidia are arranged differently in cone cells, we suggest that the nuclei of the cone cells of DRA ommatidia in M. viggianii perform some optical role, facilitating the specialization of this group of ommatidia. The optical function for nuclei was described for rod cells of nocturnal vertebrates, where chromatin inside the cell nucleus has a direct effect on light propagation (Solovei et al., 2009; Błaszczak et al., 2014; Feodorova et al., 2020).”

      (3) Please consider comparing the structure and function of ectopic receptors with the eyelet in Drosophila (i.e. https://doi.org/10.1523/JNEUROSCI.22-21-09255.2002 )

      We thank the Reviewer for this advice and have included the comparison fragment into the text:

      “The position of ePR, their morphology and synaptic targets look similar to the eyelet (extraretinal photoreceptor cluster) discovered in Drosophila (Helfrich-Förster et al., 2002). Eyelets are remnants of the larval photoreceptors, Bolwig’s organs in Drosophila (Hofbauer, Buchner, 1989). Unlike Drosophila, Trichogrammatidae are egg parasitoids and their central nervous system differentiation is shifted to the late larva and even early pupa (Makarova et al., 2022). According to the available data on the embryonic development of Trichogrammatidae, no photoreceptors cells were found during the larval stages (Ivanova-Kazas, 1954, 1961).”

      According to this, the analogy question remains open.

      (4) Minor remarks:

      “but also to trace the pathways that connect the analyzer with the brain.” - I find the word analyzer a bit stretched here; sure, the DRA is polarization analyzer, but if the main retina was monochromatic, it would only be a detector, not an analyzer.

      The sentence was changed according to the Reviewer’s advice.

      Table I: thikness -> thickness, wigth -> width

      We have fixed these misprints.

      “The cross-section of Non-DRA ommatidia has a strongly spherical shape” - perhaps circular, not spherical. And not necessary to say “strongly”

      The spelling was changed according to the Reviewer’s advice.

      “which can be rarely visualized in the cell's projections not far from the basement membrane.” - I'd suggest saying “which are nearly absent in retinula axons”

      The spelling was changed according to the Reviewer’s advice.

      “The pigment granules of the retinula cells have an elongated nearly oval shape” - please consider replacing 'elongated nearly oval' with 'prolate' (try googling for “prolate” or “oblate spheroids”; the adjective describes precisely what you wanted to say)

      We thank the Reviewer for this piece of advice but prefer to leave our original phrasing, because it is more readily understandable.

      “The results of our morphological analysis of all ommatidia in Megaphragma are consistent with the light-polarization related features in Hymenoptera and other insects” - please add citations, see my comment on the DRA above.

      We have added the citations according to the Reviewer’s advice.

      “The group of short PRs (R1-R6)” - please consider renaming into “short visual fibre photoreceptors” (as opposed to “long visual fibre PRs”; hence SVFs and LVFs). This naming is quite common.

      The naming was changed according to the Reviewer’s advice.

      “The total rhabdom shortening in M. viggianii ommatidia probably favors polarization and absolute sensitivity,” - please see comments on DRA. Wide rhabdom means also a wider acceptance angle.

      Shortening of DRA rhabdoms does not result in their widening compared to other rhabdoms, so it is difficult to say how this may be related to sensitivity. The comments on DRA given earlier have been taken into account.

      “Ommatidia located across the diagonal area of the eye are more sensitive to light” - I don't understand what is diagonal area.

      We have deleted the sentence.

      “Estimated optical sensitivity of the eyes very close to those reported for diurnal hymenopterans with apposition eyes (Greiner et al., 2004; Gutiérrez et al., 2024) and possess around 0.19 {plus minus} 0.04 μm2 sr. M. viggianii have reasonably huge values of acceptance angle Δρ, and thus should result in a low spatial resolution” - please correct English here. “eyes IS very close”, “should result in a low”

      The grammatical errors were fixed.

      Table 6 legend: “SPC - secondary pigment cells.” -> “SPC – secondary pigment cells.”

      Citation “(Makarova et al., 2025).” - probably 2015

      The typos were fixed.

      Methods, FIB-SEM: I can't understand the sentence “The volumetric data of lenses and cones, some linear measurements (lens thickness, cone length, cone width, curvature radius) and to visualize the complete 3D-model of eye we use (measure or reconstruct) the elements from another eye (left).”

      The sentence is a continuation of the previous one. We have rewritten it as follows to clarify the meaning and move it to the 3D reconstruction section:

      “The right eye, on which the reconstruction was performed, has several damaged regions from milling (see Appendix 1С), which hinder the complete reconstructions of lenses and cones on a few ommatidia. According to this, for the volumetric data on lenses and cones, some linear measurements (lens thickness, cone length, cone width, curvature radius), we use (measure or reconstruct) the corresponding elements from the other (left) eye.”

      “The cells of single interfacet bristles were not reconstructed, because of damaging on right eye and worst quality of section on the left.” - please change to “The cells of the single interfacet bristle were not reconstructed, because of damage to the right eye and inferior quality of the sections of the left eye.”

      The text has been changed as follows:

      “The cells of single interfacet bristles were not reconstructed, because of the damage present in the right eye and because of the generally lower quality of this region on the left eye.”

      “Morphometry. Each ommatidia was” -> “Morphometry. Each ommatidium was”

      The grammatical error has been fixed.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Major comments

      Unfortunately the major conclusions of the article are not well supported by the provided data. Including:

      1. That interhemispheric remodelling occurs in non-mammalian amniotes. It would not surprise me that this may be the case, however, the major evidence for this is a series of horizontal insets that do not evidence this point well. There are broad morphological changes during development that can change the proportions and regionalisation of tissue, and therefore the IHF becoming apparently smaller as development progresses (qualitatively, in single sectioning planes, and without clear n numbers) may easily be explained by sutble differences in sectioning planes, or, for example, more caudal territories of the brain expanding at faster rates than the rostral territories. Quantification of the ratio between the IHF and total midline length across ages and between species may go some way to helping to clarify the degree of potential midline remodelling. Very high quality live imaging of the process would be the definitive way to evidence the claim, although I appreciate this is highly technically difficult and may not be possible. A key opportunity seems to be missed in the Satb2 knockout geckoes, where midline remodelling is purported to not occur. This is shown only qualitatively in a single plane of sectioning and again is not convincing. If the IHF length in these animals was quantified to be longer than wildtype at a comparable age, this would help to evidence the claim that remodelling occurs in these species.

      Our responses

      We take seriously the critique that the series of horizontal section images in the figures do not sufficiently substantiate our claim that interhemispheric remodeling occurs in non-mammalian amniotes. To address this, we plan to create a simplified atlas composed of adjacent sections of various wild-type amniotes as well as Satb2-knockout geckos.

      Additionally, in response to the suggestion that the IHF (interhemispheric fissure) should be quantified relative to the total midline length across developmental stages and species, we note that Figure 1 already presents such an analysis. Specifically, we have quantified changes in the midline collagen content using Principal Component Analysis (PCA) in Satb2 Crispants in geckos(FigureS4). However, if necessary, we also plan to perform a similar analysis on wild-type soft-shelled turtles at developmental stages before and after interhemispheric remodeling.

      That similar cell types contribute to remodelling in non-mammalian amniotes as mice/eutherian mammals. The microphotographs presented are not of very high quality, and it is often difficult to be convinced that the data is showing the strong claims made in the paper. For instance the "MZG-like cells" may in fact be astrocytes or another cell type as it is hard to visualise morphology, and the "intercalation of GFP-positive radial glial fibres" is very unclear from the photos. The colocalization of MMPsense with laminin positive cells is very hard to appreciate from the figure, and again not quantified. Similarly, there is a claim that there was degeneration of laminin-positive leptomeninges during astroglial intercalation, which is an active process that is difficult to infer from a single microphotograph. From the data, I can appreciate that some of the similar broad categories of cell types that exist at the mouse midline (glia, radial glia) are also present in non-mammalian amniote midlines, but it is difficult to be convinced of much more than this from the data presented.

      Our responses

      We take seriously the critique that the degeneration of Laminin-positive leptomeninges close to astroglial components is not accepted and that the evidence for glial fiber intercalation is insufficient.

      Verifying the degeneration of Laminin-positive leptomeninges is highly challenging. However, we have recently developed a method to visualize collagen in the pia mater using μCT and a CHP probe (3Helix Inc.). Preliminary experiments have already revealed pan-collagen deposition in the midline of the telencephalon (with lower amounts in the fusion region) and degeneration of the collagen composing the pia mater. We plan to incorporate these findings into the revised manuscript.


      That the gecko RPC and CPC connect distinct parts of the brain (rostral and caudal). These tracer injections lacked visualisation of the deposition site to confirm specificity, as well as appropriate quantification. Importantly, the absence of axons in the CPC following the rostral dye deposition (and vice versa) was not shown, which is essential to make the claim that these commissures carry axons from specific parts of the brain. The alternative hypothesis is that all axons are intermixed and traverse both commissures, independent of brain area of origin, which is not at all tested or disproved by the data presented.

      Our responses

      Thank you for the valuable critique suggestion. To support our claim that the pallial commissure in geckos consists of axons derived from specific brain regions, we should carefully eliminate the possibility that all axons are intermixed and cross both RPC and CPC regardless of brain region.

      To address this, we are planning additional experiments and will include a schematic diagram clearly indicating the labeling sites.


      Overall, the major conclusions of the study are not well supported by the data. A major effort to quantify phenomena and/or dramatically soften conclusions would be needed in order to make the conclusions well supported.

      Our responses

      We will thoroughly reconsider our conclusions and make significant efforts to revise the manuscript.

      Minor comments

      1. The n numbers are not always clearly reported

      Our responses

      We plan to address the clarification of quantitative data and the exact number of replicates.

      At times important points reference reviews or articles that do not support the statements as well as the most important primary articles might.

      Our responses

      We plan to carefully review the manuscript and, in addition to citing the most important primary papers, revise any descriptions that are not sufficiently supported by the cited reviews or articles, as per the suggestions.

      Figures showing the entire section that insets were taken from would help to convince that sectioning planes were equivalent, and also show the deposition site of neurovue experiments.

      Our response

      We will add a schematic showing the locations labeled in NeuroVue and additional experiments as a similar point made in Major comment 3.

      The fibre direction of GFAP+ fibres in figure 6 is confusing - It seems from the labelling on the figures as if red is used for the WT condition in mouse, but for the Satb2del condition in Gecko? If this is the case, then it would appear that the fibres are more specifically oriented in the del condition in mice, but in the WT condition of geckoes? There are several instances of this where clearer description and labelling would help the reader to interpret the results.

      Our response

      We plan to add clarification and indication of the direction of GFAP+ fibers in Figure 6 to make it easier to understand.

      Reviewer #1 (Significance (Required)):

      This study attempts to address a highly significant, novel and important question, that, if well achieved, would be publishable at a high degree of interest and impact to the basic research fields of brain development and evolution. Unfortunately the major conclusions made by the study are stronger than the data provided is able to evidence, and I remain unconvinced by many of them.

      Our responses

      We take seriously the suggestion that the major claims made by this study are excessive and so strong that they cannot be proven with the data provided. We will revise the manuscript as necessary.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary

      The authors provide a comparative analysis of interhemispheric (IHF) remodeling and its potential role in the generation of commissural axons. Based on histological material from mice, chickens, turtles, and geckos, the IHF remodeling of the midline is divided in two events: caudal and rostral. It is suggested that the rostral event is a preliminary step to the crossing of commissural axons, as it is characteristic of eutherian mammals with a corpus callosum (CC). However, the authors describe similar histologic features in other amniotes during development, particularly reptiles. This is in contrast with the case of the chick, which does not show signs of IHF remodeling nor a rostral pallial commissure. Additionally, deficient transgenic mice and geckos illustrate a potential role of Satb2 in rostral IHF remodeling and subsequent commissural formation. Whereas the topic and the conclusions of the analysis are interesting and provide new knowledge to the evo-devo field, several issues should be addressed prior to publication, such as data precision and presentation to support the main statements in the manuscript.

      Major comments:

      ____-A central point of this article is the splitting of the IHF into rostral and caudal events. The authors suggest that each one can be regulated differentially, and they attribute the rostral remodeling as a step prior to corpus callosum (CC) formation, in contrast to the caudal remodeling. In my opinion, these two events are not sufficiently characterized either in the figures or the manuscript. It is necessary to better describe these two processes that the authors mention. For instance, the authors could add or re-organize information in Figures 1-3 to include wide-field images showing the whole septum from rostral to caudal, and representative dorsoventral sections at important stages (with insets pointing at specific features). Otherwise, a table summarizing the rostral and caudal events would also be helpful to the reader.

      Our responses

      We take the suggestion seriously that the distinction between rostral and caudal remodeling may not be clear, especially regarding rostral remodeling, which is prior to the stage of corpus callosum (CC) formation, in contrast to caudal remodeling. Specifically, we plan to add or restructure the information from Figures 1 to 3 by including wide-field images that show the entire septum from rostral to caudal, as well as representative sagittal sections along the dorsal-ventral axis at key stages, with insets highlighting specific features. These will be added to the Supplementary data. Additionally, a table summarizing the events in both the rostral and caudal regions will also be created and included in the revised manuscript.

      When the authors refer to the reptilian rostral pallial commissure (RPC) and caudal pallial commissure (CPC), are these the same structures as the pallial commissure and anterior commissure described by Lanuza and Halpern (1997), Butler and Hodos (2005) and Puelles et al. (2019)? It is necessary to clarify the nomenclature, given that they are providing data from several species. Also, structures with the same names among species may not be truly homologous. A simple atlas with some horizontal and transverse planes highlighting anatomical landmarks and important structures (commissural tracts in this case) of the non-mammalian species would be extremely useful for the reader.

      Our responses

      As suggested by the reviewer, we are considering to provide a more detailed definition of the nomenclature of the pallial commissure in the revised manuscript, specifically in the introduction. Additionally, as mentioned earlier, we plan to create a simplified atlas with several horizontal and transverse sections, emphasizing anatomical landmarks and important structures (in this case, the commissural pathways) in species other than mammals.

      ____I wonder if the authors tested Fgf8 as marker on any of their sauropsidian tissue samples, as this gene has a known role in murine MZG development, which is required for IHF remodeling (Gobius et al. 2016, already cited in the manuscript). It would be beneficial to test this marker for the study, and if positive, it would open the possibility of designing loss-of-function experiments in avian or reptilian development models to identify mechanisms common to eutherians and support the statements of this work.

      Our responses

      We plan to verify the gene expression necessary for mouse MZG development and IHF remodeling, including Fgf8, DCC, and MMP2, through immunohistochemical staining as suggested.

      It would be really interesting to provide a more elaborate discussion on whether authors consider the sauropsidian IHF as a homologous process to eutherian IHF, and the reptilian RPC as an homologous of the CC.

      Our responses

      Since 3 out of the 4 reviewers consider IHF remodeling in sauropods to be homologous to that in placental mammals, we plan to further emphasize this claim in the revised manuscript. Additionally, we will expand on the discussion regarding whether the process of RPC formation in reptiles is considered homologous to that of the corpus callosum, and I will approach this from the context of character identity mechanisms claimed by Dr. Günter Wagner.

      Data and methods are presented in such a way that, in principle, they could be reproduced. Authors should indicate the number of animals/replicates of each species used in each experiment.


      Our responses

      As suggested, we plan to provide more detailed descriptions of the methods to ensure reproducibility. This will include adding the number of samples and trial repetitions for each animal species used in the experiments, including those for the additional experiments, in the revised manuscript.


      Minor comments:

      In the results section, paragraph 2, line 3: "We detected the accumulation of GFAP-positive cells and phosphorylated vimentin (Ser55) -positive mitotic radial glia in the IHF and telencephalic hinge in developing turtles, geckoes and chicks (Figure 2A)". Figure 2A shows sections from the four analyzed species labeled with radial glia markers at the end of the IHF remodeling. It would be beneficial to have analogous sections at several time points (perhaps before or after the process) to compare and show more clearly the accumulation of glial cells at that location.

      Our responses

      We have prepared serial sections before and after the developmental stages when interhemispheric remodeling occurs, in order to compare and more clearly show the accumulation of glial cells at their respective locations in mice, geckos, and soft-shelled turtles. I plan to add these results to Figure 2A in the revised manuscript.

      The article will improve its quality by adding more comparative information in the introduction about the analyzed sauropsidian structures (rostral pallial commissure and caudal pallial commissure), their relations with the pallial and anterior commissures, the structures/cells connected by them, and homologies previously proposed.

      Our responses

      We will add comparative information regarding the brain structures in sauropod, including the rostral and caudal pallial commissures and their relationship to the pallial commissure and anterior commissure, and the structures they connect, such as pyramidal cells, along with previously proposed homologies. This information will be included in the introduction and summarized in a table.


      In Figure 1 panels A-D, there is a lot of disparity in brain sizes and scales both between sections of the same species and between species. Placing the insets next to their source images is very necessary for clarity.


      Our responses

      As mentioned earlier, I will create a simplified atlas using adjacent sections and continuous μCT tomography images. Additionally, I will adjust the placement of the inset images in the revised manuscript to more visually accessible positions, improving their visibility.

      In the results section, paragraph 2, line 11: "In addition, it was suggested that astroglial intercalation occurs in conjunction with the aforementioned regression of the IHF from st.21 to st.26 in the developing turtle (Figure 2C)." In Figure 2C, all images are at different scales,

      which makes it very hard to properly compare between stages.

      Our responses

      By creating inset images based on the low-magnification images in the upper panel, we will enhance the visibility of GFAP intercalation. Additionally, we will improve the visibility in the revised manuscript by adding scale bars, referencing the simplified atlas in the figure legends, and standardizing the tissue specimen scale. we also plan to correct any typographical errors in the figures.

      In Figure 2D, the authors show the presence of MMP around the leptomeninges, suggesting MMP-mediated degradation. In the images, MMP labeling is revealed in dark blue, which is largely invisible against the black background. Colors should be used properly to allow visualization of this MMP labeling.

      Our responses

      In Figure 2D, we will reconsider the selection of pseudo-colors and use cyan to represent MMPsense.

      In Figure 4, it would really help if the authors provided wide-field images and DAPI counterstaining of the anterograde and retrograde tracings, to provide anatomical landmarks that help readers to identify the midline and understand the orientation of images.


      Our responses

      In addition to the previously mentioned schematic diagram of the gecko's pallial commissure and the additional experiments, we plan to include wide-field images along with forward and retrograde tracing using Hoechst counterstaining.

      In Figure 5B, I understand that the images in the red and blue squares correspond to brain areas in the squares in A. However, some confusion remains, especially with the image in B, which does not seem to be at the same angle as in the diagram representation. This makes it difficult to understand the results.

      Our responses

      According to the comment, we will revise the design of the Figure 5B to be more easily understand, and modify the scheme to match the angle of sections with actual figures.

      In Figure 6D, to better visualize defects in the RPC formation, the asterisk in the middle of the deficient structure needs to be replaced with a more lateral arrow pointing to the malformation.


      Our responses

      To better visualize the absence of RPC formation in Figure 6D, we will replace the asterisk in the center of the missing structure with a horizontal arrow indicating the malformation.

      In Figure S5, violin plots in panel C do not correspond with data in A and B. This needs correction or clarification.

      Our responses

      In Figure S5, the inconsistency between the violin plot in panel C and the data in panels A and B is a clear error, and we will correct this in the revised manuscript.

      In the article, a section appears solely to explain spatial transcriptomics results in a chick coronal section. The conclusion of this experiment is that three markers associated with midline remodeling are present in chick, suggesting that interhemispheric remodeling is conserved between mouse and chick. As these are complementary results and are not deeply analyzed in this manuscript, I think it would be better to summarize these findings in a dedicated paragraph and transfer some of the key images from Figure S2 to one of the main figures. Other problems with Figure S2: color contrast between clusters in the tSNE projection in B is very poor, should be enhanced; color intensity in FeaturePlots of panels D-F is too weak, and it seems that there is not really much expression at all in any cluster for any of these genes.

      Our responses

      In the revised manuscript, we will move some of the key images from Figure S2 to Main Figure 3 to demonstrate that the three markers related to midline remodeling are also present in chickens, showing that interhemispheric remodeling is conserved between mice and chickens. Additionally, we will enhance the contrast between clusters in the tSNE projection of the FeaturePlots in S2B and D-F by increasing the pseudo-color intensity or adjusting the intensity levels to emphasize the color contrast, and incorporate this updated figure into the revised manuscript.

      Reviewer #2 (Significance (Required)):

      The authors identify in the developing brain of sauropsids an event similar to IHF remodeling in eutherians, and suggest a causal relation between the rostral IHF remodeling and the formation of the pallial commissure in reptilian brains. This implies a potential homology between the pallial commissure and the corpus callosum of placental mammals. If this is the intention of the authors, this conclusion should be addressed explicitly and at length in the Discussion section. Whereas the results and conclusions described in the manuscript will be valuable in the field, the data presented in the manuscript needs quite some improvement, particularly for some of the images in the previously mentioned figures. Otherwise, the original data cannot be properly judged and may set reasonable doubt to readers.

      Advance: The findings described in this report are new to my knowledge. The description of the IHF remodeling event prior to corpus callosum development in mice has been published (Gobius et al. 2016, Cell Reports), but not in other mammalian branches or non-mammalian vertebrates. For this reason, the data in this report should be very convincing and better presented.

      Audience: This research will be interesting for a specialized and basic research audience, particularly for researchers in the evo-devo fields.

      My expertise: neuroanatomy, development, evolution, brain, cerebral cortex

      Our responses

      Thank you for your positive feedback on the novelty and high evaluation of identifying phenomena in reptilian development that resemble interhemispheric fissure (IHF) remodeling in placental mammals and demonstrating a causal relationship between rostral IHF remodeling and the formation of the reptilian pallial commissure. we will incorporate the concept of the potential homology between the corpus callosum in placental mammals and the brain commissures in reptiles into the revised manuscript, reflecting this in the context of character Identity mechanisms claimed by Dr. Günter Wagner. This will be clearly and thoroughly discussed in the discussion section. Additionally, we sincerely appreciate the constructive comment about the room for significant improvement, particularly in some of the figures, and we will address these points in the revised manuscript.


      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Conserved interhemispheric morphogenesis in amniotes preceded the evolution

      of the corpus callosum. Noji Kaneko et al., 2025

      The CC is formed exclusively in placental mammals. In other amniotes species, the communication of the two hemispheres is mediated by other structures such as the anterior commissure or the hippocampal commissure. The authors perform anatomical comparisons between species to conclude that interhemispheric fissure remodeling, a prior developmental step for CC formation, is highly conserved in non-mammalian amniotes, such as reptiles and birds. They suggest that might have contributed to the evolution of eutherian-specific CC formation. In an attempt to test their hypothesis, the authors investigate the role of Satb2 in interhemispheric fissure remodeling. They show IH fissure defects in both mice and geckoes. This is a nice manuscript that bridges a gap in the current understanding of CC formation. The study is mostly anatomical and directed at a specialized community.

      Our response

      We appreciate for positive comments on the manuscript.

      I suggest some changes that might contribute to improving the manuscript.

      Main

      1. Much of the most important conclusions are extracted from the anatomical observation of the dynamics of IHF closure and the emergence of the Hinge. It is very clear that the researchers are specialists in the field but for a broader audience, the images they provide are not always easy to interpret. It takes a lot of effort to visualize the anatomical data they use for their conclusions. As an example, perhaps the authors can find ways to explain how to identify the hinge specifically. It is very clear what the hinge is in the schemes (drawings)but forms one picture to the other at different developmental stages neither in the same animal species nor from different species. In Figure 1, it is difficult to see how the hinge in the mouse is similar (i.e. the same structure) to the hinge in the Gecko and chick. Moreover, in panels C , chick brain sections are shown at much greater magnification than the gecko, and thus is very difficult. In addition, in the manuscript text, the authors refer to sequential sectioning, but only one image for each stage is shown. They can show more images in supplementary Figures, otr they can just explain that they show the relevant images of the sectioning. As another example, in Fig2A, in the text, the authors explain that they detect the same specific glial components, but the images show very different co-localizations and distributions. In Figures 1 and 3, there are lines indicating Dorsal to ventral. This refers to the sectioning but in reality, what the sections are illustrating is the anterior-to-posterior differences in the IHF. maybe they can clarify it, because at quick sight it can be confusing.

      Our responses

      We sincerely appreciate the feedback regarding the interpretation of images that show the dynamics of interhemispheric remodeling and the emergence of the hinge, which is central to the most important conclusions of this study, as it may not always be easy to interpret. In the revised manuscript, we plan to address this by making the following revisions. For example, to clarify how the hinge corresponds across different species, we will create a simplified atlas to explain that the sections from the main figure are at the same level within the continuous slices.

      The authors have to revise the manuscript text to be more precise. For example, In the result section quote "To address whether the interhemispheric remodeling in non-mammalian amniotes is dependent on midline glial activities, we next examined the expression of several glial markers in the reptilian and avian midline regions". the anatomical comparison does not address the role of glial.

      Our responses

      Thank you for your feedback. I will correct the expression "midline glial activities" to "midline glial components" and incorporate this more accurate terminology into the revised manuscript.


      As an option to increase the relevance of their work, the authors might want to consider to describe in more detail and moving the results of the RNAseq and the analysis of the Stab2 mutants to the main figures.


      Our responses

      Thank you for your feedback. we will move the RNAseq results and the analysis of Satb2 mutants to the main figures and will describe them in more detail to enhance the relevance of the study. Specifically, we plan to separate Figure 6A-C as independent figures and add Supplementary Figure 5, corresponding to mice and geckos, to the main figures in the revised manuscript.


      Minor:

      Please indicate the length of the scale bars in the figure legends, and not only in the figure panels Fig5. Indicate the animal model in the panel Perhaps they can draw a model of the different mechanisms of caudal and anterior remodeling.


      Our responses

      Thank you for your feedback. I plan to revise the figure legend for Figure 5 by clearly indicating the scale bar length and increasing the font size, as well as including the information in each panel. Additionally, I will add a graphical abstract that illustrates the different mechanisms of caudal and rostral remodeling to enhance visual comprehension.


      Reviewer #3 (Significance (Required)):

      The study addresses a gap in knowledge from an evolutionary perspective. It provides novel hypotheses and an innovative framework for the understanding of cortical development and evolution. however, most of the conclusions are inferred from anatomical observations and the experimental testing of the hypothesis (Mutants and RNAseq analysis) are minor part of the study that could be further developed. The study is interesting for investigators with expertise in brain development and evolution but requires familiarity with comparative anatomy and even then it is difficult to go through the work.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Overall, this is a well-written manuscript focusing on the evolution of mid-line interhemispheric fusion related to corpus callosum development and evolution from amniotes to eutherian species. The authors also demonstrated that Satb2 plays a critical role in interhemispheric remodeling, which is essential for corpus callosum development. This is a nicely organized and interesting study and the data are compelling. The following are suggestions for improvement, mostly for clarity:

      Minor comments:

      1. Figure 1A: While the E14 and E17 horizontal sections are informative, the addition of the E12 horizontal section does not provide further information. It would be better to place the inset and the whole image side by side, rather than having them far apart across the whole figure. For Figures 1C-D, is it possible to include horizontal sections for chick at

      E14 and Gecko at 45 dpo, as shown in the subsequent images?

      Our responses

      In Figure 1A, we will replace the current figure with a new one that visually enhances the comparison by placing the inset and the full image side by side. we will also add new horizontal sections of the whole image for chicken E14 and gecko 45 dpo, obtained from μCT tomography images and HE staining, to improve visibility between the images.


      When comparing across species it is sometimes helpful to use a standard staging system so that different developmentally staged tissue can be compared. A timeline of how embryonic day or dpo equates to stage might be helpful.

      Our response

      To clarify the developmental stages, I plan to incorporate a time scale into the graphical abstract in the revised manuscript.


      Figure 2B: It is difficult to discern the perspective without a full, lower power section of Gecko at 45 dpo. Adding a full image with an inset would be helpful. In Figure 1C, it would be helpful to define the magnified area by placing a box on the low magnification image.

      Our responses

      We plan to add a low-magnification μCT tomography image or HE-stained whole image of the gecko at 45 dpo in the revised manuscript. As for Figure 1C, it has already been included in the preprint.


      Figures 3B-E: Include the staining methods used for these sections.

      Our response

      We plan to add a note specifying that the image is stained with HE.


      Figure 4B: Add a low magnification image with an inset. The current image is a bit confusing as it is unclear what is being shown.

      Our responses

      We plan to add a low-magnification image showing the entire section and use an inset to indicate the positional relationship of the section's plane in a schematic diagram.

      Figures 6A-E: It would be helpful to denote the genotype as Satb2+/- or heterozygous, rather than Satb2 WT/del, which can be confusing. Ensure consistency in genotyping notation throughout all figures. It is noted that some are CRISPR knockdown and could be denoted as such.

      Our responses

      For CRISPR knockdown, I will adopt the term "CRISPANT" in the revised manuscript. This terminology will be used consistently throughout all figures to avoid confusion in genotype notation.


      Reviewer #4 (Significance (Required)):

      The corpus callosum evolved only in eutherian mammals and its development relies critically on an earlier developmental process known as interhemispheric remodeling. Nomura and colleagues investigate the evolution of these processes and identify that interhemispheric remodeling occurs in reptiles and birds and was therefore already present in the common ancestor of amniotes. This highly conserved developmental process likley evolved early and provided a substrates for major commissures to form throughout evolution.

      3.____Description of the revisions that have already been incorporated in the transferred manuscript.

      Currently we do not incorporate the revision in the transferred manuscript.


      __ Description of analyses that authors prefer not to carry out__

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Major

      That similar cell types contribute to remodelling in non-mammalian amniotes as mice/eutherian mammals. The microphotographs presented are not of very high quality, and it is often difficult to be convinced that the data is showing the strong claims made in the paper. For instance the "MZG-like cells" may in fact be astrocytes or another cell type as it is hard to visualise morphology, and the "intercalation of GFP-positive radial glial fibres" is very unclear from the photos. The colocalization of MMPsense with laminin positive cells is very hard to appreciate from the figure, and again not quantified. Similarly, there is a claim that there was degeneration of laminin-positive leptomeninges during astroglial intercalation, which is an active process that is difficult to infer from a single microphotograph. From the data, I can appreciate that some of the similar broad categories of cell types that exist at the mouse midline____ ____(glia, radial glia) are also present in non-mammalian amniote midlines, but it is difficult to be convinced of much more than this from the data presented.

      Our responses

      We are confident that this paper provides sufficient evidence that cell types similar to those in non-mammalian amniotes, mice, and placental mammals contribute to interhemispheric remodeling and that glial fiber intercalation occurs. This point is also supported by other reviewers.

      In the present study, we have not conducted the MMPsense experiments with the aim of showing the co-localization of MMPsense and laminin-positive cells or pia mater. Contrary to the reviewer's claim, it is important that the non-continuous regions of MMPsense and laminin-positive areas (pia mater), which are extracellular components, are adjacent to each other. Unfortunately, establishing a quantification system using MMPsense is practically impossible.

      Major

      The implication that Satb2 expression at the midline is necessary for appropriate interhemispheric remodeling. Alternative hypotheses for an inappropriately remodeled midline upon whole-brain Satb2 knockout is that it is not dependent on expression at the midline region. Rather, it could be that, for example, the appropriately timed interaction between ingrowing callosal axons and the midline territory is needed for the timely differentiation and/or behavior of midline cells. Other alternatives include that the lack of axonal midline crossing changes the morphology of the midline territory, including potentially "unfusing" the midline. Given the high prevalence of midline remodelling defects concomitant with callosal agenesis referred to be the authors in the literature, it seems like these alternatives would be worth considering. Indeed, the only article the authors reference in their statement that "several studies implicated that agenesis of CC in Satb2-deficient mice is also associated with defects in midline fusion" is an article where Satb2 was knocked out specifically in the cortex and hippocampus. This result is difficult to interpret, as some Emx1 promotors do label some of the midline territory, however the point stands that it is difficult to interpret solely that Satb2 action at the midline is responsible for the effects. I understand that this is a hard question to investigate, so I would suggest allusion to the alternative hypotheses/interpretations as the main priority when interpreting the data.

      Our responses

      This study does not aim to demonstrate the detailed molecular function of Satb2 in the developmental processes of the corpus callosum or pallial commissure. We plan to clearly state this point in the revised manuscript and focus on the findings obtained as a result. Based on the histological relationships, we will classify interhemispheric remodeling and consider adding a section in the Discussion to identify the common character identity mechanisms underlying the development of the pallial commissure and corpus callosum. This addition will help provide a more detailed understanding of the remodeling mechanisms. As is well known, discussions of homology are complex, and we understand that providing concrete evidence is even more challenging. When discussing homology, we will emphasize that it must be handled cautiously, and that discussions on molecular features and homology will rely heavily on future research. As an alternative, we plan to position the results of Satb2 Crispants in mice and geckos as evidence of the disruption of character identity mechanisms. By incorporating this perspective into the revised manuscript, we believe it will deepen our understanding of the role of Satb2 and its molecular mechanisms.

      Reviewer4

      Minor comment 7. There is very valuable data in the supplementary figures. As suggestion is to incorporate Supp. figures S1, S2 and S5 in the main figures.

      Our responses

      Due to space constraints, we plan to move only Supplementary Figure S5 to the supplementary section, and Figures S1 and S2 will not be included in the main figures of the revised manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Overall, this is a well-written manuscript focusing on the evolution of mid-line interhemispheric fusion related to corpus callosum development and evolution from amniotes to eutherian species. The authors also demonstrated that Satb2 plays a critical role in interhemispheric remodeling, which is essential for corpus callosum development. This is a nicely organized and interesting study and the data are compelling. The following are suggestions for improvement, mostly for clarity:

      Minor comments:

      1. Figure 1A: While the E14 and E17 horizontal sections are informative, the addition of the E12 horizontal section does not provide further information. It would be better to place the inset and the whole image side by side, rather than having them far apart across the whole figure. For Figures 1C-D, is it possible to include horizontal sections for chick at E14 and Gecko at 45 dpo, as shown in the subsequent images?
      2. When comparing across species it is sometimes helpful to use a standard staging system so that different developmentally staged tissue can be compared. A timeline of how embryonic day or dpo equates to stage might be helpful.
      3. Figure 2B: It is difficult to discern the perspective without a full, lower power section of Gecko at 45 dpo. Adding a full image with an inset would be helpful. In Figure 1C, it would be helpful to define the magnified area by placing a box on the low magnification image.
      4. Figures 3B-E: Include the staining methods used for these sections.
      5. Figure 4B: Add a low magnification image with an inset. The current image is a bit confusing as it is unclear what is being shown.
      6. Figures 6A-E: It would be helpful to denote the genotype as Satb2 +/- or heterozygous, rather than Satb2 WT/del, which can be confusing. Ensure consistency in genotyping notation throughout all figures. It is noted that some are CRISPR knockdown and could be denoted as such.
      7. There is very valuable data in the supplementary figures. As suggestion is to incorporate Supp. figures S1, S2 and S5 in the main figures.

      Significance

      The corpus callosum evolved only in eutherian mammals and its development relies critically on an earlier developmental process known as interhemispheric remodeling. Nomura and colleagues investigate the evolution of these processes and identify that interhemispheric remodeling occurs in reptiles and birds and was therefore already present in the common ancestor of amniotes. This highly conserved developmental process likley evolved early and provided a substrates for major commissures to form throughout evolution.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The authors provide a comparative analysis of interhemispheric (IHF) remodeling and its potential role in the generation of commissural axons. Based on histological material from mice, chickens, turtles, and geckos, the IHF remodeling of the midline is divided in two events: caudal and rostral. It is suggested that the rostral event is a preliminary step to the crossing of commissural axons, as it is characteristic of eutherian mammals with a corpus callosum (CC). However, the authors describe similar histologic features in other amniotes during development, particularly reptiles. This is in contrast with the case of the chick, which does not show signs of IHF remodeling nor a rostral pallial commissure. Additionally, deficient transgenic mice and geckos illustrate a potential role of Satb2 in rostral IHF remodeling and subsequent commissural formation. Whereas the topic and the conclusions of the analysis are interesting and provide new knowledge to the evo-devo field, several issues should be addressed prior to publication, such as data precision and presentation to support the main statements in the manuscript.

      Major comments:

      • A central point of this article is the splitting of the IHF into rostral and caudal events. The authors suggest that each one can be regulated differentially, and they attribute the rostral remodeling as a step prior to corpus callosum (CC) formation, in contrast to the caudal remodeling. In my opinion, these two events are not sufficiently characterized either in the figures or the manuscript. It is necessary to better describe these two processes that the authors mention. For instance, the authors could add or re-organize information in Figures 1-3 to include wide-field images showing the whole septum from rostral to caudal, and representative dorsoventral sections at important stages (with insets pointing at specific features). Otherwise, a table summarizing the rostral and caudal events would also be helpful to the reader.
      • When the authors refer to the reptilian rostral pallial commissure (RPC) and caudal pallial commissure (CPC), are these the same structures as the pallial commissure and anterior commissure described by Lanuza and Halpern (1997), Butler and Hodos (2005) and Puelles et al. (2019)? It is necessary to clarify the nomenclature, given that they are providing data from several species. Also, structures with the same names among species may not be truly homologous. A simple atlas with some horizontal and transverse planes highlighting anatomical landmarks and important structures (commissural tracts in this case) of the non-mammalian species would be extremely useful for the reader.
      • I wonder if the authors tested Fgf8 as marker on any of their sauropsidian tissue samples, as this gene has a known role in murine MZG development, which is required for IHF remodeling (Gobius et al. 2016, already cited in the manuscript). It would be beneficial to test this marker for the study, and if positive, it would open the possibility of designing loss-of-function experiments in avian or reptilian development models to identify mechanisms common to eutherians and support the statements of this work
      • It would be really interesting to provide a more elaborate discussion on whether authors consider the sauropsidian IHF as a homologous process to eutherian IHF, and the reptilian RPC as an homologous of the CC.
      • Data and methods are presented in such a way that, in principle, they could be reproduced. Authors should indicate the number of animals/replicates of each species used in each experiment.

      Minor comments:

      • In the results section, paragraph 2, line 3: "We detected the accumulation of GFAP-positive cells and phosphorylated vimentin (Ser55) -positive mitotic radial glia in the IHF and telencephalic hinge in developing turtles, geckoes and chicks (Figure 2A)". Figure 2A shows sections from the four analyzed species labeled with radial glia markers at the end of the IHF remodeling. It would be beneficial to have analogous sections at several time points (perhaps before or after the process) to compare and show more clearly the accumulation of glial cells at that location.
      • The article will improve its quality by adding more comparative information in the introduction about the analyzed sauropsidian structures (rostral pallial commissure and caudal pallial commissure), their relations with the pallial and anterior commissures, the structures/cells connected by them, and homologies previously proposed.
      • In Figure 1 panels A-D, there is a lot of disparity in brain sizes and scales both between sections of the same species and between species. Placing the insets next to their source images is very necessary for clarity.
      • In the results section, paragraph 2, line 11: "In addition, it was suggested that astroglial intercalation occurs in conjunction with the aforementioned regression of the IHF from st.21 to st.26 in the developing turtle (Figure 2C)." In Figure 2C, all images are at different scales, which makes it very hard to properly compare between stages.
      • In Figure 2D, the authors show the presence of MMP around the leptomeninges, suggesting MMP-mediated degradation. In the images, MMP labeling is revealed in dark blue, which is largely invisible against the black background. Colors should be used properly to allow visualization of this MMP labeling.
      • In Figure 4, it would really help if the authors provided wide-field images and DAPI counterstaining of the anterograde and retrograde tracings, to provide anatomical landmarks that help readers to identify the midline and understand the orientation of images.
      • In Figure 5B, I understand that the images in the red and blue squares correspond to brain areas in the squares in A. However, some confusion remains, especially with the image in B, which does not seem to be at the same angle as in the diagram representation. This makes it difficult to understand the results.
      • In Figure 6D, to better visualize defects in the RPC formation, the asterisk in the middle of the deficient structure needs to be replaced with a more lateral arrow pointing to the malformation.
      • In Figure S5, violin plots in panel C do not correspond with data in A and B. This needs correction or clarification.
      • In the article, a section appears solely to explain spatial transcriptomics results in a chick coronal section. The conclusion of this experiment is that three markers associated with midline remodeling are present in chick, suggesting that interhemispheric remodeling is conserved between mouse and chick. As these are complementary results and are not deeply analyzed in this manuscript, I think it would be better to summarize these findings in a dedicated paragraph and transfer some of the key images from Figure S2 to one of the main figures. Other problems with Figure S2: color contrast between clusters in the tSNE projection in B is very poor, should be enhanced; color intensity in FeaturePlots of panels D-F is too weak, and it seems that there is not really much expression at all in any cluster for any of these genes.

      Significance

      The authors identify in the developing brain of sauropsids an event similar to IHF remodeling in eutherians, and suggest a causal relation between the rostral IHF remodeling and the formation of the pallial commissure in reptilian brains. This implies a potential homology between the pallial commissure and the corpus callosum of placental mammals. If this is the intention of the authors, this conclusion should be addressed explicitly and at length in the Discussion section. Whereas the results and conclusions described in the manuscript will be valuable in the field, the data presented in the manuscript needs quite some improvement, particularly for some of the images in the previously mentioned figures. Otherwise, the original data cannot be properly judged and may set reasonable doubt to readers.

      Advance: The findings described in this report are new to my knowledge. The description of the IHF remodeling event prior to corpus callosum development in mice has been published (Gobius et al. 2016, Cell Reports), but not in other mammalian branches or non-mammalian vertebrates. For this reason, the data in this report should be very convincing and better presented.

      Audience: This research will be interesting for a specialized and basic research audience, particularly for researchers in the evo-devo fields.

      My expertise: neuroanatomy, development, evolution, brain, cerebral cortex

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors study several valuable developmental series of non-mammalian amniotes, reaching the conclusion that interhemispheric remodeling occurs in these species and that it is dependent on transcription factor Satb2

      Major comments

      Unfortunately the major conclusions of the article are not well supported by the provided data. Including:

      1. That interhemispheric remodelling occurs in non-mammalian amniotes. It would not surprise me that this may be the case, however, the major evidence for this is a series of horizontal insets that do not evidence this point well. There are broad morphological changes during development that can change the proportions and regionalisation of tissue, and therefore the IHF becoming apparently smaller as development progresses (qualitatively, in single sectioning planes, and without clear n numbers) may easily be explained by sutble differences in sectioning planes, or, for example, more caudal territories of the brain expanding at faster rates than the rostral territories. Quantification of the ratio between the IHF and total midline length across ages and between species may go some way to helping to clarify the degree of potential midline remodelling. Very high quality live imaging of the process would be the definitive way to evidence the claim, although I appreciate this is highly technically difficult and may not be possible. A key opportunity seems to be missed in the Satb2 knockout geckoes, where midline remodelling is purported to not occur. This is shown only qualitatively in a single plane of sectioning and again is not convincing. If the IHF length in these animals was quantified to be longer than wildtype at a comparable age, this would help to evidence the claim that remodelling occurs in these species.
      2. That similar cell types contribute to remodelling in non-mammalian amniotes as mice/eutherian mammals. The microphotographs presented are not of very high quality, and it is often difficult to be convinced that the data is showing the strong claims made in the paper. For instance the "MZG-like cells" may in fact be astrocytes or another cell type as it is hard to visualise morphology, and the "intercalation of GFP-positive radial glial fibres" is very unclear from the photos. The colocalization of MMPsense with laminin positive cells is very hard to appreciate from the figure, and again not quantified. Similarly, there is a claim that there was degeneration of laminin-positive leptomeninges during astroglial intercalation, which is an active process that is difficult to infer from a single microphotograph. From the data, I can appreciate that some of the similar broad categories of cell types that exist at the mouse midline (glia, radial glia) are also present in non-mammalian amniote midlines, but it is difficult to be convinced of much more than this from the data presented.
      3. That the gecko RPC and CPC connect distinct parts of the brain (rostral and caudal). These tracer injections lacked visualisation of the deposition site to confirm specificity, as well as appropriate quantification. Importantly, the absence of axons in the CPC following the rostral dye deposition (and vice versa) was not shown, which is essential to make the claim that these commissures carry axons from specific parts of the brain. The alternative hypothesis is that all axons are intermixed and traverse both commissures, independent of brain area of origin, which is not at all tested or disproved by the data presented.
      4. The implication that Satb2 expression at the midline is necessary for appropriate interhemispheric remodeling. Alternative hypotheses for an inappropriately remodeled midline upon whole-brain Satb2 knockout is that it is not dependent on expression at the midline region. Rather, it could be that, for example, the appropriately timed interaction between ingrowing callosal axons and the midline territory is needed for the timely differentiation and/or behavior of midline cells. Other alternatives include that the lack of axonal midline crossing changes the morphology of the midline territory, including potentially "unfusing" the midline. Given the high prevalence of midline remodelling defects concomitant with callosal agenesis referred to be the authors in the literature, it seems like these alternatives would be worth considering. Indeed, the only article the authors reference in their statement that "several studies implicated that agenesis of CC in Satb2-deficient mice is also associated with defects in midline fusion" is an article where Satb2 was knocked out specifically in the cortex and hippocampus. This result is difficult to interpret, as some Emx1 promotors do label some of the midline territory, however the point stands that it is difficult to interpret solely that Satb2 action at the midline is responsible for the effects. I understand that this is a hard question to investigate, so I would suggest allusion to the alternative hypotheses/interpretations as the main priority when interpreting the data.

      Overall, the major conclusions of the study are not well supported by the data. A major effort to quantify phenomena and/or dramatically soften conclusions would be needed in order to make the conclusions well supported.

      Minor comments

      1. The n numbers are not always clearly reported
      2. At times important points reference reviews or articles that do not support the statements as well as the most important primary articles might.
      3. Figures showing the entire section that insets were taken from would help to convince that sectioning planes were equivalent, and also show the deposition site of neurovue experiments.
      4. The fibre direction of GFAP+ fibres in figure 6 is confusing - It seems from the labelling on the figures as if red is used for the WT condition in mouse, but for the Satb2del condition in Gecko? If this is the case, then it would appear that the fibres are more specifically oriented in the del condition in mice, but in the WT condition of geckoes? There are several instances of this where clearer description and labelling would help the reader to interpret the results.

      Significance

      This study attempts to address a highly significant, novel and important question, that, if well achieved, would be publishable at a high degree of interest and impact to the basic research fields of brain development and evolution. Unfortunately the major conclusions made by the study are stronger than the data provided is able to evidence, and I remain unconvinced by many of them.

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #3 (Recommendations for the authors):

      Major concerns:

      P.6, lines 223-224: The sentence sounds like the authors produced all the OVGP1s by themselves in their laboratories, which is not completely true. The recombinant human and mouse OVGP1s were purchased from OriGene. It is suggested that the authors should state and explain clearly here which OVGP1 is produced by their laboratories and that recombinant human and mouse OVGP1s were obtained and purchased from Origene.

      It is already clearly included in the M&M.

      P6, lines 227-229: The authors stated that "Western blots of the three OVGP1recombinants indicated expected sizes based on those of the proteins: 75 kDa for human and murine OVGP1 and around 60 kDa for bovine OVGP1 (Fig. 4B and S6)." I pointed out in my last review report that the size of the recombinant human OVGP1shown by the authors in their manuscript is not in agreement with what has been published previously in literature regarding the molecular weight of native human OVGP1 as well as that of recombinant human OVGP1. The authors did not address the above concern adequately. In fact, recombinant human OVGP1 has been produced a few years ago (Reproduction (2016) 152:561-573) and it has been previously demonstrated that a single protein band of approximately 110-130 kDa was detected for both native human OVGP1 (see Microscopy Research and Technique (1995) 32:57-69) and recombinant human OVGP1 (Reproduction (2016) 152:561-573; Carbohydrate Research (2012) 358:47-55) using antibodies specific for human OVGP1. Molecular weight of the protein core or polypeptide of human OVGP1 is approximately 75 kDa, but the glycosylated form of native human OVGP1 and recombinant human OVGP1 is approximately 110-130 kDa. Therefore, the authors might have been using the recombinant core protein of human OVGP1 instead of the fully glycosylated recombinant OVGP1 in their study. The same concern also applies to the commercially obtained mouse recombinant OVGP1 used by the authors in their study. I would also like to mention that the mature and fully glycosylated OVGP1s in mammals vary in molecular weight (90-95 kDa in domestic animals; 110-150 kDa in primates; 160-350 kDa in rodents). Again, the 75kDa of mouse OVGP1 detected by the authors could be the core protein or polypeptide of mouse OVGP1 instead of the fully glycosylated mouse OVGP1.

      In our study, as previously mentioned, we included commercially available recombinant proteins from Origene for human and murine OVGP1, which are produced in mammalian cells, and we also produced and purified bovine OVGP1 in mammalian cells. Therefore, these proteins should be properly glycosylated. Moreover, we performed Western blot assays favouring the blotting of higher molecular weight proteins, ensuring the optimal conditions for the assay. Additionally, we tested the size of OVGP1 from murine and bovine oviductal fluids on the same blot. During oestrus, the size of OVGP1 from oviductal fluids matches that of the recombinant proteins, and this band is downregulated during anoestrus, confirming the proper size of recombinant protein.

      P.7, lines 236 and 237: Please provide a figure or source to support the statement "...as confirmed by proteomics of the bands along with PEAKS Studio v11.5 search engine peptide identification software."

      It is included in the text the amount of unique peptides obtained by Proteomics for OVGP1 identification over all protein groups identified.

      P.7, lines 243 to 245: The statement "...using rabbit polyclonal antibody to human OVGP1 for bOVGP1 and endogenous OVGP1, and mouse monoclonal antibody against Flag (DDK)-tag for hOVGP1 and mOVGP1." is confusing and might be inaccurate. First of all, I wondered why the authors did not use an antibody against bovine OVGP1 for the recombinant bOVGP1 instead of using a rabbit polyclonal antibody to human OVGP1. Secondly, what does the "endogenous OVGP1" refer to in the statement? Thirdly, the authors in their study used the commercially available recombinant human OVGP1 and recombinant mouse OVGP1 purchased from Origene. Based on the data sheet provided by Origene, the tag used for both recombinant human OVGP1 and recombinant mouse is C-Myc/DDK-tag and not Flag-tag. Can the authors explain these discrepancies?

      Firstly, for the recombinant protein of bOVGP1 we used the same antibody that we used in the Western blot for all the proteins and oviductal fluids because we do not have anti-His tag working for Immunofluorescence (the one we had only worked for Western blot) and neither we do not have any antibody against bovine OVGP1. In the case of human and murine since we had anti-Flag antibody that worked for Western blot and for immunofluorescence, we used this one. However, as has been shown in our figure and supplementary material, the antibody against human OVGP1 works properly for both techniques (Western blot and Immunofluorescence). Secondly, endogenous OVGP1 is referred to the OVGP1 present in the oviductal fluid. Thirdly, as you can see in the datasheet of the protein, the recombinant proteins purchased from Origene contains a c-myc tag (EQKLISEEDL) some amino acids and a ddk-tag (DYKDDDDK). The sequence of ddk is the same of Flag-tag (DYKDDDDK). Since the proteins have both tags we used the antibody against Flag (or ddk) epitope.

      P12, lines 429-432: The newly added statement at the end of the Discussion saying "Additionally, future studies would be valuable to investigate whether incubating oocytes with oviductal fluid (or OVGP1) could reduce polyspermy in porcine IVF and whether ZPs could be leveraged to naturally enhance sperm selection in human ICSI" is very concerning and requires further attention. The statement reflects that the authors do not keep pace with and do not pay attention to what has been published in literature regarding porcine and human OVGP1s. In fact, porcine oviduct-specific glycoprotein (OVGP1) has already been reported to reduce the incidence of polyspermy in pig oocytes (Biology of Reproduction (2000) 63:242-250). Porcine oviductal fluid, used in porcine IVF, has also been found to exert a beneficial effect on oocytes by reducing the incidence of polyspermy without decreasing the penetration rate. (Theriogenology (2016) 86:495-502). Therefore, the studies deemed valuable by the authors to be investigated in the future have, in fact, already been carried out two decades ago by several other laboratories. I am surprised the authors were not aware of these published work in literature. All the above should have been incorporated in the Discussion.

      This sentence is modified in the discussion and the references are included.

      Furthermore, as mentioned earlier, recombinant human OVGP1 has also been produced (Reproduction (2016) 152:561-573), and recombinant human OVGP1 has been found to increase tyrosine phosphorylation of sperm proteins, a biochemical hallmark of sperm capacitation, and potentiate the subsequent acrosome reaction (Reproduction (2016) 152:561-573) as well as increase sperm-zona binding (Journal of Assisted Reproduction and Genetics (2019) 36:1363-1377). These earlier findings should be incorporated into the Discussion.

      Thank you for your comment, but in this work we had not performed any experimental setting related to tyrosine phosphorylation and despite is a very interesting topic is not directly related to this work.

      P.19, lines 678-683: Since the human and mouse recombinant oviductin proteins were purchased from Origene, the authors should be aware of the fact that these commercially available recombinant OVGP1s might not be fully glycosylated. While I appreciate the fact that the authors wanted to briefly describe how the human and mouse recombinant OVGP1s were prepared by the manufacturer, I strongly suggest that the authors should contact Origene, the manufacturer, for all information regarding the procedures for producing the human and mouse recombinant oviductin proteins. For example, the authors stated on lines 680-681 that "A sequence expressing FLAG-tagged epitope proteins (DYKDDDDK) was cloned into an expression vector." According to the data sheet provided by Origene, it appears that both human and recombinant oviductin proteins are C-Myc/DDK-tagged and not FLAG-tagged.

      Thank you for your comment, as according to the sequence of Flag-tag it is matching with the sequence of the tag in the datasheet corresponding to DDK (this is in detail in previous comment). Besides, the protein is tagged also by C-Myc tag. Among both tags, the antibody selected to detect it was anti-Flag tag.

      P.19, lines 692-697: The description of the primary and secondary antibodies used for detection of the various recombinant OVGP1s is also very confusing and not clearly presented. For example, it is mentioned here that "...membranes were...incubated with anti-OVGP1 rabbit monoclonal antibody for OVGP1,..". What specifically does "OVGP1" refer to here? The authors then stated that anti-Histamine Tag antibody was used to detect bOVGP1 and mOVGP1 and anti-Flag antibody was used to detect hOVGP1. As pointed out earlier, the human and mouse recombinant OVGP1s were produced using C-Myc/DDK tag and not His-tag or Flag-tag. Can the authors clarify these discrepancies?

      We apologise for the complexity of the antibodies, we included in this paragraph the ones used to Western blot for both figures: anti- human OVGP1 was used for the principal figure that contains the three recombinant proteins and oviductal fluids; and the anti-Histidine and anti-Flag antibodies that are included in supplementary figure, specifically for recombinant bovine OVGP1 (Histidine tag) and for recombinant murine and human OVGP (DDK tag). A clarifying sentence has been included in the text.

      P.31, lines 1143-1149: Figure 10 is not mentioned anywhere in the main text of the manuscript. Rewrite the second half of the sentence "...; being this specificity lost when OVGP1 is heterologous to the ZP (right diagram)." Which sounds awkward and grammatically not correct.

      The figure is already mentioned in the text, thank you for your comment. The sentence is also corrected.

      Other comments: P.1, the statement of "All authors contributed equally to this work" on line 14 can be deleted because detailed and specific contributions from each authors are listed in lines 1009-1017 on page 27.

      Both authors contributed equally to this work, now is clear in authors contribution section.

      P.2, lines 43 and 44: Do the authors mean "sperm-oocyte binding protein" instead of "sperm-oocyte fusion protein" in the sentence? "Fusion protein" is a protein composed of two or more domains encoded by different genes, or a hybrid molecule created by combining two different proteins for various purposes. I believe the term "fusion protein" is wrongly used in the sentence which should be rephrased with a proper term.

      Done.

      P2, line 73: Remove the comma after the word "Both".

      Done.

      P.5, line 179: "...mice ZP..." should be written as "...mouse ZP...".  

      Done.

      P.6, heading of 3rd paragraph on line 207: The term "binding" will be a better term than "fusion" used in the heading because the results do not actually show the fusion of the OVGP1 proteins with the ZP glycoprotein. Instead, binding of the OVGP1 proteins to the ZP occurred.

      Done.

      P.6, lines 215-217: Authors, please provide a reference or references to support the statement "Region A, corresponding to the amino acid end, shows high identity among monotremes, marsupials and placentals."

      In the text was indicated a review (29) which includes the supporting idea of this statement for Figure 4. Moreover, we have included some if the references used for the description of the domains when performing the sequence alignment of Figure S5.

      P.6, line 230 and line 233 on P.7: Authors, please be consistent in the use of either American English or British English. The word "oestrus" is British English whereas "estrus" is American English.

      Done.

      P.7, line 264: The word "sticking" used here means non-specific binding. I believe the author means specific binding here. If so, a more appropriate word should be used here instead of "sticking".

      Done.

      P.7, lines 267-269: This newly added sentence sounds very awkward and should be completely rewritten.

      Done.

      P.8, line 288: This reviewer finds it difficult to understand the meaning of the heading. The heading should be rephrased to bring out exactly what the authors want to say in well-written English.

      Done.

      P.8, line 290: The word "would" should be replaced by "could" in the sentence.

      Done.

      P.13, line 437: Authors, please provide the location of Sigma-Aldrich.

      Done.

      P.13, line 457: Here, the authors used "1800 rpm" to indicate the centrifugation speed but used the g-force elsewhere in the Materials and Methods. Please be consistent. The g-force is preferred.

      Done.

      P.14, lines 483-485: The procedure of sacrificing the cats should be provided in the Materials and Methods

      Cats weren’t sacrificed they were vasectomized. It is now included in the text.

      P.17, line 628: "...the ZPs were exposed or no exposed to..." should be written as "...the ZPs were either exposed or not exposed to...".

      Done.

      P.17, line 629: "...each groups were incubated with..." should be "...each group was incubated with...".

      Done.

      P.19, line 700: "As loading control, was used the primary antibody....." is not a complete sentence and it needs to be rewritten.

      Done.

      P.20, lines 744-754: For scanning electron microscopy and image processing, the procedures of prior treatment of the oocytes with and without oviductal fluid and OVGP1 should be included here.

      Done.

      P.21, line 756: It is stated here that "Two hundred isolated ZPs were treated with Clostridium perfringens neuraminidase....". However, it is not clear whether two hundred isolated ZPs of both porcine and murine ZPs were treated. Authors, please clarify.

      We used 200 isolated ZPs of each specie, bovine and murine. It is classified in the text.

      P.28, lines 1039 and 1040: The author only mentioned the use of bovine and murine sperm here. What about human sperm?

      Done.

      P.29, line 1076: "...in mammalian cells..." is very vague. Be specific what exactly the mammalian cells were.

      Done.

      P.29, line 1079: "Oviductal fluid from ovulated cows or anoestrus cows." is not a complete sentence and it needs to be rewritten.

      Done.