10,000 Matching Annotations
  1. Oct 2022
    1. It’s good practice to put the non-optional arguments first in your function call, and to specify the names of all optional arguments. If you don’t, someone reading your code might have to look up the definition of a function with unfamiliar arguments to understand what you’re doing.
    1. The problem is that the caller may write yield instead of block.call. The code I have given is possible caller's code. Extended method definition in my library can be simplified to my code above. Client provides block passed to define_method (body of a method), so he/she can write there anything. Especially yield. I can write in documentation that yield simply does not work, but I am trying to avoid that, and make my library 100% compatible with Ruby (alow to use any language syntax, not only a subset).

      An understandable concern/desire: compatibility

      Added new tag for this: allowing full syntax to be used, not just subset

    1. However, open source licenses do not take the technical nature and capabilities of the ML model as a different artefact to software/source code into account, and are therefore ill-adapted to enabling a more responsible use of ML models.

      This statement assumes that different approaches to ethics / responsible use are directly related to the technical differences / nature between code and ML artefacts. I am not sure this is true.

    1. EisURI EisCode EisNaam EisTekst EisToelichting EisheeftOnderwerp In deze kolom staat de unieke identifier (URI) van de eis. In deze kolom staat de code of het nummer van de eis. In deze kolom staat de de naam oftewel de titel van de eis. In deze kolom staat de eistekst. In deze kolom staat de toelichting op de eis. In deze kolom staat de URI van het Onderwerp (subject) van de eis. https://www.example.org/id/Voorbeeld-Eis1 EIS1099 Voorbeeldeis Dit is de tekst van de voorbeeldeis Dit is de toelichting van de voorbeeldeis, om achtergrond / doel en reden van de eis te kunnen verduidelijken https://www.example.org/id/Voorbeeld-Onderwerp1

      Deze tabel lijkt er dubbel in te staan.

    1. Reusing code instead of repeating code

      This is extremely important because it save a lot of redundancy. We want to keep things as clear as possible. The benefit is that it increases efficiency and reduces the work of the computer.

    1. North observed that a few simple practices, such as naming unit tests as full sen-tences and using the word “should,” can help developers write more meaningful tests,which in turn helps them write higher quality code more efficiently. When you think interms of what the class should do, instead of what method or function is being tested, it’seasier to keep your efforts focused on the underlying business requirements.

      use should in naming your tests, it makes you think about the behaviour of what you are implementing, an NLP technique according to Continuous Delivery guy

    Annotators

    1. Author Response

      Reviewer #1 (Public Review):

      Kohler and Murray present high-throughput image-based measurements of how low-copy F plasmids move (segregate) inside E. coli cell. This active segregation ensures that each daughter cell inherit equal share of the plasmids. Previous work by different labs has shown that faithful F-plasmid segregation (as well as segregation of many other low-copy plasmids, segregation of chromosomes in many bacterial species and segregation of come supramolecular complexes) require ParA and ParB proteins (or proteins similar to them) and is achieved by an active transport mechanism. ParB is known to bind to the cargo (plasmid) and ParA forms a dimer upon ATP binding that binds to DNA (chromosome) non-specifically and also can bind to ParB (associated with cargo). After ATP hydrolysis (stimulated by the interaction with ParB), ParA dimer dissociates to monomers and from ParB and the chromosome. While different mechanisms of the ParA-dependent active transport had been proposed, recently two mechanisms become most popular - one based on the elastic dynamics of the chromatin (Lim et al. eLife 2014, Surovtsev PNAS 2016, Hu et al Biophys.J 2017, Schumaher Dev.Cell 2017) and the other based on a theoretically-derived "chemophoretic" force (Sugawara & Kaneko Biophysics 2011, Walter et al. Phys.Rev.Lett. 2017).

      It is a minor comment, but we would like to point out that we do not consider these two model types as alternatives but rather as models with different levels of coarse-graining. Our interest is in the molecular-level (stochastic) models (Lim et al. eLife 2014, Surovtsev PNAS 2016, Hu et al PNAS 2015, Hu et al Biophys.J 2017, Schumacher Dev.Cell 2017).

      The authors start by following motion of F plasmid with one or two plasmids per cell and by analyzing plasmid spatial distribution, plasmid displacement (referred to as velocity) as a function of their relative position, and autocorrelations of the position and the displacement. They concluded that these metrics are consistent with 'true positioning' (i.e. average displacement is biased toward the target position - center for one plasmid and 1/4 and 3/4 positions for two plasmids ) but not with 'approximate positioning' (i.e. when plasmid moves around target position, for example, in near-oscillatory fashion). This 'true positioning' can be described as a particle moving on the over-dampened spring. They reproduce this behavior by expanding the previous model for 'DNA-relay' mechanism (Lim et al. eLife 2014, Surovtsev PNAS 2016), in which plasmid is actively moved by the elastic force from the chromosome and ParA serves to transmit this force from the chromosome to the plasmid. Now, the authors explicitly consider in the model that the chromosome-bound ParA can diffuse (which the authors refer as 'hopping') and this allows the model to achieve 'true plasmid positioning' for some combination of model parameters in addition to oscillatory dynamics reported in the original paper (Surovtsev PNAS 2016).

      Based on their computational model, the authors proposed that two parameters, diffusion scale of ParA = 2(2Dh/kd)1/2/L (typical length diffused by ParA before dissociation) and ratio of ParB-dependent and independent hydrolysis rates = kh/kd are key control parameters defining what qualitative behavior is observed - random diffusion, near-oscillatory behavior, or overdamped spring ('true positioning'). They vary this two parameters ~30- fold and ~200-fold range by changing Dh and kh respectively, to illustrate how dynamics of the system changes between these 3 modes of motion. While these parameters clearly play important role, the drawback is that the authors did not put either theoretical reasoning why these parameters are truly governing or showed it by varying other model parameters (kh, number of ParA NParA, spring constant of chromosome k, diffusion coefficient of the plasmid Dp) to show that only these combinations define the type of the system behavior. The authors qualitative analysis on importance of relies on the steady state solution for the diffusion equation for ParA. It is really unfortunate that no ParA distribution was measured simultaneously with the plasmid motion, as this would allow to compare experimental ParA profiles to expected quasi-steady-state solutions.

      We spend almost an entire section and a figure explaining the theoretical reasoning behind the identification of the $\lambda=s/(L/2n)$ as an important system parameter (section “Hopping of ParA-ATP on the nucleoid as an explanation of regular positioning” and Figure 2) and predicted that regular positioning could only occur for $\lambda>1$. This was confirmed by parameter sweeps for the cases of 1 (Figure 3I) and multiple plasmids (Figure 5-figure supplement 1), indicating that $\lambda$ is indeed an important system parameter and that our conceptual understanding of this aspect of the system is correct. This point has now been made clearer.

      However, we agree that the reasoning for $\epsilon$ (varied through the hydrolysis rate $k_h$) was not clear. It was chosen to allow us to modulate the ParA concentration at the plasmid compared to elsewhere, motivated by the differences between different ParABS systems. We originally had also considered a third quantity related to the number of nucleoid-bound ParA but we found that this had little effect on the nature of the dynamics. All three quantities describe how the timescale of a reaction/process (ParA hopping/diffusion across the nucleoid, ParB induced hydrolsysis, ParA association to the nucleoid) compares to the timescale of basal hydrolysis, which we use as a reference timescale.

      We have now made this clearer as well as adding supplementary figures showing the effect of varying other system parameters at several locations in the phase diagram (Figure 3-figure supplement 3 and 4). These sweeps justify our identification of $\epsilon$ and $\lambda$ as a useful/important set of quantities for determining the dynamics of the system.

      Additionally, we now add example kymographs showing the ParA distribution (Figure 3-figure supplement 2C).

      The authors also show by simulations that overdamped spring dynamics can transition into oscillatory behavior when decreases, for example by cell growth. Indeed, they observed more oscillatory behavior when they compared single-plasmid dynamics in the longer cells compared to the shorter cells. This was not the case in double-plasmid cells, in eprfect agreement with their analysis. They also calculated ATP consumption in the model and concluded that the system operates close but below (perhaps, "above" should be used as it refers to bigger ) the threshold to oscillatory regime which minimize ATP consumption. While ATP consumption analysis is very intriguing, this statement (Abstract Ln24-25) seems at odds with the authors own analysis that another ParA-dependent plasmid system, pB171, operates mostly in oscillatory regime, and it is actually for this regime the authors' analysis suggest minimal ATP-consumption (Fig. 8).

      To clarify, we found that pB171 (which in our hands has a copy number of 2-3 in the SR1 reduced-copy-number strain) is only clearly oscillatory in cells with a single plasmid (and only mildly so in cells with two plasmids). Otherwise, it behaves very similarly to F plasmid. We therefore believe that these two distantly related ParABS systems exhibit, overall, similar dynamics and differ only in how close the systems are to the threshold of oscillatory instability. This was not clear as we did not specify the copy number of pB171. We now provide this in Figure 7–figure supplement 1.

      We refer to these systems as lying just below, rather than above, the threshold of the oscillatory instability because, on average, plasmids do not oscillate but only do so in cells with the lowest plasmid concentration.

      I think the real strength of the paper is that it can potentially to show that if one considers that the intracellular cargo can be moved by the fluctuating chromosome via ParA-mediated attachments, then various dynamics can be achieved depending on combinations of several control parameters (plasmid diffusion coefficient, ParA diffusion coefficient, rate of hydrolysis and so on) including previously reported 'oscillations' (Surovtsev PNAS 2016), 'local excursions' (Hu et al Biophys.J 2017) and 'true positioning' (Schumaher Dev.Cell 2017). The main drawback (in this reviewer opinion) that this is obscured by the current presentation and discussion of this work and previous modelling work on ParA-dependent systems. For example, instead of using "unifying" potential of the presented model, yet another name 'relay and hopping' is used in addition to previously used 'DNA-relay', 'Brownian ratchet', 'Flux-based positioning', …

      In the abstract and discussion, we already refer to developing a “unified” model (p1 L21, p15 L22 of the original manuscript) and in the discussion we explain how our model contains other models as limiting cases. But we agree with this recommendation - the unifying nature of our model is its main strength. We now emphasise this more.

      Regarding the model name, we felt obliged to refer to the previous named models (DNA-relay and Brownian ratchet) and simply gave our model a name to avoid confusion when making comparisons. We have now removed almost all mention of ‘hopping and relay’ and just refer to ‘our model’. However, our gitlab repository with the code must have a name and therefore is still called ‘Hopping and relay’ and so the same term is used in Table 3.

      … and it appears that the presented model is an alternative to these previously published work. And only in model description (in Methods section) one can find that the "... model is an extension of the previous DNA-relay model (Surovtsev et al., 2016a) that incorporates hopping and basal hydrolysis of ParA and uses analytic expressions for the fluctuations rather than a second order approximation"(p.17, ln15-17).

      We are sorry that this reviewer felt that the fact that our model is an extension of DNA relay is hidden in the methods. However, we wrote in the main text:

      “Motivated by the previous discussion, we decided to develop our own minimal molecular model (‘hopping and relay’) of ParABS positioning, taking the DNA relay model as a starting point … The original scheme is as follows… We supplemented this scheme with two additional components: diffusion (hopping) of DNA-bound ParA-ATP dimers across the nucleoid (with diffusion coefficient Dh, where the subscript indicates diffusion of the home position) and plasmid-independent ATP hydrolysis and dissociation (with rate kd). See Material and Methods for further details of the model. “

      We now make this clearer.

      However, we would argue that as models of the same system, there are naturally overlaps and the models of Hu et al and Schumacher et al could also be thought of as extensions of the DNA relay model.

      While it is of course the authors right to decide how to name their model, it should be explicitly clear to the reader what is a real conceptual difference between presented and previous models from the abstract, introduction and discussion section of the paper, not from the "fine-print" details in the supplementary materials.

      The main conceptual difference is that we have identified the importance of having a finite diffusive length scale for ParA diffusion/hopping on the nucleoid. This allows both oscillations and regular positioning to occur for biologically relevant parameter values and reproduces the length dependent transition from mid-cell positioning to confined oscillations that we observe for F plasmid. The DNA relay model does not have this behaviour as the ParA diffusive length scale in zero while it is infinite in the models of Ietswaart et al 2014 and Schumacher et al 2017. The model of Hu et al 2017 does have a finite length scale but the authors appear not to have realised its importance and never discovered the regular positioning regime at \lambda >1. While we make these points in the discussion in the context of Figure 8A, where we compare our model to the others, we agree with this reviewer that we should have been more explicit in the abstract and introduction. We have now corrected this.

      This would allow to avoid unnecessary confusion (especially for the readers not directly involved into the modelling of ParA/B system) and clarify that all these models rely on the elastic behavior of fluctuating chromosome to drive active transport of the cargo. This reviewer believes that more explicit discussion on the models (one from the authors and previously published) differences and similarities will help with our understanding of how ParA-dependent system operate. This discussion should also include works on PomXYZ system, in which it was shown that similar dynamic system can lead to specific positioning within the cell (Schumaher Dev.Cell 2017, Kober et al. Biophys.J 2019). This will may it explicit that the models results have direct impact beyond the ParA-dependent plasmid segregation.

      To further clarify the differences between the models (beyond the second and third sections of the main text and the discussion), we have now added a section to the methods and a new table (Table 3). We have also included the mentioned PomXYZ model. However, we would like this was not the first stochastic model to have ‘true’ positioning as this reviewer cites above. Though they did not include the mechanism of force generation, the model of Ietswaart et al 2014 produces regularly positioned plasmids and is referenced repeatedly in Schumacher et al. 2017.

      I think that expanded parameter analysis, and explicit model comparison/discussion will make the contribution of this work to the field more clear and with the potential to advance our general understanding of how the same underlying mechanism can lead to various modes of intracellular dynamics and patterning depending on parameters combination.

      Reviewer #2 (Public Review):

      The work presented in this manuscript details an analysis of the partitioning of low copy plasmids under the control of the ParABS system in bacteria. Using a high throughput imaging set up they were able to track the dynamics of the partition complex of one to a few plasmids over many cell cycles. The work provides an impressive amount of quantitative data for this chemo-mechanical system. Using this data, the paper sought to clarify whether the dynamics of plasmids is due to regular positioning or noisy oscillations around a mean position. They supplement their experimental work with an intuitive model that combines elements of previous modelling efforts. Their model relies on diffusion of the ParA substrate on the nucleoid with the dynamics of the ParB partition complex being driven by the underlying elastic force due to the nucleoid on which the substrate is tethered. Their model dynamics depend on two parameters, the ratio of the length over which the substrate can explore to the characteristic length of the space and the ratio of stimulated to non-stimulated hydrolysis rates of the substrate. If the length ratio is large, ParA can fully explore the space before interacting with the ParB complex leading to balanced fluxes and regular positioning. If it gets reduced, for example by lengthening the cell, oscillations can emerge as fluxes of substrates become imbalanced and a net force can pull the partition complex.

      Strengths:

      Given the large amount of data, the observations unambiguously show that one particular ParABS system under the conditions studied is carrying out regular positioning of plasmids. The model synthesizes prior work into a nice intuitive picture. These model parameters can be fit to the data leading to estimates of molecular kinetic parameters that are reasonable and in line with other observations. Lining up the experimental observations with the phase space of the model suggests that the system is poised on the edge of oscillations, allowing for the system to have regular positioning with low resource consumption.

      Weaknesses:

      However, despite the correspondence of the simulated results with the experimental findings, other explanations are not completely ruled out. The paper emphasizes that ParA diffusion/hopping on the nucleoid is essential for the establishment of regular positioning and that without it, only oscillations were possible. Prior simulation efforts, that the paper cites, which include ParA diffusion and mixing in the cytosol but no diffusion on the nucleoid have shown that regular positioning is possible and that oscillations could get triggered as the system lengthened. Thus ParA hopping is not a necessity for regular positioning (as claimed in the paper), but very well might be needed for the given kinetic parameters of the system studied here.

      We now comment on this result. In short, we believe that the mentioned model/regime is not relevant due to stochastic effects. We are not able to produce, with biological relevant parameters, regular positioning without ParA hopping.

      The paper also presents experimental results for a second ParABS system (pB171) that is more likely to show oscillations. They attribute the greater likelihood of oscillations for pB1717 being due to ParA exploring a smaller space than the F plasmid system that showed regular positioning. This is pure conjecture and the paper does not provide any evidence that this is the reason. Thus it is hard to conclude if oscillations may not be due to other factors.

      We do not explicitly make that claim. We did have a point in the phase diagram of Figure 8A representing pB171 with a lower value of lambda than F plasmid and stated “The location of pB171 is an estimate based on a qualitative comparison of its dynamics”. We agree this was unclear.

      We now indicate the region that has oscillations with roughly the same period as single plasmids of pB171. We also make it clear that we speculate, but have not shown, that the length scale of ParA hopping is smaller than for F plasmid.

      An important point here is that we can explain both oscillations and regular positioning in the same model with the same kinetic parameters, the regimes being determined by the cell length and plasmid number in a manner consistent with experimental observations.

    1. Peer review report

      Title: Crossref as a source of open bibliographic metadata

      version: 2

      Referee: Simon Porter

      Institution: Digital Science

      email: s.porter@digital-science.com

      ORCID iD: https://orcid.org/0000-0002-6151-8423


      General assessment

      This is a clear paper that outlines a motivation (assess the metadata completeness of the Crossref record for the purposes of scientometric analysis,) along with providing a set of useful metrics to assess the completeness of each metadata field.


      Essential revisions that are required to verify the manuscript

      No essential revisions identified


      Other suggestions to improve the manuscript

      Minor suggestions: Figures in the interactive version of the preprint do not have headings or captions, or a link back to the paper.

      On data availability, In the context of the paper, making the code used to process the Crossref’s XML Metadata Plus Snapshot would be a useful contribution enabling scientometric analysis of the Crossref dataset.

      The following are offered as suggestions that could be added to the paper at the authors discression, but do not effect the content or the conclusions of the peer review

      The authors have chosen to frame metadata completeness of Crossref records as a ‘good in itself,’ leaning on Waltman, L. (2020b) to do the work of setting this up.

      Within this framework, the analysis is offered as a set of observations to help publishers understand where they need to do better. It might be the case that Publishers do not intrinsically understand why making certain metadata types available is valuable to the community.

      On the question of how Crossref can be used in scientometric analysis, readers are left to make up their own minds on what Crossref can be used for today, vs what it might be capable of providing in the future based on the evidence presented. It would be a stronger conclusion to highlight the types of scientometric analysis that are now possible with Crossref, (for instance bibliometric coupling,) and those that require limits or caveats (analysis by affiliation, abstract.) As this analysis lends itself to being rerun in the future, it would be useful to trace advances (hopefully!) not just in terms of the number of things, but also in terms of how sceintometric analysis capability is progressing because of it.


      Decision

      Verified manuscript: The content is scientifically sound, only minor amendments (if any) are suggested.

    1. This way the database can only confirm that a password was the right one, but it can’t independently up what the password is.

      Each password acts as a key, each character is converted into a "sudo" character that acts as a dummy key for the computer. The sudo code is verified and grants the user permission to access the app or platform if it matches up with the code stored in the computer's database.

    1. Reviewer #2 (Public Review):

      Does our proprioceptive system try to recognize our own actions?

      Proprioception is our sense of the motion and posture of our own body. This sixth sense uses signals from receptors in the joints, tendons, muscles, and skin that measure forces and degrees of extension. These receptors enable us to sense, for example, the posture of our body as we wake from sleep. They also provide feedback signals that help us precisely control our limbs, for example during handwriting.

      Feedback is thought to be essential to motor control, enabling the controller in our brains to rapidly adapt to the unexpected. The unexpected may include changes in the environment (like something pushing our hand that we didn't see coming), changes in our bodies (such as muscle fatigue or injury), and shortcomings of the motor program (such as a lack of precision or a badly planned limb trajectory). Feedback can come from vision and even audition, but proprioception provides an essential additional feedback path that informs us directly about the motion and posture of our limbs, and any forces on them.

      How does feedback control work in the human motor system? I want to write a 'k', but there are forces on my limbs resulting from the friction of chalk on this particular blackboard. Also, my muscles are recovering from tennis practice this morning, and I haven't used chalk on a blackboard in years.

      If the goal is to write a 'k', I have some flexibility. I am committed, not to a precise trajectory, but to a more abstractly defined objective: to write a legible 'k'. This suggests that feedback processing should evaluate to what extent I am succeeding at the action, not at tracing out a particular trajectory. Does what I'm actually doing look like writing a 'k'?

      In a new paper, Sandbrink et al. (pp2022) report on simulations of the human musculoskeletal system and neural network models that suggest that the tuning properties of neurons in the somatosensory cortex (S1) can be explained by assuming that the objective of the proprioceptive system is to recognize the action being performed.

      They used recorded traces of a person writing lower-case letters to simulate the responses of muscle spindles sensing the lengths and velocities of muscles in the human arm as would be present if the hand was moved passively along these trajectories. The physical simulation uses a 3D model of the human arm with two parameters for the direction of the upper arm and two more for the direction of the lower arm. These four parameters are inferred by inverse kinematics from the hand trajectories tracing each letter in a variety of vertical and horizontal planes. A 3D muscle model then enables the authors to compute the expected spindle responses that reflect the lengths and velocities of 25 relevant upper arm muscles.

      The authors then trained neural network models of proprioceptive processing that took the simulated muscle spindle signals as input. The neural net architectures included one that first integrates information over the muscle spindles and then across time ("spatial-temporal"), one that integrated across muscle spindles and time simultaneously ("spatiotemporal") and a recurrent long-short-term-memory model.<br /> Each architecture was trained on two objectives: to decode the trajectory (i.e. the position of the hand tracing a letter as a function of time) or to recognize the action (i.e. the letter being traced). The two objectives correspond to two hypotheses about the function of proprioceptive processing: To inform the feedback controller about either the current position of the hand or the letter being drawn.

      The models trained to recognize the action developed tuning more consistent with what is known about the tuning of neurons in the primary somatosensory cortex in primates. In particular, direction tuning with roughly equal numbers of units preferring each direction emerged in the middle layers of the neural network models trained to recognize the action, similar to what has been observed in primate neural recordings. Direction tuning is already present in the muscle-spindle signals, but the spindle signals do not uniformly represent the directions.

      The task-optimization approach to neural network modeling is inspired by work in vision, where neural networks trained on the task of image classification explained responses to novel images in populations of neurons in the inferior temporal cortex. This result suggested a tentative answer to the why question: Why do inferior temporal neurons exhibit the response profiles and representational geometry they exhibit? Because their function (or one of their functions) is to recognize the objects in the images. Here, similarly, the authors address a why question with task-optimized neural network models: Why do somatosensory cortical neurons exhibit the types of tuning that have been reported in the literature?

      The function of proprioception, of course, is not for the brain to recognize which letter it is trying to write. It already knows that. The function is to sense how the current trajectory - the actual, not the intended one - differs from, say, a legible "k" (if that was the intention), and to map from that difference to a modification vector that will improve the outcome.<br /> Why is action decoding relevant for performing the action? A key reason may be that the goal is not to produce a fixed trajectory, but to produce a legible 'k'. A legible 'k' is not a single trajectory, but a class of trajectories containing an infinity of viable solutions. If someone nudged my arm while writing, adaptive feedback control should not attempt to return me to the originally intended trajectory, but to a new trajectory that traces the most legible 'k' that is still in the cards, which may be a different style of 'k' than I originally intended.

      The paper contributes a useful data set for training models and a qualitative comparison of models to real neurons in terms of tuning properties. It would be good, in follow-up studies, to directly test to what extent each of the models can quantitatively predict either single-neuron responses or population representational geometries, as has been done in vision, and to perform statistical comparisons between models.

      Importantly, this paper develops the idea of combining simulations body and brain, of the musculoskeletal system, and the processing of control-related signals in the nervous system, which provides a very exciting direction for future research.

      Strengths

      • The paper introduces a highly original research program that marries simulation of the musculoskeletal system and neural network modelling to predict neural representations in the proprioceptive pathway.<br /> • The authors performed an architecture search and trained multiple instances of different neural network architectures with each of the two objectives.<br /> • The paper includes comprehensive analyses of the proprioceptive representations from the simulated muscle-spindle signals through the layers of the models. These analyses characterize unit tuning, linear decodability, and representational similarity.<br /> • The results suggest an explanation for the direction tuning with a roughly uniform distribution of the units' direction preferences that has been reported previously for neurons in the primate primary somatosensory (S1) cortex.<br /> • If the simulated muscle-spindle data set, models and analysis code were shared along with the published paper, this work could form the basis for quantitative model evaluation and further model development.

      Weaknesses

      • The models are qualitatively evaluated by comparison of model unit tuning to what is known about the tuning of neurons in the somatosensory cortex. Follow-up studies should quantitatively evaluate the models by inferential analyses of their ability to predict measured responses.<br /> • The two training objectives differ in multiple respects, making it difficult to assess what the necessary requirements are for the emergence of representations similar to primate S1. Decoding the hand position may be too simple, but what about decoding velocity, or trajectory descriptors such as curvature? There may be a middle ground between trajectory decoding and action recognition that also leads to the emergence of tuning properties as found in primate S1.

    1. Reviewer #2 (Public Review):

      Zylbertal and Bianco propose a new model of trial-to-trial neuronal variability that incorporates the spatial distance between neurons. The 7-parameter model is attractive because of its simplicity: A neuron's activity is a function of stimulus drive, neighboring neurons, and global inhibition. A neuroscientist studying almost any brain area in any model organism could make use of this model, provided that they have access to 1) simultaneously-recorded neurons and 2) the spatial locations of those neurons. I could foresee this model being the de-facto model to compare to all future models, as it is easy to code up and interpret. The paper explores the effectiveness of this distance model by modeling neural activity in the zebrafish optic tectum. They find that this distance-based model can capture 1) bursting found in spontaneous activity, 2) ongoing co-fluctuations during stimulus-evoked activity, and 3) adaptation effects during prey-catching behavior.

      Strengths:

      The main strength of the paper is the interpretability of the distance-based model. This model is agnostic to the brain area from which the population of neurons is recorded, making the model broadly applicable to many neuroscientists. I would certainly use this model for any baseline comparisons of trial-to-trial variability.

      The model is assessed in three different contexts, including spontaneous activity and behavior. That the model provides some prediction in all three contexts is a strong indicator that this model will be useful in other contexts, including other model organisms. The model could reasonably be extended to other cognitive states (e.g., spatial attention) or accounting for other neuron properties (such as feature tuning, as mentioned in the manuscript).

      The analyses and intuition to show how the distance-based model explains adaptation were insightful and concise.

      Weaknesses:

      Model evaluation and comparison: The paper does not fully evaluate the model or its assumptions; here, I note details in which evaluation is needed. A key assumption of the model - that correlations fall off in a gaussian manner (Fig. 1C-E - is not supported by Fig. 1C, which appears to have an exponential fall-off. Functions other than gaussian may provide better fits. Furthermore, it is not clear whether the r^2s in Fig. 1E are computed in a held-out manner (more details about what goes into computing r^2 are needed). Assessing the model based on peak location alone (Fig. 1E) is not sufficient, as other smooth monotonically-decreasing functions may perform similarly. Simulating from the model greatly improves the reader's understanding (Fig. 2D), but no explanation is given for why the simulations (Fig. 2D) have almost no background spikes and much fewer, non-co-occurring bursts than those of real data (Fig. 2E). A key assumption of the distance model (Fig. 2A) is that each neuron has the same gaussian fall-off (i.e., sigma_excitation and sigma_inhibition), but it is unclear if the data support this assumption. Although an excitatory and inhibitory gain is assumed (Fig. 2A), it is not clear from the data (Fig. 1C) that an inhibitory gain is needed (no negative correlations are observed in Fig. 1C-D). After optimization (Fig. 3), the model is evaluated on predicting burst properties but not evaluated on predicting held-out responses (R^2s or likelihoods), and no other model (e.g., fitting a GLM or a model with only an excitatory gain) is considered. In particular, one may consider a model in which "assemblies" do exist - does such an assembly model lead to better held-out prediction performance? It is unclear why a genetic algorithm (Fig. 1A-C) is necessary versus a grid search; it appears that solutions in Generation 2 (Fig. 3C, leftmost plot, points close to the origin) are as good as solutions in Generation 30 and that the spreads of points across generations do not shrink (as one would expect from better mutations). Given the small number of parameters (7), a grid search is reasonable, computationally tractable, and easier to understand for all readers (Fig. 3A). It is unclear why the excitatory and inhibitory gains of the temporal profiles (Fig. 3I) appear to be gaussian but are formulated as exponential (formula for I_ij^X in Methods). Overall, comparing this model to other possible (similar) models and reporting held-out prediction performance will support the claim that the distance model is a good explanation for trial-to-trial variability.

      Data results: Data results were clear and straightforward. However, the explanation was not given for certain results. For example, the relationship between pre-stimulus linear drive and delta R was weak; the examples in Fig. 4C do not appear to be representative of the other sessions. The example sessions in Fig. 4C have R^2=0.17 and 0.19, the two outliers in the R^2 histogram (Fig. 4D). The black trace in Fig. 4D has large variations (e.g., a linear drive of 25 and 30 have a change in delta R of ~0.1 - greater than the overall change of the dashed line at both ends, ~0.08) but the SEMs are very tight. This suggests that either this last fluctuation is real and a major effect of the data (although not present in Fig. 4C) or the SEM is not conservative enough. No null distribution or statistics were computed on the R^2 distribution (Fig. 4C, blue distribution) to confirm the R^2s are statistically significant and not due to random fluctuations. The absence of any background activity in Fig. 6B (e.g., during the rest blocks) is confusing, given that in spontaneous activity many bursts and background activity are present (Fig. 2E). Finally, it appears that the anterior optic tectum contributes to convergent saccades (CS) (Fig. 7E) but no post-saccadic activity is shown to assess how activity changes after the saccade (e.g., plotting activity from 0 to 60). No explanation is given why activity drops ~30 seconds before a convergent saccade (Fig. 7E). No statistical test is performed on the R^2 distribution (Fig. 7H) to confirm the R^2s (with a mean close to R^2=0.01) are meaningful and not due to random fluctuations.

      Presentation: A disjointed part of the paper is that for the first part (Figs. 1-3), the focus is on capturing burst activity, but for the second part (Figs. 4-7), the focus is on trial-to-trial variability with no mention of bursts. It is unclear how the reader should relate the two and if bursts serve a purpose for stimulus-evoked activity.

      Citations: The manuscript may cite other relevant studies in electrophysiology that have investigated noise correlations, such as:<br /> - Luczak et al., Neuron 2009 (comparing spontaneous and evoked activity).<br /> - Cohen and Kohn, Nat Neuro 2011 (review on noise correlations).<br /> - Smith and Kohn, JNeurosci 2008 (looking at correlations over distance).<br /> - Lin et al., Neuron 2015 (modeling shared variability).<br /> - Goris et al., Nat Neuro 2014 (check out Fig. 4).<br /> - Umakantha et al., Neuron 2021 (links noise correlation and dim reduction; includes other recent references to noise correlations).

    1. Providers supporting open source systems can be a useful entry point for libraries thatmay not have the resources to do in-house development but still wish to move to an open source solution. It is criticalto ensure that those vendors support the larger open source project community and contribute their work back to theoriginal code base.

      Importance of vendors providing open source support contribute back to the community

    1. <link rel="preconnect" href="https://fonts.googleapis.com"> <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin> <link href="https://fonts.googleapis.com/css2?family=Alkalami&family=Oswald:wght@500&family=Poppins:wght@200&display=swap" rel="stylesheet">

      Combine all google fonts into because to avoid repetition of code

    1. Thus an infant is not just recognizing the mother’s voice. There is also immediate sound discrimination: the beginning of ‘breaking the code’.

      I know nothing about newborns or their neurological functioning. However, I found it interesting how newborns can begin 'breaking the code' of language, at a stage in their lives when they cannot do much for themselves.

      • Dominic
    1. background: linear-gradient( 90deg, rgba(2, 0, 36, 1) 0%, rgba(40, 121, 9, 1) 35%, rgba(0, 212, 255, 1) 100% );

      I think you can comment this. The code below this has the same purpose, giving gradient and adding background image.

    1. <samp class="highlight">live</samp> <small class="color"><b>Breaking</b></small> <samp class="country">UK</samp>

      <samp> is used to highlight code, it may be better to use a simple <span>

    1. the dynamic structure of the programming language was the dynamic structure of the document and it's just it's a different world 00:40:27 he he had to struggle with getting the programming language evolved and adapted into
      • the dynamic structure of the programming langage
      • was the dynamic structure
      • of the doument

      This IS Intentional Programming

      !- claim : NLS was a nascent Intentional Software System - It was not a language work bench - but a system oriented - a collaboration platform designed to bootstrap itself - into existence and continual improvement and in fact - it was kernel based - and at the heart it had a Command Language Interpreter - that allowed thinking and creating software - as a conversational system - driven by a command language - that was used to articulate the intentions - in terms of other intentions, combinations of intentions ultimately - groundrd in primitiver capabilities - that were implemented in machine code for the target system - migration from one machine to the next - system desigtners and devlopers were able to articulate - ideas and means of realizing those ideas in - self structured forms and used the same capabilities - to organize the embodiment of these intentions - in code that when run exhibited the desired behaviour

    1. if you're thinking without 00:03:26 writing chances are you're fooling yourself we're only

      If you're thinking without writing, you only think you're thinking. —Leslie Lamport.“Thinking Above the Code.” Lecture presented at the Microsoft Research Faculty Summit, Microsoft Research, July 15, 2014. https://www.microsoft.com/en-us/research/video/leslie-lamport-thinking-code/. Timestamp: 03:26

      Link to:<br /> https://hypothes.is/a/rvisgFDXEe2s-SuJJGw3cA<br /> https://hypothes.is/a/yEFMHoCkEeyl34fItJe__w

      Note that the spoken quote is different from the written quote.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, authors establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. They used B. subvibroides and C. crescentus specific mutants in pseI and legI genes involved in the Pse and Leg biosynthesis, respectively, and cross-complementation assays with orthologous genes from different bacterial species, analysing motility and flagellin glycosylation. These assays show that Pse and Leg biosynthetic pathways are genetically different and recognize the LegX enzyme as a critical element in the Leg-specific enzymatic biosynthesis. Since that legX orthologous were only identified in the genome of bacteria with Leg biosynthetic pathways, it becomes a good marker to distinguish Leg from Pse biosynthesis pathways and a novel bioinformatic criterion for the assignment and discrimination of these two pathways. Reconstitution of Leg biosynthetic pathway of B. subvibroides in the C. crescentus mutant that lack flagellins, PseI and FlmG, complemented with both flagellin and FlmG of B. subvibroides, identified a new class of FlmG protein glycosyltransferases that modify flagellin with legionaminic acid. Furthermore, the construction of a chimeric FlmG through domain substitutions, allowed to reprogram a Pse-dependent FlmG into a Leg-dependent enzyme and reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

      Major comments:

      The conclusions obtained are convincing and well-supported. However, I think some points should be specify or clarify.

      1.- In the mutants (pseI, legI, flmG,...) the non-glycosylated flagellin are exported and assembled in a flagellum filament shorter than the WT strain. However, motility in plates is absent or very reduced. This might be produced by instability of the flagellum filament when rotating in a semi-solid surface. MET was performed from plates or liquid cultures? Do the author analyses motility in liquid media? If they did, changes in motility were observed?

      Response: The Caulobacter ΔpseI mutant accumulates low levels of flagellin in the supernatant. TEM analysis reveals that the flagellar filament is not assembled and only the hook structure is visible (PMID: 33108275). Brevundimonas subvibrioides ΔlegI or ΔflmG cells feature a shorter filament compared to WT by TEM. In all these analyses, TEM was performed on cells grown in broth to exponential growth phase as detailed in the Experimental procedures section. These mutant cells do not swim when analyzed by phase contrast microscopy. While is not known if swimming on semi-solid medium would further destabilize the flagellar structures seen in liquid cultures by TEM, there is more residual motility in B. subvibrioides mutants that make a short filament compared to C. crescentus mutants that lack the flagellar filament. Thus, our analyses point to a positive correlation between the residual motility and residual filament length when comparing the B. subvibrioides and C. crescentus mutants.

      2.- In page 5, lines 158-163, the analysis, by HPLC, of derivatized nonulosonic acid from B. subvibroides flagella, shows a major peak at 9.8 minutes retention and a minor peak at 15.3 minutes. Since that Pse-standard have retentions peaks at 9.7 and 13 minutes, and Leg-standard at 12.3 minutes, the authors cannot infer, only with these data, the flagella sugar is a legionaminic acid derivative. In my opinion, should be included that inference comes from the data obtained by HPLC analysis and genetic approaches. Thanks. Corrected. 3.- In page 5, line 173-175. Authors indicate, "While no difference in the abundance of flagellin was observed in extracts from mutant versus WT cells, flagellin was barely detectable in the supernatants of mutant cultures, suggesting flagellar filament formation is defective in these mutants". MET images show that the flagellum filament length is shorter in the mutants than in the WT strain. Therefore, if the same number of mutants and WT cells has been used in the immunodetection assays, there should be more flagellin monomers in the WT samples than in the mutants ones and flagellin bands should be less intense in mutant samples corresponding to the anchored flagellum. Why bands corresponding to flagellin in mutants and WT show similar intensity in the immunodetection assays (Figure 3C and D)? Furthermore, in lane 177-178, authors suggest that LegI and FlmG govern flagellin glycosylation and export (or stability after export). However, if filament stability is affected, the amount of flagellin monomers in the supernatant of mutants should be higher than in the WT. However, immunodetection assays show less abundance of flagelin monomers in the supernatant of mutants. Please, can you clarify this? In relation to this point, I suggest that authors include, in the experimental procedures, how they obtained the supernatants to flagellin immunodetection, as well as why they used anti- FljKCc anti-serum to detect the B. subvibroides flagellin.

      We thank the reviewer for raising this point. We have now clarified this question in the updated Experimental procedures section. Our immunoblots harbor the same number of cells harvested in exponential phase (OD=0.4). One mL of cells was harvested from cultures by centrifugation at full speed. The supernatant that was used for the immunodetection corresponds to the supernatant after the centrifugation. The supernatant fraction contains flagella that have been shed during the cell cycle at the swarmer cell to stalked cell (G1-S) transition of C. crescentus and B. subvibrioides.

      Thus, it is clear that the majority of flagellins detected by immunoblotting are in fact cell associated and specifically the intracellular flagellins. The evidence for this is that the levels are comparable between WT and ΔflmG mutant cells, even though the latter has shorter or no flagellar filaments. Moreover, while C. crescentus cells are not constantly flagellated during the cell cycle, flagellins are detectable on cell-associated samples by immunoblotting even when cells do not yet or no longer have a flagellar filament. Based on these two points, we conclude that the total flagellin levels associated with cells do not reflect the levels of flagellin assembled into a flagellar filament, but rather the flagellin bulk present in the cytoplasm.

      Consistent with this view, we previously reported that C. crescentus ΔpseI cells have the same amount of flagellins in cell lysates compared to the WT strain (PMID: 33108275), even though the mutant cells lack a flagellar filament. Thus, the results obtained here are consistent with previous observations and indicate that B. subvibrioides flagellin glycosylation mutants also still produce comparable amounts of flagellins intracellularly like the WT strain, despite the absence of flagellin glycosylation and inefficient assembly into a flagellar filament.

      Concerning the potential role of LegI and FlmG in flagellin stability after export, we were referring to protein stability (half-life), not filament stability. Glycosylation may impact the half-life of extracellular flagellins since glycosylation can protect from proteolytic degradation of proteins, possibly in this case by different proteases that may accumulate in the supernatant. Thus, non-glycosylated flagellins could be more easily degraded by extracellular proteases once they are exported, ultimately resulting in a lower amount in the supernatant.

      Addressing the final question about the specificity of the anti-FljKCc antiserum: we used this anti-serum because it detects the B. subvibrioides flagellins owing to the high sequence similarity between B. subvibrioides flagellins and C. crescentus flagellins. We previously showed that the anti-FljKCc anti-serum detects all six flagellins from C. crescentus, as determined by individually expressing each flagellin in a strain deleted for all six flagellin genes (Δfljx6) (PMID: 33108275). FljKCc (against which the antibody was raised) is 65% similar to the most distant C. crescentus flagellin, FljJ. As the similarity of FljKCc to the three B. subvibrioides flagellins ranges from 74% -67% sequence similarity, they should be even better recognized by the anti- FljKCc antibody than C. crescentus FljJ. However, on immunoblots we cannot attribute the signal to any individual B. subvibrioides flagellin as they could all co-migrate on SDS-PAGE and therefore all flagellins might reside in the same immunoblot band. However, we can clearly demonstrate that the immunoblot band corresponds to flagellins: a B. subvibrioides ΔflaF mutant (see below) that we constructed revealed that the flagellin signal is lost, as is the case for a C. crescentus ΔflaF mutant (PMID: 33113346). In the case of C. crescentus, the FlaF secretion chaperone is required for flagellin translation (synthesis) and we suspect that this also the case for B. subvibrioides FlaF. This experiment provides additional evidence that the B. subvibrioides flagellins are recognized by the anti-FljK (C. crescentus) anti-serum.

      4.- Authors demonstrate the specificity of the GT-B domain of FlmG, using a chimeric FlmGCc-Bs in a mutant of C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides. However, since that TPR comes from C. crescentus, this chimeric protein, could be transfer the legionaminic acid to the flagellin of B. subvibroides? Furthermore, the complementation of this mutant with the FlmGBs did not support efficient flagellin modification and this might be related to the TPRCc domain. Therefore, in my opinion, the chimeric protein should be introduced in the B. subvibroides∆flmG background. The answer to the first question is “No” or “very inefficiently” as determined from immunoblot analyses of B. subvibrioides ΔflmG cells expressing the chimeric FlmG_Cc-Bs protein that we now show in Fig S2B.

      Expression of the different FlmG (FlmG_Cc, FlmG_Bs, FlmG_Cc-Bs) in C. crescentus cells producing Pse or Leg revealed that FlmG_Bs does not support efficient flagellin modification with Pse in C. crescentus, likely because FlmG_Bs interacts poorly with the C. crescentus flagellins. By using the FlmG_Cc-Bs chimera we hoped to overcome this interaction problem with the C. crescentus flagellins (because the FlmG chimera harbors the C. crescentus TPR to bind the C. crescentus flagellins), however glycosyltransfer still does not occur efficiently because the GT domain from FlmG_Bs does not function with Pse. However, FlmG_Cc-Bs can modify the C. crescentus flagellins once C. crescentus is genetically modified to produce CMP-Leg (instead of CMP-Pse). This confirms that the FlmG TPR from C. crescentus is important for flagellin modification through the FlmG/flagellin interaction and that GT_B type glycosyltransferase only transfers Leg. In addition, we have now added as Fig S2B an immunoblot and as Fig S2C a motility test of B. subvibrioides ΔflmG cells expressing the FlmG_Cc-Bs chimeric protein in which we only observed little modification of B. subvibrioides flagellins and a poor motility, respectively. We extended our discussion of these results.

      5.- Page 8, line 299-301. Authors point out that C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides and the chimeric FlmGCc-Bs, although it has a glycosylated flagellin, whose mobility in SDS-PAGE is like the WT strain, is non-motile. They suggest that additional factors exist in the flagellation pathway that exhibit specificity towards the glycosyl group that is joined to flagellins. However, would be interesting to see if the flagellum filament has similar length to the WT strain or at least, it has increased in relation to the flagella length of the mutant. If flagella length has not increased, it could suggest that changes in the glycan type might affects the flagellin assembly or the stability of the flagellum filament. Therefore, would be also important to analyse its motility in liquid media.

      To investigate why the C. crescentus cells that produce Leg and express the chimeric FlmGCc-Bs glycosyltransferase are non-motile (Figure S5B) despite flagellin modification (by immunoblotting, Figure 7C), we employed two strategies. First, we performed immunoblot analyses on the supernatant fraction from these cells to determine if flagellins accumulate extracellularly. As now showed in Figure S5A, only low amounts of C. crescentus flagellins modified by Leg are present in the SN fraction. Second, we conducted TEM analyses of cells grown to exponential growth phase in broth. As shown in Figure S5C, the C. crescentus cells producing Leg and expressing FlmG_Cc-Bs glycosyltransferase harbor a shorter flagellum compared to those expressing the FlmG_Cc in which C. crescentus flagellins are modified by Pse. Altogether these results explain why these cells are non-motile both on soft agar plate and in liquid.

      Minor comments: 1.- Pag 3 line102. Please change ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes only PseI...." to ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes a PseI...." 2.- Figure 2 A. Plasmid nomenclature (Plac-neuB) is confusing because C.c. ΔpseI cells express predicted LegI or PseI synthases. Please change to Plac, as in Figure 2B and 4. Figure 2A and 2B do not contain any complementation with Bacillus subtilis (Basu), however two complementation are labelled as Bs in Figure 2A and 2B. Furthermore, no Bs are present in the Figure 2 legend. 3.- Legend of figure 3 should include B. subvibrioides abreviation Bs. Line 774: Please change ".......glycosylation and secretion in B. subvibrioides." to ".......glycosylation and secretion in B. subvibrioides (Bs)." 4.- Figure 3. In order to keep a similar nomenclature in all plasmids, plasmid Plac-legI syn and Plac-flmG should be labelled as Plac-legIBs syn and Plac-flmGBs.

      5.- Legend of figure 4 should include B. subvibrioides abreviation Bs. Line 791: Please change "....... complementation of the B.subvibrioides ΔlegI mutant with ...." to "....... complementation of the B.subvibrioides (Bs)ΔlegI mutant with ...." Furthermore, Legend of figure 4 indicate in line 795, that immunoblots reveal the intracellular levels of flagellin, however figure 2 and 3 show immunoblot of cell extracts. Please, correct this sentence. 6.- Legend of figure 5, 6 and 7 should include B. subvibrioides abreviation Bs. Line 808: Please change "Predicted Leg biosynthetic pathway in B. subvibrioides " to"Predicted Leg biosynthetic pathway in B. subvibrioides (Bs)" Line 834: Please change "....affects motility, flagellin glycosylation and secretion in B. subvibrioides."to "....affects motility, flagellin glycosylation and secretion in B. subvibrioides (Bs).Line 852: Please change "...acetyltransferase in flagellar motility of B. subvibrioides cells." to ""...acetyltransferase in flagellar motility of B. subvibrioides (Bs) cells." Furthermore, figure 5 should include C. crescentus abbreviation. Line 815: Please change "....whole cell lysates from C. crescentus mutant cultures......." to "....whole cell lysates from C. crescentus (Cc) mutant cultures......." 7.- In my opinion it would be useful to include a scheme of the gene organization involved in Leg biosynthesis in B. subvibrioides.

      8.- Legend of figure S1 should include B. subvibrioides (Bs) and C. crescentus (Cc) abbreviations. Line 888-867: Please change "...C. crescentus ΔpseI cells and B. subvibrioides ΔlegI cells with plasmids expressing..." to "...C. crescentus (Cc) ΔpseI cells and B. subvibrioides (Bs) ΔlegI cells with plasmids expressing..." Furthermore, the name and abbreviations (Mm, So, Ku, Pi, Dv) of the species used should be included in the legend. Why the authors used a plasmid with a Pvan promoter in these assays? Why the authors changed the code color of pseI and legI orthologous genes? It would be more useful and understandable follow the code color used in figure 2 and 4.

      Page 6 line 200, Please change ".....complementing synthases exhibit greater overall sequence similarity to LegI than Pse of C. jejuni. 22268,....." to ".....complementing synthases exhibit greater overall sequence similarity to LegI than PseI of C. jejuni. 22268,....." 10.- Page 7 line 231, Please change ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and P. sp. Irchel 3E13..." to ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and Pseudomonas sp. Irchel 3E13..." 11.- Introduce a line break between line 503 and 504. 12.- Page 14 line 543, please change "XbaI" to "XbaI" Thanks for the careful editing. We changed the text as suggested by the reviewer. We also added a scheme showing the genetic organization of the genes involved in Leg production and present as Figure 1B. When this study was initiated, the pMT335 plasmid with a Pvan promoter was used before we switched to using the pSRK plasmid with Plac promoter for better induction. Note that the results with Pvan or Plac are comparable regarding the PseI synthases interchangeability. Color code is now homogenous through the manuscript.

      Reviewer #1 (Significance (Required)):

      This is an interesting manuscript that contributes to the knowledge of the legionaminic biosynthetic pathway and establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. The analysis of Leg patway allowed to identify a gene (legX) that can be used to distinguish Leg from Pse biosynthesis pathways, becoming a bioinformatic tool for the assignment and discrimination of these two pathways. Furthermore, a new class of FlmG protein glycosyltransferases, able to transfer Leg to the flagellin, has been identified and its analysis reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: Viollier and co-workers present a study in which they preform an elegant and rigorous genetic profiling of the the legionaminic and pseudaminic acid biosynthesis and flagellar glycosylation pathways in C. crescentus (native Pse) and B. subvibrioides (native Leg). They use motility as a representative readout for functional flagellar glycosylation with these microbial sialic acids. They discover orthologous Pse synthase genes can replace the function of the native synthase in C. crescentus and orthologous legionaminic acid synthase genes can achieve the same in B. subvibrioides. However, not vice versa indicating a strong preference for each microbial sialic acid stereoisomer in these species. For the Leg biosynthesis pathway, which requires GDP-GlcNAc, the authors also identify LegX as an essential component to synthesize this sugar nucleotide and thus a marker for Leg biosynthesis pathways. Upstream in theses pathways, they also identify a new class of FlmG flagellar protein glycosyltransferases. Importantly, through heterologous reconstitution experiments to uncovered that these glycosyltransferases possess two distinct domains, a transferase domain the determines specificity for either CMP-Leg or CMP-Pse, and a flagellin-binding domain to achieve selectivity for the substrate. Interestingly, by creating chimeric FlmG for these two domains between C. crescentus and B. subvibrioides they show that these two modular parts can be interchanged to adapt flagellin glycosyltransferase specificity in these species. Major comments: The key conclusions of the manuscript by Viollier and co-workers are convincing and well supported by their experiments and used methods, with respect to the insulation of the Leg and Pse biosynthetic pathways, they key role of LegX in launching the Leg pathway and the successful reconstitution of Leg glycosylation in a previously Pse-producing C. crescentus strain. Finally, they convincingly show that a chimeric version of the involved glycosyltransferases is functional, which besides intriguing future glycoengineering possibilities also emphasizes the two discrete domains in these transferases that dictate their sugar nucleotide and acceptor specificity. There is one additional experiment I would suggest with relation to the detection and confirmation of Pse and Leg on flagella of respectively, C. crescentus and B. subvibrioides. In the case of C. crescentus the detected DMB derivatized monosaccharide co-elutes with a validated standard of tri-acetylated Pse, which is convincing evidence of its identity. However, for B. subvibrioides. Their DMB derivatized monosaccharides from its flagella, results in a peak the does not co-elute with the only Leg standard (Leg5Ac7Ac) they have, it does elute at the same time as their Pse standard. Although it cannot of course be Pse as B. subvibrioides. Does not possess a Pse biosynthesis pathway, it also does not provide enough evidence to conclude that it is a Leg derivative. An MS(-MS) measurement of the eluted signal would not be a big investment in time and resources and would provide additional evidence to at least assign this peak to microbial sialic acid related to the present Leg biosynthesis pathway. It the identified mass would lead to identification of the derivative, it would also add to the proper characterization of the flagella glycosylation in the bacterium.

      We have now added the glycopeptide analyses as requested. They are described in the last experimental section and confirm our results.

      The data and the methods presented in this study are presented with sufficient detail so that they can be reproduced? However, I would suggest as is common nowadays in most journals that the authors include images of the raw unprocessed blot in de supporting info.

      *The motility pictures are representative of three independent experiments and the immunoblots are representative of at least two independent experiments. This has now been mentioned in the Experimental procedures. The raw unprocessed blots have now been added as supporting info. *

      Minor comments: There are a few textual errors that the authors should fix: -page 2, line 70: change "used" to "use" -page 11, line 407: add the word "are" after Pse On page 2, line 36, the authors state that "most eubacteria and the archaea typically decorate their cell surface structures with (5-, 7-)diacetamido derivatives, either pseudaminic acid (Pse) and/or its stereoisomer legionaminic acid (Leg,". This should be nuanced as to my knowledge it is not most eubacteria, but more a subset as identified by Varki in his seminal PNAS paper. The authors clearly present their data and conclusions in the figures of this manuscript. However, I would recommend the take a critical look at the drawing of their monosaccharide chair conformations and the positioning of the axial and equatorial groups on these chairs in Figure 1 and 5, as these are in most cases drawn a bit crooked, which can easily be corrected. We corrected the text as the reviewer suggested. We changed the sentence in the introduction to be more nuanced. The drawing of the monosaccharide has been improved.

      Reviewer #2 (Significance (Required)):

      The family of carbohydrates called sialic acids was long thought to exclusively occur in glycoproteins and glycolipids of vertebrates, but has since also been found in specific microbes. Especially symbiotic and pathogenic microbes associated with the humans express a wide array of unique microbial sialic acids for which their functional roles are not well understood and the associated glycosylhydrolase and glycosyltransferase have in most cases not been identified yet. The authors present an impressive insight into flagellar glycosylation with Pseudaminic and Legionaminic acid in two bacterial species, using genomic analysis, rewiring, immunoblots and motility assays as their main tools. They provide compelling evidence on the insulation of the Pse of Leg pathway in these species, the flexibility in exchanging between biosynthetic enzymes from the same pathway between various species. Crucially, most glycosyltransferases that add the Pse or Leg glycoform onto various acceptor sites in bacteria, have up to this point remained elusive in most cases. It is therefore very valuable information that the authors here provide on the involved glycosyltransferases. Especially, on the two domains that govern their sugar nucleotide and acceptor specificity, and that these can be reengineered as chimeric glycosyltransferases. To me as a chemical glycobiologist this provides compelling possibilities for glycoengineering possibilities in future studies in the field to elucidate the functional roles of Pse and Leg glycosylation.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary of the findings and key conclusions (including methodology and model system(s) where appropriate): Kint et al describe a neat study of bacterial flagellin glycosylation by a recently identified class of protein glycosyltransferases called FlmG. The experiments are well designed, the data presented is convincing and the conclusions drawn are mostly in line with the experimental evidence presented. These are the key findings. Kint et al show that genetic tools and motility can be used as a readout to probe the sugar biosynthesis pathway in bacteria. Using the recently characterized system of Caulobacter crescentus, they have performed a survey of different PseI/LegI/NeuB genes from various bacteria, checking whether they could rescue the motility defect in C. crescentus ΔpseI cells. They found that those genes that did confer motility also had higher sequence similarity to C. jejuni PseI than to C. jejuni LegI or C. jejuni NeuB. They also found that these genes also restored flagellin glycosylation as checked by mobility shift on gel electrophoresis with immunoblotting to anti-FljK antibody. This survey brought up an interesting finding that the PseI/LegI/NeuB orthologs of the closely related Brevundimonas species were unable to confer motility to C. crescentus ΔpseI cells, and were more similar to C. jejuni LegI than to C. jejuni PseI or C. jejuni NeuB. They also performed similar glycoprofiling experiments using B. subvibrioides ΔlegIBs cells and various PseI/LegI/NeuB orthologs from different bacteria, which indicated the restoration of motility by putative LegI synthases. Kint et al demonstrate flagellin glycosylation in B. subvibrioides by performing in-frame deletions of FlmG, and LegI genes in B. subvibrioides and checking for motility, presence of flagella, and flagellin glycosylation by motility shift on gel electrophoresis. Further, they confirm the critical nature of GDP-GlcNAc for Leg biosynthesis by assessing flagellin glycosylation and motility in B. subvibrioides with an in-frame deletion in PtmE/LegX and by performing heterologous complementation with an M. humiferra PtmE ortholog. They also reconstitute the legionaminic acid biosynthesis pathway in C. crescentus cells that lack flagellins, PseI and FlmG, and show that the heterologously expressed B. subvibrioides flagellin is glycosylated by heterologously expressed B. subvibrioides FlmG. Finally, they also show that whereas the CcFlmG cannot substitute for BsFlmG and vice versa, a chimeric FlmG bearing the TPR domain from C. crescentus FlmG (that recognizes C. crescentus FljK) and the GT domain from B. subvibrioides FlmG (that transfers CMP-Leg) modifies CcFljK in C. crescentus cells that lack CcFlmG but express both Pse (endogenously) and Leg (from the reconstituted pathway). This demonstrates the modularity of the FlmG glycosyltransferases. Kint et al provide the chemical nature of C. crescentus flagellin glycosylation. Kint et al have analyzed the glycans released from the flagellin by acid hydrolysis and clearly shown the nature of the glycan in C. crescentus flagellin to be Pse4Ac5Ac7Ac by use of Pse standards. The glycan from B. subvibrioides was distinct from the Leg standard used, and could be a Leg derivative distinct from Leg5Ac7Ac.

      Major comments: 1. Table 1 and Text in Results, lines 116-119, "In support of the notion that derivatization occurs after the PEP-dependent condensation reaction to form Pse or Leg, our glyco profiling analysis revealed that putative PseI proteins (identified by sequence comparisons to C. jejuni 11168, Table S1) conferred motility to C. crescentus ΔpseI cells, whereas putative LegI synthases did not." Not clear how putative PseI and LegI synthases were identified. Table 1 only lists overall percent sequence identity and similarity to Cj PseI, LegI and NeuB, and percent identities and similarities of the various nonulosonic synthases to these proteins are in the similar range, as expected. In the absence of sequence alignments indicating the presence of conserved residues, particularly related to the substrate binding region, that are distinct in these paralogs, calling out the type of synthase based on the highest percent identity (to Cj PseI, LegI or NeuB) is speculative. Also, Shewanella oneidensis does not follow the pattern of highest similarity to NeuB3. Second, in the absence of data showing that the Leg and Pse found in these different organisms actually are different derivatives, this does not support that "derivatization occurs after the PEP-dependent condensation reaction to form Pse or Leg". Putative PseI and LegI were proposed based on BlastP analyses in which the protein sequences of interest were aligned to the three experimentally validated synthases from C. jejuni 11168: PseI, LegI, NeuB as well as PseI from C. crescentus, as indicated in Table S1. While, the assignment of the donor sugar is based only on the sequence identity and similarity to LegI or PseI, this assignment corresponds well according to the restoration of the motility of the C. crescentus ΔpseI mutant upon expression of PseI ortholog and B. subvibrioides ΔlegI mutant with heterologous LegI expression.

      It is true that for Shewanella oneidensis the assignment as PseI or LegI is ambiguous, exhibiting nearly identical similarity, but it is quite distinct from NeuB. This actually makes the S. oneidensis synthase a very interesting case to explore the enzymology of its Pse/LegI ortholog, knowing that it has been previously shown that this bacterium glycosylates its flagellins with Pse derivatives (PMID: 24039942). The results from our genetic complementation analysis are however very clear (PseI ortholog) and very consistent with the functional analysis in S. oneidensis.

      Concerning the different derivatives of Pse or Leg: McDonald and Boyd (PMID 32950378) recently published a review giving some examples of Bacteria/Archaea experimentally shown to contain Pse/Leg-derivatives: C. jejuni 11168 modifies its flagellin with 5,7-N-acetyl Pse, Sinorhizobium fredii NGR234 (not used in this study but in our previous work PMID 33113346 and showed to restore the motility of C. crescentus ΔpseI cells) modifies its capsule with 5-acetamido-7-3-hydroxybutyramido-Pse), Treponema denticola modifies its flagellin with 7-(2-metoxy-4,5,6-trihydroxy-hexanoyl-Pse, A. baumannii LAC-4 produces 5,7-N-acetyl-8-epi-Leg to decorate the capsule, Halorubrum sp. PV6 modifies the LPS with N-formylated Leg and L. pneumophila produces 5-acetamidino-Leg.

      The reviewer is right in that we do not know the exact version of Pse or Leg produced in C. crescentus and B. subvibrioides, HOWEVER, the fact that complementation works with the majority of the orthologs of PseI and LegI including many from bacteria that are known to produce modified Pse derivatives for example in Shewanella oneidensis and Treponema denticola, the most likely explanation is that derivatization occurs after the PseI or LegI step, but we concede that the results are also compatible with a promiscuous enzyme that can accept different Pse derivatives or different Leg derivatives.

      1. Related to (1), Text in Results, lines 130-131, "We conclude from our survey that (heterologous) PseI synthase activity generally confers motility to C. crescentus ΔpseI cells, whereas LegI-type (or NeuB-type) synthases are unable to do so." There is no a priori evidence provided indicating that these were PseI or LegI type synthases. So the conclusion really is that assuming only PseI type synthases would be able to rescue the motility defect in C. crescentus ΔpseI cells, this glyco-profiling motility assay now provides the first biochemical evidence telling us which synthases are Pse-type, and which are Neu/Leg-type. And in my view, this is the conclusion of greater significance in the field - to be able to now identify which is a PseI and which a LegI based on these complementation assays. However, if the authors still wish to retain their original conclusion, they could cite or provide evidence (either biochemical evidence in this work or reported literature regarding the sugar synthesized or bioninformatics analysis regarding the presence of distinct genes such as the Ptm genes for legionaminic acid biosynthesis pathway or genes that differ in their enzyme activities and overall fold such as PseB/LegB or PseG/LegG in the gene neighborhood) indicating or suggesting the PseI/LegI/NeuB nature of the different synthases. Also, methods for the bioinformatics analysis (eg. BLASTp settings used, dates of searches, whether regular BLAST or PSI-BLAST was used, etc.) are missing in the manuscript, and need to be included. We agree that for many PseI or LegI tested, there is no provided biochemical evidence. HOWEVER, this is not the case for some of them including the PseI, LegI and NeuB from Campylobacter jejuni (PMID 19282391), some A. baumannii strains (α-epi-legionaminic acid for A. baumannii LAC-4 PMID 24690675), Shewanella oneidensis (Pseudaminic acid with methylation PMID 23543712), Legionella pneumophila (Legionaminic acid PMID 18275154) or Halorubrum sp. PV6 (N-formylated legionaminic acid PMID 30245679). Thus, we maintain the two conclusions: the PseI and LegI synthases are generally interchangeable and the complementation assays can enable to identify and assign PseI and LegI function. BLAST2P was used to compare the protein sequences of the tested NeuB-like synthases with NeuB1, LegI (NeuB2) and PseI (NeuB3) from Campylobacter jejuni but also with PseI from C. crescentus. BLOSUM62 matrix was used as well as a word of size 3 for the comparison. We have now added this procedure in the legend of the Table S1.
      2. It is interesting that there is still a signification amount of flagellin secretion/assembly in the B. subvibrioides LegI and FlmG mutants. It will be good to see a discussion about whether this is likely from due to low level of function despite the in-frame deletion of genes; how many flagellin subunits are likely to have managed secretion and assembly in these short flagella; whether there is any redundancy of LegI / FlmG (perhaps with lower levels of expression); considering Parker and Shaw's findings of glycosylation being required for flagellin binding to the chaperone and subsequence secretion in A. caviae whether there is a FlaJ homolog in B. subvibrioides. Also, can the authors rule out the possibility that absence of glycosylation does not affect flagellin assembly but makes the flagellum prone to shear/breaks in B. subvibriodes, resulting in smaller flagella? How many flagellins are there in B. subvibrioides? Is it possible that one is glycosylated but another/others are not, and that is the reason for the small flagellum in these mutants? The number of flagellin subunits that are assembled into a full-length flagellar filament is unknown in C. crescentus and in B. subvibrioides. There are 3 different flagellin genes that are now presented schematically in Figure 1C. No redundancy has been found for LegI or FlmG. It is possible that the B. subvibrioides is better in exporting non-glycosylated flagellin or that the capping proteins can function better with sugar modification or that the filament of B. subvibrioides mutants is less fragile when it is non-glycosylated or that its flagellins “stick” better. It is also possible that short filaments are not actually containing flagellins mounted on the hook but another protein that polymerizes aberrantly in the absence of Leg or FlmG. This remains to be investigated and compared to the situation of Pse and FlmG mutants of C. crescentus.

      B. subvibrioides possesses an ortholog of the C. crescentus flagellin secretion chaperon FlaF (PMID 33113346). As observed in C. crescentus, FlaF likely has a role in flagellin translation as its inactivation totally prevents flagellins production (see answer to reviewer #1). For C. crescentus, bacterial two hybrid experiments revealed that FlaF can interact with non-glycosylated flagellins in E. coli. Thus, it is strongly possible that FlaF/flagellins interaction is not dependent on the flagellins glycosylation state. In addition, the short flagellum filament observed in B. subvibrioides ΔlegI or ΔflmG mutants argues that at least some flagellins are secreted while not glycosylated.TEM pictures have been performed in liquid medium from exponential growth phase. In this condition, no fragment of flagella was observed in the culture medium by TEM but only small flagella with a hook structure attached. Also, flagella breaks might result in more random length of flagellum.

      Three flagellins are in B. subvibrioides (Bresu_2403 is 59% identical with FljLCc, Bresu_2638 is 57% identical with FljKCc and Bresu_2636 is 62% identical with FljJCc). We now show this genetic organization of the flagellins in Fig. 1C. The three flagellins are all detected by the anti-FljKCc anti serum (see answer and figure to reviewer #1). We cannot attribute the immunoblot signal to any individual B. subvibrioides flagellin as they could all co-migrate on SDS-PAGE. However, the signal often looks like a doublet (as shown in Figure 4B for example) suggesting that at least two flagellins are detected and this doublet is always found to migrate faster in absence of glycosylation that could indicate that all B. subvibrioides flagellins (or at least 2) are modified.

      Text in Results, lines 170-171, "We then probed the resulting ΔlegIBs and ΔflmGBs single mutants for motility defects in soft agar and analyzed flagellin glycosylation by immunoblotting using antibodies to FljKCc". Was the antibody to FljKCc determined to also specifically bind to FljKBs? Also, how many flagellins are there in B. subvibrioides? Are all detected with this antibody? Antibodies raised to FljKCc were raised against His6-FljK produced in E. coli (previously published in Ardissone et al, 2020). This serum recognizes the 6 flagellins from C. crescentus (PMID: 33108275). It recognized the three flagellins from B.s. (see answer to reviewer #1).

      It is interesting that C. cresentus cells expressing Pse (endogenously) and Leg (reconstituted pathway), and BsFlmG and BsFljK (corresponding to Figure 5C) are not motile. Was the motility assay done for the experiment of figure 5B as well? Are the C. crescentus cells lacking Pse and FlmG but with heterologous expression of Leg and BsFljK and BsFlmG also non-motile? Also, it will be good to see the TEM images for these cells.

      C. crescentus cells that produce Pse (endogenously) or Leg (reconstituted pathway) and BsFlmG and BsFljK (formerly Figure 5C and now as Figure 7C) are indeed not motile as shown by the motility tests presented in Figure S5B. Motility assays with cells used in the former Figure 5B (now Figure 7B) have also been done and are now presented Figure S4B. These cells are non-motile because BsFljK is not efficiently secreted (or unstable after secretion) as shown on the immunoblot of the supernatant fraction in Figure S4A lower panel. As a result, flagellar filament is not properly assembled as only a short flagellum was observed by TEM in such cells compared to the WT C. crescentus (Figure S4C and S4D).

      Immunoblotting of the supernatants should be shown (in addition to the cell extracts) for Figures 5B and 5C so that the reader can appreciate whether glycosylation has taken place but secretion/assembly has not. Further, HPLC of the acid extracts from flagellin could be done to unambiguously show whether the CcFlmG has transferred Pse and the BsFlmG and Cc-BsFlmG have transferred Leg on to the CcFljK in Figure 5c, and the identity of the sugar, if any, transferred by CcFlmG in the absence of Pse, and BsLeg genes or BsLegX gene in figure 5B.

      *__ Immunoblots of the supernatants for Figure 5B (now Figure 7B) have been done and been added (Figure S4A lower panel). BsFljK is barely detected in the supernatant whatever its glycosylation state (with or without Leg). Note that in the supporting info where the raw unprocessed blot used for this panel is shown, a positive control of blotting (C. crescentus Δfljx6 mutant expressing CcFljK from pMT463) has been used. Immunoblots of the supernatant from Figure 5C (now 7C) have been done and been added in figure S5A. The CcFljK modified with Leg is poorly secreted (or unstable after secretion). As a result, these cells only harbor a short flagellum compared to those that are able to modify CcFljK with Pse (Figure S5C).

      HPLC of the acid extracts from flagellins have been performed on purified flagella obtained by ultracentrifugation. As C. crescentus cells expressing BsFlmG and Cc-BsFlmG harbor no or short flagellar filament, the purification by ultracentrifugation is limited. Thus, to further confirm that CcFlmG has transferred Pse and Cc-BsFlmG (and BsFlmG) has transferred Leg on CcFljK (former Figure 5C and now Figure 7C), we performed immunoblots on the cell extracts of C. crescentus ΔflmG ΔpseI cells that cannot produce Pse but able to produce Leg (reconstituted pathway). These experiments, now presented in Figure 7C (lower panel) confirmed that no modification of CcFljK was observed in C. crescentus cells expressing CcFlmG whereas CcFljK is modified in C. crescentus expressing Cc-BsFlmG, confirming that Cc-BsFlmG has transferred Leg (the only NulO produced in this condition).__*

      Text in discussion, lines 334-338, "By extension, having recognized the LegX/PtmE enzyme as a critical element in the Leg-specific enzymatic biosynthesis step (Figure 6) likewise offers another functional, but also a novel bioinformatic, criterion for the correct assignment and discrimination of predicted stereoisomer biosynthesis routes residing in ever-expanding genome databases" It will be nice to see a discussion on the prevalence of PtmE versus GlmU (or equivalent gene), PtmF, PtmA, PgmL in the Leg synthesizing organisms. Is the PtmE but not the other genes found in all cases, which makes it better as a molecular determinant for bioinformatics predictions of the type of pathway? Also, on whether PtmE has any homology to genes in other pathways (not associated with flagellin glycosylation) and how reliable a marker it is to differentiate Leg biosynthesis from Neu5Ac biosynthesis pathways.

      GlmU is a potential bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase that can be part of both Pse and Leg pathway (PMID 19282391). Accordingly, a GlmU ortholog is found in C. crescentus and B. subvibrioides that we showed are producing Pse and Leg, respectively. Thus, GlmU cannot be attributed to a Leg pathway signature. On the other hand, PtmE is barely found in the organisms from which PseI orthologs restore the motility of C. crescentus ΔpseI cells.

      PtmF, PtmA, PgmL and GlmS are proposed to act upstream of the production of GlcN-1-P that is a precursor of both UDP-GlcNAc and GDP-GlcNAc, the precursors of Pse and Leg respectively. In addition, orthologs of these genes are not prevalent in the Leg synthetizing organisms present in Table S2 using BlastP analyses with C. jejuni proteins as templates.PtmE ortholog is found in most of the Leg synthetizing organisms as shown in Table S2 and often genetically linked with other genes coding for proteins involved in Leg production (shown with the asterisk * in table S2). Of note, PtmE is found not only in organisms that modify flagellin(s) with Leg but also in organisms that add Leg on capsule such as A. baumannii LAC-4.

      It is not clear from the methods or the figure legends how many times the immunoblotting, motility experiments were done; how many experiments/trials are the images representative of? The motility pictures are representative of three independent experiments. The immunoblots are representative of at least two independent experiments. This information is now added in the Experimental procedures section.

      Minor comments:

      1. The gene for GlcN-1-P guanylyltransferase in the Leg-specific enzymatic biosynthesis step is already known as PtmE from the work of Schoenhofen's group. For the sake of consistency, it would be better to retain the nomenclature as PtmE throughout the manuscript instead of introducing the name LegX, which makes it sound like it is a previously unknown gene.

      2. Text in abstract, lines 15-17: "Sialic acids commonly serve as glycosyl donors, particularly pseudaminic (Pse) or legionaminic acid (Leg) that prominently decorate eubacterial and archaeal surface layers or appendages" The glycosyl donor is the nucleotide sugar and not the nonulosonic acid or sialic acid... rephrasing required for accuracy. Done

      3. Text in abstract, lines 18: "a new class of FlmG protein glycosyltransferases that modify flagellin" The authors are presumably referring to FlmG as the new class of protein glycosyltransferases... rephrasing required for accuracy Corrected
      4. Text in introduction, lines 41-42 "Pse and Leg derivatives synthesized in vitro can be added exogenously in metabolic labeling experiments" It should be "derivatives of Pse and Leg precursors" and not "Pse and Leg derivatives" corrected
      5. Text in introduction, line 46 "Pse- or Leg-decorated flagella may also be immunogenic." This sentence is not referenced and it is not clear why it is written here.

      6. Text in introduction, lines 63-66 "The synthesis of CMP-Pse or CMP-Leg proceeds enzymatically by series of steps [20-22], ultimately ending with the condensation of an activated 6-carbon monosaccharide (typically N-acetyl glucosamine, GlcNAc) with 3-carbon pyruvate (such as phosphoenolpyruvate, PEP) by Pse or Leg synthase paralogs, PseI or LegI, respectively" The synthesis begins with activated GlcNAc. The substrate for condensation is not activated GlcNAc. It is 2,4-diacetamido-2,4,6-trideoxy-D-mannopyranose in case of LegI and 2,4-diacetamido-2,4,6-trideoxy-b-L-altropyranose in case of PseI. Indeed, we modified the sentence.

      7. Text in introduction, line 70 "for used as glycosyl donors" Typographical error, "for use as glycosyl donors" Corrected
      8. Text in Results, line 102, "C. crescentus only encodes only PseI" Do the authors mean "only one PseI"? Corrected
      9. Text in Results, lines 108 and 109, "Such modifications could occur before the PseI synthase acts or afterwards. In the latter case, most (if not all) synthases would be predicted to produce the same Pse molecule," Do the authors know of any reports of modifications occurring before the PseI synthase? Please cite references, if known. Why "most (if not all)"? If the former case is true, the PseI synthase might not be able to accept the substrate. Correct. Because we cannot test all enzymes we must keep the statement non-committing.

      “Most (if not all)” refers to the latter case i.e. the modification occurs after PseI synthase. In this context, PseI should do the same reaction, however, there might be some exceptions.

      There is, to our knowledge, no reports showing that modifications occur before the PseI synthase. The glyco-profiling experiments all suggest that modification occurs after Pse production based on our motility readout. It is possible that PseI enzymes that condense a modified precursor would not be functional in our motility assay.

      Text in Results, lines 141-143, "our bioinformatic searches using C. jejuni 11168 as reference genome identified all six putative enzymes in the B. subvibrioides ATCC15264 genome (CP002102.1) predicted to execute the synthesis of Leg from GDP-GlcNAc" Not clear how this was done. Do the authors mean that they used the genes from C. jejuni 11168 as the query genes to identify homologs in B. subvibrioides ATCC15264 genome (CP002102.1)? Or did they use putative genes from B. subvibrioides ATCC15264 genome (CP002102.1) and pull out homologs from C. jejuni 11168 by using C. jejuni 11168 as the reference genome? We now have modified the sentence to make it clearer.

      At first reading, the flow of the manuscript is difficult to follow due to the figures not appearing in full in order of their occurrence. For instance, Figures 5B and 5C are discussed only in the end of the manuscript after the results of Figures 6 and 7. Other instances also exist. The authors may consider re-ordering the figure parts if possible so that all parts of each figure appear in order of occurrence in the manuscript text. Thanks for raising this issue. We have now tried to address this concern by re-organizing the order of occurrence of the figures. Notably we have now exchanged Figure 5 (on Leg pathway reconstitution and FlmG rewiring) with Figure 7 (on LegB and LegH). We modified the text accordingly. We hope that it makes the manuscript and corresponding figures easier to follow.

      Reviewer #3 (Significance (Required)):

      The nonulosonic acids, Pseudaminic acid and Legionaminic acid, are abundant in bacterial systems in the capsular and lipopolysaccharides as well as in glycoprotein glycans. The Ser/Thr-O-nonulosonic acid glycosylation of flagellins has been studied with respect to the system of Maf glycosyltransferases in Campylobacter jejuni, C. coli, Helicobacter pylori, Aeromonas caviae, Magnetospirillum magneticum, Clostridium botulinum and Geobacillus kaustophilus, and recently with respect to the system of FlmG glycosyltransferases by Viollier's group in Caulobacter crescentus. However, the determinants that govern the glycosyltransferase function are not still well known. Kint et al have performed excellent work using bacterial genetics tools to (1) highlight the "functional insulation" of the Leg and Pse biosynthesis pathways, (2) demonstrate the modularity of the FlmG glycosyltransferase proteins with respect to the flagellin binding and glycosyltransferase domains. This work makes a significant advance in the field with respect to (1) understanding flagellin glycosylation by FlmG, (2) making designer protein Ser/Thr-O-glycosyltransferases, and (3) bioinformatics analysis of genomes with respect to the Pse/Leg/Neu nonulosonic acid biosynthetic potential encoded. The findings will be of great interest to scientific audiences working in the areas of glycobiology and bacteriology. My area of expertise: Maf flagellin glycosyltransferases

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, authors establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. They used B. subvibroides and C. crescentus specific mutants in pseI and legI genes involved in the Pse and Leg biosynthesis, respectively, and cross-complementation assays with orthologous genes from different bacterial species, analysing motility and flagellin glycosylation. These assays show that Pse and Leg biosynthetic pathways are genetically different and recognize the LegX enzyme as a critical element in the Leg-specific enzymatic biosynthesis. Since that legX orthologous were only identified in the genome of bacteria with Leg biosynthetic pathways, it becomes a good marker to distinguish Leg from Pse biosynthesis pathways and a novel bioinformatic criterion for the assignment and discrimination of these two pathways. Reconstitution of Leg biosynthetic pathway of B. subvibroides in the C. crescentus mutant that lack flagellins, PseI and FlmG, complemented with both flagellin and FlmG of B. subvibroides, identified a new class of FlmG protein glycosyltransferases that modify flagellin with legionaminic acid. Furthermore, the construction of a chimeric FlmG through domain substitutions, allowed to reprogram a Pse-dependent FlmG into a Leg-dependent enzyme and reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

      Major comments:

      The conclusions obtained are convincing and well-supported. However, I think some points should be specify or clarify.

        • In the mutants (pseI, legI, flmG,...) the non-glycosylated flagellin are exported and assembled in a flagellum filament shorter than the WT strain. However, motility in plates is absent or very reduced. This might be produced by instability of the flagellum filament when rotating in a semi-solid surface. MET was performed from plates or liquid cultures? Do the author analyses motility in liquid media? If they did, changes in motility were observed?
        • In page 5, lines 158-163, the analysis, by HPLC, of derivatized nonulosonic acid from B. subvibroides flagella, shows a major peak at 9.8 minutes retention and a minor peak at 15.3 minutes. Since that Pse-standard have retentions peaks at 9.7 and 13 minutes, and Leg-standard at 12.3 minutes, the authors cannot infer, only with these data, the flagella sugar is a legionaminic acid derivative. In my opinion, should be included that inference comes from the data obtained by HPLC analysis and genetic approaches.
        • In page 5, line 173-175. Authors indicate, "While no difference in the abundance of flagellin was observed in extracts from mutant versus WT cells, flagellin was barely detectable in the supernatants of mutant cultures, suggesting flagellar filament formation is defective in these mutants". MET images show that the flagellum filament length is shorter in the mutants than in the WT strain. Therefore, if the same number of mutants and WT cells has been used in the immunodetection assays, there should be more flagellin monomers in the WT samples than in the mutants ones and flagellin bands should be less intense in mutant samples corresponding to the anchored flagellum. Why bands corresponding to flagellin in mutants and WT show similar intensity in the immunodetection assays (Figure 3C and D)? Furthermore, in lane 177-178, authors suggest that LegI and FlmG govern flagellin glycosylation and export (or stability after export). However, if filament stability is affected, the amount of flagellin monomers in the supernatant of mutants should be higher than in the WT. However, immunodetection assays show less abundance of flagelin monomers in the supernatant of mutants. Please, can you clarify this? In relation to this point, I suggest that authors include, in the experimental procedures, how they obtained the supernatants to flagellin immunodetection, as well as why they used anti- FljKCc anti-serum to detect the B. subvibroides flagellin.
        • Authors demonstrate the specificity of the GT-B domain of FlmG, using a chimeric FlmGCc-Bs in a mutant of C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides. However, since that TPR comes from C. crescentus, this chimeric protein, could be transfer the legionaminic acid to the flagellin of B. subvibroides? Furthermore, the complementation of this mutant with the FlmGBs did not support efficient flagellin modification and this might be related to the TPRCc domain. Therefore, in my opinion, the chimeric protein should be introduced in the B. subvibroides∆flmG background.
        • Page 8, line 299-301. Authors point out that C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides and the chimeric FlmGCc-Bs, although it has a glycosylated flagellin, whose mobility in SDS-PAGE is like the WT strain, is non-motile. They suggest that additional factors exist in the flagellation pathway that exhibit specificity towards the glycosyl group that is joined to flagellins. However, would be interesting to see if the flagellum filament has similar length to the WT strain or at least, it has increased in relation to the flagella length of the mutant. If flagella length has not increased, it could suggest that changes in the glycan type might affects the flagellin assembly or the stability of the flagellum filament. Therefore, would be also important to analyse its motility in liquid media.

      Minor comments:

        • Pag 3 line102. Please change ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes only PseI...." to ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes a PseI...."
        • Figure 2 A. Plasmid nomenclature (Plac-neuB) is confusing because C.c. ΔpseI cells express predicted LegI or PseI synthases. Please change to Plac, as in Figure 2B and 4. Figure 2A and 2B do not contain any complementation with Bacillus subtilis (Basu), however two complementation are labelled as Bs in Figure 2A and 2B. Furthermore, no Bs are present in the Figure 2 legend.
        • Legend of figure 3 should include B. subvibrioides abreviation Bs. Line 774: Please change ".......glycosylation and secretion in B. subvibrioides." to ".......glycosylation and secretion in B. subvibrioides (Bs)."
        • Figure 3. In order to keep a similar nomenclature in all plasmids, plasmid Plac-legI syn and Plac-flmG should be labelled as Plac-legIBs syn and Plac-flmGBs.
        • Legend of figure 4 should include B. subvibrioides abreviation Bs. Line 791: Please change "....... complementation of the B.subvibrioides ΔlegI mutant with ...." to "....... complementation of the B.subvibrioides (Bs)ΔlegI mutant with ...." Furthermore, Legend of figure 4 indicate in line 795, that immunoblots reveal the intracellular levels of flagellin, however figure 2 and 3 show immunoblot of cell extracts. Please, correct this sentence.
        • Legend of figure 5, 6 and 7 should include B. subvibrioides abreviation Bs. Line 808: Please change "Predicted Leg biosynthetic pathway in B. subvibrioides " to"Predicted Leg biosynthetic pathway in B. subvibrioides (Bs)" Line 834: Please change "....affects motility, flagellin glycosylation and secretion in B. subvibrioides."to "....affects motility, flagellin glycosylation and secretion in B. subvibrioides (Bs).Line 852: Please change "...acetyltransferase in flagellar motility of B. subvibrioides cells." to ""...acetyltransferase in flagellar motility of B. subvibrioides (Bs) cells." Furthermore, figure 5 should include C. crescentus abbreviation. Line 815: Please change "....whole cell lysates from C. crescentus mutant cultures......." to "....whole cell lysates from C. crescentus (Cc) mutant cultures......."
        • In my opinion it would be useful to include a scheme of the gene organization involved in Leg biosynthesis in B. subvibrioides.
        • Legend of figure S1 should include B. subvibrioides (Bs) and C. crescentus (Cc) abbreviations. Line 888-867: Please change "...C. crescentus ΔpseI cells and B. subvibrioides ΔlegI cells with plasmids expressing..." to "...C. crescentus (Cc) ΔpseI cells and B. subvibrioides (Bs) ΔlegI cells with plasmids expressing..." Furthermore, the name and abbreviations (Mm, So, Ku, Pi, Dv) of the species used should be included in the legend. Why the authors used a plasmid with a Pvan promoter in these assays? Why the authors changed the code color of pseI and legI orthologous genes? It would be more useful and understandable follow the code color used in figure 2 and 4.
      1. Page 6 line 200, Please change ".....complementing synthases exhibit greater overall sequence similarity to LegI than Pse of C. jejuni. 22268,....." to ".....complementing synthases exhibit greater overall sequence similarity to LegI than PseI of C. jejuni. 22268,....."
        • Page 7 line 231, Please change ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and P. sp. Irchel 3E13..." to ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and Pseudomonas sp. Irchel 3E13..."
        • Introduce a line break between line 503 and 504.
        • Page 14 line 543, please change "XbaI" to "XbaI"

      Significance

      This is an interesting manuscript that contributes to the knowledge of the legionaminic biosynthetic pathway and establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. The analysis of Leg patway allowed to identify a gene (legX) that can be used to distinguish Leg from Pse biosynthesis pathways, becoming a bioinformatic tool for the assignment and discrimination of these two pathways. Furthermore, a new class of FlmG protein glycosyltransferases, able to transfer Leg to the flagellin, has been identified and its analysis reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

    1. code string Type of the error. description string Descriptive text about the error. source string The point of failure in the specific operation (payment in this case). Check the card, netbanking, wallets, UPI Collect, UPI Intent, Cardless EMI and Emandate sections to know about the possible values for each method. step string The stage where the transaction failure occurred. The stages can vary depending on the payment method used to complete the transaction. Check the card, netbanking, wallets, UPI Collect, UPI Intent, Cardless EMI and Emandate sections to know about the possible values for each method. reason string The exact error reason. It can be handled programmatically.

      All these should be under error

      Also, all the links in the source and step descriptions are opening in the same tab. Ensure these links open in the new tab.

    2. In this sample code, see the description, source, step and reason, indicating that the API failed due to authentication failure for incorrect OTP.

      In the below sample error response, the description, source, step and reason indicates that the API failed due to authentication failure for incorrect OTP.

    1. Author Response

      Reviewer #1 (Public Review):

      The software presented in this paper is well documented and represents a significant achievement that breaks new ground in terms of what is possible to render and explore in the web browser. This tool is essential for the exploration of SC2 data, but equally useful for the tree of life and other tree-like data sets.

      Thank you for reviewing my work and for this generous assessment.

      Reviewer #2 (Public Review):

      This manuscript describes a web-based tool (Taxonium) for interactively visualizing large trees that can be annotated with metadata. Having worked on similar problems in the analysis and visualization of enormous SARS-CoV-2 data sets, I am quite impressed with the performance and "look and feel" of the Taxonium-powered cov2tree web interface, particularly its speed at rendering trees (or at least a subgraph of the tree).

      Thank you for the kind words.

      The manuscript is written well, although it uses some technical "web 2.0" terminology that may not be accessible to a general scientific readership, e.g., "protobuf" (presumably protocol buffer) and "autoscaling Kubernetes cluster". The latter is like referring to a piece of lab equipment, so the author should provide some sort of reference to the manufacturer, i.e., https://kubernetes.io/.

      Thank you for flagging this. I have now replaced the colloquial "protobuf" with "protocol buffer". I have now provided a URL for Kubernetes. It is always difficult to judge how much to explain technical terms. I certainly agree that many people will be unfamiliar with, for instance, protocol buffers, but an explanation of what they are (which may not be particularly important for understanding Taxonium) can sometimes overshadow more important details. So my preference in that particular case is for an interested reader to research the unfamiliar term.

      In other respects, the manuscript lacks some methodological details, such as exactly how the tree is "sparsified" to reduce the number of branches being displayed for a given range of coordinates.

      This is an important point also raised by Reviewer 3. I have added a new section in the Materials and Methods which discusses this in some detail.

      Some statements are inaccurate or not supported by current knowledge in the field. For instance, it is not true that the phylogeny "closely approximates" the transmission tree for RNA viruses.

      I agree that this was an overly broad claim, and have softened it, now saying:

      "The fundamental representation of a viral epidemic for genomic epidemiology is a phylogenetic tree, which approximates the transmission tree and can allow insights into the direction of migration of viral lineages."

      Mutations are not associated with a "point in the phylogeny", but rather the branch that is associated with that internal node.

      I have changed this as suggested.

      A major limitation of displaying a single phylogenetic tree (albeit an enormous one) is that the uncertainty in reconstructing specific branches is not readily communicated to the user. This problem is exacerbated for large trees where the number of observations far exceeds the amount of data (alignment length). Hence, it would be very helpful to have some means of annotating the tree display with levels of uncertainty, e.g., "we actually have no idea if this is the correct subtree". DensiTree endeavours to do this by drawing a joint representation of a posterior sample of trees, but it would be onerous to map a user interface to this display. I'm raising this point as something for the developers to consider as a feature addition, and not a required revision for this manuscript.

      I entirely agree with this point. I have added a sentence in the discussion:

      "Even where sequences are accurate, phylogenetic topology is often uncertain, and finding ways to communicate this at scale, building on prior work [Densitree citation] would be valuable."

      The authors make multiple claims of novelty - e.g., "[...] existing web-based tools [...] do not scale to the size of data sets now available for SARS-CoV-2" and "Taxonium is the only tool that readily displays the number of independent times a given mutation has occurred [...]" - that are not entirely accurate. For example, RASCL (https://observablehq.com/@aglucaci/rascl) allows users to annotate phylogenies to examine the repeated occurrence of specific mutations. Our own system, CoVizu, also enables users to visualize and explore the evolutionary relationships among millions of SARS-CoV-2 genomes, although it takes a very different approach from Taxonium. Taxonium is an excellent and innovative tool, and it should not be necessary to claim priority.

      I agree that comparisons with existing tools are difficult and often provide a sense of unnecessary competition. I attempted to be quite careful in the specific section focused on comparison, but may have been less careful earlier on. The intent with this first sentence in the abstract was to provide a succinct description of the gap that Taxonium was developed to fill with "however, existing web-based tools for analysing and exploring phylogenies do not scale to the size of datasets now available for SARS-CoV-2". I have now removed the words "analysing and", focusing on the exploration of phylogenies. I think this new sentence is defensible in that valuable tools such as CoVizu intentionally do not explore a phylogeny directly but instead take a multi-level approach, and this new sentence better matches the comparisons in the paper. In the second sentence, I have removed the phrase "is the only tool that", which I agree adds little and may not be accurate, depending on one's interpretation of "readily". Thank you for these points.

      Although the source code (largely JavaScript with some Python) is quite clean and has a consistent style, there is a surprising lack of documentation in the code. This makes me concerned about whether Taxonium can be a maintainable and extensible open-source project since this complex system has been almost entirely written by a single developer. For example, usher_to_taxonium.py has a single inline comment (a command-line example) and no docstring for the main function. JBrowsePanel.jsx has a single inline comment for 293 lines of code. There is some external documentation (e.g., DEVELOPMENT.md) that provides instructions for installing a development build, but it would be very helpful to extend this documentation to describe the relationships among the different files and their specific roles. Again, this is something for the developers to consider for future work and not the current manuscript.

      This is an entirely fair comment. The version of Taxonium presented in the manuscript is "2.0", which is a new version built from scratch with considerably less technical debt than the version that preceded it. Its technical strengths are that (with the exception of the backend) it is relatively well-modularised into functional components. But the limitations that the reviewer notes with respect to commenting are entirely fair. What I would say is that in the time since this manuscript was submitted, several important features have been added by an external collaborator, Alex Kramer, most notably the Treenome Browser (https://www.biorxiv.org/content/10.1101/2022.09.28.509985v1). I hope that the ability of Alex to add these features with little need for support provides some evidence of Taxonium's extensibility. But I acknowledge there is room for improvement.

      Reviewer #3 (Public Review):

      The paper succinctly provides an overview of the current approaches to generating and displaying super-large phylogenies (>10,000 tips). The results presented here provide a comprehensive set of tools to address the display and exploration of such phylogenies. The tools are well-described and comprehensive, and additional online documentation is welcome.

      The technical work to display such large datasets in a responsive fashion is impressive and this is aptly described in the paper. The author rightly decides that displaying large phylogenies is not simply a matter of rendering "more nodes", and so in my eyes, the major advancement is the approach used to downsample trees on-the-fly so that the number of nodes displayed at one time is manageable. This is detailed only briefly (Results section, 1st paragraph, 2 sentences). I would like to see more discussion about the details of this approach.

      Thank you for this point, also raised by Reviewer 2. I have now added a lengthy section on this in the Materials and Methods, which I hope is helpful. The approach is not especially sophisticated, but it does the job and runs quickly.

      Examples that came up while exploring the tool: the (well implemented) search functionality reports results from the entire tree (e.g. in Figure 4, the number of red circles is not a function of zoom level), how does this interact with a tree showing only a subset of nodes?

      Yes, this is an important feature which I perhaps did not do justice to in the write-up. I have included in the new section in the Materials and Methods a paragraph discussing search results:

      "In order to ensure that search results are always comprehensive, but at the same time to avoid overplotting, we take the following approach::

      ● Searches are performed across every single node on the tree to select a set of nodes that match the search. The total number of matches is displayed in the client.

      ● If fewer than 10,000 matches are detected, these are simply displayed in the client as circles

      ● If more than 10,000 matches are detected, the results are sparsified using the method above, and then displayed.

      ● Upon zooming or panning, the sparsification is repeated for the new bounding box."

      How is the node order chosen with regards to "nodes that would be hidden by other nodes are excluded" and could this affect interpretations depending on the colouring used?

      This perhaps was slightly sloppy language which did not directly describe the implementation. I have now rephrased this to "only nodes that overlap other nodes are excluded", as we don't in fact consider a notion of z-index when doing this. The way the sparsification works (now better described) means that the nodes excluded are determined essentially by position and I don't foresee this introducing particular biases, but this was an insightful point to raise.

      Taxonium takes the approach of displaying all available data (sparsification of nodes notwithstanding). Biases in the generation of sequences, especially geographical, will therefore be present (especially so in the two main datasets discussed here - SARS-CoV-2 and monkeypox). This caveat should be made explicit.

      This is certainly true. I have added this new paragraph in the Discussion:

      "A further challenge is the vastly different densities of sampling in different geographic regions. Because Cov2Tree does not downsample sequences from countries which are able to sequence a greater proportion of their cases, the number of tips on a tree is not indicative of the size of an outbreak and in some cases even inferences of the directionality of migration may be confounded. There would be value in the development of techniques that allow visual normalisation of trees for sampling biases, which might allow for less biased phylogenetic representations without downsampling."

      Has the author considered choosing which nodes to exclude for sparsified trees in such a way as to minimise known sampling biases?

      The last sentence of the new paragraph above alludes to a sort-of-similar approach. I hadn't directly considered the approach the reviewer suggests. It is an interesting idea. The downsampling algorithm has to be very computationally inexpensive but it would be interesting to explore ways to do this. I am tracking this in https://github.com/theosanderson/taxonium/issues/437.

      Interoperability between different software tools is discussed in a technical sense but not as it pertains to discovering the questions to ask of the data. As an example, spotting the specific mutations shown in figure 3 + 4 is not feasible by checking every position iteratively; instead, the ability to have mutations flagged elsewhere and then seamlessly explore them in Taxonium is a much more powerful workflow. This kind of interoperability (which Taxonium supports) enhances the claim of "providing insights into the evolution of the virus".

      Thank you for flagging this point -- I am very excited by the growing ecosystem of interoperable tools. You are absolutely right that most of the insights Taxonium can bring into evolution rely also on this broader ecosystem. I have added a florid sentence in the concluding paragraph: "It forms part of an ecosystem of open-source tools that together turn an avalanche of sequencing data into actionable insights into ongoing evolution."

      The prosaic reason I don't discuss Taxonium's interoperability features in more detail in this manuscript is that it aims to describe the version of Taxonium I initially developed, and these features were developed collaboratively by a broader group later on (and after deposition of this preprint).

      Taxonium has been a fantastic resource for the analysis of SARS-CoV-2 and this paper fluently presents the tool in the context of the wider ecosystem of bioinformatic tools in use today, with the interoperability of the different pieces being a welcome direction.

    2. Reviewer #2 (Public Review):

      This manuscript describes a web-based tool (Taxonium) for interactively visualizing large trees that can be annotated with metadata. Having worked on similar problems in the analysis and visualization of enormous SARS-CoV-2 data sets, I am quite impressed with the performance and "look and feel" of the Taxonium-powered cov2tree web interface, particularly its speed at rendering trees (or at least a subgraph of the tree).

      The manuscript is written well, although it uses some technical "web 2.0" terminology that may not be accessible to a general scientific readership, e.g., "protobuf" (presumably protocol buffer) and "autoscaling Kubernetes cluster". The latter is like referring to a piece of lab equipment, so the author should provide some sort of reference to the manufacturer, i.e., https://kubernetes.io/. In other respects, the manuscript lacks some methodological details, such as exactly how the tree is "sparsified" to reduce the number of branches being displayed for a given range of coordinates. Some statements are inaccurate or not supported by current knowledge in the field. For instance, it is not true that the phylogeny "closely approximates" the transmission tree for RNA viruses. Mutations are not associated with a "point in the phylogeny", but rather the branch that is associated with that internal node.

      A major limitation of displaying a single phylogenetic tree (albeit an enormous one) is that the uncertainty in reconstructing specific branches is not readily communicated to the user. This problem is exacerbated for large trees where the number of observations far exceeds the amount of data (alignment length). Hence, it would be very helpful to have some means of annotating the tree display with levels of uncertainty, e.g., "we actually have no idea if this is the correct subtree". DensiTree endeavours to do this by drawing a joint representation of a posterior sample of trees, but it would be onerous to map a user interface to this display. I'm raising this point as something for the developers to consider as a feature addition, and not a required revision for this manuscript.

      The authors make multiple claims of novelty - e.g., "[...] existing web-based tools [...] do not scale to the size of data sets now available for SARS-CoV-2" and "Taxonium is the only tool that readily displays the number of independent times a given mutation has occurred [...]" - that are not entirely accurate. For example, RASCL (https://observablehq.com/@aglucaci/rascl) allows users to annotate phylogenies to examine the repeated occurrence of specific mutations.<br /> Our own system, CoVizu, also enables users to visualize and explore the evolutionary relationships among millions of SARS-CoV-2 genomes, although it takes a very different approach from Taxonium. Taxonium is an excellent and innovative tool, and it should not be necessary to claim priority.

      Although the source code (largely JavaScript with some Python) is quite clean and has a consistent style, there is a surprising lack of documentation in the code. This makes me concerned about whether Taxonium can be a maintainable and extensible open-source project since this complex system has been almost entirely written by a single developer. For example, `usher_to_taxonium.py` has a single inline comment (a command-line example) and no docstring for the main function. `JBrowsePanel.jsx` has a single inline comment for 293 lines of code. There is some external documentation (e.g., `DEVELOPMENT.md`) that provides instructions for installing a development build, but it would be very helpful to extend this documentation to describe the relationships among the different files and their specific roles. Again, this is something for the developers to consider for future work and not the current manuscript.

    1. This means that the existing legacy code of the usersis no longer an albatross retarding innovation, but a valuable asset to which new services can be tied withoutlimit. Once encoded in terms of intentions, software assumes an invariant ―immortal‖ form, free from inherentobsolence.

      !- meme : intentional software as "immortal form" - existing legacy articulations are assets - to which new servicdes can be tied to - composed of, re-mix re-use, re-purpose

      !- meme : intentional software - beyond programming - beyond ontology

      • point to : long tail software
    2. Arbitrary type calculus, meta-work, domain-specific compilation knowledge, and information traditionally keptin ―good‖ comments should be expressible as part of the machine processable part of the program
      • arbitrary type calculus
      • meta-work
      • domain specific compilation knowledge?
      • comments should be expressible
      • as part of the machine processable part

      • not literal programming

      • where narraitve is interpersed and organizing
      • code blocks

      !- response : - but articulate software as a conversations and mutual learning

    1. Skip to content In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Dashboard Pull requests Issues Codespaces Marketplace Explore Sponsors Settings caitgarland Sign out New repository Import repository New gist New organization Sorry, something went wrong. /  ...  /   nic-dgl103-f22  /   assignment-c-dlu-...  /   Clear Command Palette Tip: Type # to search pull requests Type ? for help and tips Tip: Type # to search issues Type ? for help and tips Tip: Type # to search discussions Type ? for help and tips Tip: Type ! to search projects Type ? for help and tips Tip: Type @ to search teams Type ? for help and tips Tip: Type @ to search people and organizations Type ? for help and tips Tip: Type > to activate command mode Type ? for help and tips Tip: Go to your accessibility settings to change your keyboard shortcuts Type ? for help and tips Tip: Type author:@me to search your content Type ? for help and tips Tip: Type is:pr to filter to pull requests Type ? for help and tips Tip: Type is:issue to filter to issues Type ? for help and tips Tip: Type is:project to filter to projects Type ? for help and tips Tip: Type is:open to filter to open content Type ? for help and tips We’ve encountered an error and some results aren't available at this time. Type a new search or try again later. No results matched your search Top result Commands Type > to filter Global Commands Type > to filter This Page Files Pages Access Policies Organizations Repositories Issues, pull requests, and discussions Type # to filter Teams Users Projects Modes Use filters in issues, pull requests, discussions, and projects Search for issues and pull requests # Search for issues, pull requests, discussions, and projects # Search for organizations, repositories, and users @ Search for projects ! Search for files / Activate command mode > Search your issues, pull requests, and discussions # author:@me Search your issues, pull requests, and discussions # author:@me Filter to pull requests # is:pr Filter to issues # is:issue Filter to discussions # is:discussion Filter to projects # is:project Filter to open issues, pull requests, and discussions # is:open nic-dgl103-f22 / assignment-c-dlu-RaviPunia Private Unwatch Stop ignoring Watch 0 Notifications Participating and @mentions Only receive notifications from this repository when participating or @mentioned. All Activity Notified of all notifications on this repository. Ignore Never be notified. Custom Select events you want to be notified of in addition to participating and @mentions. Get push notifications on iOS or Android. Custom Custom Select events you want to be notified of in addition to participating and @mentions. Issues Pull requests Releases Discussions Discussions are not enabled for this repository Security alerts Apply Cancel Fork 0 Starred 0 Star 0 Code Issues 0 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights Open in github.dev Open in a new github.dev tab Permalink main Switch branches/tags Branches Tags View all branches View all tags Name already in use A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch? Cancel Create assignment-c-dlu-RaviPunia/index.html Go to file Go to file T Go to line L Copy path Copy permalink This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository. RaviPunia Final code Latest commit 3fef99d 4 days ago History 2 contributors Users who have contributed to this file 101 lines (92 sloc) 3.43 KB Raw Blame Edit this file E Open in github.dev . Open in GitHub Desktop Open with Desktop View raw Copy raw contents Copy raw contents Copy raw contents Copy raw contents View blame <!DOCTYPE html> <html lang="en"> <head> <!-- DGL 103 DLU1 - Ravi Punia - Assignment C --> <meta charset="UTF-8"> <meta http-equiv="X-UA-Compatible" content="IE=edge"> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <link rel="stylesheet" href="./style.css"> <link rel="icon" type="image/x-icon" href="images/favicon.ico"> <title>Hairpins Boutique Salon</title> <link rel="preconnect" href="https://fonts.googleapis.com"> <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin> <link href="https://fonts.googleapis.com/css2?family=Poppins:wght@400;600&display=swap" rel="stylesheet"> <style> .services { background-color: #000000; /* Here I changed the background color using Hexadecimal value */ color: white; } </style> </head> <body> <header> <a href="index.html" ><img src="images/hairpins-salon-logo.png" alt="hairpins Logo" width="300" ></a> <nav> <ul> <li><a href="index.html">Home</a></li> <li><a href="services.html">Services</a></li> <li><a href="contact.html">Contact Us</a></li> </ul> </nav> </header> <main> <h1>Welcome to Hairpins Boutique Salon</h1> <p> Located in the heart of the Comox Valley, Hairpins Boutique Salon offers a high-end experience with competitive prices, top of the line products and a warm, welcoming atmosphere. </p> <figure> <img src="./images/the-hairpins-salon.jpeg" alt="salon image"> <figcaption> The Hairpins hairdressing salon in Courtenay, BC, Canada. </figcaption> </figure> <p> Stylist and owner, Staysea Brown has been overwhelmed by the success Hairpins has received over the past 10 years and is ever grateful to the Comox Valley community for all the support. With over a decade of industry experience, Staysea has the knowledge and drive to run a successful business that's hard to forget. Pop on by! </p> <p class="services"> Time for a new do? <a href="./services.html">Check out our services</a> </p> <h2>Meet the Team</h2> <p> Offering talented stylists with varied personalities, outgoing customer service, and an eclectic, fun atmosphere, Hairpins is striving to be one of a kind. <br> <br> By evolving with their clientele and constantly offering the latest trends and services, they are ensuring every visit is a unique one. Hairpins is filled with its own special brand of magic. Come in and sit down, the Hairpins' Girls are waiting for you! </p> <figure> <img src="./images/the-hairpins-team.jpeg" alt="team members"> <figcaption> We are incredibly proud of our diverse team of stylists who greet each client with a smile. We prioritize inclusivity, community, and sustainability, and make sure that everyone who walks through our door feels welcome. </figcaption> </figure> <a href="./contact.html">Contact us today to book an appointment!</a> </main> <footer> <p> Content taken from <a href="https://www.hairpins.ca/">https://www.hairpins.ca/</a> Used for educational purposes only. </p> </footer> </body> </html> Copy lines Copy permalink View git blame Reference in new issue Go Footer © 2022 GitHub, Inc. Footer navigation Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. .user-mention[href$="/caitgarland"] { color: var(--color-user-mention-fg); background-color: var(--color-user-mention-bg); border-radius: 2px; margin-left: -2px; margin-right: -2px; padding: 0 2px; } assignment-c-dlu-RaviPunia/index.html at main · nic-dgl103-f22/assignment-c-dlu-RaviPunia

      I don't see any issues. Great job, Ravi.

    1. Skip to content In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Dashboard Pull requests Issues Codespaces Marketplace Explore Sponsors Settings caitgarland Sign out New repository Import repository New gist New organization Sorry, something went wrong. /  ...  /   nic-dgl103-f22  /   assignment-c-dlu-...  /   Clear Command Palette Tip: Type # to search pull requests Type ? for help and tips Tip: Type # to search issues Type ? for help and tips Tip: Type # to search discussions Type ? for help and tips Tip: Type ! to search projects Type ? for help and tips Tip: Type @ to search teams Type ? for help and tips Tip: Type @ to search people and organizations Type ? for help and tips Tip: Type > to activate command mode Type ? for help and tips Tip: Go to your accessibility settings to change your keyboard shortcuts Type ? for help and tips Tip: Type author:@me to search your content Type ? for help and tips Tip: Type is:pr to filter to pull requests Type ? for help and tips Tip: Type is:issue to filter to issues Type ? for help and tips Tip: Type is:project to filter to projects Type ? for help and tips Tip: Type is:open to filter to open content Type ? for help and tips We’ve encountered an error and some results aren't available at this time. Type a new search or try again later. No results matched your search Top result Commands Type > to filter Global Commands Type > to filter This Page Files Pages Access Policies Organizations Repositories Issues, pull requests, and discussions Type # to filter Teams Users Projects Modes Use filters in issues, pull requests, discussions, and projects Search for issues and pull requests # Search for issues, pull requests, discussions, and projects # Search for organizations, repositories, and users @ Search for projects ! Search for files / Activate command mode > Search your issues, pull requests, and discussions # author:@me Search your issues, pull requests, and discussions # author:@me Filter to pull requests # is:pr Filter to issues # is:issue Filter to discussions # is:discussion Filter to projects # is:project Filter to open issues, pull requests, and discussions # is:open nic-dgl103-f22 / assignment-c-dlu-RaviPunia Private Unwatch Stop ignoring Watch 0 Notifications Participating and @mentions Only receive notifications from this repository when participating or @mentioned. All Activity Notified of all notifications on this repository. Ignore Never be notified. Custom Select events you want to be notified of in addition to participating and @mentions. Get push notifications on iOS or Android. Custom Custom Select events you want to be notified of in addition to participating and @mentions. Issues Pull requests Releases Discussions Discussions are not enabled for this repository Security alerts Apply Cancel Fork 0 Starred 0 Star 0 Code Issues 0 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights Open in github.dev Open in a new github.dev tab Permalink main Switch branches/tags Branches Tags View all branches View all tags Name already in use A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch? Cancel Create assignment-c-dlu-RaviPunia/services.html Go to file Go to file T Go to line L Copy path Copy permalink This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository. RaviPunia Final code Latest commit 3fef99d 4 days ago History 2 contributors Users who have contributed to this file 110 lines (102 sloc) 3.88 KB Raw Blame Edit this file E Open in github.dev . Open in GitHub Desktop Open with Desktop View raw Copy raw contents Copy raw contents Copy raw contents Copy raw contents View blame <!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <meta http-equiv="X-UA-Compatible" content="IE=edge"> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <link rel="stylesheet" href="./style.css"> <link rel="icon" type="image/x-icon" href="images/favicon.ico"> <title>Services - Hairpins Boutique Salon</title> <link rel="preconnect" href="https://fonts.googleapis.com"> <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin> <link href="https://fonts.googleapis.com/css2?family=Poppins:wght@400;600&display=swap" rel="stylesheet"> <style> main li{ list-style: none; /* Here I removed the bullets from list items */ } </style> </head> <body> <header> <a href="index.html" ><img src="images/hairpins-salon-logo.png" alt="hairpins Logo" width="300" ></a> <nav> <ul> <li><a href="index.html">Home</a></li> <li><a href="services.html">Services</a></li> <li><a href="contact.html">Contact Us</a></li> </ul> </nav> </header> <main> <h1>Our Services</h1> <h2>Below is a list of services we proudly offer.</h2> <p> At Hairpins, we care about the environment and recognize the impact we all have on it. We are continually making strides to reduce where we can and have only aligned ourselves with companies and products we believe in. We are proud to be a CERTIFIED GREEN CIRCLE SALON and through that partnership are able to divert 95% of our salon waste from landfills. Get in touch if you have any questions or want to learn more about the programs and charities we are focusing our efforts on. </p> <h3>Cuts</h3> <p> Range from 45 minutes to 90 minutes. Please call us at 250-338-7467 (PINS) to book a shorter appointment for Kid's Cuts, Dry Cuts, Fringe Trims, Neck trims, or Clipper Cut maintenance.</p> <ul> <li>47.00 = 45 Minute Clipper Cuts and Short Fine Hair</li> <li>$61.00 = 60 Minute Cut for Fine to Medium Hair</li> <li>$76.00 = 75 Minute Cut for Medium to Thick Hair</li> <li>$90.00 = 90 Minute Cut for THICK THICK Hair, you know who are :)</li> </ul> <p> All cuts include shampoo, scalp massage, blowdry, and style. </p> <h3>Colours</h3> <ul> <li>FULL FOIL: $188/$219 with cut</li> <li>3/4 FOIL: $172/$203 with cut</li> <li>1/2 FOIL: $158/$189 with cut</li> <li>1/4 FOIL: $144/$175 with cut</li> </ul> <h3>Styling</h3> <ul> <li>BLOWOUTS ~ 30 mins: $40 - $45</li> <li>BLOWOUTS ~ 45 mins: $47 - $52</li> <li>BLOWOUTS ~ 1 hour: $60 - $67</li> </ul> <br> <h3><strong>* * 48 Hour Cancellation Required * *</strong></h3> <p> We require 48 hours' notice for any cancellations. </p> <br> <ul> <li>If you are a “no show”, you will be required to pay for your missed service in full in order to rebook.</li> <li>If you cancel with less than 48 hours' notice, you will be required to pay for 1/2 of the service you canceled in order to rebook.</li> </ul> <p> We understand that last-minute things happen! We will address each situation on a case-by-case basis. Please communicate with us and we will try our best to help. We appreciate your understanding. </p> <p> <a href="./contact.html">Contact us today to book an appointment!</a> </p> </main> <footer> <p> Content taken from <a href="https://www.hairpins.ca/">https://www.hairpins.ca/</a> Used for educational purposes only. </p> </footer> </body> </html> Copy lines Copy permalink View git blame Reference in new issue Go Footer © 2022 GitHub, Inc. Footer navigation Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. .user-mention[href$="/caitgarland"] { color: var(--color-user-mention-fg); background-color: var(--color-user-mention-bg); border-radius: 2px; margin-left: -2px; margin-right: -2px; padding: 0 2px; } assignment-c-dlu-RaviPunia/services.html at main · nic-dgl103-f22/assignment-c-dlu-RaviPunia

      Looks great!

    1. Earlier this year, Police Commissioner James O'Neill admitted that a "theft of services" arrest (the legal code name for turnstile jumping) could in fact lead to an immigrant getting deported. And earlier this month, a series of bills the City Council passed last year encouraging the use of civil summonses instead of arrests for quality of life crimes like public drinking, public urination and littering went into effect.

      Excusing criminality in a matter of deference to foreign nationals who are unlawfully present in the United States is perverse. The immigration laws have many provisions by design to ensure that foreign nationals who violate the laws of the United States in certain ways are not allowed to remain and harm the safety of Americans.

    1. When we’ve been accessing Twitter through Python and the “tweepy” code library.

      We can relate this to one of the first project we completed in this class. We had to retrieve API Keys from Twitter Developer to insert them into Python.

    1. The answer is simple and here is an oversimplified example: // Your browser is identifying itself as this $ua_string = 'EDGE'; // The website logic is this if($ua_string === 'IE') { echo 'You are using IE'; } elseif($ua_string === 'FIREFOX') { echo 'You are using Firefox'; } else { echo 'You are using Chrome, and we don\'t support that.'; }

      answering using source code

    1. Sheis one of the best and wisest friends that the cause of femaleeducation has ever had in India, though she does not writearticles for magazines or make speeches on the subject, forshe has seen that it by no means follows that what is theperfect code for the Western girl is of necessity to be slavishlyfollowed for her of the East

      Lady Harris genuinely tries to form relationships with Indian women and appeal to activities, like theater, that they would enjoy, instead of forcing them to participate in western forms of education

    Annotators

    1. 4 {

      Very clean and organized font organization. I found it quite easy to quickly get to the font or style I'm looking for. I will try to emulate this in my future code organization.

    1. there are many drop-outs that way, that the acquired skill level flattens out quickly, and that there’s no efficiency gain visible in consequent activities of the children involved

      there is a big opportunity cost to unassisted learning to code: many people will drop-out (in frustration). Again this is how the IndieWeb community quickly drives people away too. the skill level acquired this way is limited (due to absence of Vgl deliberate practice?) the acquired skills do not make following activities more efficient. (Vgl how I usually struggle with the same basic coding issues, no matter how many scripts I try to write).

    2. getting stuck and unstuck on your own is the way to go.

      Still very common to observe as example of such 'coder's ethics'. E.g. IndieWeb community discussion is mostly based on figuring things out yourself and creating one's own code (self-dogfooding), with only generic and hand-wavy assistance most of the time.

    3. many of us acquired our own tech skills strongly shapes the assumptions about learning to code

      The way you learned programming shapes assumptions about how to learn programming.

      The fact I self-learned through trial and error without supervision in the early and mid 80s, is common, but not a model for K transfer, and not useful "coder's ethic"

    1. Since you want others to be able to download and run your code without problems, you need to ensure repeatable builds. That’s what freezing is for. It outputs roughly the same set of dependencies with their sub-dependencies in a special format:

      freeze makes the requirements.txt like a package.json.

    1. Do you think there is information that could be discovered through data mining that social media companies should seek out

      sometimes, I get coupons code from McDonald give me special coupon codes, which my other friend don't get, for example, I get a 50% off for a ice cream cone, but my friend just got 30% off, we don't know why and what makes the difference, but maybe McDonald's think I can only afford 50% for the cone, they want me to spend money on it, so they give me a higher discount. I am not sure if this can be considered as "fair", but I think this coupons are based on personal income, and asset.

    1. Valentine, Greta, and Kate Barron. 2022. “An Examination of Academic Library Privacy Policy Compliance With Professional Guidelines”. Evidence Based Library and Information Practice 17 (3):77-96. https://doi.org/10.18438/eblip30122.

      Abstract

      Objective – The tension between upholding privacy as a professional value and the ubiquity of collecting patrons’ data to provide online services is now common in libraries. Privacy policies that explain how the library collects and uses patron records are one way libraries can provide transparency around this issue. This study examines 78 policies collected from the public websites of U.S. Association of Research Libraries’ (ARL) members and examines these policies for compliance with American Library Association (ALA) guidelines on privacy policy content. This overview can provide library policy makers with a sense of trends in the privacy policies of research-intensive academic libraries, and a sense of the gaps where current policies (and guidelines) may not adequately address current privacy concerns.

      Methods – Content analysis was applied to analyze all privacy policies. A deductive codebook based on ALA privacy policy guidelines was first used to code all policies. The authors used consensus coding to arrive at agreement about where codes were present. An inductive codebook was then developed to address themes present in the text that remained uncoded after initial deductive coding.

      Results – Deductive coding indicated low policy compliance with ALA guidelines. None of the 78 policies contained all 20 codes derived from the guidelines, and only 6% contained more than half. No individual policy contained more than 75% of the content recommended by ALA. Inductive coding revealed themes that expanded on the ALA guidelines or addressed emerging privacy concerns such as library-initiated data collection and sharing patron data with institutional partners. No single inductive code appeared in more than 63% of policies.

      Conclusion – Academic library privacy policies appear to be evolving to address emerging concerns such as library-initiated data collection, invisible data collection via vendor platforms, and data sharing with institutional partners. However, this study indicates that most libraries do not provide patrons with a policy that comprehensively addresses how patrons’ data are obtained, used, and shared by the library.

    1. In the example code

      This is linking to the "putting it all together" (pose.ipynb) example. Is it meant to link to the "Point cloud registration with known correspondences" in the icp notebook?

    1. In 1967, the NASW Code of Ethics was amended to include the followingstatement on nondiscrimination: “I will not discriminate because of race, color,religion, sex, or national ancestry and in my job capacity will work to eliminatesuch discrimination in rendering service, in work assignments, and in employ-ment practices”

      use this for explaining values

    Annotators

    1. However, developing a clear understanding of each of these concepts and their role in organizing code is critical to mastering the art of programming.

      对象、函数、解释器,掌握各个概念之间的关系和它们在代码中的角色,对编码能力非常重要。

    1. Doch die hat Goo­gle mit allen Mitteln bekämpft. Wäre sie von vielen Akteuren übernommen worden, wären Google und Konsorten womöglich gezwungen gewesen, den Quellcode der auf ihren Servern laufenden Software freizugeben, auch für Nut­ze­r:in­nen, die über die Cloud mit dieser Software arbeiten. Seinen eigenen Ent­wick­le­r:in­nen hat der Gigant aus dem Silicon Valley daher die Verwendung von AGPL-basiertem Code schlichtweg verboten.

      Meiner Meinung nach ein sehr interessanter Zug von Google. Sie verzichten lieber auf die Weiterentwicklung ihrer Software, die die Konkurrenz potentiell einsehen könnte, um die eigene Machtstellung zu sichern und auszubauen.

    1. <!--Fonts below are for card three--> <link rel="preconnect" href="https://fonts.googleapis.com"> <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin> <link href="https://fonts.googleapis.com/css2?family=Bebas+Neue&family=IBM+Plex+Serif:ital,wght@1,300&family=Lexend+Deca:wght@400;500&display=swap" rel="stylesheet"> <!--Fonts below are for card four--> <link rel="preconnect" href="https://fonts.googleapis.com"> <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin> <link href="https://fonts.googleapis.com/css2?family=Montserrat&display=swap" rel="stylesheet">

      redundant, in longer pages of code it will slow down your page. Just use one <link>

    1. id="head_one"

      This instance, among others, seems like excessive ID use. Consider using descendant selectors like .card-three h1 {} to make your code more concise.

    1. The name “Learn Enough” refers to the philosophy that you don’t have to learn everything about most subjects—you just have to learn enough to be dangerous. In this context, “dangerous” is meant to have a positive meaning, indicating that you are able to get things done. Part of this philosophy involves know what to leave out, at least initially. For example, some HTML tutorials cover code for submitting information, called a form, but never make forms that do anything!

      .c1

    1. Having no if or switch statement is a huge win. (...) You can see how code like that grows and grows as new shapes get added. The code itself is complex, as all conditionals are. They have extra possible execution paths through the code. Each one needs testing. Each one increases the chance of coding mistakes. This switch statement prejudice is a constant of OOP, as far as I can tell. Anytime polymorphism is explained, it's contrasted with a “scary” switch statement.

      Yeah, I've noticed this. The argument is that if a new type is added you may need to update many switch statements to support the new type.

    1. WHO

      Hello SENG.

      This is an annotation added on top of this web page. This is to illustrate how we can add notes on top of documents on the web using Web Standards.

      It's also my way of using code to illustrate ideas.

    1. <a href="Contact us today to book an appointment!">

      this doesn't link to anything. you need to make it <a href="contact.html"> This is also duplicate code.

    1. Even as the app has transformed into a public square for news and conversation, TikTok’s opaque systems of promotion and suppression fuel worries that China’s aggressive model of internet control could warp what appears there. Many users already are self-censoring, adopting a second language of code words — “unalive,” not dead; “procedure,” not abortion — in hopes of dodging the app’s censors and preserving their chances at online fame.

      Anecdotes of self-censorship to avoid algorithmic censors

      A few paragraphs later in the article there is a story from a high school literature teacher that wont use the word "death" lest it might "stunt his reach."

    1. width="360px">

      I agree with Abbey regarding the P element but leave in the div, and add h3 along with the p. I had some help from Claire , Use the div class for both p and h3 right after the image code.

    1. /* not sure but there was probably a shorter code style I could of used */

      for the border declarations it can be shortened by using shorthand: border-top: 3px solid hsl(...); this gives all the info in one line of code

    1. is a terrible drought that strikes so that at this point there there is a sense that for Indian people to survive the world is going to have to be made anew it's going to have to become a new

      The Ghost Dance was created by American Indians, in response to the unwanted presence/intrusions of white settlers on their territory. As more and more settlers pushed further west, Native populations not only lost the land they had occupied for hundreds of years but the resources they relied upon for just as long. White settlers, in this context, can reasonably be seen as an invasive species to the American landscape. However, rather than out-competing Native populations, colonial settlers dwindled their communities with the disease, forced migrations, and environmental degradation. These circumstances were so drastic to American Indians that they came up with a technique or code, in the Ghost Dance, to help bring about an end to the world they have been forced into. In doing so, they would unlock a new existence free of white settlers who are continually encroaching upon and inflicting themselves on your life. With all this in mind, I would be inclined to agree with Sclonit's assessment of the Ghost Dance from a technological standpoint. It is an organized and systematic practice meant to bring about a specific desired result or experience, in this case, a new world.

    1. ShortWeb adds the ability to automate actions on web pages. It allows you to click links, fill in fields, and more. You can write code to make this happen or just tap through a web page while ShortWeb is recording. You can then save a set of actions for a web page as a macro within the app. Then, in Shortcuts, you can call this macro within a Shortcut. This means that you can automate your way through web pages with Shortcuts easily. There are a few bugs in the ShortWeb app, but on the whole, it’s a solid experience. Give this one a look if you want to or already have created Shortcuts that work on the web! It’ll give you a plethora of new tools and ideas to work with.

      Automatizar páginas web.

    1. teachers stop teaching Black students to code-switch! Instead, we must teach Black students about anti-Black linguistic racism and white linguistic supremacy!

      Black student have to code switch to be accepted academically, this promotes linguistic supremacy because it is teaching all student that what is accepted academically and professionally is not black

    2. We Demand that Teachers Stop Teaching Black Students to Code-Switch!

      Code-switching is something that occurs with bilingualism, and this point is demanding that teachers acknowledge and use the connections between language rather than a strict structure of only white mainstream English as that is a form of injustice.

    1. put in your special Twitter bot passwords skip the fake_tweepy step above take the first line of the code below and replace 'from:fake_celebrity' with something like 'from:cnn'

      This is a very good method that I can smoothly complete my work. It let me know how to find bot passwords. And I can learn a skill.

    1. The way we present ourselves to others around us (our behavior, social role, etc.) is called our public persona. We also may change how we behave and speak depending on the situation or who we are around, which is called code switching.

      I believe that in today society, our public persona have gotten very similar to one another, we as a society dislike the idea of being different, and in some country it is frown upon to stand out, like in the case of Japan and many Asian country. people are have different personality depends who they are with, like how you would behave around your parents versus how you are with your friends or significant other.

    2. While modified behaviors to present a persona or code switch may at first look inauthentic, they can be a way of authentically expressing ourselves in each particular setting.

      A person may have different persona for different situations. Though the persona may differ from who he really is, but it doesn't necessarily mean that the person is inauthentic. Sometimes, a modification of one's behavior is needed just like the following examples indicate. A person's persona may subconsciously change with the surrounding environment.

    1. In other words, our study reframes the discussion in terms of(false) remembering (statements about events that are code-termined by memory information and the context of remem-bering, e.g., ref. 61) rather than (false) memories (supposedstable entities within people’s minds). This, again, is not meant tobelittle the importance of false memories in any way (e.g., bysuggesting that they are inherently transitory); quite to thecontrary, social context will often help to sustain them (e.g., refs.70, 79, 82–84), and there is no doubt whatsoever that false re-membering/memories can have devastating real-life conse-quences. It does suggest, however, that—given the situationaldependency of false remembering demonstrated here—focusingon the mere prevalence of false memories (as if it were a con-stant) in attempts to resolve the false memory debate leads to anunproductive dead end. Future research should rather seek(similar to the system factors approach in eyewitness identifica-tion research; e.g., refs. 85–87) to further our understanding ofsituational factors under the potential control of investigatorsthat minimize the risk of producing/retrieving false memoriesand maximize veridical remembering.

      explanation of the relation to they hypothesis and the conclusion drawn from the data seen/measured

  2. pointersgonewild.files.wordpress.com pointersgonewild.files.wordpress.com
    1. protected static function resolveFacadeInstance($name)

      This page has a neat effect, first apparent with this example, where a blur effect is used on most of the text in the code block, except for lines 11–13 which are shown in sharp focus. (You can mouse over the code block to eliminate the blur effect.)

      .torchlight.has-focus-lines .line:not(.line-focus) {
              transition: filter 0.35s, opacity 0.35s;
              filter: blur(.095rem);
              opacity: .65;
      }
      

      Each line is dumped into a div and the line-focus class set on those which are supposed to be unblurred.

      (For ordinary code blocks without any blur/focus effect, the has-focus-class line is simply not used.)

    1. It’s easier now than ever to access archival materials like those in the Shingwauk collection. In Canada, that’s been made possible due in part to a massive push by large institutions such as Library and Archives Canada (LAC) to collect, digitize and share the records that make up Canada’s history. In 2015, LAC partnered with smaller public archives to create the National Heritage Digitization Strategy (NHDS) as a way to harmonize digitization efforts, establish best practices and fund the digitization of material that sometimes gets left out of the national conversation.

      I think this is amazing to do but also could have some draw backs. It would need to have some sort of security measure as I think some text in any language, religion, culture is considered sacred and should be treated with respect but obviously the world does not work as it should all the time. In my opinion the collection should have two layers to it, one which would be open to general public and a second that can only be accessed through a certain code (which could be through application/online quiz/ etc..) that only allows people that actually care enough to learn to get access what is considered to more "sacred" important files. (although all files would be important)

  3. cod.pressbooks.pub cod.pressbooks.pub
    1. <a href="https://www.google.com/maps/place/Hairpins+Boutique+Salon/@49.6904218,-124.9971279,15z/data=!4m2!3m1!1s0x0:0x859b2cfce3bc31ea?sa=X&ved=2ahUKEwiYi_2Ex6T6AhXNMjQIHYkoCUUQ_BJ6BAhSEAc">Check us out on Google Maps</a></p>

      Google has an embedded maps you can add, google maps>click on the share button>click on the embed a map tab and copy the code.

    1. Skip to content In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Dashboard Pull requests Issues Codespaces Marketplace Explore Sponsors Settings caitgarland Sign out New repository Import repository New gist New organization Sorry, something went wrong. /  ...  /   nic-dgl103-f22  /   assignment-c-dlu-...  /   Clear Command Palette Tip: Type # to search pull requests Type ? for help and tips Tip: Type # to search issues Type ? for help and tips Tip: Type # to search discussions Type ? for help and tips Tip: Type ! to search projects Type ? for help and tips Tip: Type @ to search teams Type ? for help and tips Tip: Type @ to search people and organizations Type ? for help and tips Tip: Type > to activate command mode Type ? for help and tips Tip: Go to your accessibility settings to change your keyboard shortcuts Type ? for help and tips Tip: Type author:@me to search your content Type ? for help and tips Tip: Type is:pr to filter to pull requests Type ? for help and tips Tip: Type is:issue to filter to issues Type ? for help and tips Tip: Type is:project to filter to projects Type ? for help and tips Tip: Type is:open to filter to open content Type ? for help and tips We’ve encountered an error and some results aren't available at this time. Type a new search or try again later. No results matched your search Top result Commands Type > to filter Global Commands Type > to filter This Page Files Pages Access Policies Organizations Repositories Issues, pull requests, and discussions Type # to filter Teams Users Projects Modes Use filters in issues, pull requests, discussions, and projects Search for issues and pull requests # Search for issues, pull requests, discussions, and projects # Search for organizations, repositories, and users @ Search for projects ! Search for files / Activate command mode > Search your issues, pull requests, and discussions # author:@me Search your issues, pull requests, and discussions # author:@me Filter to pull requests # is:pr Filter to issues # is:issue Filter to discussions # is:discussion Filter to projects # is:project Filter to open issues, pull requests, and discussions # is:open nic-dgl103-f22 / assignment-c-dlu-chelsieadam Private Unwatch Stop ignoring Watch 0 Notifications Participating and @mentions Only receive notifications from this repository when participating or @mentioned. All Activity Notified of all notifications on this repository. Ignore Never be notified. Custom Select events you want to be notified of in addition to participating and @mentions. Get push notifications on iOS or Android. Custom Custom Select events you want to be notified of in addition to participating and @mentions. Issues Pull requests Releases Discussions Discussions are not enabled for this repository Security alerts Apply Cancel Fork 0 Starred 0 Star 0 Code Issues 0 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights Open in github.dev Open in a new github.dev tab Permalink main Switch branches/tags Branches Tags View all branches View all tags assignment-c-dlu-chelsieadam/contact.html Go to file Go to file T Go to line L Copy path Copy permalink This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository. chelsieadam Fixed Validation issues Latest commit 44427b0 5 days ago History 2 contributors Users who have contributed to this file 63 lines (57 sloc) 2.04 KB Raw Blame Edit this file E Open in github.dev . Open in GitHub Desktop Open with Desktop View raw Copy raw contents Copy raw contents Copy raw contents Copy raw contents View blame <!DOCTYPE html> <html lang="en"> <head> <link rel="stylesheet" href="style.css"> <link rel="icon" type="image/x-icon" href="images/favicon.ico"> <meta charset="UTF-8"> <meta http-equiv="X-UA-Compatible" content="IE=edge"> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <link rel="preconnect" href="https://fonts.googleapis.com"> <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin> <link href="https://fonts.googleapis.com/css2?family=Open+Sans&family=Raleway:wght@700&display=swap" rel="stylesheet"> <title>Hairpins Salon Contact Page</title> </head> <body> <header> <a href="index.html"><img src="images/hairpins-salon-logo.png" alt="hairpins Logo" width="300"></a> <nav> <ul> <li><a href="index.html">Home</a></li> <li><a href="services.html">Services</a></li> <li><a href="contact.html">Contact Us</a></li> </ul> </nav> </header> <h1>Contact Us</h1> <br> Questions, comments, ready for a new do?<br> We look forward to hearing from you!<br> If you're looking to make an appointment online, please do so here.<br> <br> Monday ~ Closed<br> Tuesday 9:00am ~ 5:00pm<br> Wednesday 9:00am ~ 8:00pm<br> Thursday 9:00am ~ 8:00pm<br> Friday 9:00am ~ 5:00pm<br> Saturday 9:00am ~ 4:00pm<br> Sunday ~ Closed<br> <br> Hairpins Boutique Salon<br> #4 - 224 6th Street<br> Courtenay, BC V9N 1M1<br> <br> <a href="https://www.google.com/maps/place/Hairpins+Boutique+Salon/@49.6904218,-124.9971279,15z/data=!4m2!3m1!1s0x0:0x859b2cfce3bc31ea?sa=X&ved=2ahUKEwiYi_2Ex6T6AhXNMjQIHYkoCUUQ_BJ6BAhSEAc)">Check us out on Google Maps</a><br> <br> Tel: (250) 338-7467<br> <a href="tel:250-338-7467">Click to Call</a><br> <br> Email: salon.hairpins@gmail.com <br> <a href="mailto:salon.hairpins@gmail.com">Email us</a><br> <br> <a href="contact.html">Conctact us today to book an appointment!</a> <br> <div class="footer"> Content taken from<a href="http://www.hairpins.ca/">https://www.hairpins.ca/</a>Used for educational purposes only. </div> </body> </html> Copy lines Copy permalink View git blame Reference in new issue Go Footer © 2022 GitHub, Inc. Footer navigation Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. .user-mention[href$="/caitgarland"] { color: var(--color-user-mention-fg); background-color: var(--color-user-mention-bg); border-radius: 2px; margin-left: -2px; margin-right: -2px; padding: 0 2px; } assignment-c-dlu-chelsieadam/contact.html at main · nic-dgl103-f22/assignment-c-dlu-chelsieadam

      Similar comments - I would use unordered lists and paragraph elements where necessary!

      Also, I found there is an < address > tag you could use. Learn more: https://www.w3schools.com/tags/tag_address.asp

    1. interactive form created by the code and frictionless data package presented alongside this work [40  Reference40DongR, CameronD, BedoJ Supporting data for “svaRetro and svaNUMT: modular packages for annotating retrotransposed transcripts and nuclear integration of mitochondrial DNA in genome sequencing data”. GigaScience Database, 2022; http://dx.doi.org/10.5524/102318.].

      See GigaBlog forr more http://gigasciencejournal.com/blog/frictionless-data-interactive-figures/

    1. George thought he would be terrified, that every thought of what this means for the algorithm would overtake him.

      I suppose I get it now, there is a subtle underlying theme in their dynamic of an algorithm, the algorithm of them, like when George talked about a deviation in their code or something

    1. Using CRISPick28 (see methods) we selected a sgRNA with a predicted on-target score of 0.7 that cut 40 bp upstream of the BST2 stop codon. We designed homology arms that covered both sides of the predicted Cas9 cut site (Fig. 2a)

      Based on lots of tagging of endogenous loci using a similar plasmid based delivery system in C. elegans, I would think that you should be able to get away with moving the insertion site to the C-terminus (in this case) or N-terminus (in other cases) - it would be great to test this out - but we (and others) have found that if you re-code the homology arm in your repair plasmid between the cut site and insertion point, you can get KIs when you push the distance between the cut site and the insertion site, avoiding having to insert your FP into the gene. Is there a locus of interest with some different sgRNAs at the C- or N-terminus that you could experimentally try this out with (like one that would split the insertion site, one that would be ~10-20 bp away, and one that would be ~50-60bp away)?

    1. 2016 report from the Obama-era Office of Science and Technology Policy warned that the impact of artificial-intelligence-driven algorithms on workers has the potential to worsen inequality, and noted that bias buried in computer code could disadvantage individuals in a host of fields.

      potential government source.

    1. Reviewer #2 (Public Review):

      In this work Kado and colleagues analyzed cell membrane partitioning in Mycobacterium smegmatis. Based on the membrane fluidizing effect of benzyl alcohol they did a transposon sequencing that are sensitive to the treatment. Among a group of genes that code for antiporter, they identify the bifunctional PBP PonA2 to be involved in benzyl alcohol sensitivity. Membrane partitioning in domains with higher and lower fluidity seems to depend on the peptidoglycan cell wall. In particular, de novo partitioning depends on preexisting cell wall, but not on the active synthesis. The authors use a variety of techniques to support their claims.

      The authors claim that the membrane in Msmeg is partitioned in IMDs (intracellular membrane domain) and a PM-CW (apparently a more rigid membrane domain). I know that the term IMD has been used before, but I find this misleading. Intracellular means that something is within the cell. Here we are talking about different fluidities within the 2D space of the membrane. I do not think that this term is meaningful and should be used.

      The authors suggest that PonA2 regulates the density (or heterogeneity - I assume the authors mean degree of crosslinking?) of the peptidoglycan, thereby influencing membrane partitioning (lines 371-372). This claim would require a PG analysis and a comparison of the cross-linking degree. The influence of PonA2 on membrane partitioning remains somewhat unclear. While the authors claim that PonA2 was also shown to provide a protective effect against other stresses, such as heat, it is not certain that this has to do with membrane partitioning. Although increase in temperature has certainly an effect on membrane dynamics, heat also triggers unfolded protein response. Bacteria furthermore adapt their membranes quickly to changes in temperature and likely adaption also takes place when other stressors influence membrane fluidity. Also, only the TG defective PonA2 led to the phenotype and not the TP mutation, which would argue against a change in crosslinking.

    1. Now, three years after that watershed Gizmodo story and the tempest that followed, we’re at a similar juncture—but with few lessons learned.

      This part really stuck out to me as here we are learning about misinformation and disinformation and how prevalent it is in society and what we can do to minimize its impact yet the professionals are still learning too! I have learned so many tools within this course and past courses and I think educating everyone on these matters is the only solution. I think we could be 10 years down the road and in this exact same dilemma if we are not changing foundational things within our society mainly educating on this very subject. We have to have a society that understands the content they are intaking is not real or out of context and one that is better informed overall. Facebook employees although very intelligent can't code a magic algorithm to make its users understand mis/disinformation it has to be bigger than that and we have to do better as a whole to fix these issues that keep happening.

    1. t appears that being a highly religious person is asso-ciated with more stability and satisfaction with close relationships. This may be associat-ed with living according to a spiritually-oriented code, being together as a family in reli-gious worship, sharing in a common purpose, and hearing weekly reminders of Christianvalues

      More stability and satisfaction

    1. Our results pose questions with both theoretical and practicalimpact. The current results show that infants readily learn fromforeign-language exposure, which raises a question about thehuman capacity to acquire more than one language. Because thephonetic contrasts of different languages often require differentperceptual groupings (in Japanese, for example,randlbelongto the same category, whereas in English they are separate),bilingually raised infants must learn two different ways ofclassifying speech sounds. Are there limits on infants’ abilities tolearn the phonetic cues of different languages? Infants in thecurrent tests readily acquired phonetic information from alanguage they had not previously heard, suggesting that, at leastat this age, infants can learn from exposure to two distinct lan-guages. A second question is the resilience of early phoneticlearning; that is, does short-term exposure during a sensitive periodfor learning have lasting effects? In our tests, the ability to discrim-inate the Mandarin contrast was unaffected after an almost 2-wkdelay. Follow-up studies on these infants are now underway toassess the long-term impact of early foreign-language exposure.Early language learning may find a useful biological frameworkin Greenough and Alcantara’s ‘‘experience-expectant’’ learning(51). Experience-expectant learning has two principal components:neural development that occurs in anticipation of the opportunityto learn and environmental information that is reliably present atthat time. Infants’ avid language learning in the second half-year oflife, shown both in the present study and in previous studies (3, 19),could be indicative of neurological development that enables in-fants to neurally code the properties of language. The environmentreliably provides language in a social context. Neural developmenton a maturational timetable and critical environmental informationin a social setting may combine to provide an example of experi-ence-expectant learning for language acquisition.Traditional theories describe phonetic learning as an innate‘‘modular’’ process in infants, one specific to speech and encap-sulated in a way that isolates it from general systems (52, 53). Analternative view, buttressed by data on categorical perceptiontests in infants and animals, argues that infants’ initial capacitiesare based on more general perceptual systems (54). The presentdata support the view that language acquisition initially draws ona broad set of perceptual, cognitive, and social abilities. Acorollary argument, addressing evolutionary origins, suggeststhat language evolved to capitalize on preexisting general sys-tems and then went beyond them (55, 56). The characteristicsthat facilitate language acquisition in children may thus haveinfluenced the nature of language itself, ensuring that languagewas learnable by infants in natural settings.

      conclusion

    1. The "District Student Code of Conduct(Links to an external site.)Links to an external site." for Contra Costa Community College District The "Academic Integrity Policy(Links to an external site.)Links to an external site." from DVC as well as a list of "Academic Dishonesty (Links to an external site.)Links to an external site." resources.

      Are this the only resources we need for class or do we also need something to take notes and turn in class to show proof that we are doing the work ?

    1. The solution to all these problems is the same as the answer to “How do I organise my journals if I don’t use cornflakes boxes?” Use the internet. We can change papers into mini-websites (sometimes called “notebooks”) that openly report the results of a given study. Not only does this give everyone a view of the full process from data to analysis to write-up – the dataset would be appended to the website along with all the statistical code used to analyse it, and anyone could reproduce the full analysis and check they get the same numbers – but any corrections could be made swiftly and efficiently, with the date and time of all updates publicly logged.

      This seems not feasible for a few reasons: lack of tools, lack of incentives for researchers, and variance of the output for readers. The author already mentions the lack of tools/skills to make this view possible. This is perhaps the most easily solvable problem here. With time, new tools could come that would require less skills while scientists could teach themselves new skills as well. Scientists are already a highly skilled group and certainly it is plausible the next generation could become familiar with publishing these kinds of notebook reports. The bigger problems are the lack of incentives for researchers and the variance in output for readers that this will generate. Academia moves slowly and respect is still granted through journal articles and conference papers. Any one researcher moving away from this will not be able to have a career. At the same time, since publishing journal articles and conference papers is already so time consuming, it is often hard for researchers to find time for writing an additional interactive website to document their findings, thereby setting up a new infrastructure. Now, even if we can solve the above two problems, it's still not clear that this would actually lead to an improvement for the readers. Yes, the interactive articles from New York Times are indeed more pleasant and more informative than the average journal article. However, take an average Jupyter notebook. It meanders between code and figures. Often the computations are not in the same order as the clearest story of the findings. The figures are often non-interactive and thus really no better than the figures in journal articles. What do we really gain here? A strong standard for such computational notebooks could resolve this, but it may limit researchers just as much as journal articles. Perhaps a peer review / mentoring process could enforce readability , although it would need some external funding to really happen.

    1. However, the code as it stands associates the block with bar, and if the code is currently working, the programmer should definitely not parenthesize the parameter, because it would change the meaning of the code.
    1. Once we have some types of data representation on a computer, we can create different groupings of data to represent more types of data. We’ll look at two types of groupings here: Lists and Dictionaries.

      I think it is interesting that the last two lessons touch on how to code with data displayed by a tweet. There is so much more to a like, share, and retweet that the basic person would not understand without taking grouping data into account.

    1. Note: Python isn’t by default set up with event style programming. We won’t be directly doing event programming in this book.

      Is it not possible to use code like "whileTrue" and if/else statements to program similar functions? Or does this mean there's simply not a "when" command?

    1. Each cell is ‘run’ or ‘executed’ by either pressing a key combination or selecting one of the buttons at the top of the page.

      It is not clear which button at the top of the page a user should select. I recommend stating in parentheses 'the rocket symbol' at the top of the page for clarification.

      The Live Code button took about 12 mins to run and display the code (maybe this might be worthwhile stating in the notebook). Only the jupyterhub button seems to work perfectly. Binder does not work.

    1. technical innovations

      Digital technology has radically changed our lives. Technology is becoming meaningful in and of itself, which cannot but affect the text by changing the parameters of its internal organisation and the way it is perceived by the reader. At the moment At the moment, technology itself has acquired the capacity to generate media objects At the moment, technology itself has gained the ability to generate media objects (including fiction texts), while software code, along with natural language, participates in the shaping of a statement and its meaning.

    2. Digital technology has radically changed our lives. Technology is becoming meaningful in and of itself, which cannot but affect the text by changing the parameters of its internal organisation and the way it is perceived by the reader. At the moment At the moment, technology itself has acquired the capacity to generate media objects At the moment, technology itself has gained the ability to generate media objects (including fiction texts), while software code, along with natural language, participates in the shaping of a statement and its meaning.

    1. puisqu’il peut s’agir de répondre à des appels d’offre.

      Remplacer par "puisque ces demandes sont régies par le code des marchés publiques et qu'elles demandent de répondre à des appels d'offres"

    1. Moor's reasoning regarding the malleability of computers and the complexity of computer related issues shows why there is a need for a separate branch of ethics dealing directly with cybertechnologies. The many policy vacuums that were created during the spread of the internet show that our existing code of ethics is not extensive enough to govern all cybertechnologies.

    1. demands a fair amount of their time and that of others before they have any idea of what goes on.

      I think that problem still exists today. I once took over a project that a graduate student didn't finish when I was an undergraduate. Although the code was somewhat commented, there were no specific document to introduce the project. As a result, I spent a lot of extra time trying to figure out the technical details and system structure inside. Therefore, it is important to maintain a complete project documentation during the project.

    1. <body> <!--F22 DGL 103 CVS1 - RENU GOPINATHA PILLAI - Assignment A --> <h1>Hello World</h1> <p>This is Renu Gopinatha Pillai speaking.</p> <img src="images/HTML.png" width="400" alt="HTMLCSS"> </body> </html>

      Indenting the h1 and p elements makes it visually easier to read the code.

    1. “I always thought that was sort of black magic,” Vint Cerf, a pioneer of Internet infrastructure, told me.

      Vint Cerf on NTP

      If Vint Cerf thinks it is black magic, you know it is going to be deeply complex code. The rest of the article bears this out.

    1. While the HolyC compiler builds things, it internally keeps a sourcemap (much like webapp sourcemaps or how gcc relates errors at runtime to lines of code for the developer) of all of the functions it compiles.

      Insane! Can interactively browse the code like a UI

    2. ll ram is identity-mapped too, so sharing memory between tasks is as easy as passing a pointer. There's a locking intrinsyc too. It has full documentation (with graphical diagrams) embedded directly in source code.

      Insane mode

    1. so big companies, like Apple, saw new restrictions coming in at the same time as more aggressive enforcement, and said "well shit, we want to base our software on these handy convenient tools like GCC but we can't use GPLv3 software while keeping our hardware and software as locked together as we'd like." so they started pouring money into a new C compiler, LLVM, that was instead open source.

      THis is new for me, and a fascinating case of how open source ecosystems work today. This is no longer about alternatives to dominant, corporate code, but about a standard for creating dominant code.

  4. Sep 2022
    1. Compilers

      I'd like to say that something like the language or even the IDE is important. Some platforms are much easier and more elegant to use, which allows for easier programming and sometimes allows you to code in a more robust format.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01574

      Corresponding author(s): Casey, Greene

      1. General Statements [optional] We thank the reviewers for their thorough feedback. We have addressed all the points raised, revised the manuscript accordingly, and explained our changes below. To aid readability, the reviewers’ comments have been converted to italics, and our responses have been bolded.

      Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors systematically evaluate the performance of linear and non-linear ML methods for making predictions from gene expression data. The results are interesting and timely, and the experiments are well designed.

      I have a few minor comments:

      - It was hard for me to understand Figure 1B. I think a figure like this would be very helpful however. What do the numbers represent? If sample ID, then I am not sure why x-axis label is also "samples"

      - For analysis of GTEx data, not sure what "studywise splitting" would mean, since the GTEx dataset is one study? Do you leave out the same individuals from all tissues for evaluation?

      We thank the reviewer for their input on these two points. To make Figure 1B clearer and to elaborate on our stratified splitting methods, we have amended its description to “We stratify the samples into cross-validation folds based on their study (in Recount3) or donor (in GTEx). We also evaluate the effects of sample-wise splitting and pretraining (B).”

      - I found the sample size on x-axis of Fig 2a confusing. If I understand correctly, GTEx has a total of ~1000 subjects. So in some sense, effective sample size can not be bigger than 1000. If you are counting subjects x tissue as sample, then it can be misleading in terms of the effective sample size.

      We thank the reviewer for this point. To incorporate it into the manuscript, we’ve added the following text to the description of Fig. 2: “It is worth noting that "Sample Count" in these figures refers to the total number of RNA-seq samples, some of which share donors. As a result, the effective sample size may be lower than the sample count. “

      - Would be interesting to assess out-of-sample generalizability of linear and non-linear models. Have you tried training on GTEx and predicting on Recount3 or vice versa?

      This question intrigued us. We reran the tissue prediction experiments from the manuscript on a subset of the GTEx and Recount3 datasets in which we performed an intersection over tissues and genes. We found that in the out-of-sample domain the logistic regression model and the three layer neural network performed similarly, while the five layer net generally had a lower accuracy despite having similar accuracy in the training domain. We also found (consistent with our results in the paper) that GTEx predictions are an easier task than their Recount counterparts. Below are plots demonstrating these findings:

      [These plots appear in the PDF but do not appear to work in the ReviewCommons Form].

      Reviewer #1 (Significance (Required)):

      Important and timely study, evaluating linear vs non-linear methods for predicting phenotype from gene expression datasets.

      We appreciate the reviewer’s positive comments on the timeliness of our manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary

      The authors want to assess the presence of non-linear signal in gene expression values in the task of tissue and sex classification. They use logisitic regression classifiers and two types of neural networks, with 3 and 5 layers, and assess classification performance on two large expression datasets from Recount3 and GTEX and three simulated datasets.

      The authors carefully construct their learning setup in such a way that one can reason about the removal of linear signal from the expression features. The interesting conclusion is, that although the linear approach works well on both datasets, and sometimes even better than the more complex models. The authors convingly show, that there is a significant non-linearity in the gene expression data. However, just because it is "there" does not imply that any non-linear methods performs better.

      Major comments:

      - Are the key conclusions convincing?

      The authors did a good job in showing, that there is non-linear signal in gene expression features for the classification problems studied.

      We thank the reviewer for their positive feedback.

      - Should the authors qualify some of their claims as preliminary or speculative, or

      remove them altogether?

      The overall claims of the authors are justified, the discussion may be improved.

      We appreciate the reviewer’s support for our overall claims and we have adjusted the manuscript as noted point by point below.

      - Would additional experiments be essential to support the claims of the paper?

      No, additional experiments are not essential. But the authors did not compare to other non-linear methods such as SVM or knn-classifiers in the resulst or conclusion section. It is unlikely that the main conclusion would change if those methods were tried. But it is possible that other "simpler" non-linear methods, such as knn for example, are able to outperform the logistic regression classifier on the GTEX and Recount3 data set. Thus, the authors should at least mention this as part of the conclusion and could extend their discussion on the implications of their study concerning other tasks or models.

      We agree that there should be more discussion of other models in the conclusion section. We have updated the fifth paragraph of the conclusion accordingly:

      “We are also unable to make claims about all problem domains or model classes. There are many potential transcriptomic prediction tasks and many datasets to perform them on. While we show that non-linear signal is not always helpful in tissue or sex prediction, and others have shown the same for various disease prediction tasks, there may be problems where non-linear signal is more important. It is also possible that other classes of models, be they simpler nonlinear models or different neural network topologies are more capable of taking advantage of the nonlinear signal present in the data.”

      - Are the suggested experiments realistic in terms of time and resources?

      Not applicable.

      - Are the data and the methods presented in such a way that they can be reproduced?

      There is a separate github repo which has the code to reproduce the analyses. This is good. However, would be nice to explain in more detail in the manuscript how the limma function was used for removing the linear signal, as they mention the "removeBatchEffect" function was used, but it would be good to tell the reader how that works, as this is their way for assessing the effect of linear-signal removal. Are there any limitations for the assessment of signal removal in this way?

      We thank the reviewer for their input, and have updated the model training section on signal removal to read: “We also used Limma[24] to remove linear signal associated with tissues in the data. We ran the ‘removeBatchEffect’ function on the training and validation sets separately, using the tissue labels as batch labels. This function fits a linear model that learns to predict the training data from the batch labels, and uses that model to regress out the linear signal within the training data that is predictive of the batch labels.”

      We have also elaborated on the limitations of signal removal by updating the sentence “This experiment supported our decision to perform signal removal on the training and validation sets separately, as removing the linear signal in the full dataset induced predictive signal (supp. fig. 6)” to read “This experiment supported our decision to perform signal removal on the training and validation sets separately. One potential failure state when using the signal removal method would be if it induced new signal as it removed the old. This state can be seen when removing the linear signal in the full dataset(supp. fig. 6).”

      - Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      - Specific experimental issues that are easily addressable.

      no

      - Are prior studies referenced appropriately?

      Yes

      - Are the text and figures clear and accurate?

      *Also, they conducted 3 different experiments in Figure 3. It would be useful to separate the figure into 3) A, 3) B, and 3) C and link that specifically in the text. Figure 4 is an extended version of Figure 2, just with the additional results of the signal removed performances. *

      We appreciate the feedback. To make the figure and the text more clear, we have added A, B, and C subheadings to figure 3, and updated the subfigure’s references within the text accordingly.

      First, the pairwise results in 4B are hard to read as the differences in colors and line type are difficult to see as some lines are short. Second, we did not find it helpful to reproduce the full signal approach in Figure 4. We would suggest to make Figure 4 as Figure 2, and simply only talk about the Full signal mode in the beginning, how it is in the text.

      We agree. We have made Figure 4 our new Figure 2 and updated the references in the text.

      Further, it would be nice to give better names in the legends of these plots. Pytorch_lr is not a nice name.

      We thank the reviewer for pointing this out. We have updated the names in the legends to be “Five Layer Network”, “Three Layer Network”, and “Logistic Regression”

      - Do you have suggestions that would help the authors improve the presentation of

      their data and conclusions?

      As the Recount3 dataset is different in quality and complexity it would be reasonable to show the results of the binary classifcation also in the main paper. In particular, as this behaves different to the GTEX binary classification.

      We have now moved the Recount binary classification figure from the supplement to join the GTEx binary classification data as the new figure 4.

      -The title is somewhat unprecise. It may induce the impression that the paper is about expression-prediction, although that is not the case. Further, in the abstract they don't mention what prediction problem they solve and that these are classification problems. After reading the paper it is clear why the authors choose that, but we are suggesting an alternative title that the authors may consider:

      The effect of nonlinear signal in classification problems using gene expression values

      We agree with the reviewer’s comment and have updated our title to “The effect of non-linear signal in classification problems using gene expression”

      Further, they should give more details on the problem learned in the abstract.

      We thank the reviewer for their feedback, and have added details to the abstract about the problem domains. The relevant sentence now reads “We verified the presence of non-linear signal when predicting tissue and metadata sex labels from expression data by removing the predictive linear signal with Limma, and showed the removal ablated the performance of linear methods but not non-linear ones.”

      *-In addition, the conclusion section, which may be title as Disucssion and Conclusion, could contain additional points concerning the topology and training of the neural networks. *

      We have updated the heading of the final section to Discussion and Conclusion. To expand on the potential drawbacks of our neural network topologies, we have also updated the limitation portion of Discussion and Conclusion to read “We are also unable to make claims about all problem domains or model classes. There are many potential transcriptomic prediction tasks and many datasets to perform them on. While we show that non-linear signal is not always helpful in tissue or sex prediction, and others have shown the same for various disease prediction tasks, there may be problems where non-linear signal is more important. It is also possible that other classes of models, be they simpler nonlinear models or different neural network topologies are more capable of taking advantage of the nonlinear signal present in the data.”

      Obviously, it is possible that other simpler or more complex neural networks have a better performance on the GTEX and Recount3 data sets compared to logistic regression. In fact, the results from Figure4 suggest that, as there is clearly useful non-linear signal in those datasets for the classification problems studied. However, optimizing a non-linear model is inherently more complex and time-consuming, and thus may not be done thoroughly in previously published papers. Compared to a linear model that is easier and faster to optimize, this may be one reason why studies find that, despite non-linear signal, the linear model performs better. Other factors such as the samples size, which the authors already mention, of course also plays a big role, and if hundreds of thousands of datasets would be there , e.g. from single cell measurements, non-linear methods may have a better chance of outcompeting linear models.

      We agree, which is why we consider the signal removal experiment to be so important. By demonstrating that the non-linear methods we used were in fact learning non-linear signal we were able to show that there was something that non-linear models were able to learn that logistic regression was unable to. That is to say that while the presence of non-linearity in the decision boundary is necessary for non-linear models to outperform linear ones, it is not by itself sufficient. Perhaps with more data or a different model non-linear methods would perform better, but there is certainly a class of models and problems where logistic regression is preferable.

      Reviewer #2 (Significance (Required)):

      The submitted manuscript adds to the discussion of the necessity of non-linear models when solving classification problems using gene expression data. The significance is mostly technically, as a comparison of logistic regression and two neural network topologies that are being compared on two large expression datasets. However, there is also a conceptual part of the contribution, which is with regards to the implications of their experiments.

      Interested audience would be computer scientists and bioinformaticians or others, that are involved in creating or interpreting these or similar prediction models.

      Our field of expertise is in the creation of machine learning models using different types of OMICs data. All aspects of the work could be assessed.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors discuss an interesting problem regarding the comparative performance of linear and non-linear machine learning models. The main conclusion is that logistic regression (linear model) and neural networks (non-linear model) have comparable performance if the data contain both linear and non-linear relations between the features (X) and the prediction target (Y), however, if the linear component in the X-Y relation is removed (e.g. regressed out) the neural networks will outperform logistic regression. This conclusion implies that linear models such as logistic regression mainly relies on the linearity in the X-Y relation.

      However, whether X-Y relation has a linear component and whether the data (e.g. for different Y classes) are linearly separable are two different questions. For example, consider a data generating mechanism, y=x^2+x and label the data points using two classes (y1). Clearly, the data is linearly separable, and any machine learning algorithm should perform very well on this problem. Now remove the linear component form the X-Y relation and use y=x^2 to generate the data. The data is still linearly separable, and the performance of logistic regression should not be affected.

      We agree that there is a difference between optimal linear decision boundaries and linear relationships between elements in the training data. Our use of the term “relationship” in place of “decision boundary” was imprecise. To make this more clear, we have made the following changes:

      Introduction:

      “Unlike purely linear models such as logistic regression, non-linear models should learn more sophisticated representations of the relationships between expression and phenotype.” -> “Unlike purely linear models such as logistic regression, non-linear models can learn non-linear decision boundaries to differentiate phenotypes.”

      “However, upon removing the linear signals relating the phenotype to gene expression we find non-linear signal in the data even when the linear models outperform the non-linear ones.” -> “However, when we remove any linear separability from the data, we find non-linear models are still able to make useful predictions even when the linear models previously outperformed the nonlinear ones.”

      Discussion and conclusion:

      We removed the following paragraph: “Given that non-linear signal is present in our problem domains, why doesn’t that signal allow non-linear models to make better predictions? Perhaps the signal is simply drowned out. Recent work has shown that only a fraction of a percent of gene-gene relationships have strong non-linear correlation despite a weak linear one [23].”

      The point is that the performance of linear models is mainly dependent on whether the data are linearly separable instead of the linearity in X-Y relation as the manuscript suggests.

      We agree that this is the key point and appreciate the reviewer for helping us to more carefully hone the language to convey this point.

      Reviewer #3 (Significance (Required)):

      The performance comparison between linear and non-linear machine learning models is important.

      We appreciate the reviewer’s recognition of the significance of the work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The authors want to assess the presence of non-linear signal in gene expression values in the task of tissue and sex classification. They use logisitic regression classifiers and two types of neural networks, with 3 and 5 layers, and assess classification performance on two large expression datasets from Recount3 and GTEX and three simulated datasets. The authors carefully construct their learning setup in such a way that one can reason about the removal of linear signal from the expression features. The interesting conclusion is, that although the linear approach works well on both datasets, and sometimes even better than the more complex models. The authors convingly show, that there is a significant non-linearity in the gene expression data. However, just because it is "there" does not imply that any non-linear methods performs better.

      Major comments:

      • Are the key conclusions convincing?

      The authors did a good job in showing, that there is non-linear signal in gene expression features for the classification problems studied. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The overall claims of the authors are justified, the discussion may be improved. - Would additional experiments be essential to support the claims of the paper?

      No, additional experiments are not essential. But the authors did not compare to other non-linear methods such as SVM or knn-classifiers in the resulst or conclusion section. It is unlikely that the main conclusion would change if those methods were tried. But it is possible that other "simpler" non-linear methods, such as knn for example, are able to outperform the logistic regression classifier on the GTEX and Recount3 data set. Thus, the authors should at least mention this as part of the conclusion and could extend their discussion on the implications of their study concerning other tasks or models. - Are the suggested experiments realistic in terms of time and resources?

      Not applicable. - Are the data and the methods presented in such a way that they can be reproduced?

      There is a separate github repo which has the code to reproduce the analyses. This is good. However, would be nice to explain in more detail in the manuscript how the limma function was used for removing the linear signal, as they mention the "removeBatchEffect" function was used, but it would be good to tell the reader how that works, as this is their way for assessing the effect of linear-signal removal. Are there any limitations for the assessment of signal removal in this way? - Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      • Specific experimental issues that are easily addressable.

      no - Are prior studies referenced appropriately?

      Yes - Are the text and figures clear and accurate?

      Also, they conducted 3 different experiments in Figure 3. It would be useful to separate the figure into 3) A, 3) B, and 3) C and link that specifically in the text. Figure 4 is an extended version of Figure 2, just with the additional results of the signal removed performances. First, the pairwise results in 4B are hard to read as the differences in colors and line type are difficult to see as some lines are short. Second, we did not find it helpful to reproduce the full signal approach in Figure 4. We would suggest to make Figure 4 as Figure 2, and simply only talk about the Full signal mode in the beginning, how it is in the text. Further, it would be nice to give better names in the legends of these plots. Pytorch_lr is not a nice name. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      As the Recount3 dataset is different in quality and complexity it would be reasonable to show the results of the binary classifcation also in the main paper. In particular, as this behaves different to the GTEX binary classification. - The title is somewhat unprecise. It may induce the impression that the paper is about expression-prediction, although that is not the case. Further, in the abstract they don't mention what prediction problem they solve and that these are classification problems. After reading the paper it is clear why the authors choose that, but we are suggesting an alternative title that the authors may consider:

      The effect of nonlinear signal in classification problems using gene expression values

      Further, they should give more details on the problem learned in the abstract. - In addition, the conclusion section, which may be title as Disucssion and Conclusion, could contain additional points concerning the topology and training of the neural networks. Obviously, it is possible that other simpler or more complex neural networks have a better performance on the GTEX and Recount3 data sets compared to logistic regression. In fact, the results from Figure4 suggest that, as there is clearly useful non-linear signal in those datasets for the classification problems studied. However, optimizing a non-linear model is inherently more complex and time-consuming, and thus may not be done thoroughly in previously published papers. Compared to a linear model that is easier and faster to optimize, this may be one reason why studies find that, despite non-linear signal, the linear model performs better. Other factors such as the samples size, which the authors already mention, of course also plays a big role, and if hundreds of thousands of datasets would be there , e.g. from single cell measurements, non-linear methods may have a better chance of outcompeting linear models.

      Significance

      The submitted manuscript adds to the discussion of the necessity of non-linear models when solving classification problems using gene expression data. The significance is mostly technically, as a comparison of logistic regression and two neural network topologies that are being compared on two large expression datasets. However, there is also a conceptual part of the contribution, which is with regards to the implications of their experiments.

      Interested audience would be computer scientists and bioinformaticians or others, that are involved in creating or interpreting these or similar prediction models.

      Our field of expertise is in the creation of machine learning models using different types of OMICs data. All aspects of the work could be assessed.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We are grateful to the reviewers for their time and expertise, and we have addressed all points they raised as detailed in our point-to-point response and highlighted the changes in the main manuscript. We have addressed all points raised by the reviewers and elaborated how this was done in a point-by-point reply. There are two new tables and a new supplementary figure. The figures and the text have been reshaped, according to the suggestions.

        We are looking forward to your reply.
      
         Best regards, Yannick Schwab and Anna M Steyer
      

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Serra Lleti et al. report a new software (CLEMSite) for fully automated FIB-SEM imaging based on locations identified beforehand in LM. The authors have implemented routines for automatically identifying common reference patterns and an automated FIB-SEM quality control. This allows autonomous data acquisition of multiple locations distributed over the entire sample dish. CLEMSite has been developed as a powerful tool for fast and highly efficient screening of morphological variations.

      **Major comments:**

      The performance of CLEMSite has been demonstrated by the authors with two typical biological example applications. The stated performance parameters such as correlation precision and reproducibility are highly convincing and supported by the presented data. The authors give detailed information on their workflow and how on to use CLEMSite, which should allow other researchers to implement this for their own applications. The only comment I have in this regard, and I might have overlooked it, but how will CLEMSite be made available to the scientific community?

      Reply 1.1

      We would like to warmly thank Reviewer #1 for their very supportive feedback. It is important to us to share our work with the community. Our prime intention is to offer CLEMSite as a proof of concept that has been demonstrated on a specific instrument, thus linked to a company (Zeiss). Because we are convinced this code can be adapted to other APIs provided by other vendors, we made it fully available via a Github repository (https://github.com/josemiserra/CLEMSite).

      To make this more visible in the manuscript, we have modified this sentence to the first paragraph of the Results section:

      “To control the FIB-SEM microscope, CLEMSite-EM interfaces commercial software (SmartSEM and ZEISS Atlas 5 from Carl Zeiss Microscopy GmbH) via a specific Application programming interface (API) provided by Zeiss. CLEMSite code is openly accessible and free to download from a Github repository (https://github.com/josemiserra/CLEMSite ).”

      **Minor comments:**

      The author mention that decreasing the z-resolution to 200 nm steps was critical to achieve high throughput. For applications that require higher resolution: is the only disadvantage a longer data acquisition time or are there also other limitations?

      Reply 1.2:

      Reviewer #1 is right, we have designed CLEMSite as a screening tool, where we emphasize the number of cells versus the resolution at which each cell is acquired. By acquiring images every 200 nm, we are gaining speed, but also stability. We have indeed noticed that below 50 nm, on occasions the beginning of the acquisition is not stable enough (the milling has to hit the front of the cross-section at view precisely), and it requires manual intervention to retract/advance the milling. In addition, to gain time in our current workflow, we have opted to not cover the region of interest with a platinum protective layer, which has no consequences when imaging at larger z steps because the overall time spent on one cell is very short. At higher z resolution regimes though, a non-protected block surface is inevitably damaged during the successive numerous mill & view cycles. We have added one sentence in the Methods section to make this point clearer.

      “Note that to gain time in the preparation process for a run, we have not covered the ROI with a platinum protective layer and alternatively we increased the thickness of the gold coating of the full sample. In such cases, only low z-resolution acquisition is possible, as acquiring at a higher resolution would require sputtering of the sample surface.”

      Finally, we may argue that if an experiment requires high-resolution acquisition, the time overhead spent to switch from one cell to the next (a few minutes) is not significant anymore relative to the time spent to acquire one cell (from several days to weeks). In such cases, automation for multi-site acquisitions would lose its relevance.

      I would assume that locating the finer structural details in a much larger data set might also introduce additional challenges in the data analysis pipeline.

      Reply 1.3:

      We fully agree with Reviewer #1. In this proof of concept study though, we are not addressing the image analysis part but assess ultrastructural phenotypes manually using established stereology protocols. At the image resolution that we are using, our analysis is restricted to features such as volumes, surfaces, number of rather large organelles. Finer details, such as microtubules or fine contact sites between organelles would require a higher resolution, and indeed very likely other means to extract the morphometric data. State-of-the-art image analysis of isotropic FIB-SEM datasets is based on computer vision/machine learning. With such tools, the analysis of fine details is indeed accessible with very high accuracy, but at the cost of the throughput, at least for now as already mentioned in the Discussion section of the paper.

      In Table 1 in the supplements, the units are missing for the targeting positions. On page 4, 4th line from the bottom, there is a typo in "reaaching a global targeting...".

      Reply 1.4

      We thank Reviewer#1 for their thorough inspection of the paper. We have corrected it accordingly.

      Reviewer #1 (Significance (Required)):

      With CLEMSite, the authors present a powerful new software tool for the FIB-SEM imaging community. The high level of automation allows high throughput data acquisition with minimal user interaction. To my knowledge, this is the first software that fully automatically recognises reference features and is able to run fully autonomously after points of interest have been selected in FM. This high throughput screening tool for FIB-SEM imaging would make a substantial technical contribution to the field of cellular imaging. My own expertise lies in the field of technical developments for CLEM and super-resolution FM. I am not able to judge the biological content of the manuscript.

      We would like to thank Reviewer #1 for their constructive and encouraging feedback.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Review on "CLEMsite, a software for automated phenotypic screens using light microscopy and FIB-SEM" by Serra Lleti et al. The manuscript describes a toolset to correlate LM data with automated FIB-SEM of selected regions of interest. This allows 3D correlative microscopy of multiple adherent cells from a single resin block. This allows much needed high throughput in CLEM analysis to become quantitative. Two applications on Golgi apparatus morphology are shown.

      **Major questions:**

      -The software has been developed in collaboration between Zeiss/ Fibics in collaboration with academic groups and will only function on Zeiss SEMs that have the proper software. Thus, if I understand correct, it will not be of generic use and a more appropriate title would be 'CLEMsite, a software for automated phenotypic screens using light microscopy and Zeiss FIB-SEM"

      Reply 2.1.

      Reviewer #2 is right about the fact that our work was done on a Zeiss microscope and in CLEMSite’s current version, it would only work with that model, including firmware and software. As already phrased in the manuscript, we would like to stress our work is a proof-of-concept. For example, we wrote in the introduction that CLEMSite is a “software prototype”. We’ve also made clearer the links to Zeiss in the first paragraph of the Results (see also answer to reviewer 1.1)

      CLEMSite is by no means designed to become an integrated part of current or future Zeiss microscopes. On the contrary, we have designed the software as an independent unit. All the parts of the software that are sending commands to the Zeiss API are indeed customized to that brand, but other functions are stand-alone units. In particular, the correlation strategy is independent of the microscope type and can be used generically. Similarly, the principles that we developed for finding the FIB-SEM coincidence point, or for selecting features-rich regions to perform the AFAS function would be valid whichever microscope model would be used.

      For these reasons, we would prefer to avoid mentioning Zeiss already in the title of the manuscript.

      • How is the described approach using FIB-SEM advantageous compared to methods like Serial Block-face EM (SBEM) and array tomography using serial section where larger fields of multiple cells can be imaged? Especially because the axial resolution was set to 200 nm and discussed as essential for the throughput speed.

      Reply 2.2.

      This is a very important point that we tried to bring across in the introduction of the manuscript. Other volume EM methods, such as SBEM and AT, like conventional TEM, require an ultramicrotome to produce thin sections (AT and TEM) or to remove the top layers from the resin block (SBEM). This inevitably requires trimming large specimens in order to accommodate the dimensions of the diamond knife used in the associated microtomes. FIB-SEMs does not have such limitations and selected volumes can be imaged from samples of any size, providing they fit in the chamber of the microscope. In our case, we were screening cells growing on a 1 cm2 surface area, which is already beyond what standard diamond knives can process. We would even argue that larger surfaces are at CLEMSite reach, but we have not tested this.

      • Is the data FAIR available?

      Reply 2.3

      It is one of EMBL’s ambitions to make all data as FAIR as possible. For this study, we saved all the raw images and their corresponding embedded metadata as they came from the original software (ATLAS 5, Fibics for the SEM images and LAS X, Leica microsystems for the confocal images). The images published in this manuscript will be deposited on the EMPIAR data repository upon acceptance. The raw data and unpublished data, due to their size, will be fully available upon request to the authors. Additionally, their data is specifically generated for the correlation workflow, which is stored together with the image information as separated files. To store the information of logs we used text files, for intercommunication between processes, JSON, and XML to store coordinates in a readable format. As far as we know, there is no standard FAIR protocol yet that describes CLEM workflows in microscopy. We made our best possible efforts to archive our data in an understandable folder architecture, with detailed information on how to navigate through it, such that we are confident that our data could be mined by others in the future, thus reaching the goals of the FAIR charter.

      • How is CLEMsite available? Is the code public or for sale?

      Reply 2.4

      It is important to us that our proof-of-concept can be used or adapted by others in the future. For this reason, we are sharing the full code that was developed for CLEMSite - See Reply 1.1 for further details.

      **Other comments:**

      • Can you comment on the flexibility of this method? It is described as a flexible method, but only HeLa cells (quite flat cells) and Golgi apparatus targeting was used. What about different cell types and what about targets with a less obvious EM morphology?

      Reply 2.5

      It is correct that we have demonstrated our workflow only on Hela cells which present a more or less homogeneous topology. Yet our workflow is flexible when it comes to the dimensions of the region of interest and the acquisition field of view, and can accommodate a wide range of cell shapes, as long as they adhere to a culture substrate. Dimensions of ROI and FOV can be adapted in the CLEMSite interface as described in Supplementary Figure 4. Following reviewer 2 question, we realize that this feature may not appear clearly and we have modified the corresponding section of the Result:

      “The dimensions of the image stack, as well as the z resolution are set when initializing the run, via the CLEMSite interface (Supplementary Figure 4). Whilst every cell of one run can be acquired with the same recipe (as defined in ZEISS Atlas 5: sample preparation, total volume to be acquired, slice thickness and FIB currents applied at each step), CLEMSite-EM also offers individual definition of recipes, allowing a per cell adaptation of the shape or volume (Supplementary Fig. 4a).”

      Changing the ROI size would thus accommodate the surface occupancy of a cell (in the plane parallel to the culture substrate) and changing the FOV would accommodate the cell’s height.

      The morphology of the cell as it appears in the EM (SESI) does not alter the targeting strategy, since we are solely relying on the correlation, which means that the position of the target cell is extracted from the light microscopy images and the coordinate system provided by the gridded coverslip. Even if the cells were invisible at the surface of the resin block when inspected in the SEM, CLEMSite would still navigate to the proper region and create an image stack by FIB-SEM imaging.

      • For EM acquisition ZEISS smartSEM with ATLAS was used. LM was recorded with a microscope from a different vendor. Can the software be used regardless of microscope type?

      Reply 2.6

      Yes, the correlation is based on collecting the stage coordinates from the light microscope, and on analyzing the images from the various magnifications and channels. This information can be obtained by most microscope types, but it might involve minor adaptations regarding the specific brand of a microscope (e.g. changes in the coordinate system of the stage used or the naming of the channels).

      • Create less variation in the size of scale bars.

      Reply 2.7

      We have modified all figures to take this comment into account and thank Reviewer #2 for a good suggestion.

      • M&M: High-resolution light microscopy: Why call this 'high resolution'?

      Reply 2.8

      We used this term to differentiate, in the feedback microscopy setup, the first stage where images are acquired at low magnification from the images acquired at high magnification. We agree that the term is misleading, so we decided to update the manuscript and change the term high resolution by higher magnification (the second stage in feedback microscopy).

      Specs given seem randomly chosen: For example objective magnification yes, NA not; excitation wavelength yes, emission not.

      Reply 2.9

      We thank Reviewer #2 for spotting these missing details. We have edited the method section to add the NA and the emission wavelengths.

      Reviewer #2 (Significance (Required)): See above: This depends on the availability of code, as well as the usability in FIB-SEM that is not based on Zeiss.

      Reply 2.10

      We hope our answers have addressed these concerns. When the code is indeed fully available, we can not at this stage presume of the transferability of CLEMSite to microscope from other manufacturers. Yet we would like to stress once more that our main aim is to demonstrate a proof of concept.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Schwab and coworkers present an automation software for correlative light and electron microscopy (CLEM) to acquire high-resolution SEM volume datasets at room temperature. This automation enables large-scale data collection and morphometric analysis of cell phenotypes. The paper is overall well written, but often assumes a lot of prior knowledge of the workflow, which might not be present in a general audience or for newcomers to the technique. This is also seen in the insufficient labeling and explanation of the figures. They seem a bit like presentation slides, which could be well understood with the help of the presenter/narrator, but alone lack a lot of information (see more specific comments below).

      **Major Comments (in no particular order):**

      • Final accuracy of ~ 5 µm ... is this really sufficient? Given that the size of many mammalian cells is ~10-15 µm, this is still a HUGE error. Of course, there is a tradeoff between throughput and accuracy, the area covered and speed. Nonetheless, this means a serious limitation in terms of the kind of targets / biological questions that can be addressed with this technique! (Especially in the context of "rare events") This should be discussed in more detail. Reply 3.1

      We thank reviewer #3 for their constructive criticism of the work. Indeed, our final accuracy is 5 µm at best, which may at first glance appear as a disappointing value. This accuracy is the consequence of a couple of strategic decisions that we have made in designing the workflow, which will be further explained below. We have chosen to constantly opt for a large field of view that would be larger than the average cell size, thus mitigating the potential 5 µm offset in targeting. In our hands, this yielded satisfying results, yet we agree that a higher targeting precision would allow narrower fields of view and potentially an even increased throughput.

      Our correlation strategy fully relies on the coordinate system built from the gridded pattern embossed at the surface of the culture dishes. The precision of CLEMSite automated targeting thus relies on i) its ability to properly detect the grid edges, both at the LM and at the ME, and ii) on the mesh size of the grid. To ensure a wide range of applications, we decided to design CLEMSite on commercial culture dishes, of which the MatTek gridded culture dishes appeared the most convenient, for each grid square presented a unique alphanumeric ID together with a relatively large and flat surface area to accommodate a large number of cells away from the grid pattern. Whilst such dishes showed a topology that satisfied our first criteria, the grid spacing was 600 µm. A smaller mesh size would have undoubtedly resulted in higher precision in the targeting but at the expense of losing free areas. Other commercial dishes with denser meshes unfortunately would not have ID engraved directly inside every square or we experienced difficulties in reproducibility during the sample preparation process to detach the glass from the resin block.

      We also have excluded the option to design our own grids, which would have created another dependency for potential users from other laboratories.

      Another possibility for targeting would be to register the fluorescence maps to the shapes of the cell as visible in the resin block. Adherent cells can be detected in the SEM if high energies are used to scan the surface of the blocks, and also if the block is not coated with a too thick layer of gold. In our experience, switching between voltages for acquiring such overviews and low voltages for acquiring FIB-SEM stacks is another source of imprecision and doesn’t improve the targeting in very confluent areas. Another interesting idea, as shown in Hoffman et al 2020, would be to scan the embedded samples by X-ray prior the FIB-SEM targeting, but not only this would imply that high-end X-ray machines would be available for such tasks, but would still require landmarks to register the X-ray maps to the SEM overviews. This would potentially yield a higher accuracy, but we have opted for the gridded substrates, judged more accessible to a large number of laboratories.

      We tried to explain such a choice in the discussion, by adding this sentence in the description :

      “Detection of local landmarks imprinted in the culture substrate enables automated correlation and targeting with a 5 µm accuracy. We estimate that this number could still be improved by customizing a gridded substrate with a smaller mesh size, as landmarks would be much closer to the targets. The detection algorithm we developed could be extrapolated to other customized dishes or commercial substrates for cell culture in SEM samples41. An advantage of using local landmarks for the correlation is that they mitigate the impact of sample surface defects or optical aberration across long distances. Alternatively, targeting individual cells with a FIB-SEM has been achieved by mapping the resin embedded cells with microscopic X-ray computed tomography44. We speculate that such tools could be an alternative to a gridded substrate, yet cannot predict its adaptability to large resin blocks such as the ones we used in this study. “

      • Given that the whole point of the paper is "large scale automation", I would have preferred a few more examples/higher n-count. A comment on which type of targets the authors envision/have validated would be nice (also in the context of the limitation in accuracy). Reply 3.2

      To our best knowledge, no one has ever imaged multiple cells automatically. So even 5 in a row is a high number.

      In addition to this, we added an extra paragraph in the discussion.

      “We believe that other research questions could benefit from this type of screening. As an example, the 2021 Human Protein Atlas Image Classification competition61 managed to classify multiple organelles of individual cells in fluorescence microscopy. Such machine learning models could be used to find rare events or particularly interesting phenotypes. In another example, in host-pathogen interactions, early infected cells might start to display a recognizable phenotype in a small subpopulation of cells62. In both cases, those marked cells could be used to establish a FIB-SEM screening to discover new morphological differences at the micrometer level.

      To expand the applicability of these screenings beyond the proof-of-concept here presented, we propose two directions of improvement. First, by acquiring smaller enclosed volumes with isotropic resolution, we could target area-delimited organelles, like centrioles21. In this case, the full cell volume is neglected in favor of a small portion of it, but with higher z resolution. At the software level, that would require improving targeting accuracy by using smaller grids and extending the maps to 3D coordinates. 3D registration against a light microscopy Z-stack would considerably help to constrain the field of view during acquisition, thus reducing the imaging time and keeping the field of view position during tracking. At the instrument level, this would require, first, stabilizing the ion beam before the critical region is acquired, to compensate for the change between high currents for milling and fine currents for sectioning. Second, to make sure that the fine current beam hits exactly the front face of the milled cross-section, and then prevent milling artifacts. Finally, the second direction is to increase the number of samples acquired per session. That would imply ion beams that automatically reheat the Gallium source when it is exhausted (like proposed in Xu et al. 60), with faster algorithms for autofocus and autostigmatism in SEM.”

      • It should be mentioned somewhere that "commercial dishes or coverslips" contain an imprinted grid pattern with numbers and letters to locate specific squares. [Again: probably clear to "aficionados" of the technique but totally unclear to newcomers/outsiders] Reply 3.3

      We have added an explanation of the layout of the coordinate system in the part of the correlation strategy and the methods section “gridded dish with numbers and letters' to explain the correlation and targeting strategy better.

      • "It is important to keep the initial number high in order to compensate for the loss of targets" - what % of targets is lost exactly in the final step (FIB-SEM imaging)? The 10 cells out of 35 (29%!) that were not "of sufficient quality for further downstream analysis", were they lost/discarded because of problems in the automation (e.g. autofocus/tracking failure) or for other reasons (e.g. preservation of the cells during fixation/embedding)? Reply 3.4 In the main text we decide to explain the process of filtering better. We have added a supplementary figure showing different causes for problems of coordinate system detection due to scratches, cracks, or dirt. None of the cells in the study were discarded due to bad preservation, but the system being a proof of concept, we dealt with multiple difficulties that forced us to filter the acquired stacks for getting the ones showing the best quality. We also added supplementary material (Sup. Tables 3 and 4 with explanation) about the possible causes of cell losses during different experiments.

      • "One essential paradigm shift for increasing the acquisition throughput is the decision to decrease the resolution in the z-dimension, thus prioritizing the speed of acquisition and ultimately the total number of cells acquired in one run.". Surely, reducing z-resolution is an obvious way to speed up acquisition times. But this is not tied to the use of this software and obviously comes at a price ... this has been discussed before and is nothing novel. Hence "paradigm shift" might be a bit too strong. I however fully agree with CLEMSite's potential as a screening tool. Could a "high resolution" (isotropic) mode not be implemented, too? [then it would be up to the user to decide what to prioritize - throughput or resolution] Reply 3.5

      We have replaced the word “paradigm shift” with “original strategy”. It is indeed up to the user to decide if higher z resolution or higher speed should be achieved by setting up different recipes.

      Additionally, we direct the reviewer to read Reply 1.2.

      • There is no mentioning of why this specific hardware was used. Are there any limitations that currently restrict the approach to Zeiss machines? Any plans supporting other vendors? Of course, there are always certain benefits with certain instruments. Or just simply no others were available... A comment on which part was performed by/at Zeiss and which in the labs would be useful to understand specific contributions. (Since a conflict of interest statement seems missing). Reply 3.6

      The original plan was to set up a proof-of-principle study developing a program that is fully open source. We created an interface, which could plug any control via proprietary API, by simply adapting commands from the API to our interface. The idea is very similar to what is done in light microscopy open-source controllers, like the Micropilot software (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3086017/).

      That interface would be the place where to modify the software and add the external API and requires only that such API can be used with a .NET framework in C#. The programmer would have to modify only the following file:

      https://github.com/josemiserra/CLEMSite/blob/master/CLEMSiteServer/TestApp/AtlasCom.cs.

      We expect that FIB-SEMs are very similar across companies, at least in basic functionality (get images, get positions, mill execution for trench digging with a recipe file, ...). We thus believe our software could be adapted to other vendors. As an example, we used the Fibics API, but we could also program the same functionality with the Zeiss API to achieve the same goal.

      Zeiss’s contribution to the project was i) providing a system during the initial phase of the project, and allocating time with programmers from FIBICS to help to provide the control API to be used by CLEMSite.

      All experiments were performed at the European Molecular Biology Laboratory or Max-Planck Institute of Experimental Medicine.

      **Figures:**

      Figures should be improved. They often contain too little information to understand the concepts/results discussed and there's lots of white space. The legends should be improved accordingly. In general, a more concise and structured figure design could go a long way of improving the quality of the manuscript. Please find a few suggestions (for the main figures) below (but the same should be applied to the supporting figures):

      Reply 3.7

      We thank the reviewer for the suggestions on the figures in general. We have revisited all the figures and made corresponding changes as highlighted below.

      Fig. 1: While I believe it is clear to me what each scheme is supposed to represent, someone less immersed in this topic (or just entering the field) may have problems navigating the figure. For example: what are all the different letters and numbers? What's the blue box with the trapezoid ("EM targets" - it may become clear later, but here it is not), what are the blue and the red arrowhead, respectively (I suppose EM and focused ion beam?). This should be improved and labeled accordingly.

      We have addressed the queries by explaining the figure more explicitly in the legend (e.g. blue box, blue and red arrowhead). We have added i, ii, iii to separate b into subsections and adjusted the text accordingly.

      Fig. 2: Again, a lot of annotation is missing. E.g. what is the 3rd insert in b exactly (edge-detection? After CNN identification?)? For most of the figure, yellow squares are used to indicate the zoom-in region, why not for b) 1st row? With the "zoo" of scale bars, wouldn't it make sense to either always show the same bar (e.g. 200 µm), or scale the images so things become more comparable? In this regard: a) 2nd column and e) 1st column represent the same FOV. Why are they shown with different magnification/cropping?

      Reply 3.8

      The scale bars have been homogenized when possible, in the case of b the image was zoomed to match a (first image in both cases). A yellow box was added for the zoom in b as suggested. We added descriptions in the figure legend.

      Fig. 3: a) The procedure is well described in the legend, but no motivation is given in the text, why this is necessary. c) There's some floating density in the white space. Is this due to thresholding?

      Already explained in figure legend.

      Reply 3.9

      We have adapted the text in the main manuscript to explain better that the coincidence point is normally found manually and for a routine with as little as possible intervention by an operator this had to be automated. We have also explained figure 3c more in the figure legend.

      “The following steps, usually performed by a trained human operator, are triggered autonomously: localization of the coincidence point, needed to bring the FIB and SEM beams to point at the same position (Fig. 3a); milling of the trench to expose the imaging surface, detection of the trench to ensure a well-positioned imaging field of view (FOV) (Fig. 3b); automated detection of image features in the imaged surface needed to find an optimal location for the initial autofocus and autostigmation (AFAS) (Fig. 3c); and finally the stack acquisition (Fig. 3d).”

      Fig. 4) Again, description/labeling of the figure is poor. E.g. what are the red outlines present in c) row 1-3 (but missing in row 4; why?)? [presumably these are the siRNA spots?] Is there any reason this figure could not be further subdivided into d), e), f) etc? As it stands, a lot of additional descriptors ("second from the left", "two images on the right") are necessary while a simple call to a), b), c) would be much easier...

      Reply 3.10

      We have more precisely described the siRNA spots in the legend more explicitly and have added headings to divide part c into a grid rather than adding letters/numbers to subdivide to make the figure more clear.

      Fig. 5) Additional labeling (a,b,c...) could be helpful here, too. While intuitively I would assume that blue = DAPI and green = GFP, these things should be labeled or described in the legend. Especially in the 3D rendering it is quite unclear what is being portrayed. Is this an overlay of a FIB/SEM segmentation with the confocal 3D-data?

      Reply 3.11

      We have added headings to subdivide the images in b and explanations in the legends to explain the color-dye relation (blue is DAPI).

      **Minor:**

      The first element in the filtered list can thus be stored for the subsequent application of autofocus and autostigmation procedures (AFAS) (Supplementary Fig. 3c). [technically this has been defined before]

      Reply 3.12

      All typos and grammar-related issues have been addressed in the following ways:

      A transformation is computed to register together the LM list and EM landmarks list, ...

      “A transformation is computed to register the positions from the LM and the EM landmarks list, ... “

      “The FOV is changed from a 305 μm by 305 μm, used for the detection of the trench, to a 36.4 μm by 36.4 μm in the exposed cross-section and an image of that cross-section is taken for analysis (Fig. 3c).”

      The sample is positioned at the target coordinates of the first cell, and the Multisite module performs the coincidence point alignment of both the electron and ion beams (Fig. 3a and Supplementary Fig. 2a).

      Reviewer #3 (Significance (Required)):

      **Significance:**

      It is clear that the kind of automation outlined here is necessary to elevate correlative SEM volume imaging to a "high-throughput" technique, which could become valuable for many biological questions. CLEMSite offers a valid technical solution and appears to be a solid implementation of the traditional/manual workflow. However, its presentation needs to be improved before we can support publication.

      Reply 3.13

      We have worked on different aspects of the presentation, rearranged the figures, and extended figure legends and hope this meets the reviewer’s expectations.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review on "CLEMsite, a software for automated phenotypic screens using light microscopy and FIB-SEM" by Serra Lleti et al.

      The manuscript describes a toolset to correlate LM data with automated FIB-SEM of selected regions of interest. This allows 3D correlative microscopy of multiple adherent cells ¬from a single resin block. This allows much needed high throughput in CLEM analysis to become quantitative. Two applications on Golgi apparatus morphology are shown.

      Major questions:

      • The software has been developed in collaboration between Zeiss/ Fibics in collaboration with academic groups and will only function on Zeiss SEMs that have the proper software. Thus, if I understand correct, it will not be of generic use and a more appropriate title would be 'CLEMsite, a software for automated phenotypic screens using light microscopy and Zeiss FIB-SEM"
      • How is the described approach using FIB-SEM advantageous compared to methods like Serial Block-face EM (SBEM) and array tomography using serial section where larger fields of multiple cells can be imaged? Especially because the axial resolution was set to 200 nm and discussed as essential for the throughput speed.
      • Is the data FAIR available?
      • How is CLEMsite available? Is the code public or for sale?

      Other comments:

      • Can you comment on the flexibility of this method? It is described as a flexible method, but only HeLa cells (quite flat cells) and Golgi apparatus targeting was used. What about different cell types and what about targets with a less obvious EM morphology?
      • For EM acquisition ZEISS smartSEM with ATLAS was used. LM was recorded with a microscope from a different vendor. Can the software be used regardless of microscope type?
      • Create less variation in the size of scale bars.
      • M&M: High resolution light microscopy: Why call this 'high resolution'? Specs given seem randomly chosen: For example objective magnification yes, NA not; excitation wavelength yes, emission not.

      Significance

      See above: This depends on the availability of code, as well as the usability in FIB-SEM that is not based on Zeiss.

    1. The code above is full of English words like “import”, “key”, “secret”, which may help you guess the meaning of the code.

      I'm considering the observation that the words embedded in this example of Python coding are English. I wonder if Python coding only utilizes English words, or if it understands (and translates into binary code) words in other human languages such as Spanish or Mandarin. Are there programming languages built specifically to accommodate other languages? I can't imagine all programmers speak English.

    1. Code is power, and it is white and male.

      White men have been the most powerful people on earth. Take for example the right to vote; one of the most powerful and fundamental things an individual can possess white men have always been allowed to vote while white women were restricted from doing so until the 1920's. More so, black people did not earn that right until the mid 1960's. Even though these groups were given the right to vote, understanding that white men made that happen is important.

    1. First, to clarify - what is "code", what is "data"? In this article, when I say "code", I mean something a human has written, that will be read by a machine (another program or hardware). When I say "data", I mean something a machine has written, that may be read by a machine, a human, or both. Therefore, a configuration file where you set logging.level = DEBUG is code, while virtual machine instructions emitted by a compiler are data. Of course, code is data, but I think this over-simplified view (humans write code, machines write data) will serve us best for now...
    1. EisCode

      Bij alle codes is het zo dat iedere ketenpartner een eigen code kan hangen aan de item (eis, document, stakeholder, etc.). Aannemer, opdrachtgever, IB, eisenbibliotheek kunnen allemaal codes willen toevoegen aan een eis. Dus klopt de kardinaliteit?

    1. 5: Fill in the blank with code to access the cars array.

      solve this again, did solve this but not clear silly mistake the combination starts from 0

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      In this section we list all the comments done by the three referees and our corresponding action.

      Regarding Reviewer #1:

      1. On "mechanical control": The authors show changes in circadian power fraction with changes in YAP and with cytoskeletal inhibitors, but there are no properly-controlled experiments that directly perturb mechanics. The authors show a correlation between YAP nuclear/cytoplasmic ratio and circadian power, but YAP N/C alone is not a readout of mechanotrasndcution, per se. The authors have shown two different experiments where cells are cultured on a stiff (30kPa) substrate and soft substrate (300Pa), but they do not shown a direct comparison of YAP nuclear localization and circadian power under these two conditions in the same experiment. Direct, controlled perturbation of mechanical cues is necessary to support the title's use of the phrase "mechanical control."

      We agree with the referee that further mechanical perturbations could strengthen our conclusions. In our original manuscript we directly controlled the mechanical environment by culturing cells on substrates of 300Pa and 30kPa in stiffness. These differences in stiffness were not sufficient to drive changes in circadian power fraction and YAP localisation, as depicted in Fig. 3C (we note that the direct comparison requested by the referee is shown in that figure). We hypothesise that this negative result is due to a very low “rigidity threshold” or to secretion of extracellular matrix that stiffens the initially soft substrate. In any case, we plan to strengthen the “mechanical control” message of our paper with one or more of the below experiments:

      A) We will measure circadian power fraction and YAP localisation in even extremer stiffness/adhesion conditions, using 300 Pa and 30 kPa polyacrylamide gels with a different fibronectin coating protocol, as described in Elósegui-Artola et al., 2017. This allows a much finer control of the concentration of fibronectin coated, so we can reach low enough levels to compromise the cell adhesion to the substrate and cross down the threshold that would lead to cytosolic localisation of YAP. We will perform this experiment in presence of the FUD peptide, which inhibits matrix deposition (Tomasini-Johansson et al., 2001; this peptide has already been tested in our lab).

      B) We will use the approach described in Fig. 2E to compare the circadian power fraction in cells spread in stadium-shaped islands of 2400 um2 and 1200 um2. Oakes et al., 2014 already showed that traction forces exerted by 3T3 fibroblasts depend on the size of the spread area of the cells, so we expect differences in mechanotransduction that should affect YAP localisation and, if our hypothesis is correct, the RevVNP circadian oscillations.

      C) We will abolish the physical connection between the actin cytoskeleton and the nucleus by disrupting the LINC complex via the overexpression of a dominant negative (DN) nesprin-1 KASH domain (Lombardi et al., 2011). The plasmid designed for the inducible overexpression of the DN KASH domain, originally tested in NIH3T3 cells (Mayer et al., 2019), is available in our lab and has been used to prove that uncoupling cytoskeleton and nucleus leads to nuclear YAP decrease in single cells (Kechagia at al., 2022). We will aim to increase the circadian power fraction in low density cells upon the overexpression of the DN KASH domain.

      Elosegui-Artola A, Andreu I, Beedle AEM, Lezamiz A, Uroz M, Kosmalska AJ, Oria R, Kechagia JZ, Rico-Lastres P, Le Roux AL, et al (2017) Force Triggers YAP Nuclear Entry by Regulating Transport across Nuclear Pores. Cell 171: 1397-1410.e14

      Kechagia Z, Sáez P, Gómez-González M, Zamarbide M, Andreu I, Koorman T, Beedle AEM, Derksen PWB, Trepat X, Arroyo M, et al (2022) The laminin-keratin link shields the nucleus from mechanical deformation and signalling Cell Biology

      Lombardi ML, Jaalouk DE, Shanahan CM, Burke B, Roux KJ & Lammerding J (2011) The Interaction between Nesprins and Sun Proteins at the Nuclear Envelope Is Critical for Force Transmission between the Nucleus and Cytoskeleton*. Journal of Biological Chemistry 286: 26743–26753

      Mayer CR, Arsenovic PT, Bathula K, Denis KB & Conway DE (2019) Characterization of 3D Printed Stretching Devices for Imaging Force Transmission in Live-Cells. Cel Mol Bioeng 12: 289–300

      Oakes PW, Banerjee S, Marchetti MC & Gardel ML (2014) Geometry regulates traction stresses in adherent cells. Biophysical Journal 107: 825–833

      Tomasini-Johansson BR, Kaufman NR, Ensenberger MG, Ozeri V, Hanski E & Mosher DF (2001) A 49-Residue Peptide from Adhesin F1 of Streptococcus pyogenes Inhibits Fibronectin Matrix Assembly*. Journal of Biological Chemistry 276: 23430–23439

      2. On "via YAP/TAZ": In addition to above, it is necessary to show that the changes in Circadian power fraction induced by mechanical cues in fact require YAP/TAZ signaling. Thus, an experiment comparing soft (300Pa) substrate with Stiff (30kPa) substrate in the presence or absence of YAP/TAZ is necessary to state that YAP and TAZ are the mechanistic mediators of mechanical cues on the clock.

      We are currently generating via CRISPR-KO and shRNA silencing a YAP1/TAZ double mutant. We plan to use this cell line in those conditions where YAP is prominently nuclear (low density in stiff substrates) with the purpose of rescuing the RevVNP circadian power fraction.

      1. While the TEAD-binding domain mutant experiment is elegant, to claim that TEAD is the transcriptional mediator, it must be demonstrated that this mutant indeed fails to induce TEAD-mediated transcription. This could be simply executed by demonstrating that the CCD mutant expresses reduced CTGF and Cyr61 (for example), compared to the 5SA, under these conditions. Further, endogenous YAP is still active and available to bind to TEAD in this system, which should be discussed.

      We plan to carry out quantitative real-time PCR of CTGF and Cyr61 in all the YAP mutants and the control. Regarding the presence of endogenous YAP, we will clarify in the text that a) the overexpression of the different YAP mutants was done in high-density conditions, where endogenous YAP is significantly less localised in the nucleus, and that b) the levels of the exogenous YAP are much higher (we already have western blots showing this).

      1. In Figure 3a: The cell perimeter needs to be shown either by actin staining or by brightfield images. The manually marking of cell boundaries is insufficient, specifically because the drugs used in this experiment affect the cytoskeleton. It would be very helpful to see this via actin staining or in the least with brightfield images.

      The cell perimeter was drawn based on the cytosolic YAP immunostaining, whose levels are high enough to infer the cell shape (higher resolution images can be attached if necessary). As stated in the manuscript, the YAP nuclear-to-cytosolic ratio is calculated using two adjacent areas of identical size, one inside the nucleus and the other one just outside (see Materials and Methods/Immunostainings), so the exact cell shape is irrelevant for this particular quantification.

      Regarding Reviewer #2:

      Effects on the circadian clock

      1. The authors use the fluorescent reporter created by Nagoshi from sections of the Rev-erbα gene. This reporter is widely used to estimate relative circadian timing in individual cells but it does not provide direct information on the circadian clock activity. In other words, while Reverb rhythmic expression is driven by the clock, it is not known whether less-rhythmic or non-rhythmic expression or change in expression level of Rev-erbα is affecting the core clock. For example, it has been shown that Rev-erbα knock-down cells are rhythmic as long as Rev-erb-beta is present. Thus, one major shortcoming of the current version of the manuscript is the missing dissection between Rev-erbα rhythmicity/expression and the circadian clock. More concretely, it remains unclear whether the change in Rev-erbα expression is a direct effect or caused by a defect clock. Since the authors presume a direct effect of YAP/TAP on Rev-erb expression, the former is likely. If that is the case, the data could be interpreted as that (missing) mechanic stimuli can lead to nuclear YAP/TAZ, which rises the level of Rev-erbα (and maybe interfere with its rhythmic accumulation). Beyond Rev-erbα expression, there may or may not be an effect on the circadian clock (core clock, CCGs). With the current version we do not know since the authors do not look beyond Rev-erbα expression. Thus, the claims on circadian clock or circadian rhythms in their cells is not studied in this version of the manuscript. The current version is still very interesting and provides insights into the Rev-erbα modulation, but additional work would be needed to show links with the core clock machinery. For this the authors could show influence (or at least correlation) of the YAP/TAZ/REVERBA phenotype on the oscillations of core clock genes or clock-controlled genes. Either through the use of alternative (ideally constitutive) reporters (e.g. PER2, BMAL1, fluorescent or LUC), or/and by analyzing RNA/Protein of core clock genes or output genes. This would not be necessary for all experiments, but at least for some were its possible (e.g. experiments with drugs perturbations). Otherwise, any claim like "YAP/TAZ perturbs the circadian clock ..." or "the circadian clock deregulation in nuclear YAP-enriched cells" is potentially flawed and has to be removed/reformulated.

      We agree with the reviewer. In order to understand if the core clock is affected, beyond REV-ERBA, by YAP/TAZ expression and localisation, we plan to perform the two experimental approaches explained below. For both of them we will use high-density cells with and without YAP-5SA overexpression since the other conditions (drugs, micropatterned cells, low density) may not render enough cells for analytical approaches that are not based on fluorescent microscopy (real-time qPCR or luminescence recordings). Also, the potential results obtained with YAP-5SA overexpression will be more informative regarding causality YAP-circadian clock than those using the other conditions described in the manuscript.

      1. We will use NIH3T3 bmal1::luc cells (already generated in our lab with the pABpuro-BluF plasmid; https://www.addgene.org/46824/) and an adapted microscopy-based system to track bioluminescence. We will need to give our cells a synchronisation shock since the single-cell signal with this reporter is too low and noisy to perform single-cell tracking.
      2. We will check during 48 hours, every 4 hours, the mRNA levels of Bmal1, Clock, Cry1, Per2, Yap1 and Rev-erbα via quantitative real-time PCR. As in A), we will need to synchronise our cells prior RNA collection. In case the expression of the other components of the clock are not affected by YAP-5SA overexpression, we will modify the message of our manuscript to emphasize the role of REV-ERBA. As the referee mentions (and we thank them for that comment), finding that the modulation of Rev-erbα is mechano-sensitive and dependent on YAP/TAZ signalling would be still very relevant, given the role of this factor in metabolism, inflammation, mitochondrial activity, or Alzheimer’s disease, as discussed in lines 231-235 in the manuscript.
      1. The authors aim to discard the possibility of paracrine signals by showing no increase in circadian power fraction of cells growing in low density with conditioned medium (Figure 2D). A paracrine signal coming from an oscillatory system is likely to oscillate and in that case, I do not see how growing cells in constant conditional medium can discard the effects of an oscillatory paracrine signal. I believe the elegant experiment shown in Figure 2E more precisely address this issue.

      The reviewer is right in the sense that paracrine coupling of circadian oscillators would require a circadian paracrine signal, like shown in Finger et al., 2021, and that we provide sufficient experimental evidence of a mechanics- rather than paracrine-driven control of the RevVNP circadian oscillations. Specifically, by using micropatterning (Fig. 2E) and gap closure (Fig. 2A) we show that cells under the same paracrine medium are able to display acute differences in RevVNP expression. The experiment with conditioned medium, which is a traditional technique used in some papers in the field like in Noguchi et al., 2013, was performed to rule out the possibility that secreted factors, even if not circadian, could ultimately impact the low-density cells’ circadian clock. We will rephrase the manuscript to stress out this reasoning.

      Finger AM, Jäschke S, del Olmo M, Hurwitz R, Granada AE, Herzel H & Kramer A (2021) Intercellular coupling between peripheral circadian oscillators by TGF-β signaling. Science Advances 7

      Noguchi T, Wang LL & Welsh DK (2013) Fibroblast PER2 circadian rhythmicity depends on cell density. Journal of Biological Rhythms 28: 183–192

      Data analysis methodology:

      1. Single-cell circadian recordings like the ones analyzed here are characterized by noisy amplitude and non-sinusoidal waveforms with fluctuating period (Bieler et al., 2014; Feillet et al., 2014). The authors interpolate, smooth, detrend and normalize their data; operations that are known to introduce spectral artifacts that can mislead the interpretation of the power spectrum. Moreover, the time-series pre-processing operations described by the authors in the methods sections is incomplete and the authors should more explicitly describe all their operations with exact methods applied, filter parameters and time-windows sizes (if applicable). To validate their pre-processing steps the authors could provide their time-series analysis pipeline code and/or provide a few examples of raw versus pre-processed data together with their respective spectrums before and after pre-processing. In addition, the authors could provide their raw trace signal data together with the corresponding post-processed signal data as plain text files.

      In our response to the reviewers, we will address this point exactly as requested by the reviewer. We will rewrite our methods section to explain better our analysis pipeline, clarifying that we do not apply detrending, that we resort rarely to interpolation of missing points, and stating the specifics of the standard low-pass filter we apply. We will then strengthen Supplementary Figure 1 with more examples of raw-data and processed data, and will provide raw trace signal data and the corresponding processed data to illustrate our approach.

      1. The authors rely on Fourier analysis and a reasonable self-made definition of circadian strength named as "circadian power fraction". Using a stationary-based method for noisy non-stationary data can lead to inaccurate spectrum power estimations. As the current version of the manuscript does not provide any alternative/complementary analysis method nor we have any available raw signal data it is unclear if their analysis appropriately represents the circadian power. The authors could consider implementing complementary data-analysis strategies to validate their conclusions. Fortunately, there are multiple suitable data analysis strategies already available that are exactly designed for this kind of data (eg. (Price et al., 2008; Leise et al., 2012; Leise, 2013; Bieler et al., 2014; Mönke et al., 2020). This time-series analysis methods is a crucial step as all main results on this manuscript rely on the authors self-made definition of circadian power. This is particularly important as there is no standardized method in the circadian field to estimate circadian rhythmicity and/or circadian power of single-cell traces.

      We will take this point into consideration by running a complementary analysis of our data with one of the methods recommended by the reviewer. Our choice is pyBOAT, as presented in Mönke et al. (2020), because on first inspection its implementation of the wavelet method appears to be the most suitable for our dataset type. If we find that our time-series are too short for these methods we will use the RAIN algorithm (Thaben and Westermark, 2014) instead.

      Mönke G, Sorgenfrei FA, Schmal C & Granada AE (2020) Optimal time frequency analysis for biological data - pyBOAT Systems Biology

      Thaben PF & Westermark PO (2014) Detecting Rhythms in Time Series with RAIN. J Biol Rhythms 29: 391–400

      1. The authors mainly show circadian power fraction and analyze rhythmicity scores/powers. Is there the a chance that a rise in the basal expression level of Rev-erbα is reducing the rhythmicity score? Or to phrase it otherwise, the absolute amplitude may remain the same, but the relative amplitude may be reduced? Would that affect the FT analysis power scored? To clarify this the authors could provide an analysis of the relative amplitude in addition to the circadian intensity (as in Fig.1C).

      Our analysis pipeline subtracts the mean signal from each cell’s intensity-time trace, and then divides each trace by its standard deviation. This procedure eliminates any bias due to basal expression of Rev-erbα. We will address this point by clarifying the methods section and providing examples in Supplementary Figure 1 of raw data with high-basal levels and low basal levels, showing their pre- and post-processed spectra.

      Minor points by text-line:

      YAP and TAZ should be introduced to the reader during introduction. by set a of proteins. Here the authors probably meant that cells were not reset nor entrained during the experiment. "..expression depends on..". This is a correlation, not proof of causation is shown until this point. This is an overstatement. Using the term "provoked" suggests a causal relationship not shown. Similarly last sentence "This result established.... is caused..". Again, this is an overstatement as only correlation is shown. According to their description the authors are not using any image-preprocessing steps, eg background subtraction or other filters. Is this correct? It is not clear what image metric for the single-cell signals are the authors using, eg. integrated nuclear intensity or mean/median nuclear intensity. I am not familiar with TrackMate but it might be possible to export and share with the readers the image-analysis pipeline used which would clarify any questions about image processing and signal extraction.

      We thank the reviewer for pointing out all these minor points. We will address each one of them to make the paper clearer.

      Regarding Reviewer #3:

      The authors state in lines 163-165: 'This striking anticorrelation reveals that the robustness of the Rev-erbα circadian expression depends on the nucleocytoplasmic transport of YAP and its mechanosensitive regulation'. Although interesting, the data in figure 3 to which this statement refers is, as the authors identify, correlative, rather than causative. I would strongly suggest altering this statement to better reflect the data.

      We will modify the text to eliminate this overstatement.

      It looks to me as though all experiments were carried out in the same clonal reporter 3T3 line. To avoid possible issues with founder effects, I would ask that the authors repeat the initial experiment in figure 1B, and the associated analysis as in 1C-E with a different clonal 3T3 line. Hopefully this will not be very arduous, as the methods suggest that multiple clonal 3T3 reporter lines were made initially. With time to defrost, plate, record and analyse the data, I would hope that this would not take more than six weeks maximum.

      We will perform the experiments regarding the cell density effect on the RevVNP oscillations (Fig. 1) in another clonal 3T3 line as the reviewer suggests. We have already initiated the experimental repeats with the alternative clone.

      I would note that the custom software used for analysis does not appear to be generally available. I would assume that the authors would make this available upon request.

      We will extend the explanation of our method as suggested by Reviewer #2 and make the code available to the community.

      Experiments appear to have been adequately replicated in terms of n. However, the robustness of these findings would be supported though use of a different clonal reporter line, as discussed above.

      We will solve this problem as stated above.

      Statistical analysis is generally appropriate. I would suggest including statistical analysis in figures 3B and S4B to demonstrate that the pharmacological treatments are indeed having a statistically significant effect on the MAL and YAP nuclear/cytoplasmic ratio.

      We will perform the corresponding statistical analysis on those data.

      For Figure 4, it is not stated which statistical tests have been used, with only P values given in table S1. Please state which test has been used.

      We will specify the statistical test used in the figure legend.

      Furthermore, it would be valuable to see if it is possible to perform statistical analysis looking at the populations should in Figure 4A, to either support or refute the statement made in Line 189-90 that 'we overexpressed 5SA-S94A-YAP, a mutant version of YAP unable to interact with TEAD and observed that the cells recovered, to a large extent, both the RevVNP circadian power fraction and the REV-ERBα basal levels displayed by the wild-type high-density population'

      The p-values corresponding to that dataset are represented in Table S1, but we will move them to the figure legend so the extent of the differences between the YAP mutants and the control becomes more noticeable. This applies too to the next comment of the reviewer.

      Additionally, it is a little unclear to me why exact p values are reported in table S1. It seems that they might be better placed in the relevant figure legend.

      Minor comments:

      Although the authors took good care to try to ensure that there was minimal phase synchrony between cells, it would be good to see some analysis to confirm that these efforts were successful. This is of particular concern, given that many things that commonly happen during cell handling, such as temperature change and media change, even with conditioned media, can act to synchronise cells. Hopefully, this information should be available from your existing analysis.

      All our experiments, except for the gap closure ones (which imply an unavoidable medium shock after the removal of the gasket where the cells are cultured to achieve high density) are carried out in a similar way (see Materials and Methods). This approach does not involve the typical shock of serum, dexamethasone, or other hormones, because we want to avoid biochemical signalling that could mask the “pure” effect of mechanics on the pathways that affect the circadian clock. In any case, a certain level of synchrony should not affect the analysis we perform, since this is single cell-based and does not consider the phase but the strength of its circadian frequency. But as requested by the reviewer we will analyze the phase signal and report the results if relevant to the project.

      It would be informative to see both phase and period analysis for the data shown in figure 2C. Do cells at the edge show differences in relative synchrony following the removal of the PDMS barrier and Rev-erba induction? Is there a period difference between cells at the edge and those that remain confluent?

      We agree with the referee that the “shock” received by the cells at the edge should work as a reset of their circadian phase and we have tried to analyse this effect. However, there are technical limitations that make this analysis difficult, mainly the short duration of the experiment and the fact that these cells transition very fast, upon gap closure, from a non-circadian to a circadian behaviour. We will attempt to better report this interesting effect by using the WAVECLOCK (Price et al., 2008) or the pyBOAT method (Mönke et al., 2020), suggested by Reviewer #2, which are designed to analyse non-stationary data.

      Mönke G, Sorgenfrei FA, Schmal C & Granada AE (2020) Optimal time frequency analysis for biological data - pyBOAT Systems Biology

      Price TS, Baggs JE, Curtis AM, Fitzgerald GA & Hogenesch JB (2008) WAVECLOCK: wavelet analysis of circadian oscillation. Bioinformatics 24: 2794–2795

      Figure 2B - the text states that those cells far from the edge oscillate robustly thoughout the experiment, but this is not easy to see from this kymograph due to the dynamic range. Is there another way of presenting this that might make it easier to confirm?

      We will calculate the circadian power fraction of the “bulk” cells as we do for the other conditions described in the manuscript. We can also show examples of individual traces if the average shown in Fig. 2C or the kymograph in Fig. 2B are not clear enough.

      Figure 1D-E - the text provides periodicity for the high-density cells, but not the low density ones. Could you provide periodicity for both populations - do they differ?

      We will represent in more detail the results of the frequency analysis on the low-density cells so the diversity of periods (frequencies) at this condition gets more evident.

      Figure S3 - it is interesting to note the difference in population rhythmicity between the bulk and edge data here, which is not seen so clearly in cells without thymidine. Could the authors comment on this?

      We agree with the referee that there is an obvious difference regarding RevVNP expression (mainly on the edge cells but also in the bulk) between the experiments with and without thymidine. We hypothesise this is due to the pronounced decrease in cell divisions in the presence of thymidine, which considerably slows down the gap closure and impacts the density of the entire cell population. We will comment this effect in the manuscript.

      Line 148 - it is unclear here what is meant by 'the onset of circadian oscillations'. Could you rephrase this for clarity?

      We will change that sentence.

      Line 173 - a few words to highlight that Lats is a kinase and the function of YAP phosphorylation by Lats would aid clarity here. Similarly, explanation of the functional difference between the protein with 4 Serine to alanine mutations and 5 mutations and why both of these mutants were used would be helpful.

      We will clarify this point following the reviewer’s suggestion.

      Line 174 - for accuracy, this should perhaps read 'fibroblast circadian clock', as this work is only in 3T3 cells, and therefore may not apply more generally.

      We will implement this change.

      Line 202 - could you expand to explain the existing limitations of studying cell signalling cascades in synchronised cells? This is not clear to me. Thanks.

      We will discuss the signalling effects caused by 50% serum shocks and other traditional ways to synchronise the cells as requested by the reviewer.

      Figures 1D and 4B - the choice of colour range used in these kymographs is skewed towards the warmer colours, making it quite hard to discern differences between the groups. I would suggest using the cooler colour range for a greater proportion of the data set, to make rhythmicity, or lack of it, clearer to see.

      We will invest further efforts to finding the optimal colour map and range for our datasets.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Abenza et al. investigate an important question of how the physical environment affects the properties of the individual circadian clocks. The authors utilize a set of clever experiments, pharmacological manipulations and data analysis techniques to unveil a potential role of YAP/TAZ in the circadian clock.

      Major comments:

      Effects on the circadian clock

      1. The authors use the fluorescent reporter created by Nagoshi from sections of the Rev-erbα gene. This reporter is widely used to estimate relative circadian timing in individual cells but it does not provide direct information on the circadian clock activity. In other words, while Reverb rhythmic expression is driven by the clock, it is not known whether less-rhythmic or non-rhythmic expression or change in expression level of Rev-erbα is affecting the core clock. For example, it has been shown that Rev-erbα knock-down cells are rhythmic as long as Rev-erb-beta is present. Thus, one major shortcoming of the current version of the manuscript is the missing dissection between Rev-erbα rhythmicity/expression and the circadian clock. More concretely, it remains unclear whether the change in Rev-erbα expression is a direct effect or caused by a defect clock. Since the authors presume a direct effect of YAP/TAP on Rev-erb expression, the former is likely. If that is the case, the data could be interpreted as that (missing) mechanic stimuli can lead to nuclear YAP/TAZ, which rises the level of Rev-erbα (and maybe interfere with its rhythmic accumulation). Beyond Rev-erbα expression, there may or may not be an effect on the circadian clock (core clock, CCGs). With the current version we do not know since the authors do not look beyond Rev-erbα expression. Thus, the claims on circadian clock or circadian rhythms in their cells is not studied in this version of the manuscript. The current version is still very interesting and provides insights into the Rev-erbα modulation, but additional work would be needed to show links with the core clock machinery. For this the authors could show influence (or at least correlation) of the YAP/TAZ/REVERBA phenotype on the oscillations of core clock genes or clock-controlled genes. Either through the use of alternative (ideally constitutive) reporters (e.g. PER2, BMAL1, fluorescent or LUC), or/and by analyzing RNA/Protein of core clock genes or output genes. This would not be necessary for all experiments, but at least for some were its possible (e.g. experiments with drugs perturbations). Otherwise, any claim like "YAP/TAZ perturbs the circadian clock ..." or "the circadian clock deregulation in nuclear YAP-enriched cells" is potentially flawed and has to be removed/reformulated.

      2. The authors aim to discard the possibility of paracrine signals by showing no increase in circadian power fraction of cells growing in low density with conditioned medium (Figure 2D). A paracrine signal coming from an oscillatory system is likely to oscillate and in that case, I do not see how growing cells in constant conditional medium can discard the effects of an oscillatory paracrine signal. I believe the elegant experiment shown in Figure 2E more precisely address this issue.

      Data analysis methodology:

      1. Single-cell circadian recordings like the ones analyzed here are characterized by noisy amplitude and non-sinusoidal waveforms with fluctuating period (Bieler et al., 2014; Feillet et al., 2014). The authors interpolate, smooth, detrend and normalize their data; operations that are known to introduce spectral artifacts that can mislead the interpretation of the power spectrum. Moreover, the time-series pre-processing operations described by the authors in the methods sections is incomplete and the authors should more explicitly describe all their operations with exact methods applied, filter parameters and time-windows sizes (if applicable). To validate their pre-processing steps the authors could provide their time-series analysis pipeline code and/or provide a few examples of raw versus pre-processed data together with their respective spectrums before and after pre-processing. In addition, the authors could provide their raw trace signal data together with the corresponding post-processed signal data as plain text files.

      2. The authors rely on Fourier analysis and a reasonable self-made definition of circadian strength named as "circadian power fraction". Using a stationary-based method for noisy non-stationary data can lead to inaccurate spectrum power estimations. As the current version of the manuscript does not provide any alternative/complementary analysis method nor we have any available raw signal data it is unclear if their analysis appropriately represents the circadian power. The authors could consider implementing complementary data-analysis strategies to validate their conclusions. Fortunately, there are multiple suitable data analysis strategies already available that are exactly designed for this kind of data (eg. (Price et al., 2008; Leise et al., 2012; Leise, 2013; Bieler et al., 2014; Mönke et al., 2020). This time-series analysis methods is a crucial step as all main results on this manuscript rely on the authors self-made definition of circadian power. This is particularly important as there is no standardized method in the circadian field to estimate circadian rhythmicity and/or circadian power of single-cell traces.

      3. The authors mainly show circadian power fraction and analyze rhythmicity scores/powers. Is there the a chance that a rise in the basal expression level of Rev-erbα is reducing the rhythmicity score? Or to phrase it otherwise, the absolute amplitude may remain the same, but the relative amplitude may be reduced? Would that affect the FT analysis power scored? To clarify this the authors could provide an analysis of the relative amplitude in addition to the circadian intensity (as in Fig.1C).

      Minor points by text-line:

      1. YAP and TAZ should be introduced to the reader during introduction.
      2. by set a of proteins.
      3. Here the authors probably meant that cells were not reset nor entrained during the experiment.
      4. "..expression depends on..". This is a correlation, not proof of causation is shown until this point. This is an overstatement.
      5. Using the term "provoked" suggests a causal relationship not shown.
      6. Similarly last sentence "This result established.... is caused..". Again, this is an overstatement as only correlation is shown.
      7. According to their description the authors are not using any image-preprocessing steps, eg background subtraction or other filters. Is this correct?
      8. It is not clear what image metric for the single-cell signals are the authors using, eg. integrated nuclear intensity or mean/median nuclear intensity. I am not familiar with TrackMate but it might be possible to export and share with the readers the image-analysis pipeline used which would clarify any questions about image processing and signal extraction.

      References:

      1. Bieler, J, Cannavo, R, Gustafson, K, Gobet, C, Gatfield, D, and Naef, F (2014). Robust synchronization of coupled circadian and cell cycle oscillators in single mammalian cells. Mol Syst Biol 10, 739.

      2. Leise, TL (2013). Wavelet analysis of circadian and ultradian behavioral rhythms. J Circadian Rhythms 11, 5.

      3. Leise, TL, Wang, CW, Gitis, PJ, and Welsh, DK (2012). Persistent Cell-Autonomous Circadian Oscillations in Fibroblasts Revealed by Six-Week Single-Cell Imaging of PER2::LUC Bioluminescence. PLoS One 7, 1-10.

      4. Mönke, G, Sorgenfrei, F, Schmal, C, and Granada, A (2020). Optimal time frequency analysis for biological data - pyBOAT. BioRxiv 179, 985-986.

      5. Price, TS, Baggs, JE, Curtis, AM, FitzGerald, GA, and Hogenesch, JB (2008). WAVECLOCK: wavelet analysis of circadian oscillation. Bioinformatics 24, 2794-2795.

      Significance

      I believe this manuscript is of high significant both for the circadian as well as the mechanobiology fields. Readers from single-cell signalling studies will also be very interested in this work.

      To my knowledge the discussed link has not been studied before at single cell level, which as the authors show can provide multiple new insights.

      I do work with similar single-cell signals, have broad expertise in microscopy, image analysis methods, time series analysis, and the circadian clock mechanisms but very little experience in mechanobiology.

    1. <h1>Assignment B</h1>

      Later on, you may want to add in some semantic html such as header, main, footer. This can become useful for understanding what part of your code you are looking at and also can be used as selectors when we get more into CSS

    1. #000000

      I think the hex number for this colour is the default text colour in browsers. If that is so, it is unnecessary to add this line of code, which would make it more compact.

    1. <p>M

      It looks good to me, what I did notice and some of these areas have spaces

      my and others don'

      between the writing. Not sure if it matters code wise but makes it easier to view with space.

    1. <!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <meta http-equiv="X-UA-Compatible" content="IE=edge"> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <!-- F22 DGL 103 DLU1 - Caitlin Garland - Assignment A --> <link rel="stylesheet" href="style.css"> <title>assignment A</title> </head> <body> <h1>Hello World</h1> <p>This is Caitlin speaking.</p> <img src="images/snowboarding.jpeg" width="600" alt="snowboarding"> </body> </html>

      Hello, lovely code. Everything looks good here, I don't think it needs anything (not that I'm currently qualified to say lol)

      I like your photo, it's pretty :)

    1. element to successfully create a list--> <!-- I used the hyperlink

      Hello, very nice code you've got here :)

      At my skill level it seem awfully presumptuous to be making suggestions to anyone else but here goes...

      Line 25: I might split this onto separate (p) lines to make it easier to work with and read without having to scroll over every time

      I'm curious why you chose (em) to add italics on line 22, and (i) on line 25. Is there a difference between the effect each has?

      Why did you add this? - (link rel="stylesheet" href="style.css") *Edit: I see it's because you've added some css :)

      Please note I am using rounded brackets for the purpose of communicating because if I use the angle brackets the editor actually does the edits

    1. head>

      Good morning! It looks like you are missing a large chunk of text before your body. It should read which document type this is (html), language, etc. The skeleton code automatically populates when you start typing html into VS code. Hopefully it will pop up for you!

    1. fax</a>, but I live

      Hello, nice code!

      It seems like a good choice aesthetically to move the text from your first paragraph onto separate lines. And paragraphs 3 and 4 as well.

      I ended up doing the same with mine, started with everything on these big long lines initially then ended up separating them all later.

      I like your use of the unordered list to bullet your pastimes. 

      I'm curious why you made "exploring the mountain" small? Just for aesthetic emphasis?

      Is there a difference between (b) and (strong) that made decide to go with (b)? - I had to use rounded brackets to write this because when I used the angle brackets the editor took them out and made the sentence bold!

    1. Author Response

      Reviewer #1 (Public Review):

      Several questions have remained regarding the characteristics of these cells:

      1) Based on the transcriptome data in Figure 2, the authors inferred that thymic macrophages are "specialized in lysosome degradation of phagocytosed material and antigen presentation" yet did not show functional data to support these claims. Functional assays such as phagocytosis and antigen presentation are desirable, especially in comparison to other well characterized macrophage populations.

      We agree with the reviewer that additional functional characterization of thymic macrophages will strengthen the conclusions of our manuscript. We have performed antigen presentation assay and in vitro phagocytosis assay to functionally characterize the thymic macrophages. Indeed, thymic macrophages seem to be quite good antigen presenting cells – not as good as thymic DCs, but much better than peritoneal macrophages. This is documented in Fig. 3A and B. They were also good phagocytes both in vitro and in vivo as demonstrated in Fig. 3C-G. Surprisingly, peritoneal macrophages were better in the in vitro phagocytosis assay. We attribute this result to thymic macrophages’ poor survival during the sorting and in vitro culture.

      2) Do transcriptomes of CX3CR1+ thymic macrophages in old mice significantly differ from those of young mice?

      This is a very interesting question that we plan to explore in the future, but we feel it is beyond the scope of the current manuscript.

      3) It would be helpful to better graphically show the compositions (both cell number and cell ratio) of thymic macrophage subsets (TIM4+, CX3CR1+, and others) in mice at different ages (1 week, 6 weeks, and 4 months old). It is not straightforward to deduce all the information based on the current data presentation.

      We thank the reviewer for the suggestion! Plotting the cell numbers did reveal a peak in young age and then significant decline in the number of Tim4+ cells and a trend for accumulation of Tim4+ cells with age. Unfortunately, older mice show great variability in thymus size, which prevented the Tim4- result from being statistically significant. We have added these data to Fig. 8F.

      4) The description of the gating strategy of thymic macrophages for Figure 1 is quite verbose. Adding a step-wise gating strategy of thymic macrophages as a figure panel would be helpful for readers to follow the experimental details.

      We thank the reviewer for the suggestion. The description of the gating strategy has been stripped to 2 panels that capture its essence (Fig. 1B).

      Reviewer #2 (Public Review):

      This work provides by far the most thorough characterization of thymic macrophages. The authors used bulk RNA-seq, single-cell seq and fate mapping animal models to demonstrate the phenotype, origin and diversity of thymic macrophages. Overall the manuscript is well written and the conclusions of the paper are mostly well supported by data.

      Some aspects of data acquisition and data analysis need to be clarified.

      1) the authors should state what does row min row max in figure2 b,d refer to. is this expression value on log scale? In figure 2d, the authors compared their own RNAseq data with ImmGen seq data, what kind of normalization did the authors apply?

      We appologize for not making this clear. The values in Fig. 2b and d (current Fig. 2A and C) are expression values on log scale. We have included this information in the figure.

      Our data is part of the IMMGEN dataset. We sorted the cells and sent them to the US for RNA sequencing. That is why we referred to it as “our” data. However, to avoid confusion we changed the wording to clearly reflect that the data are from IMMGEN.

      2)The authors used immunofluorescent to identify the localization of two populations of macrophages, where they used merTK staining to indicate all macrophages. However, MerTK expression may not restrict to immune cells. The authors are encouraged to confirm that MerTK only labels macrophages in thymus by co-staining with F4/80 or CD45. Tim4 can also be used in immunofluorescence.

      We agree that staining with additional macrophage markers will strengthen our conclusions about ThyMacs localization. We have performed staining with CD64 together with MerTK or Tim4. CD64 and MerTK almost completely overlapped and so did CD64 and Tim4 in the cortex. We could not stain MerTK and Tim4 together because the antibodies are raised in the same species (rat). Additional evidence for the specificity of these markers for thymic macrophages comes from Fig. 3E and F showing the high degree of co-localization of apoptotic cells (TUNEL+) with MerTK or Tim4. Finally, Fig. 4 figure supplement 1 also clearly shows the distribution of TIM4 and CD64 in the whole thymus.

      3) The data of Cx3cr1+ cells accumulation with age in thymus is very interesting, and as the author has discussed, might indicate their contribution to thymus involution. However, the authors only showed change of percentage. As the total macrophages numbers decreased with age, it is not clear whether these cells actually "accumulate" with age. It will help us to assess if this increased percentage of Cx3Cr1+ cells is an actual increase of "influx" or due to the decrease of the self-maintain Tim4+ macrophage subsets.

      The reviewer is raising a very important point. As the changes in the Tim4+ and Tim4- thymic macrophages proportions with age occur at the background of thymic involution, it is difficult to judge whether Tim4+ cells self-maintain and whether Tim4- cells accumulate. Plotting the cell numbers revealed a peak in young age and then significant decline in the number of Tim4+ cells and a trend for accumulation of Tim4+ cells with age. Unfortunately, older mice show great variability in thymus size, which prevented the Tim4- result from being statistically significant. We have added these data to Fig. 8F.

      Reviewer #3 (Public Review):

      This study by Zhou et al. focuses on thymic macrophages and shows that two populations can be distinguished with different identities, localization and origin. Authors use several murine reporter and fate-mapping models, coupled with flow cytometry and transcriptomics approach to support their claims.

      Overall, the question tackled by this study is interesting, thymic macrophages having a bit being forgotten in the last decade which has seen many studies similar to the one presented here in other organs. So, the stated aim to closing this gap is relevant. But the actual version of the study suffers from many defects, more or less severe, which affect the clarity and the persuasiveness of it.

      • About the plan, authors study the origin of the thymic population and provide data in fig 2, 3 & 4 assuming that thymic macs form a homogeneous population. But from fig 5, they distinguish 2 populations and study them separately. So the end of the paper renders obsolete the beginning, that asks for a revision of the whole plan.

      We agree with the reviewer that there is more than one way to tell this story and we have been agonizing over our plan. However, we respectfully disagree that the beginning of the paper is made obsolete by the ending for several reasons:

      1) The initial figures in our manuscript contain very fundamental characterizaition of ThyMacs. Just as the revelation of a heterogeneity in liver macrophages or lung macrophages (ref) does not render all prior research on these cells obsolete, the initial figures in our manuscript are an essential part of the story. Such data are available for all other studied tissue resident macrophage populations. Removing them will be a disservice to the community.

      2) Another reviewer asked for deeper characterization of ThyMacs based on the data in Fig. 2. Accommodating this request will be very difficult if we remove this part.

      Nevertheless, we agree that ThyMacs heterogeneity is the central claim of the manuscript and should be introduced earlier. Now, the original figure 5 (current Fig. 4) that described the heterogeneity has been moved before the original figures 3 and 4 (current Fig. 5 and 6). Additional analyses distinguishing Tim4+ and Tim4- ThyMacs has been incorporated in current Fig. 5 and 6.

      • The figure 1 is not very clear. The backgating should be added in 1a. Or why not using the color map axis mode from FlowJo to show 3 parameters at a glance? The gating strategy should be more clearly displayed on the figure. On fig 1S3, there are clearly 2 pops in the CX3CR1-GFP mice. Why not starting from this to introduce the two populations?

      We thank the reviewer for the suggestion. We have included a color map axis to show MerTK, CD64, and F4/80 in one plot. The description of the gating strategy has been stripped to 2 panels that capture its essence. \We agree that there are several indications for heterogeneity among thymic macrophages, starting with Fig. 1E – the expression of Tim4, and Fig S4c – the expression of CX3CR1-GFP. We have added extra text at the beginning of the paragraph describing current Fig. 4 to point out these facts.

      • The figure 2 could be revised also. First, the panel 2a is useless and should be removed. A PC analysis of all the macs would be more useful here. Also, the color code used for the genes is confusing. Why genes up in ThyMacs are red in 2b but only half of them in 2d? Info can be found in the legend but it should be more clear on a graphical point of view.

      We have revised Fig. 2 according to the reviewer’s suggestions. The PCA analysis is consistent with the hierarchical clustering and shows that splenic and liver macrophages are most closesly related to ThyMacs. We agree that the presence of red in both heatmaps is confusing and we have changed the color code – color was removed from current Fig. 2A but retained in Fig. 2C.

      • For figure 3, what is the timepoint of the panel 3b? Here, authors should show microglia and ThyMacs for both timepoints and conclude based on the comparison. If ThyMacs are as stable as the microglia, no replacement. If not, replacement. For the panel 3f, n=3 is too low to be convinced notably with the standard variation here. And displaying the dot plot with 11% of blood mono from donor while the median being around 20 is not fair, authors should present the most representative plot. For the panel 3h, there are more GFP (in term of MFI) for TEC and ThyMacs than for total cells. How is it possible? TECs and ThyMacs should be in the total cells? Or the gating is not clear enough?

      We thank the reviewer for pointing our omissions. Fig. 3b (current Fig. 5B) is from E19.5 and we have added this information to the figure. We also agree that in Fig. 3f (current Fig. 5F) the sample number is too small and the variation too large to make solid conclusions. That is why we have repeated the partial chimeras experiment trying to irradiate as much as possible of the mice without affecting the thymus. We have substituted the data in the Fig. 3e and 3f with the new data. For Fig. 3h, we appologize for not labeling the data clearly. The panels labeled “single, live cells” should be labeled as “thymocytes” as they were obtained without enzymatic digestion that is essential for both TECs and ThyMacs. However, we found an important caveat in the thymus transplant experiment. It appeared that some of the thymus macrophages were GFP positive not because they express GFP but because they have engulfed GFP+ cells. As a result our experiments with embryonic GFP+ thymus transplants overestimate the percentage of donor-derived ThyMacs (all of them were GFP+). We have repeated the thymus transplantation experiments with congenically marked thymuses (CD45.2 donor and CD45.1 host). While this set up did not allow us to use the thymic epithelial cells as positive control because they are CD45-, we did identify host-derived ThyMacs, consistent with Tim4- cells originating from adult HSCs. Thus, we have replaced the previous data in Fig. 3H and 3I with current figures 5H and 5I.

      • For figure 4, the EdU staining (4e) is not convincing at all. The signal is very low (as compared to 4c for example.

      We agree that signal after 21d chase is a lot weaker than after 2 h (Fig. 4c) or 21d (Fig. 4e) of EdU pulse. The reason we decided to keep this data is that: 1) the thymocytes also have much lower EdU staining after 21d chase compared to 2h and 21d of EdU pulse; 2) The results from EdU staining are very consistent with the data from Ki67 staining, cell cycle analysis, and scRNA-Seq revealing a small population (~5%) of cycling ThyMacs.

      • For figure 7, the interpretation of the data and the way to present them are not clear. Authors use an inducible fate-mapping model. The fact that Tim4- loose their signal with time argue for a replacement by non-labelled cells (blood monocytes) whereas Tim4+ ones are stable meaning they self-maintain. It is what authors claim. But how it fits with previous data where they say that Tim4+ derived form CX3CR1+? The explanation that is a bit subtended here but not enough clearly shown is that CX3CR1+ give rise to Tim4+ during embryonic development but is stops after, Tim4 self-renew independently, and CX3CR1+ are slowly replaced by monocytes. As this is the central claim of the paper, it should be most clearly reported and for this, a substantial change of the whole plan is required.

      We thank the reviewer for pointing out the need for better explanation. The maintenance of the different populations of ThyMacs is indeed complex and proceeds in different ways in the different periods of life. We have added some extra data to Fig. 7 (current Fig. 8) that we hope will add some clarity to the maintenance of thymic macrophages with age. The new Fig. 8F shows the dynamics of the cell numbers of Tim4+ and Tim4- macrophages with age. Tim4+ cells reach a peak in young mice and decline significantly as mice age. So, we do not think that they are self-maintaining but instead, undergo slow attrition with very limited replacement. These results are consistent with Fig. 6I showing low levels of Mki67 in Tim4+ cells. Tim4- are a different story: they progressively accumulate with age. Although the variability in thymus size and Tim4- macrophages in very old mice is too great for the data to reach significance, the trend is clear.

      As for the dynamics of the populations in the embryonic period, we added data formally demonstrating that TIM4+CX3XR1- are derived from CX3CR1+ cells by fate mapping (Fig. 7E-G). We induced re-combination in pregnant ROSA26LSL-GFP mice pregnant from Cx3cr1CreER males at E15.5 when almost all ThyMacs are Cx3cr1+ (Fig. 7A). Just before birth, at E19.5, we could find a substantial proportion of TIM4+CX3CR1- cells among the fate mapped GFP+ macrophages, indicating that Cx3cr1+ cells, indeed, give rise to TIM4+CX3CR1- cells. As pointed out before, this pathway gets exhausted by the first week after birth – at d7 all ThyMacs are TIM4+.

    1. <h1><strong>BIO</strong></h1>

      In later assignments, it will be helpful to add some semantic html to what sections of code are what ex. body> header> h1> example nav> your code for your navigation. </nav> /header> main> p> your main content

      /main> /body> This will also come in handy once we start to learn some more CSS