10,000 Matching Annotations
  1. Mar 2025
    1. Reviewer #1 (Public review):

      Summary:

      During early Drosophila pupal development, a subset of larval abdominal muscles (DIOMs) is remodelled using an autophagy-dependent mechanism.

      To better understand this not very well studied process, the authors have generated a transcriptomics time course using dissected abdominal muscles of various stages from wild-type and autophagy-deficient mutants. The authors have further identified a function for BNIP3 in muscle mitophagy using this system.

      Strengths:

      (1) The paper does provide a detailed mRNA time course resource for DIOM remodelling.

      (2) The paper does find an interesting BNIP3 loss of function phenotype, a block of mitophagy during muscle remodelling, and hence identifies a specific linker between mitochondria and the core autophagy machinery. This adds to the mechanism of how mitochondria are degraded.

      (3) Sophisticated fly genetics demonstrates that the larval muscle mitochondria are, to a large extent, degraded by autophagy during DIOM remodelling.

      Weaknesses:

      (1) Mitophagy during DIOM remodelling is not novel (earlier papers from Fujita et al.).

      (2) The transcriptomics time course data are not well connected to the autophagy part. Both could be separated into 2 independent manuscripts.

      (3) The muscle phenotypes need better quantifications, both for the EM and light microscopy data in various figures.

      (4)The transcriptomics data are hard to browse in the provided PDF format.

    2. Reviewer #2 (Public review):

      Summary:

      Autophagy (macroautophagy) is known to be essential for muscle function in flies and mammals. To date, many mitophagy (selective mitochondrial autophagy) receptors have been identified in mammals and other species. While the loss of mitophagy receptors has been shown to impair mitochondrial degradation (e.g., OPTN and NDP52 in Parkin-mediated mitophagy and NIX and BNIP3 in hypoxia-induced mitophagy) at the level of cultured cells, it remains unclear, especially under physiological conditions in vivo. In this study, the authors revealed that one of the receptors BNIP3 plays a critical role in mitochondrial degradation during muscle remodeling in vivo.

      Overall, the manuscript provides solid evidence that BNIP3 is involved in mitophagy during muscle remodeling with in vivo analyses performed. In particular, all experiments in this study are well-designed. The text is well written and the figures are very clear.

      Strengths:

      (1) In each experiment, appropriate positive and negative controls are used to indicate what is responsible for the phenomenon observed by the authors: e.g. FIP200, Atg18, Stx17 siRNAs during DIOM remodeling in Figure 2 and Full, del-LIR, del-MER in Figure 5.

      (2) Although the transcriptional dynamics of DIOM remodeling during metamorphosis is autophagy-independent, the transcriptome data obtained by the authors would be valuable for future studies.

      (3) In addition to the simple observation that loss of BNIP3 causes mitochondrial accumulation, the authors further observed that, by combining siRNA against STX17, which is required for fusion of autophagosomes with lysosomes, BNIP3 KO abolishes mitophagosome formation, which will provide solid evidence for BNIP3-mediated mitophagy. Furthermore, using a Gal80 temperature-sensitive approach, the authors showed that mitochondria derived from larval muscle, but not those synthesized during hypertrophy, remain in BNIP3 KO fly muscles.

      Weaknesses:

      (1) Because BNIP3 KO causes mitochondrial accumulation, it is expected that adult flies will have some physiological defects, but this has not been fully analyzed or sufficiently mentioned in the manuscript.

      (2) In Figure 5, the authors showed that BNIP3 binds to Atg18a by co-IP, but no data are provided on whether MER-mut or del-MER attenuates the affinity for Atg18a.

    3. Reviewer #3 (Public review):

      Summary:

      Fujita et al build on their earlier, 2017 eLife paper that showed the role of autophagy in the developmental remodeling of a group of muscles (DIOM) in the abdomen of Drosophila. Most larval muscles undergo histolysis during metamorphosis, while DIOMs are programmed to regrow after initial atrophy to give rise to temporary adult muscles, which survive for only 1 day after eclosion of the adult flies (J Neurosci. 1990;10:403-1. and BMC Dev Biol 16, 12, 2016). The authors carry out transcriptomics profiling of these muscles during metamorphosis, which is in agreement with the atrophy and regrowth phases of these muscles. Expression of the known mitophagy receptor BNIP3/NIX is high during atrophy, so the authors have started to delve more into the role of this protein/mitophagy in their model. BNIP3 KO indeed impairs mitophagy and muscle atrophy, which they convincingly demonstrate via nice microscopy images. They also show that the already known Atg8a-binding LIR and Atg18a-binding MER motifs of human NIX are conserved in the Drosophila protein, although the LIR turned out to be less critical for in vivo protein function than the MER motif.

      Strengths:

      Established methodology, convincing data, in vivo model.

      Weaknesses:

      The significance for Drosophila physiology and for human muscles remains to be established.

    1. eLife Assessment

      This paper provides a compelling and rigorous quantitative analysis of the turnover and maintenance of CD4+ tissue-resident memory T cell clones, in the skin and the lamina propria. It provides a fundamental advance in our understanding of CD4 T cell regulation. Interestingly, in both tissues, maintenance involves an influx from progenitors on the time scale of months. The evidence that is based on fate mapping and mathematical inference is strong, although open questions on the interpretation of the Ki67-based fate mapping remain.

    2. Reviewer #1 (Public review):

      Summary:

      Compelling and clearly described work that combines two elegant cell fate reporter strains with mathematical modelling to describe the kinetics of CD4+ TRM in mice. The aim is to investigate the cell dynamics underlying the maintenance of CD4+TRM.

      The main conclusions are that:<br /> (1) CD4+ TRM are not intrinsically long-lived.<br /> (2) Even clonal half-lives are short: 1 month for TRM in skin, and even shorter (12 days) for TRM in lamina propria.<br /> (3) TRM are maintained by self-renewal and circulating precursors.

      Strengths:

      (1) Very clearly and succinctly written. Though in some places too succinctly! See suggestions below for areas I think could benefit from more detail.

      (2) Powerful combination of mouse strains and modelling to address questions that are hard to answer with other approaches.

      (3) The modelling of different modes of recruitment (quiescent, neutral, division linked) is extremely interesting and often neglected (for simpler neutral recruitment).

      Weaknesses/scope for improvement:

      (1) The authors use the same data set that they later fit for generating their priors. This double use of the same dataset always makes me a bit squeamish as I worry it could lead to an underestimate of errors on the parameters. Could the authors show plots of their priors and posteriors to check that the priors are not overly-influential? Also, how do differences in priors ultimately influence the degree of support a model gets (if at all)? Could differences in priors lead to one model gaining more support than another?

      (2) The authors state (line 81) that cells were "identified as tissue-localised by virtue of their protection from short-term in vivo labelling (Methods; Fig. S1B)". I would like to see more information on this. How short is short term? How long after labelling do cells need to remain unlabelled in order to be designated tissue-localised (presumably label will get to tissue pretty quickly -within hours?). Can the authors provide citations to defend the assumption that all label-negative cells are tissue-localised (no false negatives)? And conversely that no label-positive cells can be found in the tissue (no false positives)? I couldn't actually find the relevant section in the methods and Figure S1B didn't contain this information.

      (3) Are the target and precursor populations from the same mice? If so is there any way to reflect the between-individual variation in the precursor population (not captured by the simple empirical fit)? I am thinking particularly of the skin and LP CD4+CD69- populations where the fraction of cells that are mTOM+ (and to a lesser extent YFP+) spans virtually the whole range. Would it be nice to capture this information in downstream predictions if possible?

      (4) In Figure 3, estimates of kinetics for cells in LP appear to be more dependent on the input model (quiescent/neutral/division-linked) than the same parameters in the skin. Can the authors explain intuitively why this is the case?

      (5) Can the authors include plots of the model fits to data associated with the different strengths of support shown in Figure 4? That is, I would like to know what a difference in the strength of say 0.43 compared with 0.3 looks like in "real terms". I feel strongly that this is important. Are all the fits fantastic, and some marginally better than others? Are they all dreadful and some are just less dreadful? Or are there meaningful differences?

      (6) Figure 4 left me unclear about exactly which combinations of precursors and targets were considered. Figure 3 implies there are 5 precursors but in Figure 4A at most 4 are considered. Also, Figure 4B suggests skin CD69- were considered a target. This doesn't seem to be specified anywhere.

    3. Reviewer #2 (Public review):

      This manuscript addresses a fundamental problem of immunology - the persistence mechanisms of tissue-resident memory T cells (TRMs). It introduces a novel quantitative methodology, combining the in vivo tracing of T-cell cohorts with rigorous mathematical modeling and inference. Interestingly, the authors show that immigration plays a key role in maintaining CD4+ TRM populations in both skin and lamina propria (LP), with LP TRMs being more dependent on immigration than skin TRMs. This is an original and potentially impactful manuscript. However, several aspects were not clear and would benefit from being explained better or worked out in more detail.

      (1) The key observations are as follows:

      a) When heritably labeling cells due to CD4 expression, CD4+ TRM labeling frequency declines with time. This implies that CD4+ TRMs are ultimately replenished from a source not labeled, hence not expressing CD4. Most likely, this would be DN thymocytes.

      b) After labeling by Ki67 expression, labeled CD4+ TRMs also decline - This is what Figure 1B suggests. Hence they would be replaced by a source that was not in the cell cycle at the time of labeling. However, is this really borne out by the experimental data (Figure 2C, middle row)? Please clarify.

      (2) For potential source populations (Figure 2D): Please discuss these data critically. For example, CD4+ CD69- cells in skin and LP start with a much lower initial labeling frequency than the respective TRM populations. Could the former then be precursors of the latter? A similar question applies to LN YFP+ cells. Moreover, is the increase in YFP labeling in naïve T cells a result of their production from proliferative thymocytes? How well does the quantitative interpretation of YFP labeling kinetics in a target population work when populations upstream show opposite trends (e.g., naïve T cells increasing in YFP+ frequency but memory cells in effect decreasing, as, at the time of labeling, non-activated = non-proliferative T cells (and hence YFP-) might later become activated and contribute to memory)?

      (3) Please add a measure of variation (e.g., suitable credible intervals) to the "best fits" (solid lines in Figure 2).

      (4) Could the authors better explain the motivation for basing their model comparisons on the Leave-One-Out (LOO) cross-validation method? Why not use Bayesian evidence instead?

    1. eLife Assessment

      This study approaches an important topic providing insight into the neuronal circuitry that interconnects memory consolidation and sleep. The data were collected and analysed using a solid methodology, contributing new findings for neurobiologists working on how memories are stored and the roles of sleep. However, the data is incomplete to support the proposed role of the PAM-DPM circuits as the link between sleep state and long-term memory consolidation.

    2. Reviewer #1 (Public review):

      Summary:

      The authors aim to use state-of-the art behaviour, imaging, and connectome techniques to identify the neural interaction between sleep and long-term memory consolidation in the PAM-DPM circuits, a well-known dopaminergic pathway within Drosophila Mushroom Body.

      Strengths:

      From a Drosophila sleep researcher's perspective, the investigation follows a clear and logical strategy to collect a huge dataset of sleep, appetitive memory, and live imaging. The authors clearly identified and showed that activation of a PAM subset: alpha-1 reduces sleep quality and memory consolidation in a starvation-dependent manner. The authors also convincingly demonstrated the corresponding neuronal responses of DPM neurons following PAM alpha-1 activation, and the positive role of DPM neural activity in sleep and memory consolidation. Moreover, the authors applied a new way of sleep statistics to demonstrate hour-by-hour changes between treatment and genotypes. Importantly, the authors demonstrated that memory loss derived from PAM alpha 1 activation can be partly restored by ectopic sleep enhancement via feeding THIP during the memory consolidation period after training.

      Weaknesses:

      Two investigatory gaps relate to the misalignment between circuital activity and behaviours, due to the nature of large circuital functional analysis like this. Firstly, the central observation of the study indicates that PAM alpha1 activation causes DPM inhibition which disrupts sleep and memory consolidation. Therefore one would expect a reduced PAMalpha1 and increased DPM activities after memory training, but the authors found that the endogenous CRTC::GFP reported neuronal activity for PAMalpha1 and DPM are both increased after memory training (Figure 9). This can be due to the difficult functional demarcation among the 14 PAMalpha1 projections. Secondly, the authors acknowledged the contradicting finding that memory defect is detected in PAMalpha1 inactivation (Figure 7C), yet suggested a tight link between sleep and memory consolidation; it is clear loss of PAM subset activity can disrupt memory consolidation without affecting sleep (cf Figure 7C and 7I).

    3. Reviewer #2 (Public review):

      Summary:

      Sleep plays a critical role in memory consolidation, but the neural mechanisms underlying this relationship remain poorly understood. The authors present novel findings implicating two small neuronal groups with inhibitory connections, PAM-a1 to DPM, in sleep regulation and LTM consolidation. However, whether the PAM-a1 to DPM microcircuit promotes LTM consolidation through sleep regulation requires further investigation.

      Strengths:

      The authors report several novel findings. Brief activation or inhibition of PAM-a1 neurons, or brief inhibition of DPM neurons during the first few hours after training, impairs 24-hour LTM. Notably, these brief manipulations disrupt sleep for many hours afterward, particularly at night. Interestingly, disruption of PAM-a1 and DPM neurons impairs sleep and appetitive memory consolidation only under starvation conditions, and pharmacological induction of sleep during the night rescues the LTM defects. These findings suggest that PAM-a1 and DPM neurons are involved in sleep regulation and LTM consolidation under starvation. These are important findings that advance our understanding of the link between sleep and memory consolidation.

      Weaknesses:

      Some claims lack sufficient evidence or clarity:

      (1) All sleep experiments are conducted under the "training" (temperature-change) condition. While genotypic controls are helpful, additional no-training controls are required to confirm that the observed differences are due to training rather than unknown genotype-related factors. The fact that experimental genotypes exhibit significantly altered sleep even before "training" (e.g., Figs. 7H, J, K, 8A, B, D) highlights the necessity of these controls.

      (2) Previous studies on disrupted memory due to sleep reduction have primarily examined conditions with severe sleep deprivation. In contrast, this report claims that relatively small decreases in total sleep accompanied by sleep fragmentation are responsible for impaired memory consolidation. It remains unclear whether sleep fragmentation at this level is truly critical for memory consolidation. The authors should cause sleep loss and fragmentation of similar magnitude through other means and determine whether it can impair LTM.

      (3) The authors employed a neural activity reporter to show that starvation increases the basal activity of PAM-a1 but not DPM neurons in untrained flies (Figures 9C-E). They observed small increases in the activity of both neuron groups immediately after training but not one hour later. Given the inhibitory connection from PAM-a1 to DPM, it is unclear why both neuron groups show increased activity after training. Additionally, as the authors acknowledge, it is puzzling how the inactivation of PAM-a1 produces similar effects on sleep and memory as DPM inhibition and PAM-a1 activation. Further experiments are needed to clarify these findings, such as manipulating PAM-a1 activity during the one-hour post-training period and evaluating the effect on DPM activity. Including data from training under fed conditions would provide a more comprehensive understanding of state-dependent neural activity. Even if certain experiments are not feasible, these issues warrant further discussion. It is also important to clarify that the term "synchronized" does not imply single-spike-level synchrony.

      (4) The authors considered that PAM-a1 and DPM might function in parallel, independent pathways for sleep and LTM. They rejected this possibility based on the lack of additive effects when both neuronal groups were simultaneously inactivated. However, they found that MB299B-labelled neurons exert stronger memory effects than MB043B-labelled neurons, while MB043B neurons have stronger sleep effects. If sleep is a primary driver of memory consolidation, a stronger correlation between memory and sleep effects would be expected. This observation merits further discussion.

      (5) Given prior knowledge that PAM neurons are heterogeneous and that the R58E02 driver is broadly expressed, data in Figures 1-5 concerning PAM are outdated. The use of more restricted PAM-a1 drivers from the outset would make the manuscript easier to read and interpret.

      (6) Some figures lack relevant data, certain experiments are missing necessary controls, and anomalies are present in some data sets.

    4. Reviewer #3 (Public review):

      Summary:

      Understanding the neural circuits that link sleep and memory remains a fundamental challenge in neuroscience. In this study, Lin Yan and colleagues investigate how dopamine signaling in Drosophila regulates long-term memory (LTM) formation in the context of sleep. They identify a specific microcircuit between protocerebral anterior medial dopamine neurons (PAM-DANs) and dorsal paired medial (GABAergic DPM) neurons that modulates memory consolidation. Their findings suggest that disrupting the basal activity of PAM-α1 neurons during early consolidation impairs LTM, with particularly pronounced effects under starvation conditions. Notably, sleep fragmentation caused by this disruption can be pharmacologically rescued, restoring LTM. These results provide compelling evidence that dopamine signaling plays a crucial role in linking sleep and memory, offering new insights into the underlying mechanisms.

      Strengths:

      This study presents a well-executed investigation into sleep-memory interactions, utilizing a combination of connectomics, behavioral assays, functional imaging, and pharmacological manipulations. The authors convincingly demonstrate that the PAM-α1 and DPM circuits interact, highlighting a potential mechanism by which sleep influences memory consolidation. The anatomical and functional dissection of this circuit is of high interest to the field, and the study's integration of sleep and memory processes contributes significantly to our understanding of dopamine's role in cognitive functions.

      Weaknesses:

      While the study is well-designed and presents compelling findings, some aspects require further clarification. The interpretation of dopamine receptor signaling remains incomplete, particularly regarding inhibitory pathways. The role of DPM in memory consolidation is not entirely conclusive, as different genetic approaches yield variable results. Additionally, some inconsistencies in neuronal activity patterns and experimental variability, especially regarding sleep patterns or pharmacological rescue, should be addressed to strengthen the mechanistic framework.

      Conclusion:

      Overall, this study provides valuable new insights into how sleep and dopamine circuits interact to regulate memory consolidation. While the findings are compelling, addressing the points above-particularly receptor signaling and the specific role of DPM and its activity patterns within the microcircuit would further solidify the study's conclusions.

    1. eLife Assessment

      Lu and colleagues developed an important imaging protocol that combines expansion microscopy, light-sheet microscopy, and image segmentation for use with the planarian Schmidtea mediterranea, a powerful model system for regeneration. This represents a substantial improvement on current standards and enables more rapid data acquisition. The utility of this solid protocol is demonstrated by quantifying several aspects of this flatworm's neural anatomy and musculature during homeostasis and regeneration. This work will be of interest to researchers looking to implement more systematic approaches towards imaging and quantifying intact specimens.

    2. Reviewer #1 (Public review):

      Summary:

      The planarian flatworm Schmidtea mediterranea is widely used as a model system for regeneration because of its remarkable ability to regenerate its entire body plan from very small fragments of tissue, including the complete and rapid regeneration of the CNS. Prior to this study, analysis of CNS regeneration in planaria has mostly been performed on a gross anatomical level. Lu et al. describe a careful and detailed analysis of the planarian neuroanatomy and musculature in both the homeostatic and regenerating contexts. To improve the effective resolution of their imaging, the authors optimized a tissue expansion protocol for planaria. Imaging was performed by light sheet microscopy, and the resulting optical sections were tiled to reconstruct whole worms. Labelled tissues and cells were then segmented to allow quantification of neurons, muscle fibers, and all cells in individual worms.

      Strengths:

      The resulting workflow can produce highly detailed and quantifiable 3D reconstructions at a rate that is fast enough to allow the analysis of large numbers of whole animals.

      Weaknesses:

      While Lu et al. have shown how their methodology and workflow can be used to image and quantify features from whole animals, it is unclear how well their technique as described will perform at sub-cellular resolutions based upon the data that they show.

    3. Reviewer #3 (Public review):

      Summary:

      In this manuscript, the authors apply tissue expansion and tiling light sheet microscopy to study allometric growth and regeneration in planaria. They developed image analysis pipelines to help them quantify different neuronal subtypes and muscles in planaria of different sizes and during regeneration. Among the strengths of this work, the authors provide beautiful images that show the potential of the approaches they are taking and their ability to quantify specific cell types in relatively large numbers of whole animal samples. Many of their findings confirm previous results in the literature, which helps validate the techniques and pipelines they have applied here. Among their new observations, they find that the body wall muscles at the anterior and posterior poles of the worm are organized differently and show that the muscle pattern in the posterior head of beta-catenin RNAi worms resembles the anterior muscle pattern. They also show that glial cell processes appear to be altered in beta-catenin or insulin receptor-1 RNAi worms. Weaknesses include some over-interpretation of the data and lack of consideration or citation of relevant previous literature, as discussed below.

      Strengths:

      This method of tissue expansion will be useful for researchers interested in studying this experimental animal. The authors provide high-quality images that show the utility of this technique. Their analysis pipeline permits them to quantify cell types in relatively large numbers of whole animal samples.

      The authors provide convincing data on changes in total neurons and neuronal sub-types in different-sized planaria. They report differences in body wall muscle pattern between the anterior and posterior poles of the planaria, and that these differences are lost when a posterior head forms in beta-catenin RNAi planaria. They also find that glial cell projections are reduced in insulin receptor-1 RNAi planaria.

      Comments on revisions:

      The authors have satisfactorily addressed the major concerns of the previous reviewers.

    4. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1(Public review):

      comment 1: Lu et al. use their workflow to visualize RNA expression of five enzymes that are each involved in the biosynthetic pathway of different neurotransmitters/modulators, namely chat (cholinergeric), gad (GABAergic), tbh (octopaminergic), th (dopaminergic), and tph (serotonergic). In this way, they generate an anatomical atlas of neurons that produce these molecules. Collectively these markers are referred to as the "neuronpool." They overstate when they write, "The combination of these five types of neurons constitutes a neuron pool that enables the labeling of all neurons throughout the entire body." This statement does not accurately represent the state of our knowledge about the diversity of neurons in S. mediterranea. There are several lines of evidence that support the presence of glutamatergic and glycinergic neurons, including the following. The glutamate receptor agonists NMDA and AMPA both produce seizure-like behaviors in S. mediterranea that are blocked by the application of glutamate receptor antagonists MK-801 and DNQX (which antagonize NMDA and AMPA glutamate receptors, respectively; Rawls et al., 2009). scRNA-Seq data indicates that neurons in S. mediterranea express a vesicular glutamate transporter, a kainite-type glutamate receptor, a glycine receptor, and a glycine transporter (Brunet Avalos and Sprecher, 2021; Wyss et al., 2022). Two AMPA glutamate receptors, GluR1 and GluR2, are known to be expressed in the CNS of another planarian species, D. japonica (Cebria et al., 2002). Likewise, there is abundant evidence for the presence of peptidergic neurons in S. mediterranea (Collins et al., 2010; Fraguas et al., 2012; Ong et al., 2016; Wyss et al., 2022; among others) and in D. japonica (Shimoyama et al., 2016). For these reasons, the authors should not assume that all neurons can be assayed using the five markers that they selected. The situation is made more complex by the fact that many neurons in S. mediterranea appear to produce more than one neurotransmitter/modulator/peptide (Brunet Avalos and Sprecher, 2021; Wyss et al., 2022), which is common among animals (Vaaga et al., 2014; Brunet Avalos and Sprecher, 2021). However the published literature indicates that there are substantial populations of glutamatergic, glycinergic, and peptidergic neurons in S. mediterranea that do not produce other classes of neurotransmission molecule (Brunet Avalos and Sprecher, 2021; Wyss et al., 2022). Thus it seems likely that the neuronpool will miss many neurons that only produce glutamate, glycine or a neuropeptide.

      In response to your comments, we agree that our initial statement regarding the "neuron pool" overstated the extent of neuronal coverage provided by the five selected markers. We have revised the sentence as “The combination of these five types of neurons constitutes a neuron pool that enables the labeling of most of the neurons throughout the entire body, including the eyes, brain, and pharynx”.

      Furthermore, we chose the five neurotransmitter systems (cholinergic, GABAergic, octopaminergic, dopaminergic, and serotonergic) based on their well-characterized roles in planarian neurobiology and the availability of reliable markers. However, we acknowledge the limitations of this approach and recognize that it does not encompass all neuron types, particularly those involved in glutamatergic, glycinergic, and peptidergic signaling, which have been documented in S. mediterranea. We have also added the content about other neuron types in our revised results section “Additionally, the neuron system of S. mediterranea is complex which characterized by considerable diversity among glutamatergic, glycinergic, and peptidergic neurons in planarians and many neurons in S. mediterranea express more than one neurotransmitter or neuropeptide, which adds further complexity to the system. We used five markers for a proof of concept illustration. By employing Fluorescence in Situ Hybridization (FISH), we successfully visualized a variety of planarian neurons, including cholinergic (chat<sup>+</sup>), serotonergic (tph<sup>+</sup>), octopaminergic (tbh<sup>+</sup>), GABAergic (gad<sup>+</sup>), and dopaminergic (th<sup>+</sup>) neurons based on their well-characterized roles in planarian neurobiology and the availability of reliable markers. (Figure S2A, Supplemental video 2) (Currie et al., 2016). The combination of these five types of neurons constitutes a neuron pool that enables the labeling of most of the neurons throughout the entire body, including the eyes, brain, and pharynx (Figure 1B).”

      comment 2: The authors use their technique to image the neural network of the CNS using antibodies raised vs. Arrestin, Synaptotagmin, and phospho-Ser/Thr. They document examples of both contralateral and ipsilateral projections from the eyes to the brain in the optic chiasma (Figure 1C-F). These data all seem to be drawn from a single animal in which there appears to be a greater than normal number of nerve fiber defasciculatations. It isn't clear how well their technique works for fibers that remain within a nerve tract or the brain. The markers used to image neural networks are broadly expressed, and it's possible that most nerve fibers are too densely packed (even after expansion) to allow for image segmentation. The authors also show a close association between estrella-positive glial cells and nerve fibers in the optic chiasma.

      Thank you for your detailed feedback. While we did not perform segmentation of all neuron fibers, we were able to segment more isolated fibers that were not densely packed within the neural tracts. We use 120 nm resolution to segment neurons along the three axes. Our data show the presence of both contralateral and ipsilateral projections of visual neurons. Although Figure 1C-F shows data from one planarian, we imaged three independent specimens to confirm the consistency of these observations. In the revised manuscript, we have included a discussion on the limitations of TLSM in reconstructing neural networks. In the discussion part, we added “It should be noted that the current resolution for our segmentation may be limited when resolving fibers within densely packed regions of the nerve tracts”.

      comment 3: The authors count all cell types, neuron pool neurons, and neurons of each class assayed. They find that the cell number to body volume ratio remains stable during homeostasis (Figure S3C), and that the brain volume steadily increases with increasing body volume (Figure S3E). They also observe that the proportion of neurons to total body cells is higher in worms 2-6 mm in length than in worms 7-9 mm in length (Figure 2D, S3F). They find that the rate at which four classes of neurons (GABAergic, octopaminergic, dopaminergic, serotonergic) increase relative to the total body cell number is constant (Figure S3G-J). They write: "Since the pattern of cholinergic neurons is the major cell population in the brain, these results suggest that the above observation of the non-linear dynamics between neurons and cell numbers is likely from the cholinergic neurons." This conclusion should not be reached without first directly counting the number of cholinergic neurons and total body cells. Given that glutamatergic, glycinergic, and peptidergic neurons were not counted, it also remains possible that the non-linear dynamics are due (in part or in whole) to one or more of these populations.

      We have revised the statement into “These results suggest that the above observation of the non-linear dynamics between neuron and total cell number is not likely from the octopaminergic, GABAergic, dopaminergic, and serotonergic neurons. Since our neuron pool may not include glutamatergic, glycinergic, and peptidergic neurons, the non-linear dynamics may be from cholinergic neurons or other neurons not included in our staining.”

      Reviewer #2 (Public review):

      Weaknesses:

      (1) The proprietary nature of the microscope, protected by a patent, limits the technical details provided, making the method hard to reproduce in other labs.

      Thank you for your comment. We understand the importance of reproducibility and transparency in scientific research. We would like to point out that the detailed design and technical specifications of the TLSM are publicly available in our published work: Chen et al., Cell Reports, 2020. Additionally, the protocol for C-MAP, including the specific experimental steps, is comprehensively described in the methods section of this paper. We believe that these resources should provide sufficient information for other labs to replicate the method.

      (2) The resolution of the analyses is mostly limited to the cellular level, which does not fully leverage the advantages of expansion microscopy. Previous applications of expansion microscopy have revealed finer nanostructures in the planarian nervous system (see Fan et al. Methods in Cell Biology 2021; Wang et al. eLife 2021). It is unclear whether the current protocol can achieve a comparable resolution.

      Thank you for raising this important point. The strength of our C-MAP protocol lies in its fluorescence-protective nature and user convenience. Notably, the sample can be expanded up to 4.5-fold linearly without the need for heating or proteinase digestion, which helps preserve fluorescence signals. In addition, the entire expansion process can be completed within 48 hours. While our current analysis focused on cellular-level structures, our method can achieve comparable or better resolution and we will add this information in the revised manuscript as “It is important to point out that the strength of our C-MAP protocol lies in its fluorescence-protective nature and user convenience. Notably, the sample can be expanded up to 4.5-fold linearly without the need for heating or proteinase digestion, which helps preserve fluorescence signals. In addition, the entire expansion process can be completed within 48 hours. Based on our research requirement, two spatial resolutions were adopted to image expanded planarians, 2×2×5 μm<sup>3</sup> and 0.5×0.5×1.6 μm<sup>3</sup>. The resolution can be further improved to 500 nm and 120 nm, respectively.”

      (3) The data largely corroborate past observations, while the novel claims are insufficiently substantiated.

      A few major issues with the claims:

      Line 303-304: While 6G10 is a widely used antibody to label muscle fibers in the planarian, it doesn't uniformly mark all muscle types (Scimone at al. Nature 2017). For a more complete view of muscle fibers, it is important to use a combination of antibodies targeting different fiber types or a generic marker such as phalloidin. This raises fundamental concerns about all the conclusions drawn from Figures 4 and 6 about differences between various muscle types. Additionally, the authors should cite the original paper that developed the 6G10 antibody (Ross et al. BMC Developmental Biology 2015).

      We appreciate the reviewer’s insightful comments and acknowledge that 6G10 does not uniformly label all muscle fiber types. We agree that this limitation should be recognized in the interpretation of our results. We have revised the manuscript to explicitly state the limitations of using 6G10 alone for muscle fiber labeling and highlight the need for additional markers. We have included the following statement in the Results section: “It is noted that previous studies reported that 6G10 does not label all body wall muscles equivalently with the limitation of predominantly labeling circular and diagonal fibers (Scimone et al., 2017; Ross et al., 2015). Our observation may be limited by this preference”. We would also clarify that the primary objective of our study was to demonstrate the application of our 3D tissue reconstruction method in addressing traditional research questions. Nonetheless, we agree that expanding the labeling strategy in future studies would allow for a more thorough investigation of muscle fiber diversity. Relevant citations have been properly revised and updated.

      (4) Lines 371-379: The claim that DV muscles regenerate into longitudinal fibers lacks evidence. Furthermore, previous studies have shown that TFs specifying different muscle types (DV, circular, longitudinal, and intestinal) both during regeneration and homeostasis are completely different (Scimone et al., Nature 2017 and Scimone et al., Current Biology 2018). Single-cell RNAseq data further establishes the existence of divergent muscle progenitors giving rise to different muscle fibers. These observations directly contradict the authors' claim, which is only based on images of fixed samples at a coarse time resolution.

      Thank you for your valuable feedback. Our intent was not to suggest that DV muscles regenerate into longitudinal fibers. Our observations focused on the wound site, where DV muscle fibers appear to reconnect, and longitudinal fibers, along with other muscle types, gradually regenerate to restore the structure of the injured area. We have revised the our statement as:“During the regeneration process, DV muscle fibers reconnect at the wound site, with longitudinal fibers and other muscle types gradually restoring the structure at the anterior tip and later integrating with circular and diagonal fibers through small DV fiber branches (Figure S5O1-O3).”

      (5) Line 423: The manuscript lacks evidence to claim glia guide muscle fiber branching.

      We agree with your concerns that our statement may be overestimated. We have removed this statement from the revised version. Instead, we focused on describing our observations of the connections between glial cells and muscle fibers. We have revised the section as follows: “Considering the interaction between glial and muscle cells, the localization of estrella<sup>+</sup> glia and muscle fibers is further investigated. By dual-staining of anti-Phospho (Ser/Thr) and 6G10 in inr-1 RNAi and β-catenin-1 RNAi planarians, we found that the morphologies of neurons are normal, and they have close contact with muscle fibers (Figure 6D, E). However, by dual staining of estrella and 6G10, we found that the structure of glial cells is star-shaped in egfp RNAi planarian, however, glial cells in inr-1 RNAi and β-catenin-1 RNAi planarians have shorter cytoplasmic projections, and their sizes are smaller, lacking the major projection onto the muscles (Figure 6D, E, Figure S6E-K). Especially, in the posterior head of β-catenin-1 RNAi planarians, the glial cell has few axons and can hardly connect with muscle fibers (Figure 6E). These results indicated that proper neuronal guidance and muscle fiber distribution could potentially contribute to facilitating accurate glial-to-muscle projections.

      (6) Lines 432/478: The conclusion about neuronal and muscle guidance on glial projections is similarly speculative, lacking functional evidence. It is possible that the morphological defects of estrella+ cells after bcat1 RNAi are caused by Wnt signaling directly acting on estrella+ cells independent of muscles or neurons.

      We understand that this approach is insufficient and we have revised the this section as follows: “Further investigation is required to distinguish the cell-autonomous and non-autonomous effects of inr-1 RNAi and β-catenin-1 RNAi on muscle and glial cells.”

      (7) Finally, several technical issues make the results difficult to interpret. For example, in line 125, cell boundaries appear to be determined using nucleus images; in line 136, the current resolution seems insufficient to reliably trace neural connections, at least based on the images presented.

      We use two setups for imaging cells and neuron projections. For cellular resolution imaging, we utilized a 1× air objective with a numerical aperture (NA) of 0.25 and a working distance of 60 mm (OLYMPUS MV PLAPO). The voxel size used was 0.8×0.8×2.5 μm<sup>3</sup>. This configuration resulted in a resolution of 2×2×5 μm<sup>3</sup> and a spatial resolution of 0.5×0.5×1.25 μm<sup>3</sup> with 4.5× isotropic expansion. Alternatively, for sub-cellular imaging, we employed a 10×0.6 SV MP water immersion objective with 0.8 NA and a working distance of 8 mm (OLYMPUS). The voxel size used in this configuration was 0.26×0.26×0.8 μm<sup>3</sup>. As a result of this configuration, we achieved a resolution of 0.5×0.5×1.6 μm<sup>3</sup> and a spatial resolution of 0.12×0.12×0.4 μm<sup>3</sup> with a 4.5× isotropic expansion. The higher resolution achieved with sub-cellular imaging allows us to observe finer structures and trace neural connections.

      Regarding your question about cell boundaries, we have revised the manuscript to specify that the boundaries we identified are those of each nucleus.

      Reviewer #3 (Public review):

      Weaknesses:

      (1) The work would have been strengthened by a more careful consideration of previous literature. Many papers directly relevant to this work were not cited. Such omissions do the authors a disservice because in some cases, they fail to consider relevant information that impacts the choice of reagents they have used or the conclusions they are drawing.

      For example, when describing the antibody they use to label muscles (monoclonal 6G10), they do not cite the paper that generated this reagent (Ross et al PMCID: PMC4307677), and instead, one of the papers they do cite (Cebria 2016) that does not mention this antibody. Ross et al reported that 6G10 does not label all body wall muscles equivalently, but rather "predominantly labels circular and diagonal fibers" (which is apparent in Figure S5A-D of the manuscript being reviewed here). For this reason, the authors of the paper showing different body wall muscle populations play different roles in body patterning (Scimone et al 2017, PMCID: PMC6263039, also not cited in this paper) used this monoclonal in combination with a polyclonal antibody to label all body wall muscle types. Because their "pan-muscle" reagent does not label all muscle types equivalently, it calls into question their quantification of the different body wall muscle populations throughout the manuscript. It does not help matters that their initial description of the body wall muscle types fails to mention the layer of thin (inner) longitudinal muscles between the circular and diagonal muscles (Cebria 2016 and citations therein).

      Ipsilateral and contralateral projections of the visual axons were beautifully shown by dye-tracing experiments (Okamoto et al 2005, PMID: 15930826). This paper should be cited when the authors report that they are corroborating the existence of ipsilateral and contralateral projections.

      Thank you for your feedback. We have incorporated these citations and clarifications into the revised manuscript. We acknowledge the limitations of this approach and have added a statement for this limitation in the revised manuscript “It is noted that previous studies reported that 6G10 does not label all body wall muscles equivalently with the limitation of predominantly labeling circular and diagonal fibers (Scimone et al., 2017; Ross et al., 2015). Our observation may be limited by this preference.”

      (2) The proportional decrease of neurons with growth in S. mediterranea was shown by counting different cell types in macerated planarians (Baguna and Romero, 1981; https://link.springer.com/article/10.1007/BF00026179) and earlier histological observations cited there. These results have also been validated by single-cell sequencing (Emili et al, bioRxiv 2023, https://www.biorxiv.org/content/10.1101/2023.11.01.565140v). Allometric growth of the planaria tail (the tail is proportionately longer in large vs small planaria) can explain this decrease in animal size. The authors never really discuss allometric growth in a way that would help readers unfamiliar with the system understand this.

      Thank you for your feedback. We have incorporated these citations and clarifications into the revised manuscript “These findings provide evidence to support the previous prediction and consistency between different planarian species (Baguñà et al., 1981; Emili et al.,2023). Because the tail is proportionately longer in large than in small planarians, the allometric growth of the planarians can be one possibility for this decrease along with the increase in animal size. The phenomenon may also suggest the existence of a threshold in the increase of planarian neuron numbers, which may ultimately contribute to some physiological changes, such as planarian fission.”

      (3) In some cases, the authors draw stronger conclusions than their results warrant. The authors claim that they are showing glial-muscle interactions, however, they do not provide any images of triple-stained samples labeling muscle, neurons, and glia, so it is impossible for the reader to judge whether the glial cells are interacting directly with body wall muscles or instead with the well-described submuscular nerve plexus. Their conclusion that neurons are unaffected by beta-cat or inr-1 RNAi based on anti-phospho-Ser/Thr staining (Fig. 6E) is unconvincing. They claim that during regeneration "DV muscles initially regenerate into longitudinal fibers at the anterior tip" (line 373). They provide no evidence for such switching of muscle cell types, so it is unclear why they say this.

      We acknowledge that some of our conclusions were overclaimed given the current data, and we appreciate the opportunity to clarify and refine these claims in the revised manuscript. Due the technique reason, we have not achieved the triple-staining to address this concern. We hope to make a progress in our future studies. Regarding the statement that "DV muscles initially regenerate into longitudinal fibers at the anterior tip" (line 373), as addressed in our previous response, this statement was unclear. Our intent was not to imply that DV muscles switch into longitudinal fibers. Instead, we observed that muscle fibers reconnect at the wound site, with longitudinal fibers and other muscle types gradually restoring the structure. We have revised this section: “During the regeneration process, DV muscle fibers reconnect at the wound site, with longitudinal fibers and other muscle types gradually restoring the structure at the anterior tip and later integrating with circular and diagonal fibers through small DV fiber branches (Figure S5O1-O3).”

      (4) The authors show how their automated workflow compares to manual counts using PI-stained specimens (Figure S1T). I may have missed it, but I do not recall seeing a similar ground truth comparison for their muscle fiber counting workflow. I mention this because the segmented image of the posterior muscles in Figure 4I seems to be missing the vast majority of circular fibers visible to the naked eye in the original image.

      Thank you for raising this important point. We have included a ground truth comparison of our automated muscle fiber segmentation with the original image in the revised Figure S6. The original Figure S6 has been changed as Figure S7. Regarding the observation of missing circular fibers in Figure 4I, we agree that the segmentation appears to have missed a significant number of circular fibers in this particular image. This may have been due to limitations in the current parameters of the segmentation algorithm, especially in distinguishing fibers in regions of varying intensity or overlap.

      (5) It is unclear why the abstract says, "We found the rate of neuron cell proliferation tends to lag..." (line 25). The authors did not measure proliferation in this work and neurons do not proliferate in planaria.

      Thank you for pointing out this mistake. What we intended to convey was the increase in neuron number during homeostasis. We have revised the abstract “We found that the increase in neuron cell number tends to lag behind the rapid expansion of somatic cells during the later phase of homeostasis.”

      (6) It is unclear what readers are to make of the measurements of brain lobe angles. Why is this a useful measurement and what does it tell us?

      The measurement of brain lobe angles is intended to provide a quantitative assessment of the growth and morphological changes of the planarian brain during regeneration. Additionally, the relevance of brain lobe angles has been explored in previous studies, such as Arnold et al., Nature, 2016, further supporting its use as a meaningful parameter.

      (7) The authors repeatedly say that this work lets them investigate planarians at the single-cell level, but they don't really make the case that they are seeing things that haven't already been described at the single-cell level using standard confocal microscopy.

      Thank you for your comment. We agree that single-cell level imaging has been previously achieved in planarians using conventional confocal microscopy. However, our goal was to extend the application of expansion microscopy by combining C-MAP with tiling light sheet microscopy (TLSM), which allows for faster and high-resolution 3D imaging of whole-mount planarians. We have added in the discussion section: “This combination offers several key advantages over standard techniques. For example, it enables high-throughput imaging across entire organisms with a level of detail and speed that is not easily achieved using confocal methods. This approach allows us to investigate the planarian nervous system at multiple developmental and regenerative stages in a more comprehensive manner, capturing large-scale structures while preserving fine cellular details. The ability to rapidly image whole planarians in 3D with this resolution provides a more efficient workflow for studying complex biological processes.”

    1. eLife Assessment

      This study makes the fundamental discovery of the first natural animal rhodopsin that uses a chloride ion instead of an amino acid side chain as a counterion. Using a combination of biochemical and spectroscopic experiments together with QM/MM simulations, the authors identify the spectral tuning mechanism in the dark state and in the photoproduct state. The methods are sound and the results are convincing. This work will be of interest to biologists working on visual proteins and it also raises new questions about how environmental factors might affect coral opsins.

    2. Reviewer #1 (Public review):

      The chromophore molecule of animal and microbial rhodopsins is retinal which forms a Schiff base linkage with a lysine in the 7-th transmembrane helix. In most cases, the chromophore is positively charged by protonation of the Schiff base, which is stabilized by a negatively charged counterion. In animal opsins, three sites have been experimentally identified, Glu94 in helix 2, Glu113 in helix 3, and Glu181 in extracellular loop 2, where a glutamate acts as the counterion by deprotonation. In this paper, Sakai et al. investigated molecular properties of anthozoan-specific opsin II (ASO-II opsins), as they lack these glutamates. They found an alternative candidate, Glu292 in helix 7, from the sequences. Interestingly, the experimental data suggested that Glu292 is not the direct counterion in ASO-II opsins. Instead, they found that ASO-II opsins employ a chloride ion as the counterion. In the case of microbial rhodopsin, a chloride ion serves as the counterion of light-driven chloride pumps. This paper reports the first observation of a chloride ion as the counterion in animal rhodopsin. Theoretical calculation using a QM/MM method supports their experimental data. The authors also revealed the role of Glu292, which serves as the counterion in the photoproduct, and is involved in G protein activation.

      The conclusions of this paper are well supported by data, while the following aspects should be considered for the improvement of the manuscript.

      (1) Information on sequence alignment only appears in Figure S2, not in the main figures. Figure S2 is too complicated by so many opsins and residue positions. It will be difficult for general readers to follow the manuscript because of such an organization. I recommend the authors show key residues in Figure 1 by picking up from Figure S2.

      (2) Halide size dependence. The authors observed spectral red-shift for larger halides. Their observation is fully coincident with the chromophore molecule in solution (Blatz et al. Biochemistry 1972), though the isomeric states are different (11-cis vs all-trans). This suggests that a halide ion is the hydrogen-bonding acceptor of the Schiff base N-H group in solution and ASO-II opsins. A halide ion is not the hydrogen-bonding acceptor in the structure of halorhodopsin, whose halide size dependence is not clearly correlated with absorption maxima (Scharf and Engelhard, Biochemistry 1994). These results support their model structure (Figure 4), and help QM/MM calculations.

      (3) QM/MM calculations. According to Materials and Methods, the authors added water molecules to the structure and performed their calculations. However, Figure 4 does not include such water molecules, and no information was given in the manuscript. In addition, no information was given for the chloride binding site (contact residues) in Figure 4. More detailed information should be shown with additional figures in Figure SX.

      (4) Figure 5 clearly shows much lower activity of E292A than that of WT, whose expression levels are unclear. How did the authors normalize (or not normalize) expression levels in this experiment?

      (5) The authors propose the counterion switching from a chloride ion to E292 upon light activation. A schematic drawing on the chromophore, a chloride ion, and E292 (and possible surroundings) in Antho2a and the photoproduct will aid readers' understanding.

    3. Reviewer #2 (Public review):

      Summary:

      This work reports the discovery of a new rhodopsin from reef-building corals that is characterized experimentally, spectroscopically, and by simulation. This rhodopsin lacks a carboxylate-based counterion, which is typical for this family of proteins. Instead, the authors find that a chloride ion stabilizes the protonated Schiff base and thus serves as a counterion.

      Strengths:

      This work focuses on the rhodopsin Antho2a, which absorbs in the visible spectrum with a maximum at 503 nm. Spectroscopic studies under different pH conditions, including the mutant E292A and different chloride concentrations, indicate that chloride acts as a counterion in the dark. In the photoproduct, however, the counterion is identified as E292.

      These results lead to a computational model of Antho2a in which the chloride is modeled in addition to the Schiff base. This model is improved using the hybrid QM/MM simulations. As a validation, the absorption maximum is calculated using the QM/MM approach for the protonated and deprotonated E292 residue as well as the E292A mutant. The results are in good agreement with the experiment. However, there is a larger deviation for ADC(2) than for sTD-DFT. Nevertheless, the trend is robust since the wt and E292A mutant models have similar excitation energies. The calculations are performed at a high level of theory that includes a large QM region.

      Weaknesses:

      I have a couple of questions about this study:

      (1) I find it suspicious that the absorption maximum is so close to that of rhodopsin when the counterion is very different. Is it possible that the chloride creates an environment for the deprotonated E292, which is the actual counterion?

      (2) The computational protocol states that water molecules have been added to the predicted protein structure. Are there water molecules next to the Schiff base, E292, and Cl-? If so, where are they located in the QM region?

      (3) If the E292 residue is the counterion in the photoproduct state, I would expect the retinal Schiff base to rotate toward this side chain upon isomerization. Can this be modeled based on the recent XFEL results on rhodopsin?

    4. Reviewer #3 (Public review):

      Summary:

      The paper by Saito et al. studies the properties of anthozoan-specific opsins (ASO-II) from organisms found in reef-building coral. Their goal was to test if ASO-II opsins can absorb visible light, and if so, what the key factors involved are.

      The most exciting aspect of this work is their discovery that ASO-II opsins do not have a counterion residue (Asp or Glu) located at any of the previously known sites found in other animal opsins.

      This is very surprising. Opsins are only able to absorb visible (long wavelength light) if the retinal Schiff base is protonated, and the latter requires (as the name implies) a "counter ion". However, the authors clearly show that some ASO-II opsins do absorb visible light.

      To address this conundrum, they tested if the counterion could be provided by exogenous chloride ions (Cl-). Their results find compelling evidence supporting this idea, and their studies of ASO-II mutant E292A suggest E292 also plays a role in G protein activation and is a counterion for a protonated Schiff base in the light-activated form.

      Strengths:

      Overall, the methods are well-described and carefully executed, and the results are very compelling.

      Their analysis of seven ASO-II opsin sequences undoubtedly shows they all lack a Glu or Asp residue at "normal" (previously established) counter-ion sites in mammalian opsins (typically found at positions 94, 113, or 181). The experimental studies clearly demonstrate the necessity of Cl- for visible light absorbance, as do their studies of the effect of altering the pH.

      Importantly, the authors also carried out careful QM/MM computational analysis (and corresponding calculation of the expected absorbance effects), thus providing compelling support for the Cl- acting directly as a counterion to the protonated retinal Schiff base, and thus limiting the possibility that the Cl- is simply altering the absorbance of ASO-II opsins through some indirect effect on the protein.

      Altogether, the authors achieved their aims, and the results support their conclusions. The manuscript is carefully written, and refreshingly, the results and conclusions are not overstated.

      This study is impactful for several reasons. There is increasing interest in optogenetic tools, especially those that leverage G protein-coupled receptor systems. Thus, the authors' demonstration that ASO-II opsins could be useful for such studies is of interest.

      Moreover, the finding that visible light absorbance by an opsin does not absolutely require a negatively charged amino acid to be placed at one of the expected sites (94, 113, or 181) typically found in animal opsins is very intriguing and will help future protein engineering efforts. The argument that the Cl- counterion system they discover here might have been a preliminary step in the evolution of amino acid based counterions used in animal opsins is also interesting.

      Finally, given the ongoing degradation of coral reefs worldwide, the focus on these curious opsins is very timely, as is the authors' proposal that the lower Schiff base pKa they discovered here for ASO-II opsins may cause them to change their spectral sensitivity and G protein activation due to changes in their environmental pH.

    1. eLife Assessment

      This paper reports the analysis of coevolutionary patterns and dynamical information for identifying functionally relevant sites. These findings are considered important due to the broad utility of the unified framework and network analysis capable of revealing communities of key residues that go beyond the residue-pair concept. The data are solid and the results are clearly presented.

    2. Reviewer #1 (Public review):

      Summary:

      As reported above, this paper by Xu et al reports on a new method to combine the analysis of coevolutionary patterns with dynamic profiles to identify functionally important residues and reveal correlations between binding sites.

      Strengths:

      In general, coevolutionary analysis and MD analysis are carried out separately and while there have been attempts to compare the information provided by the two, no unified framework exists. Here, the authors convincingly demonstrate that integrating signals from Dynamics and coevolution gives information that substantially overcomes the one provided by either method in isolation. While other methods are useful, they do not capture how dynamics is fundamental to define function and thus sculpts coevolution, via the 3D structure of the protein. At the same time, the authors demonstrate how coevolution in turn also influences internal dynamics. The Networks they rebuild unveil information at an even higher level: the model starts pairwise but through network representation the authors arrive to community analysis, reporting on interaction patterns that are larger than simple couples.

      Comments on latest version:

      I have nothing to add to this revision. The paper looks excellent and very interesting.

    3. Reviewer #2 (Public review):

      Summary:

      The authors introduced a computational framework, DyNoPy, that integrates residue coevolution analysis with molecular dynamics (MD) simulations to identify functionally important residues in proteins. DyNoPy identifies key residues and residue-residue coupling to generate an interaction graph and attempts to validate using two clinically relevant β-lactamases (SHV-1 and PDC-3).

      Strengths:

      DyNoPy could not only show clinically relevance of mutations but also predict new potential evolutionary mutations. Authors have provided biologically relevant insights into protein dynamics which can have potential applications in drug discovery and understanding molecular evolution.

      Comments on latest version:

      I appreciate the efforts of the authors to address my comments.

    4. Reviewer #3 (Public review):

      Summary:

      In this paper, Xu, Dantu and coworkers report a protocol for analyzing coevolutionary and dynamical information to identify a subset of communities that capture functionally relevant sites in beta-lactamases.

      Strengths:

      The combination of coevolutionary information and metrics from MD simulations is interesting for capturing functionally relevant sites, which can have implications in the fields of drug discovery but also in protein design.

      Comments on latest version:

      The authors have successfully addressed all my previous comments/concerns. I am happy with the current version of the manuscript.

    5. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      Summary:

      As reported above, this paper by Xu et al reports on a new method to combine the analysis of coevolutionary patterns with dynamic profiles to identify functionally important residues and reveal correlations between binding sites.

      Strengths:

      In general, coevolutionary analysis and MD analysis are carried out separately and while there have been attempts to compare the information provided by the two, no unified framework exists. Here, the authors convincingly demonstrate that integrating signals from Dynamics and coevolution gives information that substantially overcomes the one provided by either method in isolation. While other methods are useful, they do not capture how dynamics is fundamental to define function and thus sculpts coevolution, via the 3D structure of the protein. At the same time, the authors demonstrate how coevolution in turn also influences internal dynamics. The Networks they rebuild unveil information at an even higher level: the model starts pairwise but through network representation the authors arrive to community analysis, reporting on interaction patterns that are larger than simple couples.

      Weaknesses:

      The authors should

      - Make an effort in suggesting/commenting the limits of applicability of their method;

      We have added a sentence on Page 17, line 15 that describes the limitation of our method.

      - Expand discussion on how DyNoPy compares to other methods;

      A paragraph has been added to explain the comparison with other models (Page 3, line 18)

      - Dynamic is not essential in all systems (structural proteins): The authors may want to comment on possible strategies they would use for other systems where their framework may not be suitable/applicable.

      We agree with the reviewer that dynamics is not essential in all systems. In systems where there is limited role of dynamics in the function, the analysis done with DyNoPy is equivalent to conventional coevolution analysis, which can be consider one limitation of our method. Conversely, for dynamic proteins, combining functional dynamics descriptors with coevolution analysis using DyNoPy, helps in denoising information by deconvolution of communities. We have included this in the manuscript to highlight the suitability/applicability of the method.

      Further, we have added a paragraph in the Introduction and conclusions highlighting the main difference between DyNoPy and existing computational tools like DCCM, KIN, and SPM and for your convenience it is provided below:

      “Functional sites are often regulated by both, local and global interactions. Changes in these interactions are instrumental for functional events like substrate binding, catalysis, and conformational changes (18). The development of physical models of protein dynamics and the increase in available computational power has stimulated the adoption of computational techniques (19, 20) to investigate the conformational dynamics of proteins, an essential component of the many biological functions (21, 22). Different models have been proposed to describe the interactions between residues during simulations and network models have been particularly popular,  including methods on single structures and MD simulations data built by analysing the response to external forces on residue networks (23), by estimating the prevalence of non-covalent energy interaction networks in homologous proteins (24), or by analysing linear or non-linear correlation in atomic fluctuations (25, 26). These techniques have demonstrated their usefulness in extracting allosteric networks from structural data with applications in enzyme design (26).”

      Reviewer #2 (Public review):

      Summary:

      Authors introduced a computational framework, DyNoPy, that integrates residue coevolution analysis with molecular dynamics (MD) simulations to identify functionally important residues in proteins. DyNoPy identifies key residues and residue-residue coupling to generate an interaction graph and attempts to validate using two clinically relevant β-lactamases (SHV-1 and PDC-3).

      Strengths:

      DyNoPy could not only show clinically relevance of mutations but also predict new potential evolutionary mutations. Authors have provided biologically relevant insights into protein dynamics which can have potential applications in drug discovery and understanding molecular evolution.

      Weaknesses:

      Although DyNoPy could show the relevance of key residues in active and non-active site residues, no experiments have been performed to validate their predictions.

      We thank the reviewer for highlighting this point. We acknowledge that direct experimental validation of our predictions for DyNoPy has not yet been performed. However, we have provided explanations and evidence from experiments conducted on closely related homologs to support the relevance of key residues. These homologs share significant structural and functional similarity, which strengthens the reliability of our predictions.

      In addition, they should compare their method with conventional techniques and show how their method could be different.

      We thank all the reviewers for highlighting this oversight on our behalf. In Introduction and conclusion, we have added the following paragraphs:

      “Functional sites are often regulated by both, local and global interactions. Changes in these interactions are instrumental for functional events like substrate binding, catalysis, and conformational changes (18). The development of physical models of protein dynamics and the increase in available computational power has stimulated the adoption of computational techniques (19, 20) to investigate the conformational dynamics of proteins, an essential component of the many biological functions (21, 22). Different models have been proposed to describe the interactions between residues during simulations and network models have been particularly popular,  including methods on single structures and MD simulations data built by analysing the response to external forces on residue networks (23), by estimating the prevalence of non-covalent energy interaction networks in homologous proteins (24), or by analysing linear or non-linear correlation in atomic fluctuations (25, 26). These techniques have demonstrated their usefulness in extracting allosteric networks from structural data with applications in enzyme design (26). ”

      An explanation of "communities" divided in the work and how these communities are relevant to the article should be provided. In addition, choice of collective variables and their relevance in residue coupling movement is also not very well explained. Dynamics cross correlation map can also be a good method for understanding the residue movements and can explain the residue-residue coupling, it is not explained how DyNoPy is different from the conventional methods or can perform better.

      The following sentences have been included in the manuscript to address the questions raised by the reviewer:

      On Community Definition and relevance

      DyNoPy identified coevolving residue pairs (scaled coevolution score >1) with interactions strongly correlated with protein functional motions (i.e., J values larger than zero). Applying network analysis on the combined dynamics-coevolution matrix helps us extracting higher-order interactions beyond pairwise coupling and detecting critical residues, which show multiple interactions with each other. Moreover, indirect long-range relationships, which would be hard to identify from numerical data, could be detected through community clustering. Community-based analysis offers a more comprehensive understanding of residue relationships and enables the visualization of residue couplings on the protein structure.

      On Choice of collective variables:

      DyNoPy works on the assumption that time-dependent interactions between critical residues, either having significant structural change or not will correlate with functional conformational motions. Since MD simulation data is high-dimensional, a time-dependent dynamic descriptor is required to extract the most relevant information for the process under study. A good collective variable (CV) should appropriately describe protein functional motions. Thus, a CV that detects the highest number of residue couplings is expected to be the most suitable descriptor (Mentioned in Page 22 Line 14). In our study, we tested 12 CVs, either focusing on the entire protein or on selected regions. And the best performed CV (the one identified the most residue couplings) was selected for further analysis. In practical applications, users can decide whether to focus on the most relevant global or local dynamics descriptor  depending on the dynamics of their specific system.

      We have added a paragraph in the Introduction differentiating DyNoPy with other methods including DCCM. DCCM differs from DyNoPy in two aspects 1) it does not account for inter-residue coevolution 2) the correlation matrix captures correlations of atomic/residue movements associated with the whole intrinsic dynamics of the system, without filtering for the contributions to the important motions involved in the biological function. Additionally, any residue pair contributing to functional motion without itself undergoing any structural change will not be visible in this approach.

      In the sentence "DyNoPy identified eight significant communities of strongly coupled residues within SHV-1 (Supporting Fig. S4A)" I could not find a clear description of eight significant communities.

      The following sentences have been included in the results, methods and figure legends that define ‘significant community’:

      ‘DyNoPy identified eight meaningful communities, each consisting of at least three strongly coupled residues within SHV-1 (Supplementary Fig. S4A). All crucial catalytic residues and critical substitution sites previously mentioned participating in one of these communities with the exceptions of R<sub>43</sub>, R<sub>202</sub>, and S<sub>130</sub>.’ (Page 8 Line 28)

      ‘A meaningful community should contain at least three residues.’ (Page 21 Line 2)

      ‘A reasonable residue community should contain at least three residues.’ (SI Page 11)

      Again the description of communities is not clear to me in the following sentence "Detailed description of the other three communities is provided in the supporting information (Fig. S6)."

      This following sentence has been rewritten.

      ‘Detailed description of communities with secondary importance for protein function (community 3, 8, and 9) is provided in the supplementary information (Supplementary Fig. S6).’ (Page 9, line 8)

      In the sentence "N170 acts as an intermediary between N136 and E166". Kindly cite the reference figure to show N179 as intermediate residue.

      This sentence has been rewritten to avoid any confusion.

      ‘Although DyNoPy did not detect this direct interaction between N136 and E166, the established relationship between N136 and N170 highlights the role of N136 in influencing E166.’ (Page 10 Line 8)

      Please be careful with the numbers. In the sentence "These residues not only interact with each other directly but are also indirectly coupled via 21 other residues." I could count 22 other residues and not 21.

      We thank the reviewer for spotting this error. This has now been corrected. All the communities are counted again.

      ‘These residues not only interact with each other directly but are also indirectly coupled via 22 other residues.’ (Page 12 Line 14)

      In the sentence "Unlike other substitution sites that are adjacent to the active site, R<sub>205</sub> is situated more than 16 Å away from catalytic serine S<sub>70</sub>". Please add this label somewhere in the figure.

      The figure legends have been updated to include this. Distances have been added to community 4 Fig. 3 and community 6 Fig. 4. Residue index in the legend of Fig.3 has been included as subscript. Distance in the main text has been changed to be more accurate.

      ‘G<sub>156</sub> and A<sub>146</sub> are two functional important residues distant from the active site. G<sub>156</sub> is 21.3Å away from the catalytic S<sub>70</sub>. A<sub>146</sub> is 16.8Å away from S<sub>70</sub>.’ (Page 12 Line 2)

      ‘R<sub>205</sub> is a functional important residue that is 20.6Å away from the active site S<sub>70</sub>.’ (Page 13 Line 10)

      Please cite a reference in the sentence "This indicates that mutations on G238 would result in an alteration on protein catalytic function, as well as an increased flexibility of the protein, which strongly aligns with previous finding."

      The citation has been added

      ‘This indicates that mutations on G238 would result in an alteration on protein catalytic function, as well as an increased flexibility of the protein, which strongly aligns with previous finding (62).’ (Page 15 Line 2)

      Reviewer #3 (Public review):

      Summary:

      In this paper, Xu, Dantu and coworkers report a protocol for analyzing coevolutionary and dynamical information to identify a subset of communities that capture functionally relevant sites in beta-lactamases.

      Strengths:

      The combination of coevolutionary information and metrics from MD simulations is interesting for capturing functionally relevant sites, which can have implications in the fields of drug discovery but also in protein design.

      Weaknesses:

      The combination of coevolutionary information and metrics from MD simulations is not new as other protocols have been proposed along the years (the current version of the paper neglects some of them, see below), and there are a few parameters of the protocol that, in my opinion, should be better analyzed and discussed.

      (1) As mentioned, the introduction of the paper lacks some important publications in the field of using graph theory to represent important interaction networks extracted from MD simulations (DOI: 10.1002/pro.4911), and also combining MD data with MSA to identify functionally relevant sites for enzyme design (doi: 10.1021/acscatal.4c04587, 10.1093/protein/gzae005).

      We are very grateful for pointing us to these references. We have added a paragraph in the Introduction mentioning these and other computational tools similar to DyNoPy. Further, in conclusion we have highlighted the differences between DyNoPy and existing tools.

      (2) The matrix used to apply graph theory (J_ij) is built from summing the scaled coevolution and degree of correlation values. The alpha and beta weights are defined, and the authors mention that alpha is set to 0.5, thus beta as well to fulfil with the alpha + beta = 1. Why a value of 0.5 has been selected? How this affects the overall results and conclusions extracted? The finding that many catalytically relevant residues are identified in the communities is not surprising given that such sites usually present a high conservation score.

      This is an excellent question. Our present formulation allows the user to easily assess the influence of coevolution and dynamic couplings on the output. Setting alpha to 0.5, weights both evolutionary and dynamics information equally and has shown promising results in SHV-1 and PDC-3. As it has been presented in the manuscript, setting alpha to 1, i.e., purely utilising coevolution data does not let us identify critical residues effectively as all residues are included in the set (Supplementary Fig. S4 and S5). In future work, we would like to investigate the effect of scanning alpha from 0 to 1 on the final residue list, possibly on a larger set of proteins and protein families.

      We would also like to point out that some of the residue pairs with coevolution scores in the top 1% have J-scores set to 0, as they lacked significant coupling to the functional dynamics.

      (3) Another important point that needs further explanation is the selection of the relevant descriptor of protein dynamics. In this study two different strategies have been used (one more global the other more local), but more details should be provided regarding their choice. What is the best strategy according to the authors? Why not using the same strategy for both related systems? The obtained results using one methodology or the other will have a large impact on the dynamical score. Another related point is: what is the impact of the MD simulation length, how the MSA is generated and number of sequences used for MSA construction?

      As in the case of many complex proteins, the flow of information occurs in β-lactamases via structural interactions (https://doi.org/10.7554/eLife.66567). These interactions occur both on a local level, as in the case of binding site residues or residues immediately surrounding the binding site; however, there are interactions far away (>20Å) from the binding site that have the ability to alter function. We have obtained this information from extensive surveys of clinical isolates and experimental data. To account for such interactions, a more global approach has to be taken. To answer the reviewer’s question: each system is unique and there is no one-fixed strategy. In short, the method used should be able to denoise information and the user is advised to fine-tune their findings by corroborating with experimental and clinical information.

      The length of MD simulations is also system specific. Some systems effectively sample the functional dynamics within a shorter simulation time, while others take a long timescale MD simulation to converge. The results won’t change as long as the simulation has effectively sampled the functional dynamics associated with biological function.

      The MSA is generated by the HH-Suite package as mentioned on Page 19 Line 19. More specifically, the MSA is constructed based on the UniRef30 database, where sequences are clustered, and each cluster contains sequences with at least 30% sequence identity. This provides a non-redundant set of protein sequences. Our package allows the automatic generation of MSAs from the database. For SHV-1, the alignment contains 18,175 protein sequences and for PDC-3, the alignment consists of 27,892 protein sequences. Full details of this protocol are published in Bibik et al. (https://doi.org/10.1093/bioinformatics/btae166). We have revised the methods section to include these details.

      Other Minor Alterations

      ‘Fig. S1 and S2’ has been changed to ‘Supplementary Fig. S1 and S2’ for consistency (Page 6 Line 12)

      (1) ‘Figure 5B’ has been changed to ‘Fig. 5B’ for consistency (Page 16 Line 11)

      (2) All the ‘Figure’ has been changed to ‘Fig.’ in the SI for consistency

      (3) Just as the suggestion, an alteration has been made on the Step 1 of Fig.1.

    1. eLife Assessment

      CellDetective is a useful software package for segmentation, tracking, and analysis of time‐lapse microscopy datasets, specifically designed to be accessible to researchers without coding expertise. The authors provide solid evidence of its capabilities through comprehensive validations and well‐executed comparisons across immunological assays. However, the current implementation is limited to 2D widefield imaging and presents technical challenges - including occasional crashes, restricted flexibility in defining multiple cell populations, and some interface issues that hinder the full user experience. Overall, this work will be of significant interest to the bioimaging community, especially those in immunology and cell biology, and promises to evolve into a more robust tool with further development.

    2. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Torro et al. presented CellDetective, an open-source software designed for a user-friendly execution of single-cell segmentation, tracking, and analysis of time-lapse microscopy data. The authors demonstrated the applications of the software by measuring NK cell spreading events acquired with reflection interference contrast microscopy (RICM), as well as detecting target cell death events and their interaction with neighboring NK cells in a multichannel widefield microscopy dataset.

      Strengths:

      The segmentation (StarDist, Cellpose) and tracking (bTrack) modules implemented were based on existing and published software packages. The authors added the event detection, classification, and analysis modules to enable an end-to-end time-lapse microscopy data processing and analysis pipeline, complete with a graphical user interface (GUI). This minimizes the coding experience required from the user. The documentation that accompanies CellDetective is also adequate.

      Weaknesses:

      Given that the software was designed to improve user experience, such an approach also limits its scope and functionality and is currently capable of handling very specific types of experiments. Additionally, this reviewer has also encountered many technical difficulties (see documented bugs/crashes below) that have prevented an extensive exploration of all the functionality of CellDetective.

      Specifics:

      (1) The software can only handle 2D 'widefield' time-lapse imaging datasets. It should be noted that many studies that examine cell-cell interactions in vitro also used confocal microscopy and acquired the time-lapse images in 3D z-stacks to enable the reconstruction of entire cell volumes from multiple optical sections along the z-axis.

      Given that almost all of the implemented segmentation (StarDist, Cellpose) and tracking (bTrack) packages already support the handling of 3D datasets, it is unclear why CellDetective was designed to only work with 2D datasets.

      As noted above, extending the support for 3D images would allow the scope and utility of this software to be further extended for imaging studies acquired in z-stacks. As an example, the dense clustering of effector cells in Figure 4 had prevented accurate segmentation due to the 2D nature of the experimental dataset. More importantly, support for a 3D dataset could also allow for the tracking of fluorescent protein-based sub-cellular as well as membrane protein localization during cell-cell interactions.

      Furthermore, it also widens the potential applicability for analyzing datasets from 3D organoid imaging and perhaps even intravital two-photon microscopy.

      (2) The software in its current form only allows the broad demarcation of the cells examined into two populations: targets and effectors. This limits the number of cell populations that can be examined for their interactions. It might be more useful to just allow multiple user-defined populations instead of restricting the populations to target and effector cells only.

      (3) Similarly, subsetting of each of the populations could be made more intuitive. Although it is possible to define subsets of cells using the "Custom classification" function under the "Measure" module with user-defined parameters, visualization of multiple groups remains unintuitive and it appears that only one custom classified group can be selected and visualized at any given time in the Signal Annotator under Measurement instead of allowing visualization of multiple (custom defined) groups of cells in different colors. It is also unclear how, if possible at all, to visualize a custom group of cells in the Signal Annotator under the Detect Events module.

      Software issues:

      (4) When initially tested on v1.3.9, the Segment module could not be initiated (with the error message AttributeError: 'WindowsPath' object has no attribute 'endswith' when attempting to run segmentation).<br /> Update: this has been fixed in v1.3.9.post4 dated February 7th, 2025.

      (5) Further testing was then performed by downgrading the software to v1.3.1. While testing the ADCC demo experiment (https://celldetective.readthedocs.io/en/latest/adcc-example.html), the workflow was stuck at attempts to initiate the Detect Events step:

      AssertionError: No signal matches with the requirements of the model ['dead_nuclei_channel_mean', 'area']. Please pass the signals manually with the argument selected_signals or add measurements. Abort.

      (Update: fixed in the latest v1.3.9.post4 version dated February 7th, 2025)

      (6) Random bugs causing the software to crash. Example: switching characteristic to 'status_color' in the Signal Annotator under Measurement caused the software to crash (v1.3.9.post4):

      TypeError: ufunc 'isnan' is not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule 'safe'

      (7) Overall, when exploring the functionality of the software, there have been multiple instances of software crashes when clicking/switching around to show different parameters, etc.

      This reviewer understands the difficulties and time involved in bug fixing and hopes that the experience could have been much smoother and that the software behaves much more stably in order to maximize its useability.

    3. Reviewer #2 (Public review):

      Summary:

      Immune assays enable the analysis of immune responses in vitro. These assays generate time series image data across several experimental conditions. The imaging parameters such as the imaging modality and the number of channels can vary across experiments. A challenge in the field is the lack of (open source) tools to process and analyze these data. R. Torro, et. al. developed an open source end-to-end pipeline for the analysis of image data from these immune assays. The pipeline is designed with a GUI and is suited for experimental biologists with no coding experience. The authors have incorporated several existing methods and tools for individual tasks such as for segmentation and cell tracking, and incorporated them with custom methods where necessary such as for tracking cell state transitions.

      Strengths:

      (1) The tool is extremely well-documented and easy to install.

      (2) Applicable to a wide variety of imaging modalities and analysis.

      (3) There are several different options for each step, such as segmentation using traditional methods or deep learning methods, and all the analysis steps are integrated in one place with a GUI. The no-coding requirement makes this a very powerful tool for biologists and has the potential to enable a wide variety of analyses.

      Weakness:

      (1) It would be good to provide documentation on how to make the tool applicable for applications and analysis other than for immune profiling since most methods integrated here are applicable well beyond immune profiling. For example, a user might want to use the tool just for the segmentation of their IF microscopy-images.

      (2) They applied Celldetective to two immune assays. The authors present the results from these assays and use the results to validate their assay. However, they have not included data that demonstrates results obtained via this pipeline are comparable to results obtained with other pipelines and/or if these results are consistent with what is expected in the literature.

    4. Author response:

      In view of the suggestions of the referees, we wish to underline that a user can interact with celldetective at two levels: a non-coder can analyse data and train models without coding, but is necessarily offered pre-determined choices and flexibility. An advanced user however has practically limitless flexibility to extend the fully-open source celldetective, aided by its modularity and detailed manual.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In this manuscript, Torro et al. presented CellDetective, an open-source software designed for a user-friendly execution of single-cell segmentation, tracking, and analysis of time-lapse microscopy data. The authors demonstrated the applications of the software by measuring NK cell spreading events acquired with reflection interference contrast microscopy (RICM), as well as detecting target cell death events and their interaction with neighboring NK cells in a multichannel widefield microscopy dataset.

      Strengths:

      The segmentation (StarDist, Cellpose) and tracking (bTrack) modules implemented were based on existing and published software packages. The authors added the event detection, classification, and analysis modules to enable an end-to-end time-lapse microscopy data processing and analysis pipeline, complete with a graphical user interface (GUI). This minimizes the coding experience required from the user. The documentation that accompanies CellDetective is also adequate.

      Weaknesses:

      Given that the software was designed to improve user experience, such an approach also limits its scope and functionality and is currently capable of handling very specific types of experiments. Additionally, this reviewer has also encountered many technical difficulties (see documented bugs/crashes below) that have prevented an extensive exploration of all the functionality of CellDetective.

      We apologize for the technical difficulties and bugs; the ones mentioned have been already corrected. New users have also tested the installation and reported it to be bug-free.

      We fully agree on the compromise that has to be found between user experience and versatility. We have already tested celldetective in other biological contexts, such as microbiology, but made a choice to showcase it in the article for immunological applications. We invite the reader to consult the software documentation and online examples to learn about more options.

      Specifics:

      (1) The software can only handle 2D 'widefield' time-lapse imaging datasets. It should be noted that many studies that examine cell-cell interactions in vitro also used confocal microscopy and acquired the time-lapse images in 3D z-stacks to enable the reconstruction of entire cell volumes from multiple optical sections along the z-axis.

      Given that almost all of the implemented segmentation (StarDist, Cellpose) and tracking (bTrack) packages already support the handling of 3D datasets, it is unclear why CellDetective was designed to only work with 2D datasets.

      As noted above, extending the support for 3D images would allow the scope and utility of this software to be further extended for imaging studies acquired in z-stacks. As an example, the dense clustering of effector cells in Figure 4 had prevented accurate segmentation due to the 2D nature of the experimental dataset. More importantly, support for a 3D dataset could also allow for the tracking of fluorescent protein-based sub-cellular as well as membrane protein localization during cell-cell interactions.

      Furthermore, it also widens the potential applicability for analyzing datasets from 3D organoid imaging and perhaps even intravital two-photon microscopy.

      We thank the reviewer for this suggestion. Indeed, extension to 3-dimensions is a natural development, since we have chosen segmentation and tracking methods which are compatible with 3D. However, two important strengths of celldetective are: harnessing statistical power of cell populations together with multiplexing biological conditions, and dynamic analysis of fast events.

      For both, 2D is advantageous. Our own focus is on analyzing cellular events with minute time resolution, relevant in immunology. By our estimate (experience and literature), 3D timelapse acquisition would reduce the time resolution, as well as throughput (in terms of events and conditions) to below acceptable level. While we don’t envisage this upgrade in the immediate future, we encourage advanced users to contribute to further develop the open-source code in this direction. As a mitigation solution, a 2.5D approach on a flat sample by combining two z planes (in order to address issues of cell superposition for example), could be readily implemented with minimal change.

      (2) The software in its current form only allows the broad demarcation of the cells examined into two populations: targets and effectors. This limits the number of cell populations that can be examined for their interactions. It might be more useful to just allow multiple user-defined populations instead of restricting the populations to target and effector cells only.

      We thank the reviewer for this suggestion. There is little architectural limitation to its implementation; this will be proposed in the future version. This updated version will allow more than two user-defined populations, labelled directly by the user, which will also facilitate the natural extension to more varied biological applications. Three-way interactions are much more complex, and, to our knowledge, not currently addressed by biologists. The interactions will for the moment be limited to 2 populations interactions, as multipartite ones involve a higher level of code modifications, not immediately envisaged.

      (3) Similarly, subsetting of each of the populations could be made more intuitive. Although it is possible to define subsets of cells using the "Custom classification" function under the "Measure" module with user-defined parameters, visualization of multiple groups remains unintuitive and it appears that only one custom classified group can be selected and visualized at any given time in the Signal Annotator under Measurement instead of allowing visualization of multiple (custom defined) groups of cells in different colors. It is also unclear how, if possible at all, to visualize a custom group of cells in the Signal Annotator under the Detect Events module.

      The simultaneous visualization of several classes poses problems in the choice of colors and symbols, and may render the tool difficult to use. The time propagation option in the classification tool allows to define event classes as opposed to groups, that are compatible with the Signal Annotator. For more complex classifications, a simple solution is to work with composite classifications, which are already supported by using logical AND/OR operators on the condition defining the class. We believe that this feature is sufficient to address this issue.

      Software issues:

      (4) When initially tested on v1.3.9, the Segment module could not be initiated (with the error message AttributeError: 'WindowsPath' object has no attribute 'endswith' when attempting to run segmentation).

      Update: this has been fixed in v1.3.9.post4 dated February 7th, 2025.

      (5) Further testing was then performed by downgrading the software to v1.3.1. While testing the ADCC demo experiment (https://celldetective.readthedocs.io/en/latest/adcc-example.html), the workflow was stuck at attempts to initiate the Detect Events step:

      AssertionError: No signal matches with the requirements of the model ['dead_nuclei_channel_mean', 'area']. Please pass the signals manually with the argument selected_signals or add measurements. Abort.

      (Update: fixed in the latest v1.3.9.post4 version dated February 7th, 2025)

      (6) Random bugs causing the software to crash. Example: switching characteristic to 'status_color' in the Signal Annotator under Measurement caused the software to crash (v1.3.9.post4):

      TypeError: ufunc 'isnan' is not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule 'safe'

      (7) Overall, when exploring the functionality of the software, there have been multiple instances of software crashes when clicking/switching around to show different parameters, etc.

      This reviewer understands the difficulties and time involved in bug fixing and hopes that the experience could have been much smoother and that the software behaves much more stably in order to maximize its useability.

      We apologize again for the various technical issues encountered during the review process, and thank the reviewer for mentioning that several bugs were already fixed in the last software release. The open source and software maintenance protocol enabled by github should help to resolve any further emerging issue.

      Reviewer #2 (Public review):

      Summary:

      Immune assays enable the analysis of immune responses in vitro. These assays generate time series image data across several experimental conditions. The imaging parameters such as the imaging modality and the number of channels can vary across experiments. A challenge in the field is the lack of (open source) tools to process and analyze these data. R. Torro, et. al. developed an open source end-to-end pipeline for the analysis of image data from these immune assays. The pipeline is designed with a GUI and is suited for experimental biologists with no coding experience. The authors have incorporated several existing methods and tools for individual tasks such as for segmentation and cell tracking, and incorporated them with custom methods where necessary such as for tracking cell state transitions.

      Strengths:

      (1) The tool is extremely well-documented and easy to install.

      (2) Applicable to a wide variety of imaging modalities and analysis.

      (3) There are several different options for each step, such as segmentation using traditional methods or deep learning methods, and all the analysis steps are integrated in one place with a GUI. The no-coding requirement makes this a very powerful tool for biologists and has the potential to enable a wide variety of analyses.

      Weakness:

      (1) It would be good to provide documentation on how to make the tool applicable for applications and analysis other than for immune profiling since most methods integrated here are applicable well beyond immune profiling. For example, a user might want to use the tool just for the segmentation of their IF microscopy-images.

      This is an important suggestion that we will implement as short demonstrations using data from the public domain. These will be proposed as examples in the online documentation.

      (2) They applied Celldetective to two immune assays. The authors present the results from these assays and use the results to validate their assay. However, they have not included data that demonstrates results obtained via this pipeline are comparable to results obtained with other pipelines and/or if these results are consistent with what is expected in the literature.

      In the final version of the article, we shall compare celldetective with existing literature, including our previous work, when possible. However, we emphasize that most of the presented data are original and don’t have any published equivalent in the literature. Concerning the immunotherapy assays, data presented already show expected trends (see for example Fig. 2 and Fig. 5). We reserve for future publications the systematic comparison with traditional (non microscopy-based) methods, as we consider it out-of-scope here. Additionally, there is, to our knowledge no existing open pipeline performing the full end-to-end analysis.

    1. eLife Assessment

      This manuscript describes the characterization of the conformational dynamics of two chemokine receptors at the single-molecule level using FRET. The authors make a convincing case for attributing the distinct interaction and pharmacology of the two receptors to differences in their conformational energy landscape. These important findings will be of interest to scientists working on activation mechanisms of GPCRs and signal transduction.

    2. Joint Public Review:

      Summary

      This manuscript uses single-molecule fluorescence resonance energy transfer (smFRET) to identify differences in the molecular mechanisms of CXCR4 and ACKR3, two 7-transmembrane receptors that both respond to the chemokine CXCL12 but otherwise have very different signaling profiles. CXCR4 is highly selective for CXCL12 and activates heterotrimeric G proteins. In contrast, ACKR3 is quite promiscuous and does not couple to G proteins, but like most G protein-coupled receptors (GPCRs), it is phosphorylated by GPCR kinases and recruits arrestins. By monitoring FRET between two positions on the intracellular face of the receptor (which highlight the movement of transmembrane helix 6 [TM6], a key hallmark of GPCR activation), the authors show that CXCR4 remains mostly in an inactive-like state until CXCL12 binds and stabilizes a single active-like state. ACKR3 rapidly exchanges among four different conformations even in the absence of ligand, and agonists stabilize multiple activated states.

      Strengths

      The core method employed in this paper, smFRET, can reveal dynamic aspects of these receptors (the breadth of conformations explored and the rate of exchange among them) that are not evident from static structures or many other biophysical methods. smFRET has not been broadly employed in studies of GPCRs. Therefore, this manuscript makes important conceptual advances in our understanding of how related GPCRs can vary in their conformational dynamics.

      Weaknesses

      The probes used cannot reveal conformational changes in other positions besides transmembrane helix 6 (TM6). GPCRs are known to exhibit loose allosteric coupling, so the conformational distribution observed at TM6 may not fully reflect the global conformational distribution of receptors. This could mask important differences that determine the ability of intracellular transducers to couple to specific receptor conformations.

      While it is clear that CXCR4 and ACKR3 have very different conformational dynamics, the data do not definitely show that this is the main or only mechanism that contributes to their functional differences.

      The extent to which conformational heterogeneity is a characteristic feature of ACKRs that contributes to their promiscuity and arrestin bias is unclear. The key residue the authors find promotes ACKR3 conformational heterogeneity is not conserved in most other ACKRs, but alternative mechanisms could generate similar heterogeneity.

      An inherent limitation of the approach is that mutagenesis, purification, and labeling of the receptors could affect their conformational distributions. The cysteine mutations in ACKR3 required to site-specifically install fluorophores substantially increase its ligand-induced activity (Fig. S1D). There are no data to confirm that the two receptors retain the same functional profiles observed in cell-based systems following in vitro manipulations (purification, labeling, nanodisc reconstitution).

    3. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      Summary:

      This paper uses single-molecule FRET to investigate the molecular basis for the distinct activation mechanisms between 2 GPCR responding to the chemokine CXCL12 : CXCR4, that couples to G-proteins, and ACKR3, which is G-protein independent and displays a higher basal activity.

      Strengths:

      It nicely combines the state-of-the-art techniques used in the studies of the structural dynamics of GPCR. The receptors are produced from eukaryotic cells, mutated, and labeled with single molecule compatible fluorescent dyes. They are reconstituted in nanodiscs, which maintain an environment as close as possible to the cell membrane, and immobilized through the nanodisc MSP protein, to avoid perturbing the receptor's structural dynamics by the use of an antibody for example.

      The smFRET data are analysed using the HHMI technique, and the number of states to be taken into account is evaluated using a Bayesian Information Criterion, which constitutes the state-of-the-art for this task.

      The data show convincingly that the activation of the CXCR4 and ACKR3 by an agonist leads to a shift from an ensemble of high FRET states to an ensemble of lower FRET states, consistent with an increase in distance between the TM4 and TM6. The two receptors also appear to explore a different conformational space. A wider distribution of states is observed for ACKR3 as compared to CXCR4, and it shifts in the presence of agonists toward the active states, which correlates well with ACKR3's tendency to be constitutively active. This interpretation is confirmed by the use of the mutation of Y254 to leucine (the corresponding residue in CXCR4), which leads to a conformational distribution that resembles the one observed with CXCR4. It is correlated with a decrease in constitutive activity of ACKR3.

      Weaknesses:

      Although the data overall support the claims of the authors, there are however some details in the data analysis and interpretation that should be modified, clarified, or discussed in my opinion

      Concerning the amplitude of the changes in FRET efficiency: the authors do not provide any structural information on the amplitude of the FRET changes that are expected. To me, it looks like a FRET change from ~0.9 to ~0.1 is very important, for a distance change that is expected to be only a few angstroms concerning the movement of the TM6. Can the authors give an explanation for that? How does this FRET change relate to those observed with other GPCRs modified at the same or equivalent positions on TM4 and TM6?

      The large FRET change in our system was initially unexpected. However, the reviewer is mistaken that the expected distance change is only a few angstroms. Crystal structures of the homologous beta2 adrenergic receptor (β<sub>2</sub>AR) in inactive and active conformations reveal that the cytoplasmic end of TM6 moves outwards by 16 angstroms during activation (Rasmussen et al., 2011, ref 47).  Consistent with this, smFRET studies of β<sub>2</sub>AR labeled in TM4 and TM6 (as here) showed that the donor-acceptor (D-A) distance was 14 angstroms longer in the active conformation (Gregorio et al., ref 38).  Surprisingly, the apparent distance change in our system (calculated for our FRET probes, A555/Cy5, using FPbase.com) is almost 30 angstroms. A possible explanation is that the fluorophore attached to TM6 interacts with lipids within the nanodisc when TM6 moves outwards, which could stretch the fluorophore linker and thereby increase the D-A distance (lipids were absent in the β<sub>2</sub>AR study). Such an interaction could also constrain the fluorophore in an unfavorable orientation for energy transfer, also leading to lower than expected FRET efficiencies and inflated distance calculations. Regardless, it is important to emphasize that none of the interpretations or conclusions of our study are based on computed D-A distances. Rather, we resolved different receptor conformations and quantified their relative populations based on the measured FRET efficiency distributions.

      Finally, we note that a recent smFRET study of the glucagon receptor (labeled in TM4 and TM6, as here) also revealed a large difference in apparent FRET efficiencies between inactive (E<sub>app</sub> = 0.83) and active (E<sub>app</sub> = 0.32) conformations (Kumar et al., ref. 39). Thus, the large change in FRET efficiency observed in our study is not unprecedented.

      Concerning the intermediate states: the authors observe several intermediate states.

      (1) First I am surprised, looking at the time traces, by the dwell times of the transitions between the states, which often last several seconds. Is such a long transition time compatible with what is known about the kinetic activation of these receptors?

      We too were surprised by the apparent kinetics of the receptors in our system. However, it was previously noted that purified systems, including nanodiscs, lead to slower activation times for GPCRs compared to cellular membrane systems (Lohse et al, Curr. Opin. Cell Biology, 27, 8792, 2014). Indeed, slow transitions among different FRET states (dwell times in the seconds range) were also observed in recent smFRET studies of the mu opioid receptor (Zhao et al., 2024, ref. 41) and the glucagon receptor (Kumar et al., 2023, ref. 39). These studies are consistent with the observed time scale of the FRET transitions reported here.

      (2) Second is it possible that these “intermediate” states correspond to differences in FRET efficiencies, that arise from different photophysical states of the dyes? Alexa555 and Cy5 are Cyanines, that are known to be very sensitive to their local environment. This could lead to different quantum yields and therefore different FRET efficiencies for a similar distance. In addition, the authors use statistical labeling of two cysteines, and have therefore in their experiment a mixture of receptors where the donor and acceptor are switched, and can therefore experience different environments. The authors do not speculate structurally on what these intermediate states could be, which is appreciated, but I think they should nevertheless discuss the potential issue of fluorophore photophysics effects.

      The reviewer is correct that the intermediate FRET states could, in principle, arise from a conformational change of the receptor that alters the local environment of the donor and/or acceptor fluorophores, rather than a change in donor-acceptor distance. This caveat is now included in the discussion on Pg. 10:

      “In principle, the intermediates in CXCR4 and ACKR3 could represent partial movements of TM6 from the inactive to active conformation or more subtle conformational changes altering the photophysical characteristics of the probes without drastically altering the donor-acceptor distance. Either possibility leads to detectable changes in apparent FRET efficiency and reflect discrete conformational steps on the activation pathway; however, it is not possible to resolve specific structural changes from the data.”

      Regarding the second possibility, it is true that our labeling methodology leads to a statistical mixture of labeled species (D on TM6 and A on TM4, D on TM4 and A on TM6). If the photophysical properties of the fluorophores were markedly different for the two labeling orientations, this would produce two different FRET efficiencies for a given receptor conformation. Assuming two receptor conformations, this scenario would produce four distinct FRET states: E<sub>1</sub> (inactive receptor, labeling configuration 1), E<sub>2</sub> (active receptor, labeling configuration 1), E<sub>3</sub> (inactive receptor, labeling configuration 2) and E<sub>4</sub> (active receptor, labeling configuration 2), with two cross peaks in the TDP plots, corresponding to E<sub>1</sub> ↔ E<sub>2</sub> and E<sub>3</sub> ↔ E<sub>4</sub> transitions. Notably, E<sub>2</sub> ↔ E<sub>3</sub> cross peaks would not be present, since states E<sub>2</sub> and E<sub>3</sub> exist on separate molecules. Instead, we see all states inter-connected sequentially, R ↔ R’ ↔ R* in CXCR4 and R ↔ R’ ↔ R*’ ↔ R* in ACKR3 (Fig. 2), suggesting that the resolved FRET states represent interconnected conformational states.

      We added the following text to the Results section on Pg. 6:

      “Two-dimensional transition density probability (TDP) plots revealed that the three FRET states were connected in a sequential fashion (Figs. 2A & B), indicating that the transitions occurred within the same molecules. Notably, these observations exclude the possibility that the midFRET state arises from different local fluorophore environments (hence FRET efficiencies) for the two possible labeling orientations of the introduced cysteines: assuming two receptor conformations, this model would produce four distinct FRET states, but only two cross peaks in the TDP plot.”

      (3) It would also have been nice to discuss whether these types of intermediate states have been observed in other studies by smFRET on GPCR labeled at similar positions.

      Intermediate states have also been reported in previous smFRET studies of other GPCRs. For example, in the glucagon receptor (also labeled in TM4 and TM6), a third FRET state (E<sub>app</sub> =  0.63) was resolved between the inactive (E<sub>app</sub>  = 0.85) and active (E<sub>app</sub>  = 0.32) states (Kumar et al., Ref. 39).  Discrete intermediate receptor conformations were also observed in the A<sub>2A</sub>R labeled in TM4 and TM6 (Fernandes et al., Ref 40). These examples are now cited in the Discussion.

      On line 239: the authors talk about the R↔R' transitions that are more probable. In fact it is more striking that the R'↔R* transition appears in the plot. This transition is a signature of the behavior observed in the presence of an agonist, although IT1t is supposed to be an inverse agonist. This observation is consistent with the unexpected (for an inverse agonist) shift in the FRET histogram distribution. In fact, it appears that all CXCR4 antagonists or inverse agonists have a similar (although smaller) effect than the agonist. Is this related to the fact that these (antagonist or inverse agonist) ligands lead to a conformation that is similar to the agonists, but cannot interact with the G-protein ?? Maybe a very interesting experiment would be here to repeat these measurements in the presence of purified G-protein. G-protein has been shown to lead to a shift of the conformational space explored by GPCR toward the active state (using smFRET on class A and class C GPCR). It would be interesting to explore its role on CXCR4 in the presence of these various ligands. Although I am aware that this experiment might go beyond the scope of this study, I think this point should be discussed nevertheless.

      We thank the reviewer for this observation and the possible explanation offered.  In response, we have added the following text to the Results section on Pg. 7:

      “The small-molecule ligand IT1t is reported to act as an inverse agonist of CXCR4 (54-56). However, the conformational distribution of CXCR4 showed little change to the overall apparent

      FRET profile, although R’ ↔ R* transitions appeared in the TDP plot (Figs. 3A & B, Fig. S8). This suggests that the small molecule does not suppress CXCR4 basal signaling by changing the conformational equilibrium. Instead IT1t appears to increase transition probabilities which may impair G protein coupling by CXCR4.”

      We have also added the following text to the Results on Pg. 8:

      “Despite the ability of CXCL12<sub>P2G</sub> and CXCL12<sub>LRHQ</sub> to stabilize the active R* conformation of CXCR4, both variants are known to act as antagonists (20). This suggests that the CXCL12 mutants inhibit CXCR4 coupling to G proteins not by suppressing the active receptor population but rather by increasing the dynamics of the receptor state transitions. Our results suggest that the helical movements considered classic signatures of the active state may not be sufficient for CXCR4 to engage productively with G proteins.”

      In addition, we have added the following text to the Discussion on Pg. 11:

      “The chemokine variants CXCL12<sub>P2G</sub> and CXCL12<sub>LRHQ</sub> are reported to act as antagonists of CXCR4 (19, 20), and the small molecule IT1t acts as an inverse agonist (54-56). Surprisingly, none of these ligands inhibit formation of the active R* conformation of CXCR4. In fact, the chemokine variants both stabilize and increase this state to some degree, although less effectively than CXCL12<sub>WT</sub>. Thus, the antagonism and inverse agonism of these ligands does not appear to be linked exclusively to receptor conformation, suggesting that the ligands inhibit coupling of G proteins to CXCR4 or disrupt the ligand-receptor-G protein interaction network required for signaling (Fig. S10) (21, 23).  Interestingly, these ligands also increase the probabilities of state-to-state transitions (Figs. 3B & 4B), suggesting that enhanced conformational exchange prevents the receptor from productively engaging G proteins. Similarly, ACKR3 is naturally dynamic and lacks G protein coupling, suggesting a common mechanism of G protein antagonism.”

      Finally, we also agree that experiments with G proteins could be informative. In fact, we initiated such experiments during the course of this study.  However, it soon became apparent that significant optimization would be required to identify fluorophore labeling positions that report receptor conformation without inhibiting G protein coupling. Accordingly, we decided that G protein experiments would be the subject of future studies.

      However, we added the following text to the Discussion on Pg. 12:

      “Future smFRET studies performed in the presence of G proteins should be informative in this regard”.

      The authors also mentioned in Figure 6 that the energetic landscape of the receptors is relatively flat ... I do not really agree with this statement. For me, a flat conformational landscape would be one where the receptors are able to switch very rapidly between the states (typically in the submillisecond timescale, which is the timescale of protein domain dynamics). Here, the authors observed that the transition between states is in the second timescale, which for me implies that the transition barrier between the states is relatively high to preclude the fast transitions.

      We thank the reviewer for the comment. We have modified the description of the energy landscapes of ACKR3 and CXCR4 in the discussion on Pg. 10 as follows:

      “These observations imply that ACKR3 has a relatively flat energy landscape, with similar energy minima for the different conformations, whereas the energy landscape of CXCR4 is more rugged (Fig. 6). For both receptors, the energy barriers between states are sufficiently high that transitions occur relatively slowly with seconds long dwell times (Figs. 1C and S2).”

      Reviewer #2 (Public Review):

      Summary:

      his manuscript uses single-molecule fluorescence resonance energy transfer (smFRET) to identify differences in the molecular mechanisms of CXCR4 and ACKR3, two 7transmembrane receptors that both respond to the chemokine CXCL12 but otherwise have very different signaling profiles. CXCR4 is highly selective for CXCL12 and activates heterotrimeric G proteins. In contrast, ACKR3 is quite promiscuous and does not couple to G proteins, but like most G protein-coupled receptors (GPCRs), it is phosphorylated by GPCR kinases and recruits arrestins. By monitoring FRET between two positions on the intracellular face of the receptor (which highlights the movement of transmembrane helix 6 [TM6], a key hallmark of GPCR activation), the authors show that CXCR4 remains mostly in an inactive-like state until CXCL12 binds and stabilizes a single active-like state. ACKR3 rapidly exchanges among four different conformations even in the absence of ligands, and agonists stabilize multiple activated states.

      Strengths:

      The core method employed in this paper, smFRET, can reveal dynamic aspects of these receptors (the breadth of conformations explored and the rate of exchange among them) that are not evident from static structures or many other biophysical methods. smFRET has not been broadly employed in studies of GPCRs. Therefore, this manuscript makes important conceptual advances in our understanding of how related GPCRs can vary in their conformational dynamics.

      Weaknesses:

      (1) The cysteine mutations in ACKR3 required to site-specifically install fluorophores substantially increase its basal and ligand-induced activity. If, as the authors posit, basal activity correlates with conformational heterogeneity, the smFRET data could greatly overestimate the conformational heterogeneity of ACKR3.

      The change in basal ACKR3 activity with the Cys introductions are modest in comparison and insignificantly different as determined by extra-sum-of-squares F test (P=0.14).

      (2) The probes used cannot reveal conformational changes in other positions besides TM6. GPCRs are known to exhibit loose allosteric coupling, so the conformational distribution observed at TM6 may not fully reflect the global conformational distribution of receptors. This could mask important differences that determine the ability of intracellular transducers to couple to specific receptor conformations.

      We agree that the overall conformational landscape of the receptors has not been investigated and we have added this caveat to the discussion on Pg. 12.

      “An important caveat is that our study does not report on the dynamics of the other TM helices and H8, some of which are known to participate in arrestin interactions.”

      (3) While it is clear that CXCR4 and ACKR3 have very different conformational dynamics, the data do not definitively show that this is the main or only mechanism that contributes to their functional differences. There is little discussion of alternative potential mechanisms.

      The main functional difference between CXCR4 and ACRK3 is their effector coupling: CXCR4 couples to G proteins, whereas ACKR3 only couples to arrestins (following phosphorylation of the C-terminal tail by GRKs). As currently noted in the discussion, ACKR3 has many features that may contribute to its lack of G protein coupling, including lack of a well-ordered intracellular pocket due to conformational dynamics, lack of an N-term-ECL3 disulfide, different chemokine binding mode, and the presence of Y257. Steric interference due to different ICL loop structures may also interfere with G protein activation. No one thing has proven to confer ACKR3 with G protein activity including swapping all of the ICLs to those of canonical chemokine receptor, suggesting it is a combination of these different factors. The following has been added to the discussion on Pg. 13 to clearly note that any one feature is unlikely to drive the atypical behavior of ACKR3:

      “The atypical activation of ACKR3 does not appear to be dependent on any singular receptor feature and is likely a combination of several factors.”

      (4) The extent to which conformational heterogeneity is a characteristic feature of ACKRs that contributes to their promiscuity and arrestin bias is unclear. The key residue the authors find promotes ACKR3 conformational heterogeneity is not conserved in most other ACKRs, but alternative mechanisms could generate similar heterogeneity.

      Despite the commonalities in the roles of the ACKRs, they all appear to have evolved independently. Thus, we do not believe that all features observed and described for one ACKR will explain the behavior of another. We have carefully avoided expanding our observations to other ACKRs to avoid suggesting common mechanisms.

      (5) There are no data to confirm that the two receptors retain the same functional profiles observed in cell-based systems following in vitro manipulations (purification, labeling, nanodisc reconstitution).

      We agree this is an important point. All labeled receptors responded to agonist stimulation as expected. As only properly folded receptors are able to make the extensive interactions with ligands necessary for conformational changes (for instance, CXCL12 interacts with all TMs and ECLs), this suggests that the proteins are folded correctly and functional following all manipulations.

      Reviewer #3 (Public Review):

      Summary:

      This is a well-designed and rigorous comparative study of the conformational dynamics of two chemokine receptors, the canonical CXCR4 and the atypical ACKR3, using single-molecule fluorescence spectroscopy. These receptors play a role in cell migration and may be relevant for developing drugs targeting tumor growth in cancers. The authors use single-molecule FRET to obtain distributions of a specific intermolecular distance that changes upon activation of the receptor and track differences between the two receptors in the apo state, and in response to ligands and mutations. The picture emerging is that more dynamic conformations promote more basal activity and more promiscuous coupling of the receptor to effectors.

      Strengths:

      The study is well designed to test the main hypothesis, the sample preparation and the experiments conducted are sound and the data analysis is rigorous. The technique, smFRET, allows for the detection of several substates, even those that are rarely sampled, and it can provide a "connectivity map" by looking at the transition probabilities between states. The receptors are reconstituted in nanodiscs to create a native-like environment. The examples of raw donor/acceptor intensity traces and FRET traces look convincing and the data analysis is reliable to extract the sub-states of the ensemble. The role of specific residues in creating a more flat conformational landscape in ACKR3 (e.g., Y257 and the C34-C287 bridge) is well documented in the paper.

      Weaknesses:

      The kinetics side of the analysis is mentioned, but not described and discussed. I am not sure why since the data contains that information. For instance, it is not clear if greater conformational flexibility is accompanied by faster transitions between states or not.

      The reviewer is correct that kinetic information is available, in principle, from smFRET experiments. However, a detailed kinetic analysis will require a much larger data set than we currently possess, to adequately sample all possible transitions and the dwell times of each FRET state. We intend to perform such an analysis in the future as more data becomes available. The purpose of this initial study was to explore the conformational landscapes of CXCR4 and ACKR3 and to reveal differences between them. To this end, we have documented major differences in conformational preferences and response to ligands of the two receptors that are likely relevant to their different biological behavior. Future kinetic information will add further detail, but is not expected to alter the conclusions drawn here.

      The method to choose the number of states seems reasonable, but the "similarity" of states argument (Figures S4 and S6) is not that clear.

      We thank the reviewer for noting a need for further clarification. We qualitatively compared the positions of the various FRET peaks across treatments to gain insight into the consistency of the conformations and avoid splitting real states by overfitting the data. For instance, fitting the ACKR3 treatments with three states leads to three distinct FRET populations for the R’ intermediate. Adding a fourth state results in two intermediates that are fairly well overlapping. In contrast, the two-intermediate model for CXCR4 appears to split the R* state of the CXCL12 treated sample and causes a general shift in both intermediate states to lower FRET values when CXCL12 is present. As we assume that the conformations are consistent throughout the treatments, we conclude that this represents an overfitting artifact and not a novel CXCL12CXCR4 R*’ state. Additional sentences have been added to the supplemental figure legend to better describe the comparative analysis.

      “(Top) With the 3-state model, the R’ states for apo-CXCR4 and for CXCL12- and IT1t-bound receptor overlapped well with similar apparent FRET values across all of the tested conditions. In the case of the four-state model, the R*’ (Middle) and R’ (Bottom) states were substantially different across the ligand treatments. In particular, the R*’ state with CXCL12 treatment appears to arise from a splitting of the R* conformation, indicating that the model was overfitting the data.”

      Also, the "dynamics" explanation offered for ACKR3's failure to couple and activate G proteins is not very convincing. In other studies, it was shown that activation of GPCRs by agonists leads to an increase in local dynamics around the TM6 labelling site, but that did not prevent G protein coupling and activation.

      We agree with the reviewer that any single explanation for ACKR3 bias, including the dynamics argument presented here, is insufficient to fully characterize the ACKR3 responses. As noted by the reviewer, the TM6 movement and dynamics is generally correlated with G protein coupling, whereas other dynamics studies (Wingler et al. Cell 2019) have noted that arrestinbiased ligands do not lead to the same degree of TM6 movement. We have added the following statement to the discussion on Pg. 13:

      “The atypical activation of ACKR3 does not appear to be dependent on any singular receptor feature and is likely a combination of several factors.” 

      Recommendations for the authors:  

      Reviewer #1 (Recommendations For The Authors):

      I would like to raise a technical point about the calculation and reporting of the FRET efficiency. The authors report the FRET efficiency as E=IA/(IA+ID). There is now a strong recommendation from the FRET community (https://doi.org/10.1038/s41592-018-0085-0) to use the term “FRET efficiency” only when a proper correction procedure of all correction factors has been applied, which is not the case here (gamma factor has not been calculated). The authors should therefore use the term “Apparent FRET Efficiency” and  E<sub>app</sub> in all the manuscripts.

      Also, it would be nice to indicate directly on the figures whether a ligand that is used is an agonist, antagonist, inverse agonist, etc...

      We thank the reviewer for suggesting this clarification in terminology. We now refer to apparent FRET efficiency (or E<sub>app</sub>) throughout the manuscript and in the figures. In addition, we have added ligand descriptions to the relevant figures.

      Reviewer #2 (Recommendations For The Authors):

      (1) M159(4.40)C/Q245(6.28)C ACKR3 appears to have higher constitutive activity than ACKR3 Wt (Fig. S1). While the vehicle point itself is likely not significant due to the error in the Wt, the overall trend is clear and arguably even stronger than the effect of Y257(6.40)L (Fig. S9). While this is an inherent limitation of the method used, it should be clearly acknowledged; the comment in lines 162-164 seems to skirt the issue by only saying that arrestin recruitment is retained. It would be helpful and more rigorous to report the curve fit parameters (basal, E<sub>max</sub>, EC50) for the arrestin recruitment experiments and the associated errors/significance (see https://www.graphpad.com/guides/prism/latest/statistics/stat_qa_multiple_comparisons_ after_.htm for a discussion).

      The Emin, E<sub>max</sub>, and EC50 for M159<sup>4</sup>.<sup>40</sup>C/Q245<sup>6</sup>.<sup>28</sup>C ACKR3 were compared against the values for WT ACKR3 from Fig. S1 and only the E<sub>max</sub> was determined to be significantly different by the extra sum of squares F test. A note has been added to the text to reflect these results on Pg. 5.

      “Only the E<sub>max</sub> for arrestin recruitment to CXCL12-stimulated ACKR3 was significantly altered by the mutations, while all other pharmacological parameters were the same as for WT receptors.”

      (2) The methods do not specify the reactive group of the dyes used for labeling (i.e., AlexaFluor 555-maleimide and Cy5-maleimide?).

      We regret the omission and have added the necessary details to the materials and methods.

      (3) Were any of the native Cys residues removed from ACKR3 and CXCR4 in the constructs used for smFRET? ACKR3 appears to have two additional Cys residues in the N-terminus besides the one involved in the second disulfide bridge, and these would presumably be solvent-exposed. If so, please specify in the Methods and clarify whether the constructs tested in functional assays included these. (Also, please specify if the human receptors were used.)

      No additional cysteine residues were mutated in either receptor. All exposed cysteines are predicted to form disulfides. The residues in the N-terminus that the reviewer alludes to, C21 and C26, form a disulfide (Gustavsson et al. Nature Communications 2017) and are thus protected from our probes. Consistent with these expectations, neither WT CXCR4 nor ACKR3 exhibited significant fluorophore labeling (now mentioned in the text on Pg. 5). The species of origin has been added to the material and methods.

      (4) There are a few instances where the data seem to slightly diverge from the proposed models that may be helpful to comment on explicitly in the text:

      - Figure 4E (ACKR3/CXCL12(P2G)): As noted in the legend, despite stabilizing R*/R*', CXCL12(P2G) reduces transitions between these states compared to Apo. This is more similar to the effects of VUF16840 (Figure 3D) than the other ACKR3 agonists. The authors note the difference between CXCL12(LHRQ) and CXCL12(P2G) (but not vs Apo) in this regard. There might be some other information here regarding the relative importance of the conformational equilibrium vs transition rates for receptor activity.

      Although the TDPs for CXCL12<sub>P2G</sub> and VUF16840 are similar, as noted by the reviewer, the overall FRET envelopes are drastically different.

      The differences in transition probabilities for R ↔ R’ and R*’ « R* transitions observed in the presence of CXCL12<sub>P2G</sub> or CXCL12<sub>LRHQ</sub> relative to the apo receptor are now explicitly noted in the Results.

      - The conformational distributions of ACKR3 apo and ACKR3 Y257L CXCL12 are very similar (Figure 5A,D). However, there is a substantial difference in the basal activity of WT vs CXCL12stimulated Y257L (Figure S9).

      The mutation Y257L appears to promote the highest and lowest FRET states at the expense of the intermediates. Although the distribution appears similar between Apo-WT and CXCL12Y257L, the depopulation of the R’ state may lead to the observed activation in cells.

      (5) There are inconsistent statements regarding the compatibility of G protein binding to the "active-like" ACKR3 conformation observed in the authors' previous structures (Yen et al, Sci Adv 2022). In the introduction, the authors seem to be making the case that steric clashes cannot account for its lack of coupling; in the discussion, they seem to consider it a possibility.

      The introduction to previous research on the molecular mechanisms governing the lack of ACKR3-G protein coupling was not intended to be all encompassing, but rather to highlight previous efforts to elucidate this process and justify our study of the role  of dynamics. Due to the positions of the probes, we can only comment on the impact on TM6 movements and not other conformational changes. The steric clash reported in Yen et al. was in ICL2 and not directly tested here, so our observations do not preclude changes occurring in this region. We also do not claim that the active-like state resolved in our previous structures matches any specific state isolated here by smFRET.

      (6) Line 83-85: "Having excluded other mechanisms we therefore surmised that the inability of ACKR3 to activate G proteins may be due to differences in receptor dynamics."

      Line 400-402: "It is possible that the active receptor conformation clashes sterically with the G protein as suggested by docking of G proteins to structures of ACKR3."

      As mentioned above, we suspect the mechanisms governing the inability of  ACKR3 to couple to G proteins may be more complex than one particular feature but instead due to a combination of several factors. Accordingly, we have not completely eliminated a contribution of steric hindrance as we described in Yen et al. Sci Adv 2022 and instead include it as a possibility. Following the line highlighted here, we list several alternatives: 

      “Alternatively, the receptor dynamics and conformational transitions revealed here may prevent formation of productive contacts between ACKR3 and G protein that are required for coupling, even though G proteins appear to constitutively associate with the receptor.”

      And, at the end of the paragraph, we have added the following sentence: 

      “The atypical activation of ACKR3 does not appear to be dependent on any singular receptor feature and is likely a combination of several factors.”

      (7) If the authors believe that the various ligands/mutations are only altering the distribution/dynamics of the same 3/4 conformations of CXCR4/ACKR3, respectively, is there a reason each FRET efficiency histogram is fit independently instead of constraining the individual components to Gaussian components with the same centroids, and/or globally fitting all datasets for the same receptor?

      We performed global analysis across all data sets for each sample and condition. Since the peak positions of the various FRET states recovered in this way were consistent across treatments (Fig. S4,S6), we did not feel it was necessary to perform a further global analysis across all samples for a given receptor.

      Reviewer #3 (Recommendations For The Authors):

      The manuscript is well-written, the arguments are easy to follow and the figures are helpful and clear. Here are a few questions/suggestions that the authors might want to address before the paper will be published:

      (1) Include a table with kinetic rates between states in SI and have a brief discussion in the main text to support the trends observed in transition probabilities.

      As noted above, determining rate constants for each of the state-to-state transitions will require a much larger set of experimental smFRET data than is currently available and will be the subject of future studies.

      (2) The argument of state similarity (Figure S4 and S6)... why are the profiles not Gaussian, like in the fits on Figures S3 and S5, repectively? I would also suggest that once the number of states is chosen to do a global fit, where the FRET values of a certain sub-state across different conditions for one receptor are shared.

      The state distributions presented in Figs. S4 and S6 (as well as throughout the rest of the paper) are derived from HMM fitting of the time traces themselves, and are not constrained to be Gaussian, whereas the GMM analysis in Figs. S3 and S5 are Gaussian fits to the final apparent FRET efficiency histograms.

      Similar to our response to Review 2 above, due to the consistency of the fitted peak positions obtained across different conditions for a given sample, we did not feel that further global analysis was necessary.

      (3) It is shown FRET changes from ~0.85 in the inactive (closed) state to ~0.25 in the active (open) state. How do these values match the expectations based on crystal structure and dye properties?

      As noted in our response to Reviewer 1, translating the apparent FRET values using the assumed Förster distances for A555/Cy5 (per FPbase) suggest a change in D-A distance of ~30 angstroms, whereas the expected change from structures is ~16 Å. We suspect this discrepancy is due to the lipids immediately adjacent to the fluorophores, which may lead to the probes being constrained in an extended position when TM6 moves outwards, thus also reporting the linker length in the distance change. Additionally, such interactions may constrain the donor and acceptor in unfavorable orientations for energy transfer, which would also reduce the FRET efficiency in the active state. Since the calculated D-A distance changes appear too large for GPCR activation, we have opted to not make any structural interpretations. Instead, all of our conclusions are based on resolving individual conformational states and quantifying their relative populations, which is based directly on the measured FRET efficiency distributions, not computed distances.

      (4) The results on the effect of CXCL12-P2G on CXCR4 are confusing...despite being an antagonist, this ligand stabilizes the "active state"...I am not sure if the explanation offered is sufficient that the opening of the intracellular cleft is not sufficient to drive the G protein coupling/activation.

      We agree that the explanation related to the opening of the intracellular cleft being insufficient to drive G protein coupling/activation is speculative and we have removed that text. We now simply propose that the CXCL12 variants inhibit coupling of G proteins to CXCR4 or disrupt interactions necessary for signaling, as stated in the following text to the results on Pg. 8:

      “Despite the ability of CXCL12<sub>P2G</sub> and CXCL12<sub>LRHQ</sub> to stabilize the active R* conformation of CXCR4, both variants are known to act as antagonists (20). This suggests that the CXCL12 mutants inhibit CXCR4 coupling to G proteins not by suppressing the active receptor population but rather by increasing the dynamics of the receptor state-to-state transitions. Our results suggest that the helical movements considered classic signatures of the active state may not be sufficient for CXCR4 to engage productively with G proteins.”

    1. eLife Assessment

      This manuscript focuses on understanding if and how the glycosylation of SARS-CoV2 spike protein affects a putative allosteric network of interactions controlled by the binding of a fatty acid. The main conclusion is that glycans do not significantly affect the network of allosteric interactions. This valuable information - albeit mainly consisting of negative results - is based on convincing evidence. It will be of interest to scientists focusing on SARS CoV2 protein structure and dynamics.

    2. Reviewer #1 (Public Review):

      Summary:

      The investigation delves into allosteric modulation within the glycosylated SARS-CoV-2 spike protein, focusing on the fatty acid binding site. This study uncovers intricate networks connecting the fatty acid site to crucial functional regions, potentially paving the way for developing innovative therapeutic strategies.

      Strengths:

      This article's key strength lies in its rigorous use of dynamic nonequilibrium molecular dynamics (D-NEMD) simulations. This approach provides a dynamic perspective on how the fatty acid binding site influences various functional regions of the spike. A comprehensive understanding of these interactions is crucial in deciphering the virus's behavior and identifying potential targets for therapeutic intervention.

    3. Reviewer #2 (Public Review):

      This is a nice paper illustrating the use of equilibrium/non-equilibrium MD simulations to explore allosteric communication in the Spike protein. The results are described in detail and suggest a complex network of signal transmission patterns. The topic is not completely novel as it has been studied before by the same authors and the impact of glycosylation is moderated and localized at the furin site, so not many new conclusions emerge here. It is suggested that mutations are commonly found in the communication pathway which is interesting, but the authors fail to provide evidence that this is related to a positive selection and not simply to a random effect related to mutations at points that are not crucial for stability or function. One interesting point is the connection of the FA site with an additional site binding heme group. It will be interesting to see reversibility, i.e. removal of the ligand at this site is producing perturbation at the FA site?, does it produce other effects suggesting a cascade of allosteric effects? Finally, the paper lacks details to help reproducibility, in particular, I do not see details on D-NEMD calculations. One interesting point is the connection of the FA site with an additional site binding heme group.

    4. Reviewer #3 (Public Review):

      Summary:

      In a previous study, the authors analyzed the dynamics of the SARS-CoV2 spike protein through lengthy MD simulations and an out-of-equilibrium sampling scheme. They identified an allosteric interaction network linking a lipid-binding site to other structurally important regions of the spike. However, this study was conducted without considering the impact of glycans. It is now known that glycans play a crucial role in modulating spike dynamics. This new manuscript investigates how the presence of glycans affects the allosteric network connecting the lipid binding site to the rest of the spike. The authors conducted atomistic equilibrium and out-of-equilibrium MD simulations and found that while the presence of glycans influences the structural responses, it does not fundamentally alter the connectivity between the fatty acid site and the rest of the spike.

      Strengths:

      The manuscript's findings are based on an impressive amount of sampling. The methods and results are clearly outlined, and the analysis is conducted meticulously.

    5. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1:

      We thank the Reviewer for being very supportive of the work and acknowledging how important it is to understand allosteric modulation in the spike and the potential of this knowledge to contribute to the design of novel therapeutic strategies (for example, disrupting or altering the allosteric networks within the spike can be a novel strategy for drug development against COVID-19). We address their comments below: 

      (1) The Reviewer states that although the strategy used to extract the responses has been "previously validated", the complexity of the interactions investigated requires "a robust statistical analysis, which is not shown quantitatively". 

      As the Reviewer points out, the D-NEMD approach has been previously validated in various protein systems ranging from soluble enzymes to integral membrane proteins, including the spike (e.g. [Kamsri et al. (2024) Biochem; Beer et al. (2024) Chem Sci; Oliveira et al. (2023) J Mol Cell Biol; Chan et al. (2023) JACS Au; Castelli et al. (2023) JACS; Castelli et al. (2023) Protein Sci; Oliveira et al. (2022) Comput Struct Biotechnol J; Gupta et al. (2022) Nat Comm; Oliveira et al. (2021) JACS; Galdadas et al. (2021) eLife; Abreu et al. (2019) Proteins; Oliveira et al. (2019) JACS; Oliveira et al. (2019) Structure]. The Kubo-Onsager relation is used to extract the evolution of the protein's response to a perturbation by comparing the equilibrium and nonequilibrium trajectories at equivalent points in time. The calculated responses at individual times are then averaged over all the repeats (210 repeats in the current work), and the standard error of the mean (SEM) is used to assess the significance of the average response. The SEM indicates how much the calculated mean deviates from the true population mean. Calculating the SEM allows us to determine how accurate the measured response is as an estimate of the population response and assess the convergence of our calculations. The evolution of the average C<sub>α</sub> displacement and corresponding SEM values for each individual monomer can be visualised in detail in Figures S7-S9. We have added a new sentence to the Materials and Methods section in the Supporting Information, explicitly stating how the convergence and statistical significance of the responses were assessed.

      (2) The Reviewer considers that the evidence presented in the paper "is compelling" but suggests performing a sequence analysis to facilitate the understanding of the results by the scientific community. 

      We thank the Reviewer for their excellent suggestion to perform a sequence analysis of the FA site region and its allosteric connections. Indeed, this analysis (Figure S24) clearly shows that several of the mutations, deletions and insertions in the Alpha, Beta, Gamma, Delta, and Omicron variants are located either in or near the regions of the protein shown to respond to the removal of linoleate from the FA site. These sequence changes affect the protein's responses, and are responsible for the differences in allosteric behaviour observed between variants, as described previously for the non-glycosylated spike [Oliveira et al. (2023) J Mol Cell Biol]. Furthermore, some variants, such as Beta, Gamma, and Omicron, contain residue substitutions at the FA site. For example, the lysine in position 417 in the ancestral spike is mutated to asparagine in Beta and Omicron and threonine in the Gamma variant. Another example is arginine 408 in the original protein, which has been replaced by asparagine in several Omicron sub-variants. 

      To summarise, the sequence analysis (Figure S24) supports our initial 3D analysis (Figure S25), indicating that many of the changes observed in the variants of concern are indeed in or close to the allosteric networks involving the FA site. We have now included the sequence analysis results in the current paper and added a new figure to Supporting Information showing the sequence alignments between the ancestral spike and different variants (Figure S24). 

      (3) The Reviewer also has "minor considerations": first, they point to a discrepancy in the presentation of residue values S325 in the plots of Chains A, B, and C of Figure S3; second, they ask why several regions, such as RBM and Furin Site in figures S6, S7, and S8 show significant changes.

      To answer both points raised by the Reviewer, we need to start by explaining that the spike typically features 22 N-glycosylation and at least two O-glycans sites per monomer. These sites have been found to be heterogeneously populated in different experimental studies (e.g. [Watanabe et al. (2020) Science; Shajahan et al. (2020) Glycobiology; Zhang et al. (2021) Mol Cell Proteomics]). Given this, the spike model used as the starting point for this work reflects this heterogeneity, with asymmetric site-specific glycosylation profiles derived from the glycoanalytic data reported by Watanable et al. for N-glycans [Watanabe et al. (2020) Science] and Shajahan et al. for O-glycans [Shajahan et al. (2020) Glycobiology]. This means that the glycan occupancy and composition for each site differ between the three monomers. For example, while monomer A contains the two O-glycans sites (linked to T323 and S325, respectively) fully occupied, monomers B and C only contain the T323 O-glycan. A detailed description of the glycosylation of the spike model is given in the supporting information of [Casalino et al. (2020) ACS Cent Sci].

      Regarding the Reviewer's first minor point, the discrepancy in behaviour observed in Figure S3 for S325 is related to the fact that this glycosylation site is only occupied in monomer A, with no glycans present in this site in monomers B and C. 

      Regarding the second point, the differences observed in the responses between the three monomers in Figures S7-S9 are probably due to asymmetries in the protein dynamics introduced by the different glycosylation patterns in the monomers. 

      We have now added a new paragraph to the materials and methods section in the Supporting Information describing the asymmetric site-specific glycosylation profiles of the monomers.

      (4) Due to the complexity of the allosteric interactions observed, the Reviewer suggests including in the paper a "diagram showing the flow of allosteric interactions" or a "vector showing how the perturbation done in the FA Active site takes contact with other relevant regions". 

      This is an excellent suggestion to facilitate the visualisation of the allosteric networks. We have added a new figure to Supporting Information highlighting the allosteric pathways identified from the DNEMD simulations and the direction of the propagation of the structural changes (Figure S26).

      Reviewer #2:

      We thank the Reviewer for their time in evaluating our manuscript and providing suggestions for improving it and ideas for further work. We are happy that the Reviewer found this to be a "nice paper" with the calculations "well done" and interesting results. We address their comments below: 

      (1) The Reviewer suggests improving the paper by adding a more detailed explanation of the DNEMD simulations approach, a method that, although proposed decades ago, is still generally unfamiliar to the community. They also asked for "information on the convergence of the observables".

      As stated by the Reviewer, a dynamical approach to nonequilibrium molecular dynamics (D-NEMD) was first proposed in the seventies by Ciccotti et al. [Ciccotti et al. (1975) Phys Rev Lett; Ciccotti et al. (1979) J Stat Phys]. This approach combines MD simulations in equilibrium and nonequilibrium conditions. The rationale for the D-NEMD approach is simple and can be described as follows: if an external perturbation (e.g. binding/unbinding of a ligand) is added to a simulation sampling an equilibrium state and, by doing so, a parallel nonequilibrium simulation is started, the structural response of the protein to the perturbation can be directly measured by comparing the equilibrium and nonequilibrium trajectories at equivalent points in time by using the Kubo-Onsager relation as long as enough sapling is gathered (for more details, please see the reviews [Balega et al. (2024) Mol Phys; Oliveira et al. (2021) Eur Phys J B; Ciccotti et al. (2016) Mol Simul]). This approach, although conceptually simple, is very powerful as it allows for computing the evolution of the dynamic response of the protein to the external perturbation, while assessing the convergence and statistical significance of that response. This approach also has the advantage that the convergence and significance of the response can be easily evaluated, and the associated errors can be computed and made as small as desirable by increasing the number of nonequilibrium trajectories. Determining the statistical errors associated with the responses (through, e.g., the determination of the standard error of the mean, SEM) is essential to test if the sampling gathered is sufficient. In this paper, the SEM was calculated for each average C<sub>α</sub> displacement value at times 0.1, 1 and 10 ns after the removal of linoleate, LA (see Figures S7-S9). The SEM indicates how accurate the measured response is as an estimate of the population response and allows us to assess the convergence of the results. 

      Generally, multiple (tens to hundreds) D-NEMD simulations are needed to achieve statistically significant results for biomolecular systems (for examples, see [Balega et al. (2024) Mol Phys; Oliveira et al. (2021) Eur Phys J B]). As such, the length of the D-NEMD simulations (typically 5 to 10 ns) reflects the balance between the computational resources available and the number of replicates needed to achieve statistically significant responses from the system. Following the Reviewer's suggestion, we have now added a brief description of the D-NEMD approach to the main manuscript and expanded the D-NEMD section in the Supporting Information with a more detailed description of the method, including adding a new figure showing a schematic representation of the D-NEMD approach (Figure S5) as well as explicitly stating the settings used in these simulations and how the statistical significance of the responses was assessed. 

      (2) The Reviewer suggests comparing the D-NEMD results with "more traditional analysis, such as correlation analysis, or community network analysis". 

      We agree with the Reviewer that this is an important comparison, which can provide a broader, more articulate and coherent picture of spike allostery and have, therefore, performed additional analysis. The dynamic cross-correlation analysis suggested by the Reviewer is a valuable tool for identifying the regions in the protein influenced by the FA site in equilibrium conditions. However, such an approach is not straightforwardly applicable to D-NEMD simulations, as these simulations are not in equilibrium. Nevertheless, as suggested by the Reviewer, we have determined the cross-correlation matrices for both the equilibrium and D-NEMD simulations (Figure S22), similar to those in our previous work [Galdadas et al. (2021) eLife] and [Oliveira et al. (2022) J Mol Cell Biol]. The analysis of these matrices can provide information about possible allosteric networks. In Figure S22, the cyan and blue regions represent moderate and high negative correlations between C<sub>α</sub> atoms, while orange and red regions correspond to moderate and high positive correlations. Negative correlations indicate residues moving in opposite directions (moving toward or away from each other). In contrast, positive values imply that the residues are moving in similar directions. We also note that, with collaborators, we have compared D-NEMD and other nonequilibrium and equilibrium MD analysis methods for allostery [Castelli et al.  (2023) JACS].

      The cross-correlation maps depicted in Figure S22 show moderate to high positive correlations between the FA sites and two of the three RBDs in the protein. This happens because each FA site sits at the interface between two neighbouring RBDs. Low to moderate negative and mildly positive correlated motions can also be observed between the FA site and the NTDs and fusion peptide surrounding regions, respectively. To facilitate the visualisation of the above-described motions, we have also mapped the statistical correlations for R408 and K417 (two FA site residues able to directly form salt-bridge interactions with the carboxylate head group of LA) on the protein's three-dimensional structure (Figure S23). Figure S23 highlights the patterns of movement described above and allows us to identify the regions whose motions are coupled to the FA site.

      Interestingly, some segments forming the signal propagation pathways, such as R454-K458 in all three monomers, and C525-K537 in monomers B and C, can also be identified from the cross-correlation matrices, showing moderate to high correlations with the FA site (Figures S22-S23). The crosscorrelation maps computed from the equilibrium trajectories (with FA sites occupied with LA) show a slight increase in the dynamic correlations, mainly for the RBDs, compared to the maps obtained from the nonequilibrium trajectories (Figure S22). This indicates that the presence of LA in the FA strengthens the connections between the FA site and other parts of the protein. 

      We have updated the manuscript to include the cross-correlation analysis, with two new figures added to Supporting Information: one depicting the cross-correlation maps for the D-NEMD and equilibrium simulations (Figure S22), and the other showing the statistical correlations for R408 and K417 (Figure S23). 

      (3) The Reviewer considers the observed connection between the fatty acid site and the heme/biliverdin site "interesting" and suggests "exploring the impact of ligand removal on this secondary site on the protein".

      Similarly to the Reviewer, we find the connection between the FA and the heme/biliverdin site fascinating and worthy of further investigation. The observed connection between these two sites shows the complexity of the allosteric effects in the spike. It would be interesting and informative to perform new equilibrium simulations of the heme/biliverdin spike complex and a new set of D-NEMD simulations in which this site is perturbed (e.g. through the removal of the heme group) to map the networks connecting this allosteric site to other functionally important regions of the spike, including the FA site and potentially other allosteric sites. These new simulations would allow us to assess the reversibility of the connection between the FA and heme/biliverdin sites and enhance our understanding of allosteric modulation in the spike and the role of the heme/biliverdin site in this process. However, due to the large size of the system and the associated computational demands, such simulations are not possible within the timeframe of the revision of this paper. These simulations would take many months to complete using our HPC resources. We also note that an experimental structure of the spike containing both heme and linoleate is not available. Further simulation analysis of the communication pathways involving the heme/biliverdin site is an excellent idea for future work.

      (4) The Reviewer "liked the mapping of existing mutations on the communication pathway" and suggested a more detailed study focusing on the effect of the mutations. 

      We fully agree with the Reviewer and consider that a detailed study focusing on the effect of the mutations, insertions, and deletions in the different glycosylated variants of concern (including new emerging ones) would be of great interest. Our previous work using D-NEMD on the non-glycosylated ancestral, Alpha, Delta, Delta plus and Omicron BA.1 spikes revealed significant differences in the allosteric responses to LA removal, with the changes in the variants affecting both the amplitude of the structural responses and the rates at which these rearrangements propagate within the protein [Oliveira et al. (2023) J Mol Cell Biol]. 

      Using the D-NEMD approach to systematically investigate the impact of each individual mutation and their contribution to the overall allosteric response of the glycosylated variants (similar to what we have done previously for the D614G mutation in the non-glycosylated protein [Oliveira et al. (2021) Comput Struct Biotechnol J]) would provide insights into the functional modulation of the spike. However, as noted above in point 3, spike simulations are highly computationally expensive, both in terms of processing and data storage requirements, because of the large size of the protein and the need for equilibrium and D-NEMD simulations. This makes the suggested mutational study unfeasible within the timeframe of the current revisions. It is, however, an excellent idea for future research.

      Reviewer #3:

      We thank the Reviewer for carefully reading and critically reviewing this work and recognising that the findings reported are "based on an impressive amount of sampling" and "meticulous" analysis. We address their comments below: 

      (1) The Reviewer considers that this work "does not clearly show any new findings" as it shows that the glycans do not significantly impact the internal networks in the protein.

      We respectfully disagree with the Reviewer. This work identifies new allosteric effects in the spike, specifically, the connection of the FA site with the heme binding site. The equilibrium simulations alone provide the first analysis of the effects of linoleate binding in the fully glycosylated spike. The finding that glycosylation does not significantly affect the allosteric pathways in the spike is in itself an important finding. Previous D-NEMD simulations investigated only the non-glycosylated spike ([Oliveira et al. (2021) Comput Struct Biotechnol J; Oliveira et al. (2022) J Mol Cell Biol] ) leading to questions of whether the allosteric effects pathways were changed by glycosylation; our results here show that the main conclusions are reinforced, but glycosylation does have some effect on networks, and also on the speed of the dynamical response. To the best of our knowledge, our work represents the first investigation to analyse the impact of glycosylation on the allosteric networks in the spike. We show that even though the presence of glycans in the exterior of the spike does not significantly alter the internal communication pathways in the protein, in some cases (for example, the glycans linked to N234, T373 and S375), they create direct connections between different regions, which may facilitate the propagation of the structural changes. 

      (2) The Reviewer suggests adding a "clear and concise description" of the D-NEMD approach to the manuscript.

      We appreciate that the use of the D-NEMD method to study biomolecular systems is relatively new, and so may be unfamiliar. As explained above in our response to Reviewer 2 (point 1), a brief description of the D-NEMD approach was now included in the main manuscript. A detailed description of the method was also added to Supporting Information, including a new figure representing the rationale for the approach (Figure S5). The interested reader is directed to previous applications and reviews for more details of the method (e.g. [Balega et al. (2024) Mol Phys; Oliveira et al. (2021) Eur Phys J B; Ciccotti et al. (2016) Mol Simul; Kamsri et al. (2024) Biochem; Beer et al. (2024) Chem Sci; Oliveira et al. (2023) J Mol Cell Biol; Chan et al. (2023) JACS Au; Castelli et al. (2023) JACS; Castelli et al. (2023) Protein Sci; Oliveira et al. (2022) Comput Struct Biotechnol J; Gupta et al. (2022) Nat Comm; Oliveira et al. (2021) JACS; Galdadas et al. (2021) eLife; Abreu et al. (2019) Proteins; Oliveira et al. (2019) JACS; Oliveira et al. (2019) Structure]). 

      (3) The Reviewer invites us to "discuss the robustness of the findings with respect to forcefield choices".

      The Reviewer raises an important but rather complex question, and one which can, of course, be posed for any molecular dynamics simulation study. The short answer is that we have chosen state-of-the-art forcefields, which have been shown to give results for the spike that are in good agreement with experiments; glycosylated spike simulations are rather computationally expensive, and constructing the models also requires significant human time and effort. Thus, while in principle interesting, it is not practical to repeat the current simulations with different forcefields. However, as detailed below, comparison of our simulations of the glycosylated and non-glycosylated [Oliveira et al. (2022) Comput Struct Biotechnol J] spike using different forcefields indicates that our conclusions are robust and are not dependent on the choice of forcefield. 

      Comparing the performance and accuracy of different force fields is not straightforward, as the results depend on the system of interest, properties simulated and sampling. In this work, the CHARMM36m all-atom additive force field was used to describe the protein and glycans. CHARMM36m is a widely used force field that has previously been validated for the simulations of biological systems [Huang et al. (2013) J Comput Chem; Guvench et al. (2009) J Chem Theory Comput], including proteins, lipids and glycans, with many of studies adopting it in the literature. Additionally, the glycosylated models of the spike used in this work have also been successfully applied and tested before (e.g. [Dommer et al. (2023) Int J High Perform Comput Appl; Sztain et al. (2021) Nat Chem; Casalino et al. (2021) Int J High Perform Comput Appl; Casalino et al. (2020) ACS Cent Sci]), with their dynamics shown to correlate well with experimental data.   

      It is also worth pointing out that, despite differences in the amplitude of the responses, the allosteric networks identified using the D-NEMD approach for the non-glycosylated [Oliveira et al. (2022) Comput Struct Biotechnol J] and glycosylated spikes are generally similar (Figure S13). While the responses for the non-glycosylated protein were extracted from simulations using the AMBER99SBILDN forcefield [Oliveira et al. (2022) Comput Struct Biotechnol J], those reported in this work were obtained from trajectories using the CHARMM36m forcefield. The similarity between the responses for the two systems (which were simulated using different forcefields) is a good indication that our findings are forcefield independent. 

      (4) The Reviewer suggests comparing our findings with "alternative methods of analysing allostery". 

      As stated above in our response to Reviewer 2 point 2, we consider the suggested comparison an excellent idea. We have therefore performed a dynamic cross-correlation analysis to identify the regions in the protein coupled to the FA site in both equilibrium and nonequilibrium conditions (see Figures S22-S23). Overall, this analysis shows that the FA site motions are strongly coupled to the RBDs and moderately to weakly connected to the NTDs and fusion peptide surrounding regions (please see a detailed description of the results of the correlation analysis in our response to Reviewer 2 point 2). The cross-correlation analysis performed was added to the manuscript, and two new figures were included in the Supporting Information (Figures S22-S23): the first, showing the cross-correlation maps for the D-NEMD and equilibrium simulations; the second, showing the statistical correlations for R408 and K417 (two residues forming the FA site and that can directly interact with the carboxylate head group of LA). 

      We agree that comparing different allosteric analysis methods is interesting, informative and important. As noted above, we have compared D-NEMD and other nonequilibrium and equilibrium MD analysis methods for allostery in the well-characterised K-Ras system [Castelli et al.  (2023) JACS].

    1. eLife Assessment

      In this manuscript, Abd El Hay and colleagues use an innovative behavioral assay and analysis method, together with standard calcium imaging experiments on cultured dorsal root ganglion (DRG) neurons, to evaluate the consequences of global knockout of TRPV1 and TRPM2, and overexpression of TRPV1, on warmth detection. Compelling evidence is provided for a role of TRPM2 channels in warmth avoidance behavior, but it remains unclear whether this involves channel activity in the periphery or in the brain. In contrast, TRPV1 is clearly implicated at the cellular level in warmth detection. These findings are important because there is substantial ongoing discussion regarding the contribution of TRP channels to different aspects of thermo-sensation.

    2. Reviewer #3 (Public review):

      A central question in the thermal system is which thermally responsive ion channels are responsible for warm evoked behaviors and DRG afferent neuron responses to warming. Recent work has shown evidence for TRPV1, TRPM2 and TRPM8. Here Abd El Hay and colleagues investigate the role of TRPM2 and TRPV1 in a novel warm preference behavior and in the thermal responses of cultured DRG neurons.

      They develop a new thermal preference task, where both the floor and air temperature are controlled, which shows differences to the classic two-plate preference task. This is a central strength of the paper, as it will allow a new method to investigate how animals integrating floor and air temperature. They go on to use knockout mice and confirm a clear role for TRPM2 in warm preference behavior.

      Using a new approach for culturing DRG neurons they investigate the involvement of both channels in warm responsiveness and dynamics. In apparent contrast to the role of TRPM2 on thermal behavior, it does not have a major effect on the responses of cultured DRG neurons to warm stimuli. Eliminating TRPV1 however has a stronger impact on DRG responses, particularly at low stimulus amplitudes. It will be important to discover how TRPM2 influences warm driven behaviors, if it is not via changes in afferent response properties.

      Thanks to the authors for addressing my remaining questions in this updated version of the manuscript.

      This is an interesting study with novel approaches that generates new information on the differing roles of TRPV1 and TRPM2 on thermal behavior.

    3. Author response:

      The following is the authors’ response to the previous reviews.

      Public Reviews:  

      Reviewer # 1 (Public Review): 

      Summary:

      The authors use an innovative behavior assay (chamber preference test) and standard calcium imaging experiments on cultured dorsal root ganglion (DRG) neurons to evaluate the consequences of global knockout of TRPV1 and TRPM2, and overexpression of TRPV1, on warmth detection. They find a profound effect of TRPM2 elimination in the behavioral assay, whereas the elimination of TRPV1 has the largest effect on the neuronal responses. These findings are very important, as there is substantial ongoing discussion in the field regarding the contribution of TRP channels to different aspects of thermosensation.

      Strengths:

      The chamber preference test is an important innovation compared to the standard two-plate test, as it depends on thermal information sampled from the entire skin, as opposed to only the plantar side of the paws. With this assay, and the detailed analysis, the authors provide strong supporting evidence for a role of TRPM2 in warmth avoidance. The conceptual framework using the Drift Diffusion Model provides a first glimpse of how this decision of a mouse to change between temperatures can be interpreted and may form the basis for further analysis of thermosensory behavior.

      Weaknesses:

      The authors juxtapose these behavioral data with calcium imaging data using isolated DRG neurons. As the authors acknowledge, it remains unclear whether the clear behavioral effect seen in the TRPM2 knockout animals is directly related to TRPM2 functioning as a warmth sensor in sensory neurons. The effects of the TRPM2 KO on the proportion of warmth sensing neurons are very subtle, and TRPM2 may also play a role in the behavioral assay through its expression in thermoregulatory processes in the brain. Future behavioral experiments on sensory-neuron specific TRPM2 knockout animals will be required to clarify this important point.

      Reviewer # 1 (Recommendations for the authors):

      (1) I have no further suggestions for the authors, and congratulate them with their excellent study.

      For the authors information, ref. 42 does contain behavioral data from both male (Fig. 4 and Extended Figure 7) and female (Extended Figure 8) mice.

      We thank the referee for pointing out that both males and female mice were tested in the Vandewauw et al. 2018 study. We deliberated whether to include this at the appropriate section of our manuscript (“Limitations of the Study”). But since Vandewauw et al. assessed noxious heat temperatures and we here assess innocuous warmth temperature, we felt that this reference would not add to the clarification whether there are sex differences in Trp channelbased warmth temperature sensing. In particular, we did not want to “use” the argument and to suggest that there are no sex temperature differences in the warmth range just because Vandewauw et al. did not observe major sex differences in the noxious temperature range. 

      Reviewer #3 (Public Review):  

      Summary and strengths:

      In the manuscript, Abd El Hay et al investigate the role of thermally sensitive ion channels TRPM2 and TRPV1 in warm preference and their dynamic response features to thermal stimulation. They develop a novel thermal preference task, where both the floor and air temperature are controlled, and conclude that mice likely integrate floor with air temperature to form a thermal preference. They go on to use knockout mice and show that TRPM2-/- mice play a role in the avoidance of warmer temperatures. Using a new approach for culturing DRG neurons they show the involvement of both channels in warm responsiveness and dynamics. This is an interesting study with novel methods that generate important new information on the different roles of TRPV1 and TRPM2 on thermal behavior.

      Comments on revisions:

      Thanks to the authors for addressing all the points raised. They now include more details about the classifier, better place their work in context of the literature, corrected the FOVs, and explained the model a bit further. The new analysis in Figure 2 has thrown up some surprising results about cellular responses that seem to reduce the connection between the cellular and behavioral data and there are a few things to address because of this:

      (1) TRPM2 deficient responses: The differences in the proportion of TRPM2 deficient responders compared to WT are only observed at one amplitude (39C), and even at this amplitude the effect is subtle. Most surprisingly, TRPM2 deficient cells have an enhanced response to warm compared to WT mice to 33C, but the same response amplitude as WT at 36C and 39C. The authors discuss why this disconnect might be the case, but together with the lack of differences between WT and TRPM2 deficient mice in Fig 3, the data seem in good agreement with ref 7 that there is little effect of TRPM2 on DRG responses to warm in contrast to a larger effect of TRPV1. This doesn't take away from the fact there is a behavioral phenotype in the TRPM2 deficient mice, but the impact of TRPM2 on DRG cellular warm responses is weak and the authors should tone down or remove statements about the strength of TRPM2's impact throughout the manuscript, for example:

      "Trpv1 and Trpm2 knockouts have decreased proportions of WSNs."

      "this is the first cellular evidence for the involvement of TRPM2 on the response of DRG sensory neurons to warm-temperature stimuli"

      "we demonstrate that TRPV1 and TRPM2 channels contribute differently to temperature detection, supported by behavioural and cellular data"

      "TRPV1 and TRPM2 affect the abundance of WSNs, with TRPV1 mediating the rapid, dynamic response to warmth and TRPM2 affecting the population response of WSNs."

      "Lack of TRPV1 or TRPM2 led to a significant reduction in the proportion of WSNs, compared to wildtype cultures".

      We agree with the referee that the somewhat surprising result of the subtle phenotype in Trpm2 knock-out DRG culture experiments, that became detectable in the course of the new analysis, was overemphasized in the previous version of the manuscript. Per suggestion, we have toned down or removed the statements in the revised manuscript (for the referee to find those changes easily, they are indicated in “track-changes mode” in the submitted document).  

      (2) The new analysis also shows that the removal of TRPV1 leads to cellular responses with smaller responses at low stimulus levels but larger responses with longer latencies at higher stimulus levels. Authors should discuss this further and how it fits with the behavioral data.

      Because these changes shown in Fig. 2E are also subtle (similar to the cellular Trpm2 phenotype discussed above), and because both the “% Responders” (Fig 2.D) and The AUC analysis (Fig. 2F) show a reduction in Trpv1 knock out cultures ––both, at lower and at higher stimulus levels–– we did not want to overstate this difference too much and therefore did not further discuss this aspect in the context of the behavioral differences observed in the Trpv1 knock-out animals.  

      (3) Analysis clarification: authors state that TRPM2 deficient WSNs show "Their response to the second and third stimulus, however, are similar to wildtype WSNs, suggesting that tuning of the response magnitude to different warmth stimuli is degraded in Trpm2-/- animals." but is there a graded response in WT mice? It looks like there is in terms of the %responders but not in terms of response amplitude or AUC. Authors could show stats on the figure showing differences in response amplitude/AUC/responders% to different stimulus amplitudes within the WT group.

      We have added the statistics in the main text, you find them on page 7 (also in “track changes mode”).

      (4) New discussion point: sex differences are "similar to what has been shown for an operant-based thermal choice assay (11,56)", but in their rebuttal, they mention that ref 11 did not report sex differences. 56 does. Check this.

      Thank you for pointing out this mishap. We have now corrected this in the “Limitations of the study” section of the discussion and have removed the Paricio-Montesions et al study from that section and slightly revised the text (see “track-changes” on page 16).

      (5) The authors added in new text about the drift diffusion model in the results, however it's still not completely clear whether the "noise" is due to a perceptual deficit or some other underlying cause. Perhaps authors could discuss this further in the discussion.

      We have now included more discussion concerning this (page 14):

      “However, the increased noise in the drift-di3usion model points to a less reliable temperature detection mechanism. Although noise in drift di3usion models can encompass various sources of variability—ranging from peripheral sensory processing to central mechanisms like attention or motor initiation—the most parsimonious interpretation in our study aligns with a perceptual deficit, given the altered temperatureresponsive neuronal populations we observed. This implies that, despite the substantial loss of WSNs, the remaining neuronal population provides su3icient information for the detection of warmer temperatures, albeit with reduced precision”

      Within the limits of the data that is available, we hope the referee agrees with us that we have now adequately discussed this aspect; we feel that any further discussion would be too speculative.

    1. eLife Assessment

      This is an important study that generates an inventory of accessible genomic regions bound by a transcription factor ZFHX3 within the suprachiasmatic nucleus in the hypothalamus and details the impact of its depletion on daily rhythms in behavior and gene expression patterns. Analysis using circadian phase-estimation algorithms makes the argument that gene regulatory networks are at play and changes in gene expression of a few clock genes cannot account for the observed animal behaviour. While the transcriptome analysis is compelling, the data on the activity of the TF in rhythmic gene expression is solid, and interpretations that allow for direct and/or indirect roles have been incorporated.

    2. Reviewer #1 (Public review):

      Summary:

      Authors of this article have previously shown the involvement of the transcription factor Zinc finger homeobox-3 (ZFHX3) in the function of the circadian clock and the development/differentiation of the central circadian clock in the suprachiasmatic nucleus (SCN) of the hypothalamus. Here, they show that ZFHX3 plays a critical role in the transcriptional regulation of numerous genes in the SCN. Using inducible knockout mice, they further demonstrate that the deletion Of Zfhx3 induces a phase advance of the circadian clock, both at the molecular and behavioral levels.

      Strengths:

      - Inducible deletion of Zfhx3 in adults<br /> - Behavioral analysis<br /> - Properly designed and analyzed ChIP-Seq and RNA-Seq supporting the conclusion of the behavioral analysis

      Comments on revisions:

      The authors have properly addressed reviewers' issues.

    3. Reviewer #2 (Public review):

      Summary

      ZFHX3 is a transcription factor expressed in discrete populations of adult SCN and was shown by the authors previously to control circadian behavioral rhythms using either a dominant missense mutation in Zfhx3 or conditional null Zfhx3 mutation using the Ubc-Cre line (Wilcox et al., 2017). In the current manuscript, the authors assess the function of ZFHX3 by using a multi-omics approach including ChIPSeq in wildtype SCNs and RNAseq of SCN tissues from both wildtype and conditional null mice. RNAseq analysis showed a loss of oscillation in Bmal1 and changes in expression levels of other clock output genes. Moreover, a phase advance gene transcriptional profile using the TimeTeller algorithm suggests the presence of a regulatory network that could underlie the observed pattern of advanced activity onset in locomotor behavior in knockout mice.

      In Figure 1, the authors identified the ZFHX3 bound sites using ChIPseq and compared the loci with other histone marks that occur at promoters, TSS, enhancers and intergenic regions. And the analysis broadly points to a role for ZFHX3 in transcriptional regulation. The vast majority of nearly 40000 peaks overlapped H3K4me3 and K27ac marks, active promoters which also included genes falling under the GO category circadian rhythms. However, no significant differential ZFHX3 bound peaks were detected between ZT3 and ZT15. In these experiments, it is not clear if and how the different ChIP samples (ZFHX3 and histone PTM ChIPs) were normalized/downsampled for analysis. Moreover, it seems that ZFHX3 binding or recruitment has little to do with whether the promoters are active.

      Based on an enrichment of ARNT domains next to K4Me3 and K27ac PTMs, the authors propose a model where the core-clock TFs and ZFHX3 interact. If the authors develop other assays beyond just predictions to test their hypothesis, it would strengthen the argument for a role in circadian transcription in the SCN. It would be important in this context to perform a ChIP-seq experiment for ZFHX3 in the knockout animal (described from Figure 2 onwards) to eliminate the possibility of non-specific enrichment of signal from "open chromatin'. Alternatively, a ChIPseq analysis for BMAL1 or CLOCK could also strengthen this argument to identify the sites co-occupied by ZFHX3 and core-clock TFs.

      Next, they compared locomotor activity rhythms in floxed mice with or without tamoxifen treatment. As reported before in Wilcox et al 2017, the loss of ZFHX3 led to a shorter free running period and reduced amplitude and earlier onset of activity. Overall, the behavioral data in Figure 2 and supplementary figure 2 has been reported before and are not novel.

      Next, the authors performed RNAseq at 4hr intervals on wildtype and knockout animals maintained in light/dark cycles to determine the impact of loss of ZFHX3. Overall transcriptomic analysis indicated changes in gene expression in nearly 36% of expressed genes, with nearly half being upregulated while an equal fraction was downregulated. Pathways affected included mostly neureopeptide neurotransmitter pathways. Surprisingly, there was no correlation between the direction in change in expression and TF binding since nearly all the sites were bound by ZFHX3 and the active histone PTMs. The ChIP-seq experiment for ZFHX3 in the UBC-Cre+Tam mice again could help resolve the real targets of ZFHX3 and the transcriptional state in knockout animals.

      To determine the fraction of rhythmic transcripts, Using dryR, the authors categorise the rhythmic transcriptome (about 7% in all) into modules that include genes that lose rhythmicity in the KO, gain rhythmicity in the KO or remain unaffected or partially affected. The analysis indicates that a large fraction of the rhythmic transcriptome is affected in the KO model. However, among core-clock genes only Bmal1 expression is affected showing a complete loss of rhythm. The authors state a decrease in Clock mRNA expression (line 294) but the panel figure 4A does not show this data. Instead it depicts the loss in Avp expression - {{ misstated in line 321 ( we noted severe loss in 24-h rhythm for crucial SCN neuropeptides such as Avp (Fig. 3a).}}

      However, core-clock genes such as Pers and Crys show minor or no change in expression patterns while Per2 and Per3 show a ~2hr phase advance. While these could only weakly account for the behavioral phase advance, the authors used TimeTeller to assess circadian phase in wildtype and ZFHX3 deficient mice. This approach clearly indicated that while the clock is not disrupted in the knockout animals, the phase advance can be correctly predicted from a network of gene expression patterns.

      Strengths

      The authors use a multiomic strategy in order to reveal the role of the ZFHX3 transcription factor with a combination of TF and histone PTM ChIPseq, time-resolved RNAseq from wildtype and knockout mice and modeling the transcriptomic data using TimeTeller. The RNAseq experiments are nicely controlled and the analysis of the data indicates a clear impact on gene-expression levels in the knockout mice and the presence of a regulatory network that could underlie the advanced activity onset behavior.

      Weaknesses

      It is not clear whether ZFHX3 has a direct role in any of the processes and seems to be a general factor that marks H3K4me3 and K27ac marked chromatin. Why it would specifically impact the core-clock TTFL clock gene expression or indeed daily gene expression rhythms is not clear either. Details for treatment of different ChIP samples (ZFHX3 and histone PTM ChIPs) on data normalization for analysis are needed. The loss of complete rhythmicity of Avp and other neuropeptides or indeed other TFs could instead account for the transcriptional deregulation noted in the knockout mice.

      Comments on revisions:

      The authors addressed the majority of my criticisms. They also explained that some requested experiments are beyond the scope of the current manuscript, while others are technically not feasible. I do not have any further concerns.

    4. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Authors of this article have previously shown the involvement of the transcription factor Zinc finger homeobox-3 (ZFHX3) in the function of the circadian clock and the development/differentiation of the central circadian clock in the suprachiasmatic nucleus (SCN) of the hypothalamus. Here, they show that ZFHX3 plays a critical role in the transcriptional regulation of numerous genes in the SCN. Using inducible knockout mice, they further demonstrate that the deletion Of Zfhx3 induces a phase advance of the circadian clock, both at the molecular and behavioral levels.

      Strengths:

      - Inducible deletion of Zfhx3 in adults

      - Behavioral analysis

      - Properly designed and analyzed ChIP-Seq and RNA-Seq supporting the conclusion of the behavioral analysis

      Weaknesses:

      - Further characterization of the disruption of the activity of the SCN is required.

      (1) We thank the reviewer for their valuable inputs. Indeed, a comprehensive behavioral assessment of mice of this genotype was executed in Wilcox et al. ;2017 study. In Wilcox et al.; 2017, Figure 4, 6-h phase advance (jetlag) clearly showed faster reentrainment in ZFHX3-KO mice when compared to the controls.

      - The description of the controls needs some clarification.

      (2) We agree with the reviewer and have modified the text at line 211-212 to clearly describe the controls.

      Reviewer #2 (Public review):

      Summary:

      ZFHX3 is a transcription factor expressed in discrete populations of adult SCN and was shown by the authors previously to control circadian behavioral rhythms using either a dominant missense mutation in Zfhx3 or conditional null Zfhx3 mutation using the Ubc-Cre line (Wilcox et al., 2017). In the current manuscript, the authors assess the function of ZFHX3 by using a multi-omics approach including ChIPSeq in wildtype SCNs and RNAseq of SCN tissues from both wildtype and conditional null mice. RNAseq analysis showed a loss of oscillation in Bmal1 and changes in expression levels of other clock output genes. Moreover, a phase advance gene transcriptional profile using the TimeTeller algorithm suggests the presence of a regulatory network that could underlie the observed pattern of advanced activity onset in locomotor behavior in knockout mice.

      In figure1, the authors identified the ZFHX3 bound sites using ChIPseq and compared the loci with other histone marks that occur at promoters, TSS, enhancers and intergenic regions. And the analysis broadly points to a role for ZFHX3 in transcriptional regulation. The vast majority of nearly 40000 peaks overlapped H3K4me3 and K27ac marks, active promoters which also included genes falling under the GO category circadian rhythms. However, no significant differential ZFHX3 bound peaks were detected between ZT3 and ZT15. In these experiments, it is not clear if and how the different ChIP samples (ZFHX3 and histone PTM ChIPs) were normalized/downsampled for analysis. Moreover, it seems that ZFHX3 binding or recruitment has little to do with whether the promoters are active.

      (3) We thank the reviewer for their valuable comment. Different ChIP samples (ZFHX3 and histone PTM ChIPs) were treated in the same manner from preprocessing (quality control by FastQC, adapter trimming, alignment to mm10 genome) and peak calling was performed using respective input samples as control using MACS2 as mentioned in Methods. The data was normalized using bamCoverage tools and bigwig files were generated for visual inspection using UCSC Genome Browser. These additional details are added to Methods at line 592. Finally, BEDTools was employed to study overlapping peaks between ZFHX3 and histone PTMs.

      We agree that, alone, the current data does not make any claim for ZFHX3 being crucial for promoter to be active. Our data clearly suggests that a vast majority of ZFHX3 genomic binding in the SCN was observed at active promoters marked by H3K4me3 and H3K27ac and potentially regulating gene transcription.

      Based on a enrichment of ARNT domains next to K4Me3 and K27ac PTMs, the authors propose a model where the core-clock TFs and ZFHX3 interact. If the authors develop other assays beyond just predictions to test their hypothesis, it would strengthen the argument for role in circadian transcription in the SCN. It would be important in this context to perform a ChIP-seq experiment for ZFHX3 in the knockout animal (described from Figure 2 onwards) to eliminate the possibility of non-specific enrichment of signal from "open chromatin'. Alternatively, a ChIPseq analysis for BMAL1 or CLOCK could also strengthen this argument to identify the sites co-occupied by ZFHX3 and core-clock TFs.

      (4a) We agree that follow-up experiments such as BMAL1/CLOCK ChIPseq suggested by the reviewer will further confirm the proposed interaction of ZFHX3 with core-clock TFs. However, this is beyond the scope of the current study. 

      (4b) Again, conducting complementary ChIPseq in ZFHX3 knockout mice will strengthen the findings, but conducting TF-ChIPseq in a specific brain tissue such as the SCN (unlike peripheral tissues such as liver) does not only warrant use of multiple animals per sample but is also technically challenging and time-consuming to ensure specificity of the sample. For these reasons, datasets such as ours on the SCN are uncommon. Furthermore, in this particular context, we are certain that, based on current dataset, the ZFHX3 peaks (narrow) we observed were well-defined and met the specified statistical criteria mitigating any risk of signal arising from non-specific enrichment from open-chromatin regions.

      Next, they compared locomotor activity rhythms in floxed mice with or without tamoxifen treatment. As reported before in Wilcox et al 2017, the loss of ZFHX3 led to a shorter free running period and reduced amplitude and earlier onset of activity. Overall, the behavioral data in Figure 2 and supplementary figure 2 has been reported before and are not novel.

      (5) We recognise that a detailed circadian behavior assessment from adult mice lacking ZFHX3 has been conducted previously by Nolan lab (Wilcox et al; 2017). In the current study, however, we used a separate cohort of mice, to focus on the behavioral advance noted in 24-h LD cycle and generated a more refined assessment. Importantly, these mice were also used for transcriptomic studies as detailed in Figure 3, which we consider to be a positive feature of our experimental design: behavior and molecular analyses were performed on the same animals.

      Next, the authors performed RNAseq at 4hr intervals on wildtype and knockout animals maintained in light/dark cycles to determine the impact of loss of ZFHX3. Overall transcriptomic analysis indicated changes in gene expression in nearly 36% of expressed genes, with nearly half being upregulated while an equal fraction was downregulated. Pathways affected included mostly neureopeptide neurotransmitter pathways. Surprisingly, there was no correlation between the direction in change in expression and TF binding since nearly all the sites were bound by ZFHX3 and the active histone PTMs. The ChIP-seq experiment for ZFHX3 in the UBC-Cre+Tam mice again could help resolve the real targets of ZFHX3 and the transcriptional state in knockout animals.

      (6) We agree with the reviewer that most of the differentially expressed genes showed ZFHX3 binding at active promoter sites. That said, the current dataset is in line with recently published ZFHX3-CHIPseq data by Baca et al; 2024 [PMID: 38412861] in human neural stem cells and Hu et al; 2024 [PMID: 38871709] in human prostate cancer cells that clearly suggests ZFHX3 binds at active promoters and act as chromatin remodellers/mediators that modulate gene transcription depending on the accessory TFs assembled at target genes. Therefore, finding no correlation in the direction of change in expression is not striking. 

      To determine the fraction of rhythmic transcripts, Using dryR, the authors categorise the rhythmic transcriptome into modules that include genes that lose rhythmicity in the KO, gain rhythmicity in the KO or remain unaffected or partially affected. The analysis indicates that a large fraction of the rhythmic transcriptome is affected in the KO model. However, among core-clock genes only Bmal1 expression is affected showing a complete loss of rhythm. The authors state a decrease in Clock mRNA expression (line 294) but the panel figure 4A does not show this data. Instead it depicts the loss in Avp expression - {{ misstated in line 321 ( we noted severe loss in 24-h rhythm for crucial SCN neuropeptides such as Avp (Fig. 3a).}}

      (7a) Indeed, among the core-clock genes rhythmic expression is lost after ZFHX3 knockout only for Bmal1. However, given the mice were rhythmic (as assessed by wheel-running activity) in LD conditions, the observed 24-h gene expression rhythm in the majority of core-clock genes (Pers and Crys) is consistent with behavior data, and suggests towards an altered molecular clock with plausible scenarios as explained at line 439. That said, the unique and well-defined changes (amplitude and phase) observed as demonstrated in Figure 5 highlights a model in which ZFHX3 exerts differential control, for example in case of Per2 noted advance in molecular rhythm (~2-h), but no such change in Cry, presents an opportunity to delineate further the regulation of TTFL genes.

      (7b) Line 294 revised as – “Bmal1 demonstrating a complete loss of 24-h rhythm (Fig. 4A), and its counterpart Clock mRNA showing overall reduced expression levels (Supplementary Table 3)”.

      7c) Line 321 is referring to loss of Avp expression and the typo has been corrected from “Figure 3a to 4a”. Thank you. 

      However, core-clock genes such as Pers and Crys show minor or no change in expression patterns while Per2 and Per3 show a ~2hr phase advance. While these could only weakly account for the behavioral phase advance, the authors used TimeTeller to assess circadian phase in wildtype and ZFHX3 deficient mice. This approach clearly indicated that while the clock is not disrupted in the knockout animals, the phase advance can be correctly predicted from a network of gene expression patterns.

      Strengths:

      The authors use a multiomic strategy in order to reveal the role of the ZFHX3 transcription factor with a combination of TF and histone PTM ChIPseq, time-resolved RNAseq from wildtype and knockout mice and modeling the transcriptomic data using TimeTeller. The RNAseq experiments are nicely controlled and the analysis of the data indicates a clear impact on gene-expression levels in the knockout mice and the presence of a regulatory network that could underlie the advanced activity onset behavior.

      Weaknesses:

      It is not clear whether ZFHX3 has a direct role in any of the processes and seems to be a general factor that marks H3K4me3 and K27ac marked chromatin. Why it would specifically impact the core-clock TTFL clock gene expression or indeed daily gene expression rhythms is not clear either. Details for treatment of different ChIP samples (ZFHX3 and histone PTM ChIPs) on data normalization for analysis are needed. The loss of complete rhythmicity of Avp and other neuropeptides or indeed other TFs could instead account for the transcriptional deregulation noted in the knockout mice.

      (8) We thank the reviewer for the constructive feedback.  The current data suggests ZFHX3 acts as a mediating factor, occupying targeted active promoter sites and regulating gene expression by partnering with other key TFs in the SCN. Please see point 6 for clarification. The binding sites of ZFHX3 clearly showed enrichment for E-box(CACGTG) motif bound by CLOCK/BMAL1 along with binding sites for key SCN-specific TFs such as RFX (please see Supplementary Fig1). Our data thereby shows that it affects both core-clock and clock output genes (at varied levels) thereby exercising a pervasive control over the SCN transcriptome.

      For treatment of ChIP samples please see point 3. We followed ENCODE guidelines strictly. 

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      - The early activity onset associated with a short photoperiod is a phenotype found in mice with a perturbed function of the SCN like Per2 mutant (PMID: 17218255), or Clock KO (PMID: 22431615). Such disruption of the SCN function also leads to a faster synchronization to day feeding (PMID: 23824542) or jetlag (PMID: 25063847; PMID: 24092737). Therefore, authors should study the synchronizing function of these mice to day feeding and/or jetlag.

      (9) Please see our response to point 1.

      - The description of the negative controls needs clarification. While the "Method" suggests that both Cre- and Cre+ mice are treated with Tamoxifen, the text rather suggest that the controls are Cre- and Cre+ animals non-treated by Tamoxifen. Because of the potential effect of Tamoxifen on gene expression, Cre- treated animals are a required control.

      (10) We thank the reviewer. As detailed in Methods, both Cre- and Cre+ mice were treated with Tamoxifen and compared. The text had been revised at line 212. In addition to this, another genetic control (-Tamoxifen) was also used (Figure 2 and 3).

      - On line 486, authors wrote "It is important to note that although in the present study we used adult-specific Zfhx3 null mutants resulting in global loss of ZFHX3, the effects observed both at molecular and behavioural levels are independent of its functional role(s) in other tissues." On what evidence is this statement based? Using global KO rather suggest a potential role of other tissues.

      (11) We agree with the reviewer, but at line 486 we refer to the effects observed at circadian behavior and daily gene expression in the SCN to be independent of pleiotropic roles of ZFHX3 such as involvement in angiogenesis, spinocerebellar ataxia etc. We have revised the text.

      Reviewer #2 (Recommendations for the authors):

      It is not clear whether the behavioral experiments presented in this study were performed on a new set of animals - different from the cohort used in the Wilcox et al 2017 paper. For example, the proportion of total activity graphed in Figure 2C look strikingly similar to activity counts in Figure 3A in the prior publication (doi: 10.1177/0748730417722631)- down to the small burst in activity after ZT20 in the control (-Tam) group.

      (12) The behavioral experiments presented in this study were performed on a completely new cohort of mice to those used in Wilcox et al.; 2017. The mice used for behavioral assessment. In the current study were later used for molecular experiments. Please see point 5.

      Information on ChIP-seq such as read length, PE or SE seq, number of reads/replicate/condition/sample is missing. Versions of the softwares used should be indicated if known.

      (13) The details are added as:

      (13a) “Briefly, SCN punches were pooled from 80 mice at each. designated times (ZT3, ZT15) corresponding to one biological replicate per timepoint” at line 567.

      (13b) “24 ug sheared chromatin sample collected from each time point (ZT3, ZT15)” at line 571.

      (13c) “75-bp single end sequencing : 30 million reads/sample” at line 577.  

      (13d) “At line 584 – MACS algorithm v2.1.0 added”

      Versions of other softwares used were already mentioned.

    1. eLife Assessment

      This work presents a useful resource combining scRNA-seq and spatial transcriptomics studies to map mouse pre-clinical models of colorectal cancer, identifying distinct cellular programs and microenvironments that could enhance patient stratification and therapeutic approaches in colorectal cancer. While the novelty of the biological findings remains limited and incompletely supported by the evidence provided in the manuscript, the data were collected and analyzed using a validated methodology that will be of interest to the community in future studies.

    2. Reviewer #1 (Public review):

      Summary:

      The authors conducted a spatial analysis of dysplastic colon tissue using the Slide-seq method. Their main objective is to build a detailed spatial atlas that identifies distinct cellular programs and microenvironments within dysplastic lesions. Next, they correlated this observation with clinical outcomes in human colorectal cancer.

      Strengths:

      The work is a good example of utilising spatial methods to study different tumour models. The authors identified a unique stem cell program to understand tumours gently and improve patient stratification strategies.

      Weaknesses:

      However, the study's predominantly descriptive nature is a significant limitation. Although the spatial maps and correlations between cell states are interesting observations, the lack of functional validation-primarily through experiments in mouse models-weakens the causal inferences regarding the roles these cellular programs play in tumour progression and therapy resistance.

      The authors also missed an opportunity to link the mutational status of malignant cells with the cellular neighbourhoods.

      Overall, the study contributes to profiling the dysplastic colon landscape. The methodologies and data will benefit the research community, but further functional validation is crucial to validate the biological and clinical implications of the described cellular interactions.

    3. Reviewer #2 (Public review):

      In their study, Avraham-Davidi et al. combined scRNA-seq and spatial mapping studies to profile two preclinical mouse models of colorectal cancer: Apcfl/fl VilincreERT2 (AV) and Apcfl/fl LSL-KrasG12D Trp53fl/fl Rosa26LSL-tdTomato/+ VillinCreERT2 (AKPV). In the first part of the manuscript, the authors describe the analysis of the normal colon and dysplastic lesions induced in these models following tamoxifen injection. They highlight broad variations in immune and stromal cell composition within dysplastic lesions, emphasizing the infiltration of monocytes and granulocytes, the accumulation of IL-17+gdT cells, and the presence of a distinct group of endothelial cells. A major focus of the study is the remodeling of the epithelial compartment, where the most significant changes are observed. Using non-negative matrix factorization, the authors identify molecular programs of epithelial cell functions, emphasizing stemness, Wnt signaling, angiogenesis, and inflammation as major features associated with dysplastic cells. They conclude that findings from scRNA-seq analyses in mouse models are transposable to human CRC. In the second part of the manuscript, the authors aim to provide the spatial context for their scRNA-seq findings using Slide-seq and TACCO. They demonstrate that dysplastic lesions are disorganized and contain tumor-specific regions, which contextualize the spatial proximity between specific cell states and gene programs. Finally, they claim that these spatial organizations are conserved in human tumors and associate region-based gene signatures with patient outcomes in public datasets. Overall, the data were collected and analyzed using solid and validated methodology to offer a useful resource to the community.

      Main comments:

      (1) Clarity<br /> The manuscript would benefit from a substantial reorganization to improve clarity and accessibility for a broad readership. The text could be shortened and the number of figure panels reduced to emphasize the novel contributions of this work while minimizing extensive discussions on general and expected findings, such as tissue disorganization in dysplastic lesions. Additionally, figure panels are not consistently introduced in the correct order, and some are not discussed at all (e.g., Figure S1D; Figure 3C is introduced before Figure 3A; several panels in Figure 4 are not discussed). The annotation of scRNA-seq cell states is insufficiently explained, with no corresponding information about associated genes provided in the figures or tables. Multiple annotations are used to describe cell groups (e.g., TKN01 = γδ T and CD8 T, TKN05 = γδT_IL17+), but these are not jointly accessible in the figures, making the manuscript challenging to follow. It is also not clear what is the respective value of the two mouse models and time points of tissue collection in the analysis.

      (2) Novelty<br /> While the study is of interest, it does not present major findings that significantly advance the field or motivate new directions and hypotheses. Many conclusions related to tissue composition and patient outcomes, such as the epithelial programs of Wnt signaling, angiogenesis, and stem cells, are well-established and not particularly novel. Greater exploration of the scRNA-seq data beyond cell type composition could enhance the novelty of the findings. For instance, several tumor microenvironment clusters uniquely detected in dysplastic lesions (e.g., Mono2, Mono3, Gran01, Gran02) are identified, but no further investigation is conducted to understand their biological programs, such as applying nNMF as was done for epithelial cells. Additional efforts to explore precise tissue localization and cellular interactions within tissue niches would provide deeper insights and go beyond the limited analyses currently displayed in the manuscript.

      (3) Validation<br /> Several statements made by the authors are insufficiently supported by the data presented in the manuscript and should be nuanced in the absence of proper validation. For example:<br /> (a) RNA velocity analyses: The conclusions drawn from these analyses are speculative and need further support.<br /> (b) Annotations of epithelial clusters as dysplastic: These annotations could have been validated through morphological analyses and staining on FFPE slides.<br /> (c) Conservation of mouse epithelial programs in human tumors: The data in Figure S5B does not convincingly demonstrate the enrichment of stem cell program 16 in human samples. This should be more explicitly stated in the text, given the emphasis placed on this program by the authors.<br /> (d) Figure S6E: Cluster Epi06 is significantly overrepresented in spatial data compared to scRNA-seq, yet the authors claim that cell type composition is largely recapitulated without further discussion, which reduces confidence in other conclusions drawn.<br /> Furthermore, stronger validation of key dysplastic regions (regions 6, 8, and 11) in mouse and human tissues using antibody-based imaging with markers identified in the analyses would have considerably strengthened the study. Such validation would better contextualize the distribution, composition, and relative abundance of these regions within human tumors, increasing the significance of the findings and aiding the generation of new pathophysiological hypotheses.

    4. Author response:

      We thank the reviewers for their appreciation of our work and the recommendations to improve the manuscript. We have included a point-by-point response below. To summarize, for revision we plan to:

      • Clarify the manuscript to improve readability and coherence,

      • Ensure that all figures are thoroughly discussed in the text,

      • Tone down biological claims based on RNA velocity where applicable.

      While we agree with the reviewer that functional validation and/or spatial proteomics data accompanying this study could provide additional insights and broader contextualization, this is unfortunately beyond the scope of the study.

      Reviewer #1 (Public review):

      Summary:

      The authors conducted a spatial analysis of dysplastic colon tissue using the Slide-seq method. Their main objective is to build a detailed spatial atlas that identifies distinct cellular programs and microenvironments within dysplastic lesions. Next, they correlated this observation with clinical outcomes in human colorectal cancer.

      Strengths:

      The work is a good example of utilising spatial methods to study different tumour models. The authors identified a unique stem cell program to understand tumours gently and improve patient stratification strategies.

      Weaknesses:

      However, the study's predominantly descriptive nature is a significant limitation. Although the spatial maps and correlations between cell states are interesting observations, the lack of functional validation-primarily through experiments in mouse models-weakens the causal inferences regarding the roles these cellular programs play in tumour progression and therapy resistance.

      We thank the reviewer for this comment. Indeed, functional validation to pin down causal dependencies and a more thorough investigation of tumor progression and therapy resistance both in mouse model as well as human patients and/or patient derived samples would broaden the insights to be gained from this work. Unfortunately, this is beyond the scope of this study.

      The authors also missed an opportunity to link the mutational status of malignant cells with the cellular neighbourhoods.

      The data reported in this study only contains spatial data for one mouse model (AV). As spatial data for the other model (AKPV) is missing, it is not possible to link the mutational type of the model with the cellular neighborhoods. We did investigate whether there is extra "somatic" mutational heterogeneity in the AV data, both regarding single nucleotide variations (SNVs) and copy number variations (CNVs). But at the time when the mice were sacrificed (after 3 weeks) there was no significant mutational heterogeneity discoverable.

      Overall, the study contributes to profiling the dysplastic colon landscape. The methodologies and data will benefit the research community, but further functional validation is crucial to validate the biological and clinical implications of the described cellular interactions.

      Reviewer #2 (Public review):

      In their study, Avraham-Davidi et al. combined scRNA-seq and spatial mapping studies to profile two preclinical mouse models of colorectal cancer: Apcfl/fl VilincreERT2 (AV) and Apcfl/fl LSL-KrasG12D Trp53fl/fl Rosa26LSL-tdTomato/+ VillinCreERT2 (AKPV). In the first part of the manuscript, the authors describe the analysis of the normal colon and dysplastic lesions induced in these models following tamoxifen injection. They highlight broad variations in immune and stromal cell composition within dysplastic lesions, emphasizing the infiltration of monocytes and granulocytes, the accumulation of IL-17+gdT cells, and the presence of a distinct group of endothelial cells. A major focus of the study is the remodeling of the epithelial compartment, where the most significant changes are observed. Using non-negative matrix factorization, the authors identify molecular programs of epithelial cell functions, emphasizing stemness, Wnt signaling, angiogenesis, and inflammation as major features associated with dysplastic cells. They conclude that findings from scRNA-seq analyses in mouse models are transposable to human CRC. In the second part of the manuscript, the authors aim to provide the spatial context for their scRNA-seq findings using Slide-seq and TACCO. They demonstrate that dysplastic lesions are disorganized and contain tumor-specific regions, which contextualize the spatial proximity between specific cell states and gene programs. Finally, they claim that these spatial organizations are conserved in human tumors and associate region-based gene signatures with patient outcomes in public datasets. Overall, the data were collected and analyzed using solid and validated methodology to offer a useful resource to the community.

      Main comments:

      (1) Clarity

      The manuscript would benefit from a substantial reorganization to improve clarity and accessibility for a broad readership. The text could be shortened and the number of figure panels reduced to emphasize the novel contributions of this work while minimizing extensive discussions on general and expected findings, such as tissue disorganization in dysplastic lesions. Additionally, figure panels are not consistently introduced in the correct order, and some are not discussed at all (e.g., Figure S1D; Figure 3C is introduced before Figure 3A; several panels in Figure 4 are not discussed). The annotation of scRNA-seq cell states is insufficiently explained, with no corresponding information about associated genes provided in the figures or tables. Multiple annotations are used to describe cell groups (e.g., TKN01 = γδ T and CD8 T, TKN05 = γδT_IL17+), but these are not jointly accessible in the figures, making the manuscript challenging to follow. It is also not clear what is the respective value of the two mouse models and time points of tissue collection in the analysis.

      We thank the reviewer for this suggestion. For the revision we plan to clarify the manuscript to improve readability and coherence in text and figures, and expand on the cell type nomenclature.

      (2) Novelty

      While the study is of interest, it does not present major findings that significantly advance the field or motivate new directions and hypotheses. Many conclusions related to tissue composition and patient outcomes, such as the epithelial programs of Wnt signaling, angiogenesis, and stem cells, are well-established and not particularly novel. Greater exploration of the scRNA-seq data beyond cell type composition could enhance the novelty of the findings. For instance, several tumor microenvironment clusters uniquely detected in dysplastic lesions (e.g., Mono2, Mono3, Gran01, Gran02) are identified, but no further investigation is conducted to understand their biological programs, such as applying nNMF as was done for epithelial cells. Additional efforts to explore precise tissue localization and cellular interactions within tissue niches would provide deeper insights and go beyond the limited analyses currently displayed in the manuscript.

      We thank the reviewer for this comment. Our study aimed to spatially characterize the tumor microenvironment, with scRNA-seq analysis serving to support this spatial characterization.<br /> Due to technical limitations—such as the number of samples and the limited capture efficiency of Slide-seq—the resolution of immune cell identification in our spatial analysis is constrained. Additionally, while immune and stromal cells formed distinct clusters, epithelial cells exhibited a continuum that was better captured using nNMF.

      Lastly, our manuscript provides a general characterization of monocyte and granulocyte populations in scRNA-seq (line 142) and their spatial microenvironments (line 390). We believe that additional analyses of these populations would be beyond the scope of this study and could place an unnecessary burden on the reader. Instead, we suggest that such analyses be explored in future studies.

      We remark that we analyzed tissue localization for two entirely different spatial transcriptomics assays (Slide-seq and Cartana) to the resolution of cell types and programs, which was feasible within the constraints of the sparsity and gene panel and sample size in the experiments. A path to further increase the resolution of investigation in this dataset is to include other datasets, e.g. by the emerging transformer-based spatial transcriptomics integration methods, which unfortunately is outside the scope of the current study.

      We also remark that the current manuscript already includes an investigation of cellular interactions within tissue niches based on COMMOT (Fig 4k, Fig S8i, Supp Item 4).

      (3) Validation

      Several statements made by the authors are insufficiently supported by the data presented in the manuscript and should be nuanced in the absence of proper validation. For example:<br /> (a) RNA velocity analyses: The conclusions drawn from these analyses are speculative and need further support.

      We thank the reviewer for this comment. We will clarify that our conclusions from the RNA velocity analysis need further support by experimental validation, which is out of the scope of the study.

      (b) Annotations of epithelial clusters as dysplastic: These annotations could have been validated through morphological analyses and staining on FFPE slides.

      We thank the reviewer for this comment. While this could have been a possible approach, our study primarily relies on scRNA-seq, which does not preserve tissue morphology, and Slide-seq of fresh tissue, where such an analysis is particularly challenging.

      (c) Conservation of mouse epithelial programs in human tumors: The data in Figure S5B does not convincingly demonstrate the enrichment of stem cell program 16 in human samples. This should be more explicitly stated in the text, given the emphasis placed on this program by the authors.

      We thank the reviewer for pointing this out. Indeed, Figure S5B does not demonstrate the program 16 enrichment in human samples. We will clarify this in the manuscript.

      (d) Figure S6E: Cluster Epi06 is significantly overrepresented in spatial data compared to scRNA-seq, yet the authors claim that cell type composition is largely recapitulated without further discussion, which reduces confidence in other conclusions drawn.

      We thank the reviewer for this remark. Indeed, Epi06 was a cluster which drew our attention during early analyses for its mixed expression profiles with contributions of vastly different cell types. We concluded that this is best explained by doublets and excluded it from further analysis. In the current manuscript we only briefly hinted at this in figure legend 2A ("Cluster Epi06: doublets (not called by Scrublet)"), and we will expand on this in the revised manuscript. The observation that this cluster is significantly overrepresented in the annotation of the spatial data is not surprising in this context as this annotation comes from the decomposition of compositional data which contains contributions of multiple cells per Slide-seq bead which are structurally very similar to doublets. We will add this point as well to the revised manuscript.

      Furthermore, stronger validation of key dysplastic regions (regions 6, 8, and 11) in mouse and human tissues using antibody-based imaging with markers identified in the analyses would have considerably strengthened the study. Such validation would better contextualize the distribution, composition, and relative abundance of these regions within human tumors, increasing the significance of the findings and aiding the generation of new pathophysiological hypotheses.

      We agree with the reviewer with their assessment that validation by antibody-based imaging (or other spatial proteomics data) would have been useful follow-up experiments to the experiments and results presented in our manuscript, yet these are beyond the scope of the current study.

    1. eLife Assessment

      This is an important study linking olfactory bulb activity not only to sniffing parameters but also to movement and place. The evidence for odor sampling is mostly solid, but the analysis supporting the potentially exciting result on the encoding of place is currently incomplete.

    2. Reviewer #1 (Public review):

      In this manuscript, Sterrett et al. assess whether and how the olfactory system may integrate odor-driven activity with contextual, egocentric variables such as instantaneous location in space and active odor sampling. To address this, they co-record respiration and the spiking activity of principal output neurons of the mouse olfactory bulb (OB), while mice explore a small arena in the absence of any explicit reward or task structure. The authors find that mice exploring the arena breathe in bouts, switching between discrete states of particular breathing rates that persist over varying time scales (seconds to minutes). This state-like activity is also apparent in the OB population activity. Zooming into the activity of individual OB neurons, the authors show that OB activity in this setting is primarily modulated by respiration. In general, while the response times of individual neurons remain tightly locked to the inhalation onset, the overall response amplitude is modulated by the instantaneous sniff frequency. The authors further suggest that a subset of OB neurons appear to show place-selectivity, in a manner that is not explained simply by respiratory or olfactory variables.

      Overall this work addresses an important question regarding the basic temporal structuring of odor sampling behavior and activity patterns in the mouse OB. A good understanding of these features is essential to further investigate how stimulus and/or task-driven activity may add on top of this already ongoing modulation. The authors do a commendable job of analyzing the behavior and neuronal activity using a variety of analysis methods. However, in its current form, the results presented are high-level summary figures that are largely comparative (role of parameter A vs B) and hard to assess quantitatively (how well does a given parameter/model explain the responses to begin with). This makes it hard to build a clear model of the underlying mechanisms and to evaluate alternative hypotheses. These concerns can largely be addressed by some additional analyses and by presenting more intermediate-stage output of their existing analyses. In addition, the authors report that a small fraction of OB neurons show spatially selective firing patterns, akin to those observed in the Hippocampus. While this is a very exciting possibility, in my opinion, the data and analysis presented currently are not sufficient to conclude this and additional experiments would be required to test this rigorously.

      Major concerns:

      A) Regarding the claim about Spatial selectivity in OB neuron responses:

      i) From the data presented, it is very hard to assess whether a simple modulation of sniff rate, selectively in some parts of the arena can explain apparent spatial selectivity. The authors attempt to address this concern with Figure 8 - Figure Supplement 1, but the presented combinatorial color maps are hard to interpret. A simpler format would be to show the sniff-aligned raster of the given unit in question along with a heatmap (location distribution) of the actual sniff rates in the arena (not the behavioral states).

      If the authors allow the mice to explore the arena over large periods, such that the sniff rates are relatively uniform in space, are the place fields still apparent? A complementary control is to compare responses in the 'place field' with other parts in the arena with comparable sniff rate distributions.

      ii) The analysis shown in Figure 8 suggests that sniff parameters are the main predictors of individual neuron responses. The authors point out that there is however a small, but significant fraction of cells that are better predicted by place than by the sniff parameters. It would be useful to provide more raw data to get a better sense of what distinguishes these cells from the rest. Are spatially selective cells typically less sniff-aligned on average? Do they tend to be less or more frequency-modulated?

      iii) The authors compare the decoding performance of OB and hippocampal neurons. While it appears space can indeed be decoded from OB neurons, it would be useful to know how the performance scales with the number of neurons and number of traversals in the arena in the two brain regions. Further, the authors should provide some analysis of the robustness of these apparent 'place fields' within a session.

      iv) The floor rotation control is underwhelming. First, the arena is quite small and one would generally expect this to impact much more so the 'place fields' that are biased towards the corners than in the center. Second, olfactory cues on the walls may be as important - why did the authors not rotate the entire arena?

      Considering the possibility that floor rotation rules out trivial olfactory explanations, what would happen if the authors rotated the entire arena? If these are truly place fields, then one should expect that while they are robust to floor rotation, they should reformat if the distal cues change. Without these additional analyses, I find it hard to conclude the presence of spatial selectivity in the OB.

      Moderate concerns:

      B) Regarding the lack of state-like structure during head-fixation:

      While it is clear that overall sniff rates are lower and that mice do not typically sniff at peak rates during head-fixation, it is unclear if the transitions in breathing rhythm are necessarily less structured, and further whether this can be attributed to head-fixation alone. For example, if the mice are head-fixed but in a floating-platform arena or VR that is non-static - the sniffing distributions may change dramatically.

      i) The breathing patterns shown in Figure 1E, in particular during the second head-fixation phase do not appear fundamentally different from the freely moving stretch (20-30 minute window). If one subsamples the free-moving data to match overall sniff distributions, will the long-timescale autocorrelation still be more apparent in freely moving stretches than in the head-fixation periods?

      ii) Are the mice on a running wheel? How does the overall distribution of sniff rates and temporal structure change if the mice are head-fixed but simply allowed to run?

      Minor concerns:

      C) Regarding the parsing of breathing and movement into 3 distinct behavioral states:<br /> The authors show breathing patterns of freely exploring mice are temporally structured with extended bouts of sniffing at select rates. They use a HMM model to show that this structure can be captured by a 3 state-model wherein each state can be thought of as a joint distribution of movement and sniff rate. While the approach is interesting and the data are well presented, I have some minor concerns regarding the exact interpretation.

      i) While the relationship between movement and sniffing is indeed non-trivial, it is unclear if the statelike partitioning requires the incorporation of the movement variable at all in the HMM model. The state-like patterns are also apparent if one focuses exclusively on the instantaneous sniff rate while ignoring movement velocities (Figure 1 - Figure Supplement 1) or the inferred HMM states (Figure 1E). Have the authors tried modeling the breathing activity alone using an HMM with each state just being a biased distribution of sniff rates, from which the instantaneous sniff rate is drawn? Will the authors' conclusions be fundamentally different from such a model?

      ii) While it is clear that there are at least 2 distinct states a) resting (mice are generally uninterested and sniff at 2-3 Hz) and b) exploration (mice are interested in their local environment and sniff rapidly). It is hard to assess whether there is indeed a third distinct and behaviorally interpretable state that the authors call grooming or are there simply intervening periods where it is unclear what's driving the variability in sniff rates - change in movement speed, moderate curiosity, boredom, etc. From the movement velocities shown in the supplement (Figure 1 - Figure Supplement 1), it appears that the movement speed during this 'grooming' state is significantly higher than at rest. It is not obvious why a mouse should move around more while grooming. It would help if the authors provide supporting data, perhaps from behavioral pose analysis to better justify the classification of this state as grooming or alternatively choose a different name to avoid confusion.

      iii) Insufficient analysis of state transition matrices: The authors do not show the transition matrices for individual sessions and/or mice. This limits what one can learn about the behavior from the 3 state modeling of breathing states. Do individual mice have stereotypical transition patterns across sessions? How well does the model perform: can one predict the expected sniff rate in one part of the session from knowing sniff patterns in another part of the session?

      D) Regarding the dependence of individual neuron responses on sniff and movement parameters:

      i) Could the authors report the relative proportions of sniff frequency insensitive vs. frequency sensitive neurons in their data?

      ii) Could some of the striking frequency modulation the authors show in Figure 3A result from the fact that mice selectively sniffed at high or low rates in different parts of the arena? While it is unlikely that all of the modulation the authors see results from the location/presence of trace odors in different parts of the arena, it would be informative to perform the same analysis on the data recorded during head-fixation where its external environment is less variable.

      iii) Comparison of SnF latency profiles between head-fixed and freely moving conditions:<br /> The SnF latency profiles of a given OB neuron appear strikingly similar during head-fixed and freely moving conditions. It would be useful if the authors could explicitly quantify this.

      iv) Comparison of SnF frequency profiles between head-fixed and freely moving conditions: The authors comment that SnF frequency profiles are different across the head-fixed versus freely moving conditions and that they do not observe the 3 distinct clusters present in the freely moving state in their head-fixed data. If true, this is an interesting observation. Together with the observation of relatively similar SnF latency profiles in both head-fixed and freely moving conditions, this implies that sniff frequency dependence is selectively enhanced during free-moving behavior perhaps through a top-down signal.

      However, this is hard to conclude from the current data as the overall distribution of sniff rates is very different in the two conditions, with a clear underrepresentation of high-frequency sniffs in the head-fixed periods. To enable a fair comparison, the authors should undersample the sniffs in the freely moving period and compare sniff fields constructed from frequency-matched distributions.

      v) The authors suggest that the 2 types of SnF latency profiles may putatively map onto tufted and mitral cells. While this is an interesting possibility, it would be nice to support the claim with auxiliary analysis of other features such as recording depth, baseline firing rates, spike shapes, etc that indicate that these are indeed two different cell types.

    3. Reviewer #2 (Public review):

      In this study, the authors investigate the structure of breathing rhythms in freely moving mice during exploratory behaviour in the absence of explicit cues or tasks. Additionally, they link behavioural states, derived from sniffing frequency and speed movement data, to the neural activity recorded in the olfactory bulb (OB). To further characterize OB neuronal responses, the authors introduce the concept of "sniff fields" which consider the joint distribution of sniff frequency and the latency from inhalation. Lastly, they explore how OB neurons encode spatial information, and they compare this finding with previously known spatially encoding cells in the hippocampus.

      The authors successfully establish that breathing in freely moving mice is structured even in the absence of explicit olfactory cues. By simultaneously recording sniffing and movement data, they find that this structure is associated with movement in a non-linear manner and can be modelled using a Hidden Markov Model (HMM). Interestingly, they demonstrate that neuronal activity in the OB tracks this behavioural structure by showing that HMM states can effectively cluster the neural data. Additionally, they describe OB activity using sniff fields, advancing our understanding of how individual neurons encode sniffing properties such as frequency and phase. Furthermore, they report unprecedented findings showing that some OB neurons encode place independently of the sniffing field contribution. Overall, the authors provide valuable insights regarding the contribution of different behavioural variables to OB activity.

      However, some of the conclusions presented by the authors are not fully supported by the data provided. Quantitative analysis and statistical tests are missing from the description of the breathing structure. Regarding spatial encoding, the authors claim in the abstract that "at the population level, a mouse's location can be decoded from olfactory bulb with similar accuracy to hippocampus". However, they show that place was significantly decoded in only 18/31 sessions from OB activity, and in 12/13 sessions from hippocampal activity. No further comparison of decoding accuracy between OB and HPC is provided. Moreover, it is unclear whether place contributes independently of movement, which was previously shown in this study to influence neuronal activity.

      Additionally, there is a lack of methodological detail regarding the experimental procedures, which could affect the interpretation of the data. Specifically, information is missing on aspects such as head-fixed conditions, the number of mice used per experiment, and the number of sessions per mouse.

      Studying mice behaviour in more naturalistic conditions, without explicit tasks, is a very interesting approach that provides new insights into the structure of sniffing and its neuronal representation. The fact that some OB neurons encode spatial information is highly relevant beyond the field of olfaction, even though this information was not as accessible as in the hippocampus. I believe the manuscript would benefit from a revision to ensure the text aligns more closely with the data presented in the figures.

    4. Author response:

      We thank the editor and reviewers for recognizing the value of studying neural dynamics and behavior in naturalistic, task-free conditions and the importance of linking olfactory bulb activity to movement and place.  We appreciate the suggestions for analyses and edits to further quantify these relationships and clarify our interpretation.

      The primary sticking point regards our result that olfactory bulb neurons are selective for place:

      “analysis supporting the potentially exciting result on the encoding of place is currently incomplete”

      In this paper, we report evidence for spatial selectivity in the olfactory bulb, make relative comparisons with canonical “place cells” in the hippocampus, and control for alternative hypotheses such as odor- or behavior-driven sources, to motivate future experiments which can more precisely identify the mechanistic basis of these responses. Throughout the reviews, our result on the correlation of OB activity with place is not questioned, but rather whether we can better determine how much behavior or odor explain this result. Regarding the concern about behavior, we are confident that the spatial non-uniformities of breathing rhythms do not explain OB spatial selectivity based on the analyses included in the paper. We thank the reviewers for suggestions of additional analyses with which we can further test this claim and will incorporate several, as we will detail below.

      Regarding the points about odor, indeed we do not claim that we have entirely ruled out odors as an explanation of place selectivity in the bulb. Rather, our claim is that our analyses show that scent marks on the floor, the most obvious olfactory place cue, cannot fully explain place selectivity.  We acknowledge that our experiments do not exclude the possibility that other odors in the environment may also contribute. Odors are invisible and difficult to measure, and the odor sensitivity of rodents vastly outstrips that of any device known to humanity. Indeed, no study of which we are aware can fully rule out odor as a cue to the animal’s internal model of place. However, encoding of place, even if explained by odor, is still encoding of place. We will clarify our interpretation of the data, and we thank the reviewers for proposing ideas for further analysis, some of which we are implementing. However, experiments such as effects of distal cues on spatially selective olfactory bulb neurons are beyond the scope of this paper.

      We will further test whether neurons in the olfactory bulb are spatially selective by reporting additional statistical analyses including:

      - More completely quantifying the spatial distribution of sniffing patterns (visualized in Figure 8 - Sup 1) by plotting sniff-frequency distributions across locations in the arena.

      - Demonstrating independent contribution of place over speed in GLMs

      - Characterizing the temporal stability of spatially selective cells across a session (1st half vs second half)

      - reporting mean decoding errors for olfactory bulb and hippocampal decoders (visualized in Fig 7C)

      We will add to the analyses of behavioral state models by:

      - Comparing the performance of hidden Markov models fit to breathing frequency alone with those fit to breathing frequency and movement speed

      - Quantifying individual differences in state-transition matrices

      Further, we address the question around the use of “grooming” as a descriptor of the intermediate sniff frequency state. We used the term ‘grooming’ based on extensive video observation. During this state, ‘Speed’ is significantly non-zero because we defined speed as the movement of the head keypoint which moves substantially during grooming. We will make this point more explicit in the figures and text, and we will provide additional video documentation of these and the other behavioral states.

      Lastly, we will further discuss the fact stated in the first paragraph of the Results section that mice are placed in “head-fixation on a stationary platform” and thus inhibited from running. While different breathing states than those observed in our stationary platform may occur during head-fixation with a treadmill, we believe the differences between head-fixed running and free moving running are beyond the scope of this paper. Nevertheless, it’s an important point that we will more explicitly discuss in our revision.

      We appreciate these constructive comments and hope these additional analyses and textual edits will help clarify our interpretations and motivate future experiments to further test and refine them.

    1. eLife Assessment

      This valuable manuscript presents a potentially novel mechanism by which the phospholipid scramblase, PLSCR1, defends against influenza A virus infection. The paper was based on solid findings involving knockout and lung-specific over-expressing Plscr1 mice, airway tissue expression, and mechanistic studies to show Plscr1 enhances type III interferon-mediated viral clearance. The study is extensive and overall well performed.

    2. Reviewer #1 (Public review):

      This manuscript by Yang et al. presents a potentially novel mechanism by which Plscr1 defends against influenza virus infection. Using a global knockout (KO) and a tissue-specific overexpression mouse model, the authors demonstrate that Plscr1-KO mice exhibit increased susceptibility and inflammation following IAV infection. In contrast, overexpression of Plscr1 in ciliated epithelial cells protects mice from infection. Through transcriptomic analysis in mice and mechanistic studies in cell culture models, the authors reveal that Plscr1 transcriptionally upregulates Ifnlr1 expression and physically interacts with this receptor on the plasma membrane, thereby enhancing IFN-λ-mediated viral clearance.

      Overall, it's a well-performed study, however, causality between Plscr1 and Ifnlr1 expression needs to be more firmly established. This is because two recent studies of PLSCR1 KO cells infected with different viruses found no major differences in gene expression levels compared with their WT controls (Xu et al. Nature, 2023; LePen et al. PLoS Biol, 2024). There were also defects in the expression of other cytokines (type I and II IFNs plus TNF-alpha) so a clear explanation of why Ifnlr1 was chosen should also be given.

      While Plscr1 has long been recognized as a cell-intrinsic antiviral restriction factor, few studies have explored its broader physiological role. This study thus provides interesting insights into a specific function of Plscr1 in IAV-permissive airway epithelial cells and its contribution to whole-body anti-viral immunity. There are three important issues that should be addressed, and several minor points should also be considered.

      (1) The authors propose that Plscr1 restricts IAV infection by regulating the type III IFN signaling pathway. While the data show a positive correlation between Ifnlr1 and Plscr1 levels in both mouse and cell culture models, additional evidence is needed to establish causality between the impaired type III IFN pathway, and the increased susceptibility observed in Plscr1-KO mice. To strengthen this conclusion, the following experiments could be undertaken: (i) Measure IAV titers in WT, Plscr1-KO, Ifnlr1-KO, and Plscr1/ Ifnlr1-double KO cells. If the antiviral activity of Plscr1 is highly dependent on Ifnlr1, there should be no further increase in IAV titers in double KO cells compared to single KO cells; (ii) over-express Plscr1 in Ifnlr1-KO cells to determine if it still inhibits IAV infection. If Plscr1's main action is to upregulate Ifnlr1, then it should not be able to rescue susceptibility since Ifnlr1 cannot be expressed in the KO background. If Plscr1 over-expression rescues viral susceptibility, then there are Ifnlr1-independent mechanisms involved. These experiments should help clarify the relative contribution of the type III IFN pathway to Plscr1-mediated antiviral immunity.

      (2) Transcriptional activation of IFNLR1 by Plscr1 is a central mechanistic conclusion of this manuscript. A ChIP assay was used to demonstrate direct binding between Plscr1 and the Ifnrl1 promoter region. This single evidence does not sufficiently prove the role of Plscr1 in transcriptional activation. Other forms of evidence would help make this mechanistic explanation more compelling. For example, nuclear un-on experiments would demonstrate Ifnrl1 mRNA synthesis in addition to promoter binding.

      (3) In Figure 4, the authors demonstrate the interaction between Plscr1 and Ifnlr1. They suggest that this interaction modulates IFN-λ signaling. However, Figures 5C-E show that the 5CA mutant, which lacks surface localization and the ability to bind Ifnlr1, exhibits similar anti-flu activity to WT Plscr1. Does this mean the interaction between Plscr1 and Ifnlr1 is dispensable for Plscr1-mediated antiviral function? Can the authors compare the activation of IFN-λ signaling pathway in Plscr1-KO cells expressing empty vector, WT Plscr1, and 5CA mutant? This could be done by measuring downstream ISG expression or using an ISRE-luciferase reporter assay upon IFN-λ treatment.

    3. Reviewer #2 (Public review):

      This nice study explores the role of phospholipid scramblase 1 (PLSCR1) in regulating antiviral immunity and host morbidity during influenza A virus (IAV) infection. The authors identify PLSCR1 as a critical regulator of interferon-lambda receptor 1 (IFNLR1) expression, acting through enzymatic-independent mechanisms. Using PLSCR1-deficient and conditional overexpression mouse models, the study demonstrates that PLSCR1 enhances antiviral responses and mitigates inflammation, potentially through modulating type III interferon (IFN-λ) signaling. While the findings underline the importance of PLSCR1 in early viral control and tissue homeostasis, they also highlight its cell-specific functions, particularly in ciliated airway epithelial cells. This work contributes to understanding the interplay between host factors and antiviral pathways, paving the way for novel therapeutic strategies targeting host proteins.

      Specific Comments:

      (1) The statement that type I interferons are expressed by "almost all cells" is inaccurate (line 61). Type I IFN production is also context-dependent and often restricted to specific cell types upon infection or stimulation.

      (2) The antiviral response is assessed solely through flu M gene expression. Incorporating infectious virus titers (e.g., TCID50 or plaque assay) would provide a more robust and direct measure of antiviral activity.

      (3) While mRNA expression of interferons is measured, protein levels (e.g., through ELISA) should also be quantified to establish the functional relevance of IFN expression changes.

      (4) It is unclear whether reduced IFNLR1 expression translates to defective downstream signaling or antiviral responses after IFN-λ treatment in PLSCR1-deficient cells. This is particularly pertinent given the increase in IFN-λ ligand in vivo, which might compensate for receptor downregulation.

      (5) Detailed gating strategies for immune cell subsets are absent and should be included for clarity and reproducibility.

      (6) The study does not definitively establish that reduced IFN-λ signaling causes the observed in vivo phenotype. Increased morbidity and mortality in PLSCR1-deficient mice could also stem from elevated TNF-α levels and lung damage, as proinflammatory cytokines and/or enhanced lung damage are known contributors to influenza morbidity and mortality. This point warrants detailed discussions.

    4. Reviewer #3 (Public review):

      Summary:

      Yang et al. have investigated the role of PLSCR1, an antiviral interferon-stimulated gene (ISG), in host protection against IAV infection. Although some antiviral effects of PLSCR1 have been described, its full activity remains incompletely understood.

      This study now shows that Plscr1 expression is induced by IAV infection in the respiratory epithelium, and Plscr1 acts to increase Ifn-λr1 expression and enhance IFN-λ signaling possibly through protein-protein interactions on the cell membrane.

      Strengths:

      The study sheds light on the way Ifnlr1 expression is regulated, an area of research where little is known. The study is extensive and well-performed with relevant genetically modified mouse models and tools.

      Weaknesses:

      There are some issues that need to be clarified/corrected in the results and figures as presented.

      Also, the study does not provide much information about the role of PLSCR1 in the regulation of Ifn-λr1 expression and function in immune cells. This would have been a plus.

    1. eLife Assessment

      This study investigates the conditions under which abstract knowledge transfers to new learning. It presents convincing evidence across a number of behavioral experiments that when explicit awareness of learned statistical structure is present, knowledge can transfer immediately, but that otherwise similar transfer requires sleep-dependent consolidation. The valuable results provide new constraints on theories of transfer learning and consolidation.

    2. Reviewer #1 (Public review):

      Summary:

      This paper investigates the effects of the explicit recognition of statistical structure and sleep consolidation on the transfer of learned structure to novel stimuli. The results show a striking dissociation in transfer ability between explicit and implicit learning of structure, finding that only explicit learners transfer structure immediately. Implicit learners, on the other hand, show an intriguing immediate structural interference effect (better learning of novel structure) followed by successful transfer only after a period of sleep.

      Strengths:

      This paper is very well written and motivated, and the data are presented clearly with a logical flow. There are several replications and control experiments and analyses that make the pattern of results very compelling. The results are novel and intriguing, providing important constraints on theories of consolidation. The discussion of relevant literature is thorough. In sum, this work makes an exciting and important contribution to the literature.

    3. Reviewer #2 (Public review):

      Summary:

      Sleep has not only been shown to support the strengthening of memory traces but also their transformation. A special form of such transformation is the abstraction of general rules from the presentation of individual exemplars. The current work used large online experiments with hundreds of participants to shed further light on this question. In the training phase participants saw composite items (scenes) that were made up of pairs of spatially coupled (i.e., they were next to each other) abstract shapes. In the initial training, they saw scenes made up of six horizontally structured pairs and in the second training phase, which took place after a retention phase (2 min awake, 12 hour incl. sleep, 12 h only wake, 24 h incl. sleep), they saw pairs that were horizontally or vertically coupled. After the second training phase, a two-alternatives-forced-choice (2-AFC) paradigm, where participants had to identify true pairs versus randomly assembled foils, was used to measure performance on all pairs. Finally, participants were asked five questions to identify, if they had insight into the pair structure and post-hoc groups were assigned based on this. Mainly the authors find that participants in the 2 minute retention experiment without explicit knowledge of the task structure were at chance level performance for the same structure in the second training phase, but had above chance performance for the vertical structure. The opposite was true for both sleep conditions. In the 12 h wake condition these participants showed no ability to discriminate the pairs from the second training phase at all.

      Strengths:

      All in all, the study was performed to a high standard and the sample size in the implicit condition was large enough to draw robust conclusions. The authors make several important statistical comparisons and also report an interesting resampling approach. There is also a lot of supplemental data regarding robustness.

      Weaknesses:

      My main concern regards the small sample size in the explicit group and the lack of experimental control.

    4. Reviewer #3 (Public review):

      In this project, Garber and Fiser examined how the structure of incidentally learned regularities influences subsequent learning of regularities, that either have the same structure or a different one. Over a series of six online experiments, it was found that the structure (spatial arrangement) of the first set of regularities affected learning of the second set, indicating that it has indeed been abstracted away from the specific items that have been learned. The effect was found to depend on the explicitness of the original learning: Participants who noticed regularities in the stimuli were better at learning subsequent regularities of the same structure than of a different one. On the other hand, participants whose learning was only implicit had an opposite pattern: they were better in learning regularities of a novel structure than of the same one. However, when an overnight sleep separated the first and second learning phases, this opposite effect was reversed and came to match the pattern of the explicit group, suggesting that the abstraction and transfer in the implicit case were aided by memory consolidation.

    5. Author response:

      The following is the authors’ response to the previous reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This paper investigates the effects of the explicit recognition of statistical structure and sleep consolidation on the transfer of learned structure to novel stimuli. The results show a striking dissociation in transfer ability between explicit and implicit learning of structure, finding that only explicit learners transfer structure immediately. Implicit learners, on the other hand, show an intriguing immediate structural interference effect (better learning of novel structure) followed by successful transfer only after a period of sleep.

      Strengths:

      This paper is very well written and motivated, and the data are presented clearly with a logical flow. There are several replications and control experiments and analyses that make the pattern of results very compelling. The results are novel and intriguing, providing important constraints on theories of consolidation. The discussion of relevant literature is thorough. In sum, this work makes an exciting and important contribution to the literature.

      Weaknesses:

      There have been several recent papers which have identified issues with alternative forced choice (AFC) tests as a method of assessing statistical learning (e.g. Isbilen et al. 2020, Cognitive Science). A key argument is that while statistical learning is typically implicit, AFC involves explicit deliberation and therefore does not match the learning process well. The use of AFC in this study thus leaves open the question of whether the AFC measure benefits the explicit learners in particular, given the congruence between knowledge and testing format, and whether, more generally, the results would have been different had the method of assessing generalization been implicit. Prior work has shown that explicit and implicit measures of statistical learning do not always produce the same results (eg. Kiai & Melloni, 2021, bioRxiv; Liu et al. 2023, Cognition).

      The authors argued in their response to this point that this issue could have quantitative but not qualitative impacts on the results, but we see no reason that the impact could not be qualitative. In other words, it should be acknowledged that an implicit test could potentially result in the implicit group exhibiting immediate structure transfer.

      We thank the reviewer for their feedback and added a statement in our discussion section acknowledging the possible effects of alternative measures of learning.

      Given that the explicit/implicit classification was based on an exit survey, it is unclear when participants who are labeled "explicit" gained that explicit knowledge. This might have occurred during or after either of the sessions, which could impact the interpretation of the effects and deserves discussion.

      We agree with the mentioned shortcoming in principle, although there are good methodological reasons for this, as discussed in our previous response. We added a statement on this topic to our discussion to make the potential issues and our reasoning in the design decision more transparent for the reader.

      Reviewer #2 (Public review):

      Summary:

      Sleep has not only been shown to support the strengthening of memory traces, but also their transformation. A special form of such transformation is the abstraction of general rules from the presentation of individual exemplars. The current work used large online experiments with hundreds of participants to shed further light on this question. In the training phase participants saw composite items (scenes) that were made up of pairs of spatially coupled (i.e., they were next to each other) abstract shapes. In the initial training, they saw scenes made up of six horizontally structured pairs and in the second training phase, which took place after a retention phase (2 min awake, 12 hour incl. sleep, 12 h only wake, 24 h incl. sleep), they saw pairs that were horizontally or vertically coupled. After the second training phase, a two-alternativesforced-choice (2-AFC) paradigm, where participants had to identify true pairs versus randomly assembled foils, was used to measure performance on all pairs. Finally, participants were asked five questions to identify, if they had insight into the pair structure and post-hoc groups were assigned based on this. Mainly the authors find that participants in the 2 minute retention experiment without explicit knowledge of the task structure were at chance level performance for the same structure in the second training phase, but had above chance performance for the vertical structure. The opposite was true for both sleep conditions. In the 12 h wake condition these participants showed no ability to discriminate the pairs from the second training phase at all.

      Strengths:

      All in all, the study was performed to a high standard and the sample size in the implicit condition was large enough to draw robust conclusions. The authors make several important statistical comparisons and also report an interesting resampling approach. There is also a lot of supplemental data regarding robustness.

      Weaknesses:

      My main concern regards the small sample size in the explicit group and the lack of experimental control.

      We thank the reviewer for the valuable feedback throughout the review process. The issues mentioned here have been addressed in our previous response.

      Reviewer #3 (Public review):

      In this project, Garber and Fiser examined how the structure of incidentally learned regularities influences subsequent learning of regularities, that either have the same structure or a different one. Over a series of six online experiments, it was found that the structure (spatial arrangement) of the first set of regularities affected learning of the second set, indicating that it has indeed been abstracted away from the specific items that have been learned. The effect was found to depend on the explicitness of the original learning: Participants who noticed regularities in the stimuli were better at learning subsequent regularities of the same structure than of a different one. On the other hand, participants whose learning was only implicit had an opposite pattern: they were better in learning regularities of a novel structure than of the same one. However, when an overnight sleep separated the first and second learning phases, this opposite effect was reversed and came to match the pattern of the explicit group, suggesting that the abstraction and transfer in the implicit case were aided by memory consolidation.

      In their revision the authors addressed my major comments successfully and I commend them for that.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      We would encourage the authors to add text to the manuscript that acknowledges/discusses the two issues pointed out in our review.

      We added relevant passages to the discussion section of the manuscript.

      Reviewer #2 (Recommendations for the authors):

      The authors have improved some sections of the manuscript and this is reflected in my assessment. The major weaknesses remain unchanged. Since my review is published alongside the paper, readers can make up their own mind regarding their severity.

      My only hard ask would be to add that the study was not preregistered into the main manuscript as I asked before! I am surprised that the authors are so reluctant to honestly state this fact....

      We have not stated this fact in our manuscript until now since our understanding is that papers that report preregistered studies state and cite their preregistration in their method section, while any omission of such a statement by default conveys that no preregistration occurred. In fact, we cannot recall encountering papers with statements of no-preregistration in the literature. Nevertheless, we have no issue stating that our study was not preregistered and per the reviewer's request, we have added such an explicit statement in our manuscript.

      Reviewer #3 (Recommendations for the authors):

      *  I strongly urge the authors to remove the Results sub-sections from Methods.

      We thank the reviewer for highlighting this issue arising from our previous layout, which we decided to handle the following way. We re-labeledl the subsections in question as “Additional Analyses” to avoid confusion, we removed any redundant findings already reported in Results of the main text, and we moved a small number of more substantial findings from the Methods Section to the main text Results as requested. We believe that this solution constitutes the most readable option, as we do not clutter the main results with extensive sanity checks and results

      of minor interest, while we also do not need to establish experiment-wise result sections in the Supplementary Materials, which would further disperse information interested readers might look for.

      *  Authors report that in Experiment 4 "Participants with explicit knowledge (n=23) show the same pattern of results as they did in Experiment 1", but that seems inaccurate, as they did learn novel pairs in Exp4 whereas they did not in Exp1. This can be seen in the figure and also in Methods-Results: "performing above chance for ... pairs of a novel structure (M=69.6, SE=5.9, d=0.69, t(22)=3.33 p=0.012, BF=13.6) in the second training phase"

      We thank the reviewer for pointing out this error in our interpretation of the results and adjusted the section in question to better align with what our result actually shows.

    1. eLife Assessment

      This important study demonstrates that screening by artificial intelligence can identify relevant novel compounds for interacting with KATP channels. The experimental work is compelling. The broader significance of this work relates to the possibility that KATP channel mutations linked to congenital hyperinsulinism may be effectively rescued to the cell surface with a drug, which could normalize insulin secretion or enhance the effectiveness of existing KATP channel activators such as diazoxide.

    2. Reviewer #1 (Public review):

      Summary:

      Multiple compounds that inhibit ATP-sensitive potassium (KATP) channels also chaperone channels to the surface membrane. The authors used an artificial intelligence (AI)-based virtual screening (AtomNet) to identify novel compounds that exhibit chaperoning effects on trafficking-deficient disease-causing mutant channels. One compound, which they named Aekatperone, acts as a low affinity, reversible inhibitor and effective chaperone. A cryoEM structure of KATP bound to Aekatperone showed that the molecule binds at the canonical inhibitory site.

      Strengths and weaknesses:

      The details of the AI screening itself are inevitably opaque, but appear to differ from classical virtual screening in not involving any physical docking of test compounds into the target site. The authors mention criteria that were used to limit the number of compounds, so that those with high similarity to known binders and 'sequence identity' (does this mean structural identity) were excluded. The identified molecules contain sulfonylurea-like moieties. How different are they from other sulfonylureas?

      The experimental work confirming that Aekatperone acts to traffic mutant KATP channels to the surface and acts as a low affinity, reversible, inhibitor is comprehensive and clear, with very convincing cell biological and patch-clamp data, as is the cryoEM structural analysis, for which the group are leading experts. In addition to the three positive chaperone-effective molecules, the authors identified a large number of compounds that are predicted binders but apparently have no chaperoning effect.

      The authors suggest that the novel compound may be a promising therapeutic for treatment of congenital hyperinsulinism due to trafficking defective KATP mutations. Because they are low affinity, reversible, inhibitors. This is a very interesting concept, and perhaps a pulsed dosing regimen would allow trafficking without constant channel inhibition (which otherwise defeats the therapeutic purpose), although it is unclear whether the new compound will offer advantages over earlier low-affinity sulfonylurea inhibitor chaperones. These include tolbutamide which has very similar affinity and effect to Aekatperone. As the authors point out this (as well as other sulfonlyureas) are currently out of favor because of potential adverse cardiovascular effects, but again, it is unclear why Aekatperone should not have the same concerns.

      Comments on revised version:

      The authors have been very responsive to the first reviews. No further comments.

    3. Reviewer #2 (Public review):

      Summary:

      In their study 'AI-Based Discovery and CryoEM Structural Elucidation of a KATP Channel Pharmacochaperone', ElSheikh and colleagues undertake a computational screening approach to identify candidate drugs that may bind to an identified binding pocket in the SUR1 subunit of KATP channels. Other KATP channel inhibitors such as glibenclamide have been previously shown to bind in this pocket, and in addition to inhibition KATP channel function, these inhibitors can very effectively rescue cell surface expression of trafficking deficient KATP mutations that cause excessive insulin secretion (Congenital Hyperinsulinism). However, a challenge for their utility for treatment of hyperinsulinism has been that they are powerful inhibitors of the channels that are rescued to the channel surface. In contrast, successful therapeutic pharmacochaperones (eg. CFTR chaperones) permit function of the channels rescued to the cell membrane. Thus, a key criteria for the authors' approach in this case was to identify relatively low affinity compounds that target the glibenclamide binding site (and be washed off) - these could potentially rescue KATP surface expression, but also permit KATP function.

      Strengths:

      The main findings of the manuscript include:

      (1) Computational screening of a large virtual compound library, followed by functional screening of cell surface expression, which identified several potential candidate pharmacochaperones that target the glibenclamide binding site.

      (2) Prioritization and functional characterization of Aekatperone as a low affinity KATP inhibitor which can be readily 'washed off' in patch clamp, and cell based efflux assays. Thus the drug clearly rescues cell surface expression, but can be manipulated experimentally to permit function of rescued channels.

      (3) Determination of the binding site and dynamics of this candidate drug by cryo-EM, and functional validation of several residues involved in drug sensitivity using mutagenesis and patch clamp.

      The experiments are well-conceived and executed, and the study is clearly described. The results of the experiments are very straightforward and clearly support the conclusions drawn by the authors. I found the study to provide important new information about KATP chaperone effects of certain drugs, with interesting considerations in terms of ion channel biology and human disease.

      Context and remaining challenges:

      (1) The chaperones can effectively rescue KATP trafficking mutants, but clearly not as strongly as the higher affinity inhibitor glibenclamide. There is likely a challenging relationship between efficacy of trafficking rescue and channel inhibition (ie. rescued channels are inhibited and therefore non-functional) that will need to be overcome in terms of applying drugs of this class. This is recognized and clarified appropriately by the authors both in their experimental approaches and discussion. In experiments it is straightforward to wash off the chaperone, but this would not be the case in an organism.

      (2) Recent developments with ion channel trafficking correctors in the CFTR field illustrate the importance of investigating underlying mechanisms. Development of pharmacological tools and approaches in other channel types (such as KATP or other transporters and channels) will build our understanding of pathways involved in regulating maturation of membrane proteins, and ways to manipulate them.

      Comments on revised version:

      I have no further suggestions, thank you for the detailed response.

    4. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      Summary:

      Multiple compounds that inhibit ATP-sensitive potassium (KATP) channels also chaperone channels to the surface membrane. The authors used an artificial intelligence (AI)-based virtual screening (AtomNet) to identify novel compounds that exhibit chaperoning effects on trafficking-deficient disease-causing mutant channels. One compound, which they named Aekatperone, acts as a low affinity, reversible inhibitor and effective chaperone. A cryoEM structure of KATP bound to Aekatperone showed that the molecule binds at the canonical inhibitory site.

      Strengths and weaknesses:

      The details of the AI screening itself are inevitably opaque, but appear to differ from classical virtual screening in not involving any physical docking of test compounds into the target site. The authors mention criteria that were used to limit the number of compounds, so that those with high similarity to known binders and 'sequence identity' (does this mean structural identity) were excluded. The identified molecules contain sulfonylurea-like moieties. How different are they from other sulfonylure4as?

      We thank the reviewers for the questions. As part of the library preparation, molecules with greater than 0.5 Tanimoto similarity in ECFP4 space to any known binders of the target protein and its homologs within 70% sequence identity were excluded to increase the possibility of identifying novel hits. After scoring and ranking the molecules by the AtomNet® technology, a diversity clustering was performed using the Butina algorithm (Butina D. Unsupervised Data Base Clustering Based on Daylight’s Fingerprint and Tanimoto Similarity: A Fast and Automated Way To Cluster Small and Large Data Sets, J. Chem. Inf. Comput. Sci. 1999, 39, 747–750) with a Tanimoto similarity cutoff of 0.35 in ECFP4 space to minimize selection of structurally similar scaffolds for the final compound buy-list. We have revised the results and methods sections to make this clear.

      Sulfonylureas are defined by their core structure comprising a sulfonyl group (–S(=O)<sub>2</sub>) and a urea moiety (–NH–CO–NH–). While some compounds identified in our study contain a sulfonamide group (R-S(=O) <sub>2</sub>-NR<sub>2</sub>), they differ structurally from sulfonylureas by lacking the key urea group and by incorporating unique R-group substitutions (we have now added this to Figure 1A legend). For example, compound C27 (Z2068224500) includes a sulfonamide group but not a urea moiety. Likewise, C45 (Aekatperone, Z1620764636) contains a sulfonamide group along with an aromatic, nitrogen-rich heterocyclic ring, but no urea group. Additionally, the R-groups in these compounds are more complex than the simple aromatic or alkyl chains typical of sulfonylureas. They include heterocyclic aromatic systems and nitrogen-rich structures, which likely influence their binding properties and lipophilicity. These structural differences suggest distinct functional and pharmacological profiles as supported by our biochemical and functional studies.

      The experimental work confirming that Aekatperone acts to traffic mutant KATP channels to the surface and acts as a low affinity, reversible, inhibitor is comprehensive and clear, with very convincing cell biological and patch-clamp data, as is the cryoEM structural analysis, for which the group are leading experts. In addition to the three positive chaperone-effective molecules, the authors identified a large number of compounds that are predicted binders but apparently have no chaperoning effect. Did any of them have inhibitory action on channels? If so, does this give clues to separating chaperoning from inhibitory effects?

      This is an interesting question. Evidence from cryo-EM, biochemical and electrophysiology studies reveal a critical role of Kir6.2 N-terminus in K<sub>ATP</sub> channel assembly and gating, and that pharmacological chaperones like glibenclamide, repaglinide, carbamazepine, and now aekatperone exert their chaperoning and inhibitory effects by stabilizing the interaction between Kir6.2 N-terminus and the SUR1-ABC core. This stabilization, while promoting the assembly of Kir6.2 and SUR1 to “chaperone” trafficking-impaired mutant channels to the cell surface, also inhibits the channel by restricting the Kir6.2 C-terminal domain from rotating to an open state. An additional mechanism by which these compounds inhibit channel activity is by preventing SUR1-NBD dimerization, which mediates physiological activation of the channel by MgADP (see review: Driggers CM, Shyng SL. Mechanistic insights on K<sub>ATP</sub> channel regulation from cryo-EM structures. J Gen Physiol. 2023 Jan 2;155(1): e202113046, PMID: 36441147). From our compound screening, we did find some compounds that showed mild inhibition of the channel by electrophysiology but no obvious chaperone effects by western blots. It is possible that small chaperoning effects of some compounds showing mild channel inhibition effects were missed due to the lower sensitivity of the western blot assay compared to electrophysiology. Alternatively, these compounds could inhibit channels by preventing SUR1NBD dimerization without stabilizing the Kir6.2 N-terminus, which is required for the chaperone effect based on our model. Unfortunately, we did not find any compounds that show chaperone effects but no channel inhibition effects, which is consistent with our understanding of how this type of K<sub>ATP</sub> chaperones work (i.e. by stabilizing Kir6.2 N-terminus interaction with SUR1’s ABC core).

      The authors suggest that the novel compound may be a promising therapeutic for treatment of congenital hyperinsulinism due to trafficking defective KATP mutations. Because they are low affinity, reversible, inhibitors. This is a very interesting concept, and perhaps a pulsed dosing regimen would allow trafficking without constant channel inhibition (which otherwise defeats the therapeutic purpose), although it is unclear whether the new compound will offer advantages over earlier low-affinity sulfonylurea inhibitor chaperones. These include tolbutamide which has very similar affinity and effect to Aekatperone. As the authors point out this (as well as other sulfonlyureas) are currently out of favor because of potential adverse cardiovascular effects, but again, it is unclear why Aekatperone should not have the same concerns.

      We thank the reviewer for the comments. This is clearly an important question to address in the future. While we have not directly tested the effects of Aekatperone on cardiac functions, we did assess its inhibitory effect on cells expressing the cardiac K<sub>ATP</sub> channel isoform (SUR2A/Kir6.2). Our results indicate that Aekatperone exhibits higher sensitivity toward the pancreatic K<sub>ATP</sub> channel isoform (SUR1/Kir6.2) compared to the cardiac isoform. However, we acknowledge that Aekatperone could still have cardiotoxic effects through its potential action on other channels, such as the hERG channel.

      It is worth noting that tolbutamide, despite its known cardiotoxic effects, does not exert these effects through cardiac K<sub>ATP</sub> channel inhibition. This has been demonstrated in studies showing no inhibitory effect of tolbutamide on SUR2A/Kir6.2 channels and on channels formed by Kir6.2 and SUR1 harboring the S1238Y mutation (also shown as S1237Y in some studies using a different SUR1 isoform)--the amino acid substitution found in SUR2A at the corresponding position (Ashfield R, Gribble FM, Ashcroft SJ, Ashcroft FM. Identification of the high-affinity tolbutamide site on the SUR1 subunit of the K<sub>ATP</sub> channel. Diabetes. 1999 Jun;48(6):1341-7, PMID: 10342826). This suggests that tolbutamide’s cardiotoxic effects might involve other targets like the hERG channel. Interestingly, tolbutamide contains a hydrophobic tail and aromatic rings that align well with the structural features for hERG interaction (Garrido A, Lepailleur A, Mignani SM, Dallemagne P, Rochais C. hERG toxicity assessment: Useful guidelines for drug design. Eur J Med Chem. 2020 Jun 1;195:112290, PMID: 32283295). In contrast, highaffinity sulfonylureas such as glibenclamide and glimepiride, which have additional benzamide moieties, are associated with lower cardiovascular risks (Douros A, Yin H, Yu OHY, Filion KB, Azoulay L, Suissa S. Pharmacologic Differences of Sulfonylureas and the Risk of Adverse Cardiovascular and Hypoglycemic Events. Diabetes Care. 2017, 40:1506-1513, PMID:

      28864502). Given these considerations, a comprehensive assessment of Aekatperone’s potential cardiotoxicity is crucial. Future studies involving in silico modeling, in vitro, and in vivo experiments will be essential to evaluate Aekatperone’s interaction with hERG and other offtarget effects. These efforts will help clarify its safety profile. This point has now been added to the Discussion.

      Reviewer #2 (Public review):

      Summary:

      In their study 'AI-Based Discovery and CryoEM Structural Elucidation of a KATP Channel Pharmacochaperone', ElSheikh and colleagues undertake a computational screening approach to identify candidate drugs that may bind to an identified binding pocket in the SUR1 subunit of

      KATP channels. Other KATP channel inhibitors such as glibenclamide have been previously shown to bind in this pocket, and in addition to inhibition KATP channel function, these inhibitors can very effectively rescue cell surface expression of trafficking deficient KATP mutations that cause excessive insulin secretion (Congenital Hyperinsulinism). However, a challenge for their utility for treatment of hyperinsulinism has been that they are powerful inhibitors of the channels that are rescued to the channel surface. In contrast, successful therapeutic pharmacochaperones (eg. CFTR chaperones) permit function of the channels rescued to the cell membrane. Thus, a key criteria for the authors' approach in this case was to identify relatively low affinity compounds that target the glibenclamide binding site (and be washed off) - these could potentially rescue KATP surface expression, but also permit KATP function.

      Strengths:

      The main findings of the manuscript include:

      (1) Computational screening of a large virtual compound library, followed by functional screening of cell surface expression, which identified several potential candidate pharmacochaperones that target the glibenclamide binding site.

      (2) Prioritization and functional characterization of Aekatperone as a low affinity KATP inhibitor which can be readily 'washed off' in patch clamp, and cell based efflux assays. Thus the drug clearly rescues cell surface expression, but can be manipulated experimentally to permit function of rescued channels.

      (3) Determination of the binding site and dynamics of this candidate drug by cryo-EM, and functional validation of several residues involved in drug sensitivity using mutagenesis and patch clamp.

      The experiments are well-conceived and executed, and the study is clearly described. The results of the experiments are very straightforward and clearly support the conclusions drawn by the authors. I found the study to provide important new information about KATP chaperone effects of certain drugs, with interesting considerations in terms of ion channel biology and human disease.

      Weaknesses:

      I don't have any major criticisms of the study as described, but I had some remaining questions that could be addressed in a revision.

      (1) The chaperones can effectively rescue KATP trafficking mutants, but clearly not as strongly as the higher affinity inhibitor glibenclamide. Is this relationship between inhibitory potency, and efficacy of trafficking an intrinsic challenge of the approach? I suspect that it may be an intractable problem in the sense that the inhibitor bound conformation that underlies the chaperone effect cannot be uncoupled from the inhibited gating state. But this might not be true (many partial agonist drugs with low efficacy can be strongly potent, for example). In this case, the approach is really to find a 'happy medium' of a drug that is a weak enough inhibitor to be washed away, but still strong enough to exert some satisfactory chaperone effect. Could some additional clarity be added in the discussion on whether the chaperone and gating effects can be 'uncoupled'.

      Thank you for the suggestion. A similar question was raised by Reviewer 1, which was addressed above (public review, point 2). We have now added more discussion to clarify this point.

      (2) Based on the western blots in Figure 2B, the rescue of cell surface expression appears to require a higher concentration of AKP compared to the concentration response of channel inhibition (~9 microM in Figure 3, perhaps even more potent in patch clamp in Figure 2C). Could the authors clarify/quantify the concentration response for trafficking rescue?

      Thank you for bringing up this observation. Indeed, the pharmacochaperone effects of Aekatperone as well as other previously published K<sub>ATP</sub> pharmacochaperones require higher concentrations compared to their inhibitory effects on surface-expressed channels. This difference likely stems from the necessity for these compounds to cross the cell membrane and interact with newly synthesized channels in the endoplasmic reticulum, where the trafficking rescue occurs. We estimate that effective pharmacochaperone activity for Aekatperone can be achieved at concentrations ranging from 50 to 100 µM in cells expressing trafficking-deficient K<sub>ATP</sub> channel mutants, higher than that required for inhibition of surface-expressed channels (~9 µM IC50). Future work could focus on medicinal chemistry modifications, for example esterification of Aekatperone (Zhou G. Exploring Ester Prodrugs: A Comprehensive Review of Approaches, Applications, and Methods. Pharmacology & Pharmacy, 2024, 15, 269-284). Once inside the cell, the esters would be cleaved by endogenous esterases to release the active compound, ensuring efficient intracellular delivery. This strategy could potentially improve membrane permeability and bioavailability of the compound, which would lower the required concentrations to achieve desired chaperoning effects.

      (3) A future challenge in the application of pharmacochaperones of this type in hyperinsulinism may be the manipulation of chaperone concentration in order to permit function. In experiments it is straightforward to wash off the chaperone, but this would not be the case in an organism. I wondered if the authors had attempted to rescue channel function with diazoxide ine presence of AKP, rather than after washing off (ie. is AKP inhibition insurmountable, or can it be overcome by sufficient diazoxide).

      Thank you for raising this important point. We have previously shown (Martin GM et al. Pharmacological Correction of Trafficking Defects in ATP-sensitive Potassium Channels Caused by Sulfonylurea Receptor 1 Mutations. J Biol Chem. 2016, 291: 21971-21983, PMID: 27573238) that diazoxide, which stabilizes K<sub>ATP</sub> channels in an open conformation, also reduces physical association between Kir6.2 N-terminus and SUR1 as demonstrated by reduced crosslinking of engineered azido-phenylalanine (an unnatural amino acid) at Kir6.2 N-terminal amino acid 12 position to SUR1. Incubating cells with diazoxide did not rescue the trafficking mutants but actually further reduced the maturation efficiency of trafficking mutants. For this reason, we did not include diazoxide during Aekatperone incubation and instead added diazoxide after Aekatperone washout to potentiate the activity of mutant channels rescued to the cell surface. In vivo, we envision testing alternating Aekatperone and diazoxide dosing to maximize functional rescue of K<sub>ATP</sub> trafficking mutants.

      (4) Do the authors have any information about the turnover time of KATP after washoff of the chaperone (how stable are the rescued channels at the cell surface)? This is a difficult question to probe when glibenclamide is used as a chaperone, but maybe much simpler to address with a lower affinity chaperone like AKP.

      Thank you for your thoughtful comment. While we have not yet tested the duration of rescued K<sub>ATP</sub> channels at the cell surface following Aekatperone washout, we have conducted similar studies with carbamazepine (Chen PC et al. Carbamazepine as a novel small molecule corrector of trafficking-impaired ATP-sensitive potassium channels identified in congenital hyperinsulinism. J Biol Chem. 2013, 288: 20942-20954, PMID: 23744072), another compound exhibiting reversible inhibitory and chaperone effects (apparent affinity between glibenclamide and Aekatperone). Our previous findings with carbamazepine showed that in cultured cells its chaperone effects were detectable as early as 1 hour and peaked around 6 hours after treatment. Furthermore, when carbamazepine was removed following a 16-hour treatment, the rescue effect persisted for up to 6 hours post-drug removal. These results provide a potential duration of the surface expression rescue effects of reversible pharmacochaperones.

      Reviewer #1 (Recommendations for the authors):

      The paper is well-written and comprehensive with only very minor essentially copy-editing needed. That said, it would be good if the authors could answer the main points raised above:

      (1) What is the relevant Tanimoto parameters and sequence identity (does this mean structural identity) for the identified compounds?

      As we answered above in response to the overall assessment, to facilitate the identification of novel hits, molecules with greater than 0.5 Tanimoto similarity in ECFP4 space to any known binders of the target protein and its homologs within 70% amino acid sequence identity were excluded from the commercial library. Additionally, after scoring and ranking the molecules by the AtomNet® technology, a diversity clustering was performed on the top 30,000 molecules using the Butina algorithm with a Tanimoto similarity cutoff of 0.35 in ECFP4 space to minimize selection of structurally similar scaffolds for the final compound buy-list.

      (2) Did any of the identified putative binders have inhibitory action on channels? If so, does this give clues to separating chaperoning from inhibitory effects?

      Please see response to the same question in the overall assessment above.

      (3) Acknowledge that the identified compounds contain sulfonylurea-like moieties, and address why Aekatperone should (or perhaps does not) offer anything advantage over low affinity sulfonrylureas such as tolbutamide?

      Please see response to the same question in the overall assessment above.

      Reviewer #2 (Recommendations for the authors):

      Thank you for assembling the interesting study, which I felt was well designed and communicated. The diverse approaches used in the study, with consistent findings, were definitely a strength. The core findings are also well distilled in the main body of the text, and although there is quite a lot of supplementary information, I felt that it was presented appropriately and well selected in terms of what would be important for readers hoping to learn more. In addition to the questions described above, I only had a few minor editorial issues that could be fixed related to presentation.

      (1) Figure 1B. The colours and resolution of the chemical structures are difficult to see clearly and could be improved.

      We have revised the figure accordingly.

      (2) This is a minor wording point... first sentence of the discussion describes the drugs as pancreatic-selective, when it would be more clear to describe them as selective for the pancreatic isoform of KATP (Kir6.2/SUR1), or perhaps better as 'exhibiting ~4-5 fold selective for SUR1-containing KATP channels vs. SUR2A or SUR2B'.

      We have changed the wording as suggested.

      (3) As a curiosity (not necessary to do more experiments), but I am curious if the authors know whether there is any meaningful enhancement of trafficking of WT channels by AKP.

      All pharmacochaperones we have identified to date including Aekatperone also slightly enhance WT channel surface expression (10-20%).

      Reviewing editor recommendations:

      (1) Given the modest resolution of the EM reconstruction, it is perhaps not entirely clear how AKP was assigned to the density observed. Specifically, it would be helpful to include a comparison of an AKP-free map and the current AKP map (filtered to a similar resolution) showing slice views of densities in the region around the inferred binding site. This would be very helpful in ascertaining whether the cryoEM reconstruction is an independent validation of the computational and functional experiments or whether the density inference depends on the additional knowledge.

      We appreciate the editor’s suggestion. We have now added a Supplemental Figure (Supplementary Figure 7 in the revised manuscript) that compares our AKP-free cryoEM density deposited previously to the EMDB (EMD-26320) and the AKP-bound cryoEM density from this study, with cryoEM density (filtered to the same resolution) superimposed on the structural model.

      (2) It could help to mention in brief what is a probable mechanism of AKP inhibition - that is how after binding of AKP, channel opening is restricted. Is it similar to that of other site A ligands?

      Based on the strong Kir6.2 N-terminal cryoEM density observed in our AKP map, AKP most likely inhibits K<sub>ATP</sub> channels by trapping the Kir6.2 N-terminus in the central cavity of SUR1’s ABC core thus preventing Kir6.2-C-terminal domain from rotating to an open conformation, similar to other ligands that stabilize the Kir6.2 N-terminus-SUR1 interface by binding to site A (such as tolbutamide and AKP), site B (such as repaglinide), or both site A and site B (such as glibenclamide). We have now included this in the revised Results and Discussion sections.

      (3) In the context of the MD simulations, do other site A ligands (which from my understanding bind at a similar site) also exhibit similar flexibility as AKP? If there is information available on the flexibility of ligands of varying affinities, bound to the same site, maybe some correlative inferences can be drawn? However, in MD simulation trajectories it is not entirely uncommon for a ligand to simply get trapped in a local energy well. Since the authors have performed significant analysis of their MD results it could be worth mentioning/discussing such phenomena.

      Previously published MD data addressing ligand dynamics, such as glibenclamide in the SUR1 pocket (Walczewska-Szewc K, Nowak W. Photo-Switchable Sulfonylureas Binding to ATPSensitive Potassium Channel Reveal the Mechanism of Light-Controlled Insulin Release. J Phys Chem B. 2021, 125: 13111-13121, PMID: 34825567), indicate a certain degree of flexibility. Unfortunately, we cannot directly compare these results, as the simulations were performed without the KNtp domain in the SUR1 cavity, which partially contributes to ligand stabilization. This is an issue we plan to investigate in the future.

      In this study, we ran five independent MD simulations, each 500 ns long, resulting in a total of 2.5 μs of simulation time. Across all replicates, the ligand stayed in the same position, with variations mainly in the dynamics of the blurred segment. Considering the length of the simulations and the consistency across the runs, we believe this binding pose is stable and represents a global (or at least highly stable) energy minimum, consistent with the cryo-EM data.

      (4) In electrophysiological assays, 10 uM AKP seems to inhibit all currents (Figure 2), but in the Rb+ flux assay ~10 uM appears to be the IC50. The reason for this difference is not entirely clear and it would help to comment on this.

      Thank you for noticing the difference. The initial electrophysiological experiments were conducted using the very small amount of AKP provided to us from Atomwise. We estimated the concentration of the reconstituted AKP the best we could, but the concentration was likely to not be very accurate due to difficulty in handling the very small amount of the AKP powder. Subsequent Rb<sup>+>/sup> efflux experiments were conducted using a different, larger batch of AKP we purchased from Enamine. We have now stated this in the Methods section.

    1. eLife Assessment

      This valuable manuscript uses mathematical modeling to address the synchrony of the vertebrate segmentation clock with the developmental processes. The authors use convincing arguments to support the idea that this would allow the evolution of flexible body plans and a variable number of segments. This manuscript could be of interest to developmental biologists and systems biologists.

      [Editors' note: this paper was reviewed by Review Commons.]

    2. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Hammond et al. study robustness of the vertebrate segmentation clock against morphogenetic processes such as cell ingression, cell movement and cell division to ask whether the segmentation clock and morphogenesis are modular or not. The modularity of these two would be important for evolvability of the segmenting system. The authors adopt a previously proposed 3D model of the presomitic mesoderm (Uriu et al. 2021 eLife) and include new elements; different types of cell ingression, tissue compaction and cell cycles. Based on the results of numerical simulations that synchrony of the segmentation clock is robust, the authors conclude that there is a modularity in the segmentation clock and morphogenetic processes.

      The presented results support the conclusion. The manuscript is clearly written.

      Major comments from the original round of review:

      [Optional] In both the current model and Uriu et al. 2021, coupling delay in phase oscillator model is not considered. Given that several previous studies (e.g. Lewis 2003, Herrgen et al. 2010, Yoshioka-Kobayashi et al. 2020) suggested the presence of coupling delays in Delta-Notch signaling, could the authors analyze the effect of coupling delay on robustness of the segmentation clock against morphogenetic processes?

      Significance:

      Synchronization of the segmentation clock has been studied by mathematical modeling, but most previous studies considered cells in a static tissue without morphogenesis. In the previous study by Uriu et al. 2021, morphogenetic processes such as cell advection due to tissue elongation, tissue shortening, and cell mobility were considered in synchronization. The current manuscript provides methodological advances in this aspect by newly including cell ingression, tissue compaction and cell cycle. In addition, the authors bring a concept of modularity and evolvability to the field of the vertebrate segmentation clock, which is new. On the other hand, the manuscript confirms that the synchronization of the segmentation clock is robust by careful simulations, but it does not propose or reveal new mechanisms for making it robust or modular. The main targets of the manuscript will be researchers working on somitogenesis and evolutionary biologists who are interested in evolution of developmental systems. The manuscript will also be interested by broader audiences, like developmental biologists, biophysicists, and physicists and computer scientists who are working on dynamical systems.

    3. Reviewer #2 (Public review):

      Summary:

      The manuscript from Hammond et al., investigates the modularity of the segmentation clock and morphogenesis in early vertebrate development, focusing on how these processes might independently evolve to influence the diversity of segment numbers across vertebrates.

      Methodology: The study uses a previously published computational model, parameterized for zebrafish, to simulate and analyse the interactions between the segmentation clock and the morphogenesis of the pre-somitic mesoderm (PSM). Their model integrates cell advection, motility, compaction, cell division, and the synchronization of the embryo clock. Three alternative scenarios of PSM morphogenesis were modeled to examine how these changes affect the segmentation clock.

      Model System: The computational model system combines a representation of cell movements and the phase oscillator dynamics of the segmentation clock within a three-dimensional horseshoe-shaped domain mimicking the geometry of the vertebrate embryo PSM. The parameters used for the mathematical model are mostly estimated from previously published experimental findings.

      Key Findings and Conclusions: (1) The segmentation clock was found to be broadly robust against variations in morphogenetic processes such as cell ingression and motility; (2) Changes in the length of the PSM and the strength of phase coupling within the clock significantly influenced the system's robustness; (3) The authors conclude that the segmentation clock and PSM morphogenesis exhibited developmental modularity (i.e. relative independence), allowing these two phenomena to evolve independently, and therefore possibly contributing to the diverse segment numbers observed in vertebrates.

      Major comments from the original round of review:

      (1) The key conclusion drawn by the authors (that there is robustness, and therefore modularity, between the morphogenetic cellular processes modeled and the embryo clock synchronization) stems directly from the modeling results appropriately presented and discussed in the manuscript.

      The model comprises some strong assumptions, however all have been clearly explained and the parameterization choices are supported by experimental findings, providing biological meaning to the model. Estimated parameters are well explained, and seem reasonable assumptions (from the embryology perspective).

      (2) This study, as is, achieves its proposed goal of evaluating the potential robustness of the embryo clock to changes in (some) morphogenetic processes. The authors do not claim that the model used is complete, and they properly identify some limitations, including the lack of cell-cell interactions. Given the recognized importance of cellular physical interactions for successful embryo development, including them in the model would be a significant addition in future studies.

      (3) The authors have deposited all the code used for analysis in a public GitHub repository that is updated and available for the research community.

      (4) In page 6, the authors justify their choice of clock parameters for cells ingressing the PSM: "As ingressing cells do not appear to express segmentation clock genes (Mara et al. (2007)), the position at which cells ingress into the PSM can create challenges for clock patterning, as only in the 'off' phase of the clock will ingressing cells be in-phase with their neighbors."

      However, there are several lines of evidence (in chick and mouse), that some oscillatory clock genes are already being expressed as early as in the gastrulation phase (so prior to PSM ingression) (Feitas et al, 2001 [10.1242/dev.128.24.5139]; Jouve et al, 2002 [10.1242/dev.129.5.1107]; Maia-Fernandes at al, 2024 [10.1371/journal.pone.0297853]).

      Question: Is this also true in zebrafish? (I.e. is there any recent experimental evidence that the clock genes are not expressed at ingression, since the paper cited to support this assumption is from 2007).

      If they are expressed in zebrafish (as they are in mouse and chick), then the cell addition should have random clock gene periods when they enter the PSM and not start all with a constant initial phase of zero. Probably this will not impact the results since the cells will also be out of phase with their neighbors when they "ingress", however, it will model more closely the biological scenario (and avoid such criticism).

      Significance:

      GENERAL ASSESSMENT

      This study uses a previously published model to simulate alternative scenarios of morphogenetic parameters to infer the potential independence (termed here modularity) between the segmentation clock and a set of morphogenetic processes, arguing that such modularity could allow the evolution of more flexible body plans, therefore partially explaining the variability in the number of segments observed in the vertebrates. This question is fundamental and relevant, yet still poorly researched. This work provides a comprehensive simulation with a model that tries to simplify the many morphogenetic processes described in the literature, reducing it to a few core fundamental processes that allow drawing the conclusions sought. It provides theoretical insight to support a conceptual advance in the field of evolutionary vertebrate embryology.

      ADVANCE

      This study builds on a model recently published by Uriu et al. (eLife, 2021) that incorporates quantitative experimental data within a modeling framework including cell and tissue-level parameters, allowing the study of multiscale phenomena active during zebrafish embryo segmentation. Uriu's publication reports many relevant and often non-intuitive insights uncovered by the model, most notably the description of phase vortices formed by the synchronizing genetic oscillators interfering with the traveling-wave front pattern.

      However, this model can be further explored to ask additional questions beyond those described in the original paper. A good example is the present study, which uses this mathematical framework to investigate the potential independence between two of the modeled processes, thereby extracting extra knowledge from it. Accordingly, the present study represents a step forward in the direction of using relevant theoretical frameworks to quantitatively explore the landscape of complex molecular hypotheses in silico, and with it shed some light on fundamental open questions or inform the design of future experiments in the lab.

      The study incorporates a wide range of existing literature on the developmental biology of vertebrates. It comprehensively cites prior work, such as the foundational studies by Cooke and Zeeman on the segmentation clock and the role of FGF signaling in PSM development as discussed by Gomez et al. The literature properly covers the breadth of knowledge in this field.

      AUDIENCE

      Target audience: This study is relevant for fundamental research in developmental biology, specifically targeting researchers who focus on early embryo development and morphogenesis from both experimental and theoretical perspectives. It is also relevant for evolutionary biologists investigating the genetic factors that influence vertebrate evolution, as well as to computational biologists and bioinformatics researchers studying developmental processes and embryology.

      Developmental researchers studying the segmentation clock in other vertebrate model organisms (namely mouse and chick), will find this publication especially valuable since it provides insights that can help them formulate new hypotheses to elucidate the molecular mechanisms of the clock (for example finding a set of evolutionarily divergent genes that might interfere with PSM length).<br /> Additionally, this study provides a set of cellular parameters that have yet to be measured in mouse and chick, therefore guiding the design of future experiments to measure them, allowing the simulation of the same model with sets of parameters from different vertebrate model organisms, therefore testing the robustness of the findings reported for zebrafish.

    4. Reviewer #3 (Public review):

      Summary:

      In this manuscript, Verd and colleagues explored how various biologically relevant factors influence the robustness of clock dynamics synchronization among neighboring cells within the context of somatogenesis, adapting a mathematical model presented by Urio et. al in 2021 in a similar context. Specifically they show that clock dynamics is robust to different biological mechanisms such as cell infusion, cellular motility, compaction-extension and cell-division. On the other hand , the length of Presomitic Mesoderm (PSM) and density of cells in it has a significant role in the robustness of clock dynamics. While the manuscript is well-written and provides clear descriptions of methods and technical details, it tends to be somewhat lengthy.

      Major comments from original round of review:

      (1) The authors mention that "...the model is three dimensional and so can quantitatively recapture the rates of cell mixing that we observe in the PSM". I am not convinced with this justification of using a 3D model. None of the effects the authors explore in this manuscript requires a three dimensional model or full physical description of the cellular mechanics such as excluded volume interaction etc. A one-dimensional model characterized by cell position along the arclength of PSM and somatic region and segmentation clock phase θ can incorporate all the physics authors described in this manuscript as well as significantly computationally cheap allowing the authors to explore the effect of different parameters in greater detail.

      (2) I am not sure about the justification for limiting the quantification of phase synchrony in a very limited (one cell diameter wide) region at one end of the somatic part (Page 33 below Fig. 9). From my understanding of the manuscript, the segments appear in significant length anterior to this region. Wouldn't an ensemble average of multiple such one cell diameter wide regions in the somatic region be a more accurate metric for quantifying synchrony?

      (3) While studying the effect of cellular ingression, the authors study three discrete modes-random, DP and DP+LV and show that in the DP+LV mode the clock synchrony becomes affected. I would like the authors to explore this in a continuous fashion from a pure DP ingression to Pure LV ingression and intermediates.

      (4) While studying the effect of length and density of cells in PSM on cellular synchrony, the authors restrict to 3 values of density and 6 values of PSM length keeping the other parameter constant. I would be interested to see a phase diagram similar to Fig. 7 in the two dimensional parameter space of L and ρ0. I am curious if a scaling relation exists for the parameter values that partition the parameter space with and without synchrony.

      (5) Both in the abstract and introduction, the authors discuss at a great length about the variability in the number of segments. I am curious how the number and width of the segments observed depend on different parameters related to cellular mechanics and the segmentation clock ?

      (6) The authors assume that the phase dynamics of the chemical network may be described by an oscillator with constant frequency. For the completeness of the manuscript, the author should discuss in detail, for which chemical networks this is a good assumption.

      (7) Figure 3 and the associated text shows no effect of the cellular motility profile in the synchrony of the segmentation clock. This may be moved to the supplementary considering the length of this manuscript.

      Significance:

      The manuscript answers some important questions in the synchrony of segmentation clock in the vertebrates utilizing a model published earlier. However, the presented result is incomplete in some aspects (points 2 to 5 of section A) and that could be overcome by a more detailed analysis using a simpler one dimensional (point 1 of section A). I believe this manuscript could be of interest to an intersecting audience of developmental biologists, systems biologists, and physicists/engineers interested in dynamical systems.

      [Editors' note: the authors have responded comprehensively to the reviews from Review Commons.]

    5. Author response:

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary:

      In this manuscript, Hammond et al. study robustness of the vertebrate segmentation clock against morphogenetic processes such as cell ingression, cell movement and cell division to ask whether the segmentation clock and morphogenesis are modular or not. The modularity of these two would be important for evolvability of the segmenting system. The authors adopt a previously proposed 3D model of the presomitic mesoderm (Uriu et al. 2021 eLife) and include new elements; different types of cell ingression, tissue compaction and cell cycles. Based on the results of numerical simulations that synchrony of the segmentation clock is robust, the authors conclude that there is a modularity in the segmentation clock and morphogenetic processes. The presented results support the conclusion. The manuscript is clearly written. I have several comments that could help the authors further strengthen their arguments.

      Major comment: 

      [Optional] In both the current model and Uriu et al. 2021, coupling delay in phase oscillator model is not considered. Given that several previous studies (e.g. Lewis 2003, Herrgen et al. 2010, Yoshioka-Kobayashi et al. 2020) suggested the presence of coupling delays in DeltaNotch signaling, could the authors analyze the effect of coupling delay on robustness of the segmentation clock against morphogenetic processes?

      We thank the reviewer for the suggestion. Owing to the computational demands of including such a delay in the model, we cannot feasibly repeat every simulation analysed here in the presence of delay, and would like to note that the increased computational demand that delays put on the simulations is also the reason why Uriu et al 2021 did not include it, as stated in their published exchange with reviewers. However, analogous to our analysis in figure 7, we can analyse how varying the position of progenitor cell ingression affects synchrony in the presence of the coupling delay measured in zebrafish by Herrgen et al. (2010). We show this analysis in a new figure 8 (8B, specifically), on page 21, and discuss its implications in the text on pages 2022. Our analysis reveals that the model cannot recover synchrony using the default parameters used by Uriu et al. (2021) and reveal a much stronger dependence on the rate of cell mixing (vs) than shown in the instantaneous coupling case (cf. figure 7). However, by systematically varying the value of the delay we find that a relatively minor increase in the delay is sufficient to recover synchrony using the parameter set of Uriu et al. (see figure 8C). Repeating this across the three scenarios of cell ingression we see that the combination of coupling strength and delay determine the robustness of synchrony to varying position of cell ingression. This suggests that the combination of these two parameters constrain the evolution of morphogenesis.

      Minor comments: 

      -  PSM radius and oscillation synchrony are both denoted by the same alphabet r. The authors should use different alphabets for these two to avoid confusion.

      We thank the reviewer for spotting this. This has now been changed throughout to rT, as shorthand for ‘radius of tissue’.

      -  page 5 Figure 1 caption: (x-x_a/L) should be (x-x_a)/L.

      We thank the reviewer for spotting this. This has now been corrected.

      -  Figure 3C: Description of black crosses in the panels is required in the figure legend.

      Thank you for spotting this. The legend has now been corrected.

      -  Figure 3C another comment: In this panel, synchrony r at the anterior PSM is shown. It is true that synchrony at anterior PSM is most relevant for normal segment formation. However, in this case, the mobility profile is changed, so it may be appropriate to show how synchrony at mid and posterior PSM would depend on changes in mobility profile. Is synchrony improved by cell mobility at the region where cell ingression happens?

      We thank the reviewer for the suggestion. We have now plotted the synchrony along the AP axis for varying motility profiles, and this can be seen in figure 3 supplement 1, and is briefly discussed in the text on page 11. We show that while the synchrony varies with x-position (as already expected, see figure 2), there is no trend associated with the shape of the motility profile.

      -  In page 12, the authors state that "the results for the DP and DP+LV cases are exactly equal for L = 185 um, as .... and the two ingression methods are numerically equivalent in the model". I understood that in this case two ingression methods are equivalent, but I do not understand why the results are "exactly" equal, given the presence of stochasticity in the model.

      These results can be exactly equal despite the simulations being stochastic because they were both initialised using the same ‘seed’ in the source code. However, we now see that this might be confusing to the reader, and we have re-generated this figure but this time initialising the simulations for each ingression scenario using a different seed value. This is now reflected in the text on page 12 and in figure 4.

      -  The authors analyze the effect of cell density on oscillation synchrony in Fig. 4 and they mention that higher density increases robustness of the clock by increasing the average number of interacting neighbours. I think it would be helpful to plot the average number of neighbouring cells in simulations as a function of density to quantitatively support the claim.

      We thank the reviewer for their suggestion. Distributions of neighbour numbers for exemplar simulations with varying density can now be found in  figure 4 supplementary figure 1 and are referred to in the text on page 11.

      -  The authors analyze the effect of PSM length on synchrony in Fig. 4. I think kymographs of synchrony r as shown in Fig. 2D would also be helpful to show that indeed cells get synchronized while advecting through a longer PSM.

      We thank the reviewer for their suggestion and agree that visualising the data in this way is an excellent idea. We have generated the suggested kymographs and added them to figure 4 as supplements 2 and 4, and discussed these results in the text on page 12.

      -  I understand that cells in M phase can interact with neighboring cells with the same coupling strength kappa in the model, although their clocks are arrested. If so, this aspect should be also mentioned in the main text in page 16, as this coupling can be another noise source for synchrony.

      We agree this is an important clarification. We explicitly state this, and briefly justify our choice, in the text on page 16.

      -  Figure 5-figure supplement 2: panel labels A, B, C are missing. 

      Thank you for bringing this to our attention. These have now been added.

      – Figure 5-figure supplement 3: panel labels A, B, C are missing.

      Thank you for bringing this to our attention. These have now been added.

      Reviewer #1 (Significance):

      Synchronization of the segmentation clock has been studied by mathematical modeling, but most previous studies considered cells in a static tissue without morphogenesis. In the previous study by Uriu et al. 2021, morphogenetic processes such as cell advection due to tissue elongation, tissue shortening, and cell mobility were considered in synchronization. The current manuscript provides methodological advances in this aspect by newly including cell ingression, tissue compaction and cell cycle. In addition, the authors bring a concept of modularity and evolvability to the field of the vertebrate segmentation clock, which is new. On the other hand, the manuscript confirms that the synchronization of the segmentation clock is robust by careful simulations, but it does not propose or reveal new mechanisms for making it robust or modular. The main targets of the manuscript will be researchers working on somitogenesis and evolutionary biologists who are interested in evolution of developmental systems. The manuscript will also be interested by broader audiences, like developmental biologists, biophysicists, and physicists and computer scientists who are working on dynamical systems.

      We thank the reviewer for their interest in our manuscript and for acknowledging us as one of the first to address the modularity and evolvability of somitogenesis. We hope that this work will encourage others to think about these concepts in this system too.  

      In the original submission, we identified a high enough coupling strength as the main mechanism underlying the identified modularity in somitogenesis. Since, we have included an analysis of the coupling delay and find that it is the interplay between coupling strength and coupling delay that mediate the identified modularity, allowing PSM morphogenesis and the segmentation clock to evolve independently in regions of parameter space that are constrained and determined by the interplay between these two parameters. We have now added an extra figure (figure 8) where we explore this interplay and have discussed it at length in the last section of the results and in the discussion. We again thank the reviewer for encouraging us to include delays in our analysis.

      Reviewer #2 (Evidence, reproducibility and clarity):

      SUMMARY 

      The manuscript from Hammond et al., investigates the modularity of the segmentation clock and morphogenesis in early vertebrate development, focusing on how these processes might independently evolve to influence the diversity of segment numbers across vertebrates.

      Methodology: The study uses a previously published computational model, parameterized for zebrafish, to simulate and analyse the interactions between the segmentation clock and the morphogenesis of the pre-somitic mesoderm (PSM). Their model integrates cell advection, motility, compaction, cell division, and the synchronization of the embryo clock. Three alternative scenarios of PSM morphogenesis were modeled to examine how these changes affect the segmentation clock.

      Model System: The computational model system combines a representation of cell movements and the phase oscillator dynamics of the segmentation clock within a three-dimensional horseshoe-shaped domain mimicking the geometry of the vertebrate embryo PSM. The parameters used for the mathematical model are mostly estimated from previously published experimental findings.

      Key Findings and Conclusions: (1) The segmentation clock was found to be broadly robust against variations in morphogenetic processes such as cell ingression and motility; (2) Changes in the length of the PSM and the strength of phase coupling within the clock significantly influenced the system's robustness; (3) The authors conclude that the segmentation clock and PSM morphogenesis exhibited developmental modularity (i.e. relative independence), allowing these two phenomena to evolve independently, and therefore possibly contributing to the diverse segment numbers observed in vertebrates.

      MAJOR COMMENTS

      (1) The key conclusion drawn by the authors (that there is robustness, and therefore modularity, between the morphogenetic cellular processes modeled and the embryo clock synchronization) stems directly from the modeling results appropriately presented and discussed in the manuscript. The model comprises some strong assumptions, however all have been clearly explained and the parameterization choices are supported by experimental findings, providing biological meaning to the model. Estimated parameters are well explained and seem reasonable assumptions (from the embryology perspective).

      We thank the reviewer for their positive comments about our work

      (2) This study, as is, achieves its proposed goal of evaluating the potential robustness of the embryo clock to changes in (some) morphogenetic processes. The authors do not claim that the model used is complete, and they properly identify some limitations, including the lack of cellcell interactions. Given the recognized importance of cellular physical interactions for successful embryo development, including them in the model would be a significant addition in future studies.

      We would like to clarify that the model does include cell-cell interactions as cells interact with their neighbours’ clock phase to synchronise and to avoid occupying the same physical space. 

      (3) The authors have deposited all the code used for analysis in a public GitHub repository that is updated and available for the research community.

      We support open source coding practices.

      (4) In page 6, the authors justify their choice of clock parameters for cells ingressing the PSM: "As ingressing cells do not appear to express segmentation clock genes (Mara et al. (2007)), the position at which cells ingress into the PSM can create challenges for clock patterning, as only in the 'off' phase of the clock will ingressing cells be in-phase with their neighbours."  However, there are several lines of evidence (in chick and mouse), that some oscillatory clock genes are already being expressed as early as in the gastrulation phase (so prior to PSM ingression) (Feitas et al, 2001 [10.1242/dev.128.24.5139]; Jouve et al, 2002 [10.1242/dev.129.5.1107]; Maia-Fernandes at al, 2024 [10.1371/journal.pone.0297853]) Question: Is this also true in zebrafish? (I.e. is there any recent experimental evidence that the clock genes are not expressed at ingression, since the paper cited to support this assumption is from 2007). If they are expressed in zebrafish (as they are in mouse and chick), then the cell addition should have random clock gene periods when they enter the PSM and not start all with a constant initial phase of zero. Probably this will not impact the results since the cells will also be out of phase with their neighbours when they "ingress", however, it will model more closely the biological scenario (and avoid such criticism).

      We thank the reviewer for their comments. While it is known that in zebrafish the clock begins oscillating during epiboly and before the onset of segmentation (Riedel-Kruse et al., 2007), to our knowledge no-one has examined whether posteriorly or laterally ingressing progenitor cells express clock genes prior to their ingression into the PSM, which occurs later in development than the first oscillations which give rise to the first somites. We have not found any published evidence of her/hes gene expression in the dorsal donor tissues or lateral tissues surrounding the PSM, however we acknowledge that this has not been actively studied before and our assumption relies on an absence of evidence, rather than evidence of absence. 

      However, we agree with the reviewer that one should include such an analysis for completeness, and we have now generated additional simulations where progenitor cells ingress with a random clock phase. This data is presented in figure 2 supplement 1 and mentioned in the main text on page 9.

      MINOR COMMENTS 

      (1) The citations are appropriate and cover the major labs that have published work related to this study (although with some overrepresentation of the lab that published the model used).

      We have cited the vast literature on somitogenesis to the best of our ability and do recognise that the work of the Oates lab appears prominently, but this is probably because their experimental data were originally used to parametrise the model in Uriu et al. 2021.

      (2) The text is clear, carefully written, and both the methods and the reasoning behind them are clearly explained and supported by proper citations.

      We are very glad to see that the reviewer found that the manuscript was clearly presented.

      (3) The figures are comprehensive, properly annotated, with explanatory self-contained legends. I have no comments regarding the presentation of the results.

      Thank you

      (4) Minor suggestions: 

      a. Page 26: In the Cell addition sub-section of the Methods section, correct all instances where the word domain is used, but subdomain should be used (for clarity and coherence with the description of the model, stated as having a single domain comprising 3 subdomains).

      We thank the reviewer for raising this, this is a good point. We have now corrected to ‘subdomain’ where appropriate.

      b. Page 32: Table 1. Parameter values used in our work, unless otherwise stated -> Suggestion: Add a column with the individual citations used for each parameter (to facilitate the confirmation of each corresponding reference).

      Thank you for the suggstion, we have now done this (see table 1 page 36).

      Reviewer #2 (Significance):

      GENERAL ASSESSMENT 

      This study uses a previously published model to simulate alternative scenarios of morphogenetic parameters to infer the potential independence (termed here modularity) between the segmentation clock and a set of morphogenetic processes, arguing that such modularity could allow the evolution of more flexible body plans, therefore partially explaining the variability in the number of segments observed in the vertebrates. This question is fundamental and relevant, yet still poorly researched. This work provides a comprehensive simulation with a model that tries to simplify the many morphogenetic processes described in the literature, reducing it to a few core fundamental processes that allow drawing the conclusions seeked. It provides theoretical insight to support a conceptual advance in the field of evolutionary vertebrate embryology.

      ADVANCE

      This study builds on a model recently published by Uriu et al. (eLife, 2021) that incorporates quantitative experimental data within a modeling framework including cell and tissue-level parameters, allowing the study of multiscale phenomena active during zebrafish embryo segmentation. Uriu's publication reports many relevant and often non-intuitive insights uncovered by the model, most notably the description of phase vortices formed by the synchronizing genetic oscillators interfering with the traveling-wave front pattern.  However, this model can be further explored to ask additional questions beyond those described in the original paper. A good example is the present study, which uses this mathematical framework to investigate the potential independence between two of the modeled processes, thereby extracting extra knowledge from it. Accordingly, the present study represents a step forward in the direction of using relevant theoretical frameworks to quantitatively explore the landscape of complex molecular hypotheses in silico, and with it shed some light on fundamental open questions or inform the design of future experiments in the lab.

      The study incorporates a wide range of existing literature on the developmental biology of vertebrates. It comprehensively cites prior work, such as the foundational studies by Cooke and Zeeman on the segmentation clock and the role of FGF signaling in PSM development as discussed by Gomez et al. The literature properly covers the breadth of knowledge in this field.

      AUDIENCE

      Target audience | This study is relevant for fundamental research in developmental biology, specifically targeting researchers who focus on early embryo development and morphogenesis from both experimental and theoretical perspectives. It is also relevant for evolutionary biologists investigating the genetic factors that influence vertebrate evolution, as well as to computational biologists and bioinformatics researchers studying developmental processes and embryology.

      Developmental researchers studying the segmentation clock in other vertebrate model organisms (namely mouse and chick), will find this publication especially valuable since it provides insights that can help them formulate new hypotheses to elucidate the molecular mechanisms of the clock (for example finding a set of evolutionarily divergent genes that might interfere with PSM length). Additionally, this study provides a set of cellular parameters that have yet to be measured in mouse and chick, therefore guiding the design of future experiments to measure them, allowing the simulation of the same model with sets of parameters from different vertebrate model organisms, therefore testing the robustness of the findings reported for zebrafish.

      Reviewer #3 (Evidence, reproducibility and clarity): 

      In this manuscript, Verd and colleagues explored how various biologically relevant factors influence the robustness of clock dynamics synchronization among neighboring cells within the context of somatogenesis, adapting a mathematical model presented by Urio et. al in 2021 in a similar context. Specifically they show that clock dynamics is robust to different biological mechanisms such as cell infusion, cellular motility, compaction-extension and cell-division. On the other hand , the length of Presomitic Mesoderm (PSM) and density of cells in it has a significant role in the robustness of clock dynamics. While the manuscript is well-written and provides clear descriptions of methods and technical details, it tends to be somewhat lengthy.

      Below are the comments I would like the authors to address:

      (1) The authors mention that "...the model is three dimensional and so can quantitatively recapture the rates of cell mixing that we observe in the PSM". I am not convinced with this justification of using a 3D model. None of the effects the authors explore in this manuscript requires a three dimensional model or full physical description of the cellular mechanics such as excluded volume interaction etc. A one-dimensional model characterized by cell position along the arclength of PSM and somatic region and segmentation clock phase θ can incorporate all the physics authors described in this manuscript as well as significantly computationally cheap allowing the authors to explore the effect of different parameters in greater detail.

      One of the main objectives of the work we present in this manuscript is to assess how the evolution of PSM morphogenesis affects, or does not affect, segment patterning. The PSM is a three-dimensional tissue with differing cell rearrangement dynamics along its anterior-posterior axis. In addition, PSM dimension, density, the rearrangement rate, and patterns of cell ingression all vary across vertebrate species, and they are functional, especially cell mixing as it promotes synchronisation and drives elongation. In order to answer questions on the modularity of somitogenesis we therefore consider it absolutely necessary to include a three-dimensional representation of the PSM that captures single cells and their movements. In addition, this will allow us, as Reviewer #2 also pointed out, to reparametrize our model using species-specific data as it becomes available. 

      While the reviewer is right in that lower dimensional representations would be computationally more efficient, and are generally more tractable, it would not be possible to represent cell mixing in one dimension, as this happens in three dimensions. One could perhaps encode the synchrony-promoting effect of cell mixing via some coupling function κ(x) that increases towards the posterior, however it is unclear what existing biological data one could use to parameterise this function or determine its form. Cell mixing can be modelled in a two-dimensional framework, however this cannot quantitatively recapture the rate of cell mixing observed in vivo, which is an advantage of this model. 

      Furthermore, it is unclear how one would simulate processes such as compactionextension using a one-dimensional model. The two different scenarios of cell ingression which we consider can also not be replicated in a one-dimensional model, as having a population of cells re-acquiring synchrony on the dorsal surface of the tissue while new material is added to the ventral side, creating asynchrony, is qualitatively different than a one-dimensional scenario where cells are introduced continuously along the spatial axis.

      (2) I am not sure about the justification for limiting the quantification of phase synchrony in a very limited (one cell diameter wide) region at one end of the somatic part (Page 33 below Fig. 9). From my understanding of the manuscript, the segments appear in significant length anterior to this region. Wouldn't an ensemble average of multiple such one cell diameter wide regions in the somatic region be a more accurate metric for quantifying synchrony?

      Indeed, such a metric (e.g. as that used by Uriu et al. to quantify synchrony along the xaxis) would be more accurate for determining synchrony within the PSM. However, as per the clock and wavefront model of somitogenesis, only synchrony at the very anterior of the PSM (or at the wavefront, equivalently) is functional for somitogenesis and thus evolution. Therefore, we restrict our analysis to the anterior-most region of the PSM. We now further justify this in the main text on page 9.

      (3) While studying the effect of cellular ingression, the authors study three discrete modes- random, DP and DP+LV and show that in the DP+LV mode the clock synchrony becomes affected. I would like the authors to explore this in a continuous fashion from a pure DP ingression to Pure LV ingression and intermediates.

      We thank the reviewer for this suggestion; this is a very interesting question. We are currently working on a related computational and experimental project to address the question of how PSM morphogenesis can change over evolutionary time to evolve the different modes that we see across species. As part of this work, we are running precisely the simulations suggested by the reviewer to find regions of parameter space in which all the relevant morphogenetic processes can freely evolve.  While interesting, this work is however outside the scope of the current manuscript.

      (4) While studying the effect of length and density of cells in PSM on cellular synchrony, the authors restrict to 3 values of density and 6 values of PSM length keeping the other parameter constant. I would be interested to see a phase diagram similar to Fig. 7 in the two-dimensional parameter space of L and ρ0. I am curious if a scaling relation exists for the parameter values that partition the parameter space with and without synchrony.

      We thank the reviewer for their suggestion and agree that this would constitute an interesting addition to the manuscript. We have now generated these data, which are shown in figure 4 supplement 5 and mentioned on page 13. We see no clear relationship between these two variables when co-varying in the presence of random ingression. 

      (5) Both in the abstract and introduction, the authors discuss at a great length about the variability in the number of segments. I am curious how the number and width of the segments observed depend on different parameters related to cellular mechanics and the segmentation clock ?

      We thank the reviewer for this question. It was not clear to us if this was something the reviewer wants us to address in the study’s background and introduction, or an analysis we should include in the results. Therefore, we have responded to both comprehensively below:

      The prevailing conceptual framework for understanding this is the clock and wavefront model (Cooke and Zeeman, 1976), which posits that the somite length is inversely proportional to the frequency of the clock relative to the speed of the wavefront, and that the total number of segments is the relative frequency multiplied by the total duration of somitogenesis.

      Experimentally we know that the frequency is determined in part by the coupling strength (Liao, Jorg, and Oates, 2016), and from comparative embryological studies (Gomez et al., 2008; Steventon et al., 2016) we know that changes in the elongation dynamics of the PSM correlate with changes in somite number, presumably by altering the total duration of somitogenesis (Gomez et al., 2009). These changes in elongation are thought to be driven by the changes in cell and tissue mechanics we test in our manuscript. 

      Within our model, we cannot in general predict how the number of segments responds to changes in either clock parameters or cell mechanical parameters, as we lack understanding of what causes somitogenesis to cease; this is thus not encoded in our model and segmentation can in principle proceed indefinitely. Therefore, we have not performed this analysis.

      Similarly, we have not included an analysis of somite length. This is for two reasons: 1) as per the clock and wavefront model, the frequency at the PSM anterior (which we analyse) is equivalent to this measurement, as we assume (in general) the wavefront ($x = x_{a}$) is inertial. 2) the length of the nascent somite is not thought to be of much relevance to the adult phenotype, and by extension evolution. Somites undergo cell division and growth soon after their patterning by the segmentation clock, therefore their final size does not majorly depend on the dynamics of the segmentation clock. Rather, the main function of the clock is to control their number (and polarity).

      (6) The authors assume that the phase dynamics of the chemical network may be described by an oscillator with constant frequency. For the completeness of the manuscript, the author should discuss in detail, for which chemical networks this is a good assumption.

      We thank the reviewer for their suggestion and now justify this assumption in the methods on page 31. 

      Such an assumption is appropriate for the segmentation clock, as the clock in the posterior of the PSM is thought to oscillate with a constant frequency, at least for the majority of somitogenesis although the frequency of somite formation slows towards the end of this process in zebrafish (Giudicelli et al., 2007, PLoS Biol.). In addition, PSM cells isolated and cultured in the presence of FGF (thus replicating the signalling environment of the posterior PSM) will continue to exhibit her1 oscillations with an apparently constant frequency (Webb et al., 2016). 

      We note that such formulations are widely used within the segmentation clock literature (e.g. Riedel-Kruse et al., 2007, Morelli et al., 2009).

      (7) Figure 3 and the associated text shows no effect of the cellular motility profile in the synchrony of the segmentation clock. This may be moved to the supplementary considering the length of this manuscript.

      Thank you for the suggestion. However, we would argue that the lack of effect is a crucial result when discussing modularity. Reviewer #2 agrees with this assessment.

      Reviewer #3 (Significance): 

      The manuscript answers some important questions in the synchrony of segmentation clock in the vertebrates utilizing a model published earlier. However, the presented result is incomplete in some aspects (points 2 to 5 of section A) and that could be overcome by a more detailed analysis using a simpler one dimensional (point 1 of section A). I believe this manuscript could be of interest to an intersecting audience of developmental biologists, systems biologists, and physicists/engineers interested in dynamical systems.

    1. eLife Assessment

      This useful study presents a comparative investigation of category selectivity in dogs and humans. The study compares brain representations of animate and inanimate objects, replicating and extending previous reports in this nascent field of dog FMRI. The methods and results seem to lack sufficient detail, appropriate controls, or statistical evidence, so at this stage of the review process, the strength of evidence is deemed incomplete.

    2. Reviewer #1 (Public review):

      Summary

      Farkas and colleagues conducted a comparative neuroimaging study with domestic dogs and humans to explore whether social perception in both species is underpinned by an analogous distinction between animate and inanimate entities an established functional organizing principle in the primate and human brain. Presenting domestic dogs and humans with clips of three animate classes (dogs, humans, cats) and one inanimate control (cars), the authors also set out to compare how dogs and humans perceive their own vs other species. Both research questions have been previously studied in dogs, but the authors used novel dynamic stimuli and added animate and inanimate classes, which have not been investigated before (i.e., cats and cars). Combining univariate and multivariate analysis approaches, they identified functionally analogous areas in the dog and human occipito-temporal cortex involved in the perception of animate entities, largely replicating previous observations. This further emphasizes a potentially shared functional organizing principle of social perception in the two species. The authors also describe between-species divergencies in the perception of the different animate classes, arguing for a less generalized perception of animate entities in dogs, but this conclusion is not convincingly supported by the applied analyses and reported findings.

      Strengths

      Domestic dogs represent a compelling model species to study the neural bases of social perception and potentially shared functional organizing principles with humans and primates. The field of comparative neuroimaging with dogs is still young, with a growing but still small number of studies, and the present study exemplifies the reproducibility of previous research. Using dynamic instead of static stimuli and adding new stimuli classes, Farkas and colleagues successfully replicated and expanded previous findings, adding to the growing body of evidence that social perception is underpinned by a shared functional organizing principle in the dog and human occipito-temporal cortex.

      Weaknesses

      The study design is imbalanced, with only one category of inanimate objects vs. three animate entities. Moreover, based on the example videos, it appears that the animate stimuli also differed in the complexity of the content from the car stimuli, with often multiple agents interacting or performing goal-directed actions. Moreover, while dogs are familiar with cars, they are definitely of lower relevance and interest to them than the animate stimuli. Thus, to a certain extent, the results might also reflect differences in attention towards/salience of the stimuli.

      The methods section and rationale behind the chosen approaches were often difficult to follow and lacked a lot of information, which makes it difficult to judge the evidence and the drawn conclusions, and it weakens the potential for reproducibility of this work. For example, for many preprocessing and analysis steps, parameters were missing or descriptions of the tools used, no information on anatomical masks and atlas used in humans was provided, and it is often not clear if the authors are referring to the univariate or multivariate analysis.

      In regard to the chosen approaches and rationale, the authors generally binarize a lot of rich information. Instead of directly testing potential differences in the neural representations of the different animate entities, they binarize dissimilarity maps for, e.g. animate entity > inanimate cars and then calculate the overlap between the maps. The comparison of the overlap of these three maps between species is also problematic, considering that the human RSA was constricted to the occipital and temporal cortex (there is now information on how they defined it) vs. whole-brain in dogs. Considering that the stimuli do differ based on low-level visual properties (just not significantly within a run), the RSA would also allow the authors to directly test if some of the (dis)similarities might be driven by low-level visual features like they, e.g. did with the early visual cortex model. I do think RSA is generally an excellent choice to investigate the neural representation of animate (and inanimate) stimuli, but the authors should apply it more appropriately and use its full potential.

      The authors localized some of the "animate areas" also with the early visual cortex model (e.g. ectomarginal gyrus, mid suprasylvian); in humans, it only included the known early visual cortex - what does this mean for the animate areas in dogs?

      The results section also lacks information and statistical evidence; for example, for the univariate region-of-interest (ROI) analysis (called response profiles) comparing activation strength towards each stimulus type, it is not reported if comparisons were significant or not, but the authors state they conducted t-tests. The authors describe that they created spheres on all peaks reported for the contrast animate > inanimate, but they only report results for the mid suprasylvian and occipital gyrus (e.g. caudal suprasylvian gyrus is missing). Furthermore, considering that the ROIs were chosen based on the contrast animate > inanimate stimuli, activation strength should only be compared between animate entities (i.e., dogs, humans, cats), while cars should not be reported (as this would be double dipping, after selecting voxels showing lower activation for that category). The descriptive data in Figure 3B (pending statistical evidence) suggests there were no strong differences in activation for the three species in dog and human animate areas. Thus, the ROI analysis appears to contradict findings from the binary analysis approach to investigate species preference, but the authors only discuss the results of the latter in support of their narrative for conspecific preference in dogs and do not discuss research from other labs investigating own-species preference.

      The authors also unnecessarily exaggerate novelty claims. Animate vs inanimate and own vs other species perceptions have both been investigated before in dogs (and humans), so any claims in that direction seem unsubstantiated - and also not needed, as novelty itself is not a sign of quality; what is novel, and a sign of theoretical advance besides the novelty, are as said the conceptual extension and replication of previous work.

      Overall, more analyses and appropriate tests are needed to support the conclusions drawn by the authors, as well as a more comprehensive discussion of all findings.

    3. Reviewer #2 (Public review):

      Summary:

      The manuscript reports an fMRI study looking at whether there is animacy organization in a non-primate, mammal, the domestic dog, that is similar to that observed in humans and non-human primates (NHPs). A simple experiment was carried out with four kinds of stimulus videos (dogs, humans, cats, and cars), and univariate contrasts and RSA searchlight analysis was performed. Previous studies have looked at this question or closely associated questions (e.g. whether there is face selectivity in dogs). The import of the present study is that it looks at multiple types of animate objects, dogs, humans, and cats, and tests whether there was overlapping/similar topography (or magnitude) of responses when these stimuli were compared to the inanimate reference class of cars. The main finding was of some selectivity for animacy though this was primarily driven by the dog stimuli, which did overlap with the other animate stimulus types, but far less so than in humans.

      Strengths:

      I believe that this is an interesting study in so far as it builds on other recent work looking at category-selectivity in the domestic dog. Given the limited number of such studies, I think it is a natural step to consider a number of different animate stimuli and look at their overlap. While some of the results were not wholly surprising (e.g. dog brains respond more selectively for dogs than humans or cats), that does not take away from their novelty, such as it is. The findings of this study are useful as a point of comparison with other recent work on the organization of high-level visual function in the brain of the domestic dog.

      Weaknesses:

      (1) One challenge for all studies like this is a lack of clarity when we say there is organization for "animacy" in the human and NHP brains. The challenge is by no means unique to the present study, but I do think it brings up two more specific topics.

      First, one property associated with animate things is "capable of self-movement". While cognitively we know that cars require a driver, and are otherwise inanimate, can we really assume that dogs think of cars in the same way? After all, just think of some dogs that chase cars. If dogs represent moving cars as another kind of self-moving thing, then it is not clear we can say from this study that we have a contrast between animate vs inanimate. This would not mean that there are no real differences in neural organization being found. It was unclear whether all or some of the car videos showed them moving. But if many/most do, then I think this is a concern.

      Second, there is quite a lot of potential complexity in the human case that is worth considering when interpreting the results of this study. In the human case, some evidence suggests that animacy may be more of a continuum (Sha et al. 2015), which may reflect taxonomy (Connolly et al. 2012, 2016). However moving videos seem to be dominated more by signals relevant to threat or predation relative to taxonomy (Nastase et al. 2017). Some evidence suggests that this purported taxonomic organization might be driven by gradation in representing faces and bodies of animals based on their relative similarity to humans (Ritchie et al. 2021). Also, it may be that animacy organization reflects a number of (partially correlated) dimensions (Thorat et al. 2019, Jozwik et al. 2022). One may wonder whether the regions of (partial) overlap in animate responses in the dog brain might have some of these properties as well (or not).

      (2) It is stated that previous studies provide evidence that the dog brain shows selectivity to "certain aspects of animacy". One of these already looked at selectivity for dog and human faces and bodies and identified similar regions of activity (Boch et al. 2023). An earlier study by Dilks et al. (2015), not cited in the present work (as far as I can tell), also used dynamic stimuli and did not suffer from the above limitations in choosing inanimate stimuli (e.g. using toy and scene objects for inanimate stimuli). But it only included human faces as the dynamic animate stimulus. So, as far as stimulus design, it seems the import of the present study is that it included a *third* animate stimulus (cats) and that the stimuli were dynamic.

      (3) I am concerned that the univariate results, especially those depicted in Figure 3B, include double dipping (Kriegesorte et al. 2009). The analysis uses the response peak for the A > iA contrast to then look at the magnitude of the D, H, C vs iA contrasts. This means the same data is being used for feature selection and then to estimate the responses. So, the estimates are going to be inflated. For example, the high magnitudes for the three animate stimuli above the inanimate stimuli are going to inherently be inflated by this analysis and cannot be taken at face value. I have the same concern with the selectivity preference results in Figure 3E.

      I think the authors have two options here. Either they drop these analyses entirely (so that the total set of analyses really mirrors those in Figure 4), or they modify them to address this concern. I think this could be done in one of two ways. One would be to do a within-subject standard split-half analysis and use one-half of the data for feature selection and the other for magnitude estimation. The other would be to do a between-subject design of some kind, like using one subject for magnitude estimation based on an ROI defined using the data for the other subjects.

      (4) There are two concerns with how the overlap analyses were carried out. First, as typically carried out to look at overlap in humans, the proportion is of overlapping results of the contrasts of interest, e.g, for face and body selectivity overlap (Schwarlose et al. 2006), hand and tool overlap (Bracci et al. 2012), or more recently, tool and food overlap (Ritchie et al. 2024). There are a number of ways of then calculating the overlap, with their own strengths and weaknesses (see Tarr et al. 2007). Of these, I think the Jaccard index is the most intuitive, which is just the intersection of two sets as a proportion of their union. So, for example, the N of overlapping D > iA and H > iA active voxels is divided by the total number of unique active voxels for the two contrasts. Such an overlap analysis is more standard and interpretable relative to previous findings. I would strongly encourage the authors to carry out such an analysis or use a similar metric of overlap, in place of what they have currently performed (to the extent the analysis makes sense to me).

      Second, the results summarized in Figure 3A suggest multiple distinct regions of animacy selectivity. Other studies have also identified similar networks of regions (e.g. Boch et al. 2023). These regions may serve different functions, but the overlap analysis does not tell us whether there is overlap in some of these portions of the cortex and not in others. The overlap is only looked at in a very general sense. There may be more overlap locally in some portions of the cortex and not in others.

      (5) Two comments about the RSA analyses. First, I am not quite sure why the authors used HMAX rather than layers of a standardly trained ImageNet deep convolutional neural network. This strikes me also as a missed opportunity since many labs have looked at whether later layers of DNNs trained on object categorization show similar dissimilarity structures as category-selective regions in humans and NHPs. In so far as cross-species comparisons are the motivation here, it would be genuinely interesting to see what would happen if one did a correlation searchlight with the dog brain and layers of a DNN, a la Cichy et al. (2016).

      Second, from the text is hard to tell what the models for the class- and category-boundary effects were. Are there RDMs that can be depicted here? I am very familiar with RSA searchlight and I found the description of the methods to be rather opaque. The same point about overlap earlier regarding the univariate results also applies to the RSA results. Also, this is again a reason to potentially compare DNN RDMs to both the categorical models and the brains of both species.

      (6) There has been emphasis of late on the role of face and body selective regions and social cognition (Pitcher and Ungerleider, 2021, Puce, 2024), and also on whether these regions are more specialized for representing whole bodies/persons (Hu et al. 2020, Taubert, et al. 2022). It may be that the supposed animacy organization is more about how we socialize and interact with other organisms than anything about animacy as such (see again the earlier comments about animacy, taxonomy, and threat/predation). The result, of a great deal of selectivity for dogs, some for humans, and little for cats, seems to readily make sense if we assume it is driven by the social value of the three animate objects that are presented. This might be something worth reflecting on in relation to the present findings.

    4. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary

      Farkas and colleagues conducted a comparative neuroimaging study with domestic dogs and humans to explore whether social perception in both species is underpinned by an analogous distinction between animate and inanimate entities an established functional organizing principle in the primate and human brain. Presenting domestic dogs and humans with clips of three animate classes (dogs, humans, cats) and one inanimate control (cars), the authors also set out to compare how dogs and humans perceive their own vs other species. Both research questions have been previously studied in dogs, but the authors used novel dynamic stimuli and added animate and inanimate classes, which have not been investigated before (i.e., cats and cars). Combining univariate and multivariate analysis approaches, they identified functionally analogous areas in the dog and human occipitotemporal cortex involved in the perception of animate entities, largely replicating previous observations. This further emphasizes a potentially shared functional organizing principle of social perception in the two species. The authors also describe between- species divergencies in the perception of the different animate classes, arguing for a less generalized perception of animate entities in dogs, but this conclusion is not convincingly supported by the applied analyses and reported findings.

      Strengths

      Domestic dogs represent a compelling model species to study the neural bases of social perception and potentially shared functional organizing principles with humans and primates. The field of comparative neuroimaging with dogs is still young, with a growing but still small number of studies, and the present study exemplifies the reproducibility of previous research. Using dynamic instead of static stimuli and adding new stimuli classes, Farkas and colleagues successfully replicated and expanded previous findings, adding to the growing body of evidence that social perception is underpinned by a shared functional organizing principle in the dog and human occipito-temporal cortex.

      Weaknesses

      The study design is imbalanced, with only one category of inanimate objects vs. three animate entities. Moreover, based on the example videos, it appears that the animate stimuli also differed in the complexity of the content from the car stimuli, with often multiple agents interacting or performing goal-directed actions. Moreover, while dogs are familiar with cars, they are definitely of lower relevance and interest to them than the animate stimuli. Thus, to a certain extent, the results might also reflect differences in attention towards/salience of the stimuli.

      We agree with the Reviewer and were aware that using only one class of inanimate objects but three classes of animate entities, along with the differences in complexity and relevance between the animate and the inanimate stimuli potentially elicited more attention to the inanimate condition and may have thus introduced a confound. We are revising the related limitation in the discussion to acknowledge this and to emphasize why we believe these differences do not compromise our main findings.

      The methods section and rationale behind the chosen approaches were often difficult to follow and lacked a lot of information, which makes it difficult to judge the evidence and the drawn conclusions, and it weakens the potential for reproducibility of this work. For example, for many preprocessing and analysis steps, parameters were missing or descriptions of the tools used, no information on anatomical masks and atlas used in humans was provided, and it is often not clear if the authors are referring to the univariate or multivariate analysis.

      We acknowledge the concerns regarding the clarity and completeness of the methods section and are significantly revising the descriptions of the methods. Of note, in humans, the Harvard-Oxford Cortical Structural Atlas (Frazier et al., 2005; Makris et al., 2006; Desikan et al., 2006; Goldstein et al., 2007), implemented within the FSL software package, was used for anatomical masks, while the Automated Anatomical Labeling atlas (Tzourio-Mazoyer et al., 2002) was used for assigning labels.

      In regard to the chosen approaches and rationale, the authors generally binarize a lot of rich information. Instead of directly testing potential differences in the neural representations of the different animate entities, they binarize dissimilarity maps for, e.g. animate entity > inanimate cars and then calculate the overlap between the maps.

      We thank the Reviewer for these comments and ideas. We also appreciate the second Reviewer for their related concerns and suggestions about the overlap calculation. Since the neural processing of different animate entities in the dog brain is largely unexplored, in some of our analyses we aimed to provide a straightforward and directly comparable characterization of animacy perception in the two species. We believe that a measure of how overlapping the neural representations of different animate classes are in the dog vs. the human visual cortex is a simple but meaningful and insightful characterization of how animacy perception is structured in the two species, despite the lack of spatial detail. Our decision to use binarization was based on these considerations. In response to this Reviewer’s request for providing richer information, in our revised manuscript, we will present more details and additional non-binarized calculations. Specifically, we are going to use nonbinarized data to present the response profiles of a broad, anatomically defined set of regions that have been related in other works to visual functions, to thus show where there is significant difference and overlap between the neural responses for the three animate classes in each species.

      The comparison of the overlap of these three maps between species is also problematic, considering that the human RSA was constricted to the occipital and temporal cortex (there is now information on how they defined it) vs. whole-brain in dogs.

      We thank this Reviewer for raising yet another relevant point about overlap calculation. We note that the overlap calculation for univariate results used the visually responsive cortex in both dogs and humans. The decision to restrict the multivariate analysis to the occipital and temporal lobes in humans, where the visual areas are, was to reduce computational load. Since RSA in dogs yielded significant voxels almost exclusively in the occipital and temporal cortices, we believe this decision did not introduce major bias in our results. This concern will also be discussed in our revised submission.

      Of note, in the category- and class-boundary test, as for the other multivariate tests, the occipital and temporal cortex of humans was delineated based on the MNI atlas.

      Considering that the stimuli do differ based on low-level visual properties (just not significantly within a run), the RSA would also allow the authors to directly test if some of the (dis)similarities might be driven by low-level visual features like they, e.g. did with the early visual cortex model. I do think RSA is generally an excellent choice to investigate the neural representation of animate (and inanimate) stimuli, but the authors should apply it more appropriately and use its full potential.

      We thank the Reviewer for this suggestion. While this study did not aim to investigate the correlation between low-level visual features and animacy, the data is available, and the suggested analysis can be conducted in the future. This issue will also be discussed in our revised submission.

      The authors localized some of the "animate areas" also with the early visual cortex model (e.g. ectomarginal gyrus, mid suprasylvian); in humans, it only included the known early visual cortex - what does this mean for the animate areas in dogs?

      We thank the Reviewer for raising this point. Although the labels are the same, both EMG and mSSG are relatively large gyri, and the clusters revealed by each of the two analyses hardly overlap, with peak coordinates more than 12 mm apart for R EMG, and in different hemispheres for mSSG (but more than 11 mm apart even if projected on the same hemisphere). We will detail the differences and the overlaps in the revised submission.

      The results section also lacks information and statistical evidence; for example, for the univariate region-of-interest (ROI) analysis (called response profiles) comparing activation strength towards each stimulus type, it is not reported if comparisons were significant or not, but the authors state they conducted t-tests. The authors describe that they created spheres on all peaks reported for the contrast animate > inanimate, but they only report results for the mid suprasylvian and occipital gyrus (e.g. caudal suprasylvian gyrus is missing).

      We thank this Reviewer for catching these errors. The missing statistics will be provided in the revised manuscript. Also, we mistakenly named the peak in caudal suprasylvian gyrus occipital gyrus on the figure depicting the response profiles. This will also be corrected.

      Furthermore, considering that the ROIs were chosen based on the contrast animate > inanimate stimuli, activation strength should only be compared between animate entities (i.e., dogs, humans, cats), while cars should not be reported (as this would be double dipping, after selecting voxels showing lower activation for that category).

      We thank both Reviewers for raising this relevant point about potential double dipping. The aim of this analysis was to describe the relationship between the neural response elicited by the three animate stimulus classes, to show that the animacy-sensitive peaks are not the results of the standalone greater response to a single animate class. We conducted t-tests only to assess significant difference between these three animate conditions and no stats were performed or reported for any animate class vs. inanimate comparisons in these ROIs. In addition to providing the missing t-tests (comparing animate classes), we will present response profiles and corresponding statistics for a broad set of additional, independent ROIs, defined either anatomically or functionally by other studies in the revised version.

      The descriptive data in Figure 3B (pending statistical evidence) suggests there were no strong differences in activation for the three species in dog and human animate areas. Thus, the ROI analysis appears to contradict findings from the binary analysis approach to investigate species preference, but the authors only discuss the results of the latter in support of their narrative for conspecific preference in dogs and do not discuss research from other labs investigating own-species preference.

      Studying conspecific-preference was not the primary aim of this study. We only used our data to characterize the animate-sensitive regions from this aspect. The species-preference test provides an overall characterization of the entire animate-sensitive region, revealing a higher number of voxels with a maximal response to conspecific than other stimuli in dogs (and a similar tendency in humans), confirming previous evidence on neural conspecific preference in visual areas in both species. The response profiles presented so far describe only the ROIs around the main animate-sensitive peaks and, as the Reviewer points out, in most cases reveal no significant conspecific bias. We believe there is no contradiction here: the entire animate-sensitive region may weakly but still be conspecific-preferring, whereas the main animate-sensitive peaks are not; the centers of conspecific preference may be located elsewhere in the visual cortex and may be supported by mechanisms other than animacy-sensitivity. In the revised manuscript, we will elaborate more on this. Additionally, in response to other comments, and for a better and more coherent characterization of species preference (and animacy sensitivity) across the visual cortex, we will present response profiles for other, independently defined regions and explore conspecific-sensitivity in those additional regions as well. Furthermore, we will discuss related own-species preference literature in greater detail.

      The authors also unnecessarily exaggerate novelty claims. Animate vs inanimate and own vs other species perceptions have both been investigated before in dogs (and humans), so any claims in that direction seem unsubstantiated - and also not needed, as novelty itself is not a sign of quality; what is novel, and a sign of theoretical advance besides the novelty, are as said the conceptual extension and replication of previous work.

      We agree with this Reviewer regarding novelty claims in general, and we confirm that we had no intention to overstate the uniqueness of our results. We also did not mean to imply that this work would be the first one on animacy perception in dogs, which it obviously is not. But we understand that we could have been more explicit presenting our work as a conceptual extension and replication of previous works, and we are revising the wording of the discussion from this aspect.

      Overall, more analyses and appropriate tests are needed to support the conclusions drawn by the authors, as well as a more comprehensive discussion of all findings.

      We are thankful for all comments. We will revise the methods section to provide sufficient detail and ensure replicability; conduct additional analyses as detailed above; and provide a more comprehensive discussion of all findings.

      Reviewer #2 (Public review):

      Summary:

      The manuscript reports an fMRI study looking at whether there is animacy organization in a non-primate, mammal, the domestic dog, that is similar to that observed in humans and non-human primates (NHPs). A simple experiment was carried out with four kinds of stimulus videos (dogs, humans, cats, and cars), and univariate contrasts and RSA searchlight analysis was performed. Previous studies have looked at this question or closely associated questions (e.g. whether there is face selectivity in dogs). The import of the present study is that it looks at multiple types of animate objects, dogs, humans, and cats, and tests whether there was overlapping/similar topography (or magnitude) of responses when these stimuli were compared to the inanimate reference class of cars. The main finding was of some selectivity for animacy though this was primarily driven by the dog stimuli, which did overlap with the other animate stimulus types, but far less so than in humans.

      Strengths:

      I believe that this is an interesting study in so far as it builds on other recent work looking at category-selectivity in the domestic dog. Given the limited number of such studies, I think it is a natural step to consider a number of different animate stimuli and look at their overlap. While some of the results were not wholly surprising (e.g. dog brains respond more selectively for dogs than humans or cats), that does not take away from their novelty, such as it is. The findings of this study are useful as a point of comparison with other recent work on the organization of high-level visual function in the brain of the domestic dog.

      Weaknesses:

      (1) One challenge for all studies like this is a lack of clarity when we say there is organization for "animacy" in the human and NHP brains. The challenge is by no means unique to the present study, but I do think it brings up two more specific topics.

      First, one property associated with animate things is "capable of self-movement". While cognitively we know that cars require a driver, and are otherwise inanimate, can we really assume that dogs think of cars in the same way? After all, just think of some dogs that chase cars. If dogs represent moving cars as another kind of selfmoving thing, then it is not clear we can say from this study that we have a contrast between animate vs inanimate. This would not mean that there are no real differences in neural organization being found.

      It was unclear whether all or some of the car videos showed them moving. But if many/most do, then I think this is a concern.

      We thank this Reviewer for raising this relevant point about the potential animacy of cars for dogs and its implication for our results. Of note, two-thirds of our car stimuli showed a car moving (slow, accelerating, or fast). We acknowledge that these stimuli contained motionbased animacy cues, and in this regard, there was no clear difference between our animate and inanimate conditions, and possibly between some of the representations they elicited. However, our animate and inanimate stimuli differed in other key factors accounting for animacy organization, such as visual features including the presence of faces, bodies, body parts, postures, and certain aspects of biological motion. So we believe that this limitation does not compromise our main conclusions. We will elaborate on this point further in the revised discussion, also considering how dogs’ differential behavioral responses to cars and animate entities may provide additional insights in this regard.

      Second, there is quite a lot of potential complexity in the human case that is worth considering when interpreting the results of this study. In the human case, some evidence suggests that animacy may be more of a continuum (Sha et al. 2015), which may reflect taxonomy (Connolly et al. 2012, 2016). However moving videos seem to be dominated more by signals relevant to threat or predation relative to taxonomy (Nastase et al. 2017). Some evidence suggests that this purported taxonomic organization might be driven by gradation in representing faces and bodies of animals based on their relative similarity to humans (Ritchie et al. 2021). Also, it may be that animacy organization reflects a number of (partially correlated) dimensions (Thorat et al. 2019, Jozwik et al. 2022). One may wonder whether the regions of (partial) overlap in animate responses in the dog brain might have some of these properties as well (or not).

      We agree that it would be interesting to dissect which animacy-related factor(s) contribute to the observed animacy sensitivity in different regions, and although this was not the original aim of the study, we agree that we could have made better use of the variation in our stimuli to discuss this aspect. Specifically, some animacy features are shared by all three animate stimulus classes, namely the presence of biological motions, faces, and bodies. In contrast, animate classes differed in some other aspects, for example in how dogs perceived dogs, humans, and cats as social agents and in their potential behavioral goals towards them. It can therefore be argued that regions with two- and especially three-way overlapping activations are more probably involved in processing biological motion, face and body aspects, and non-overlapping ones the social agency- and behavioural goal-related aspects. In line with this, the shared animacy features are indeed ones that have been reported to be central in human animacy representation and that may have made the overlaps in human brain responses greater. We will provide a more detailed discussion of the results from this viewpoint in the revised manuscript.

      (2) It is stated that previous studies provide evidence that the dog brain shows selectivity to "certain aspects of animacy". One of these already looked at selectivity for dog and human faces and bodies and identified similar regions of activity (Boch et al. 2023). An earlier study by Dilks et al. (2015), not cited in the present work (as far as I can tell), also used dynamic stimuli and did not suffer from the above limitations in choosing inanimate stimuli (e.g. using toy and scene objects for inanimate stimuli). But it only included human faces as the dynamic animate stimulus. So, as far as stimulus design, it seems the import of the present study is that it included a *third* animate stimulus (cats) and that the stimuli were dynamic.

      We agree with this Reviewer that the findings of Dilks et al. (2015) are relevant to our study and have therefore cited them. However, the citation itself was imprecise and will be corrected in the revised manuscript.

      (3) I am concerned that the univariate results, especially those depicted in Figure 3B, include double dipping (Kriegesorte et al. 2009). The analysis uses the response peak for the A > iA contrast to then look at the magnitude of the D, H, C vs iA contrasts. This means the same data is being used for feature selection and then to estimate the responses. So, the estimates are going to be inflated. For example, the high magnitudes for the three animate stimuli above the inanimate stimuli are going to inherently be inflated by this analysis and cannot be taken at face value. I have the same concern with the selectivity preference results in Figure 3E.

      I think the authors have two options here. Either they drop these analyses entirely (so that the total set of analyses really mirrors those in Figure 4), or they modify them to address this concern. I think this could be done in one of two ways. One would be to do a within- subject standard split-half analysis and use one-half of the data for feature selection and the other for magnitude estimation. The other would be to do a between-subject design of some kind, like using one subject for magnitude estimation based on an ROI defined using the data for the other subjects.

      We thank both Reviewers again for raising this important point about potential double dipping. We also thank this Reviewer for specific suggestions for split-half analyses – we agree that, had our original analyses involved double dipping, such a modification would be necessary. But, as we explained in our response above, this was not the case. Indeed, whereas we do visualize all four conditions in Fig. 3B, we only conducted t-tests to assess differences between the three animate conditions (the corresponding stats have been missing from the original manuscript but will be added during revision). So, importantly, we did not evaluate the magnitude of the D, H, C vs iA contrasts in any of the ROIs defined by animate-sensitive peaks; therefore, we believe that these analyses do not involve double dipping. This holds for the species preference results in Fig. 3E as well. We will clarify this in the revised manuscript. Of note, in response to a request by the other reviewer and to provide richer information about the univariate results, we will also provide response profiles and corresponding stats for a broad set of additional ROIs, defined either anatomically or functionally by other studies (e.g., Boch et al., 2023).

      (4) There are two concerns with how the overlap analyses were carried out. First, as typically carried out to look at overlap in humans, the proportion is of overlapping results of the contrasts of interest, e.g, for face and body selectivity overlap (Schwarlose et al. 2006), hand and tool overlap (Bracci et al. 2012), or more recently, tool and food overlap (Ritchie et al. 2024). There are a number of ways of then calculating the overlap, with their own strengths and weaknesses (see Tarr et al. 2007). Of these, I think the Jaccard index is the most intuitive, which is just the intersection of two sets as a proportion of their union. So, for example, the N of overlapping D > iA and H > iA active voxels is divided by the total number of unique active voxels for the two contrasts. Such an overlap analysis is more standard and interpretable relative to previous findings. I would strongly encourage the authors to carry out such an analysis or use a similar metric of overlap, in place of what they have currently performed (to the extent the analysis makes sense to me).

      We agree with this Reviewer that the Jaccard index is an intuitive and straightforward overlap measure. Importantly, for our overlap calculations we already use this measure (and a very similar one) – but we acknowledge that this was not clear from the original description. Specifically, for the multivariate overlap test, we used the Jaccard index exactly as described by this Reviewer. For the univariate overlap test, we use a very similar measure, with the only difference that there, to reference the search space, the intersection of specific animate-inanimate contrasts was divided by the total voxel number of animate-sensitive areas (which is highly similar to the union of the specific animate-inanimate contrasts). In the revised submission we will provide a more detailed explanation of the overlap calculations, making it explicit that we used the Jaccard index (and a variant of it).

      Second, the results summarized in Figure 3A suggest multiple distinct regions of animacy selectivity. Other studies have also identified similar networks of regions (e.g. Boch et al. 2023). These regions may serve different functions, but the overlap analysis does not tell us whether there is overlap in some of these portions of the cortex and not in others. The overlap is only looked at in a very general sense. There may be more overlap locally in some portions of the cortex and not in others.

      We thank this Reviewer for this comment, we agree that adding spatial specificity to these results will improve the manuscript. Therefore, during revision, we will assess the anatomical distribution of the overlap results, making use of a broad set of ROIs potentially relevant for animacy perception, defined either anatomically or functionally by other studies (e.g., Boch et al., 2023 for dogs).

      (5) Two comments about the RSA analyses. First, I am not quite sure why the authors used HMAX rather than layers of a standardly trained ImageNet deep convolutional neural network. This strikes me also as a missed opportunity since many labs have looked at whether later layers of DNNs trained on object categorization show similar dissimilarity structures as category-selective regions in humans and NHPs. In so far as cross-species comparisons are the motivation here, it would be genuinely interesting to see what would happen if one did a correlation searchlight with the dog brain and layers of a DNN, a la Cichy et al. (2016).

      We thank the Reviewer for this comment and suggestion. At the start of the project, HMAX was the most feasible model to implement given our time and expertise constrains. Additionally, the biologically motivated HMAX was also an appropriate choice, as it simulates the selective tuning of neurons in the primary visual cortex (V1) of primates, which is considered homologous with V1 in carnivores (Boch et al., 2024).

      Although we agree that using DNNs have recently been extensively and successfully used to explore object representations and could provide valuable additional insights for dogs’ visual perception as well, we believe that adding a large set of additional analyses would stretch the frames of this manuscript, disproportionately shifting its focus from our original research question. Also, our experiment, designed with a different, more specific aim in mind, did not provide a large enough stimulus variety of animate stimuli for a general comparison of the cortical hierarchy underlying object representations in dog and human brains and thus our data are not an optimal starting point for such extensive explorations. Having said that, we are thankful for this Reviewer for the idea and will consider using a DNN to uncover dog’ visual cortical hierarchy in future studies with a better suited stimulus set. Furthermore, in accordance with eLife’s data-sharing policies, we will make the current dataset publicly available so further hypothesis and models can be tested.

      Second, from the text is hard to tell what the models for the class- and categoryboundary effects were. Are there RDMs that can be depicted here? I am very familiar with RSA searchlight and I found the description of the methods to be rather opaque. The same point about overlap earlier regarding the univariate results also applies to the RSA results. Also, this is again a reason to potentially compare DNN RDMs to both the categorical models and the brains of both species.

      In the revised manuscript we will provide a more detailed explanation of the methods used to determine class- and category-boundary effects. In short, the analysis we performed here followed Kriegeskorte et al. (2008), and the searchlight test looked for regions in which between-class/category differences were greater than within-class/category differences. We will also include RDMs. Additionally, we will provide anatomical details for the overlap results for RSA, just as for the univariate results, using the same independently defined broad set of ROIs, defined either anatomically or functionally by other studies (e.g., Boch et al., 2023 for dogs).

      (6) There has been emphasis of late on the role of face and body selective regions and social cognition (Pitcher and Ungerleider, 2021, Puce, 2024), and also on whether these regions are more specialized for representing whole bodies/persons (Hu et al. 2020, Taubert, et al. 2022). It may be that the supposed animacy organization is more about how we socialize and interact with other organisms than anything about animacy as such (see again the earlier comments about animacy, taxonomy, and threat/predation). The result, of a great deal of selectivity for dogs, some for humans, and little for cats, seems to readily make sense if we assume it is driven by the social value of the three animate objects that are presented. This might be something worth reflecting on in relation to the present findings.

      We thank the Reviewer for this suggestion. The original manuscript already discussed how motion-related animacy cues involved in social cognition may explain that animacysensitive regions reported in our study extend beyond those reported previously and also the role of biological motion in the observed across-species differences. This discussion of the role of visual diagnostic features and features that involved in perceiving social agents will be extended in the revised discussion, also in response to the first comment of this Reviewer, to reflect on how social cognition-related animacy cues may have affected our results in dogs.

    1. eLife Assessment

      This valuable study provides new insights into the synchronization of ripple oscillations in the hippocampus, both within and across hemispheres. Using carefully designed statistical methods, it presents convincing evidence that synchrony is significantly higher within a hemisphere than across. However, further controlling for potential confounds related to differences in animal behavior will help clarify whether this effect is influenced by memory processing. This study will be of interest to neuroscientists studying the hippocampus and memory.

    2. Reviewer #1 (Public review):

      Summary:

      In this manuscript, the authors analyze electrophysiological data recorded bilaterally from the rat hippocampus to investigate the coupling of ripple oscillations across the hemispheres. Commensurate with the majority of previous research, the authors report that ripples tend to co-occur across both hemispheres. Specifically, the amplitude of ripples across hemispheres is correlated but their phase is not. These data corroborate existing models of ripple generation suggesting that CA3 inputs (coordinated across hemispheres via the commisural fibers) drive the sharp-wave component while the individual ripple waves are the result of local interactions between pyramidal cells and interneurons in CA1.

      Strengths:

      The manuscript is well-written, the analyses well-executed and the claims are supported by the data.

      Weaknesses:

      One question left unanswered by this study is whether information encoded by the right and left hippocampi is correlated.

    3. Reviewer #2 (Public review):

      Summary:

      The authors completed a statistically rigorous analysis of the synchronization of sharp-wave ripples in the hippocampal CA1 across and within hemispheres. They used a publicly available dataset (collected in the Buzsaki lab) from 4 rats (8 sessions) recorded with silicon probes in both hemispheres. Each session contained approximately 8 hours of activity recorded during rest. The authors found that the characteristics of ripples did not differ between hemispheres, and that most ripples occurred almost simultaneously on all probe shanks within a hemisphere as well as across hemispheres. The differences in amplitude and exact timing of ripples between recording sites increased slightly with the distance between recording sites. However, the phase coupling of ripples (in the 100-250 Hz range), changed dramatically with the distance between recording sites. Ripples in opposite hemispheres were about 90% less coupled than ripples on nearby tetrodes in the same hemisphere. Phase coupling also decreased with distance within the hemisphere. Finally, pyramidal cell and interneuron spikes were coupled to the local ripple phase and less so to ripples at distant sites or the opposite hemisphere.

      Strengths:

      The analysis was well-designed and rigorous. The authors used statistical tests well suited to the hypotheses being tested, and clearly explained these tests. The paper is very clearly written, making it easy to understand and reproduce the analysis. The authors included an excellent review of the literature to explain the motivation for their study.

      Weaknesses:

      The authors state that their findings (highly coincident ripples between hemispheres), contradict other findings in the literature (in particular the study by Villalobos, Maldonado, and Valdes, 2017), but fail to explain why this large difference exists. They seem to imply that the previous study was flawed, without examining the differences between the studies.

      The paper fails to mention the context in which the data was collected (the behavior the animals performed before and after the analyzed data), which may in fact have a large impact on the results and explain the differences between the current study and that by Villalobos et al. The Buzsaki lab data includes mice running laps in a novel environment in the middle of two rest sessions. Given that ripple occurrence is influenced by behavior, and that the neurons spiking during ripples are highly related to the prior behavioral task, it is likely that exposure to novelty changed the statistics of ripples. Thus, the authors should analyze the pre-behavior rest and post-behavior rest sessions separately. The Villalobos et al. data, in contrast, was collected without any intervening behavioral task or novelty (to my knowledge). Therefore, I predict that the opposing results are a result of the difference in recent experiences of the studied rats, and can actually give us insight into the memory function of ripples.

      In one figure (5), the authors show data separated by session, rather than pooled. They should do this for other figures as well. There is a wide spread between sessions, which further suggests that the results are not as widely applicable as the authors seem to think. Do the sessions with small differences between phase coupling and amplitude coupling have low inter-hemispheric amplitude coupling, or high phase coupling? What is the difference between the sessions with low and high differences in phase vs. amplitude coupling? I noticed that the Buzsaki dataset contains data from rats running either on linear tracks (back and forth), or on circular tracks (unidirectionally). This could create a difference in inter-hemisphere coupling, because rats running on linear tracks would have the same sensory inputs to both hemispheres (when running in opposite directions), while rats running on a circular track would have different sensory inputs coming from the right and left (one side would include stimuli in the middle of the track, and the other would include closer views of the walls of the room). The synchronization between hemispheres might be impacted by how much overlap there was in sensory stimuli processed during the behavior epoch.

      The paper would be a lot stronger if the authors analyzed some of the differences between datasets, sessions, and epochs based on the task design, and wrote more about these issues. There may be more publicly available bi-hemispheric datasets to validate their results.

    1. eLife Assessment

      In this important study, the authors employed state-of-the-art biochemistry, cryo-EM, and HDX mass spec approaches to study the formation of the binary Uba7-UBE2L6 and ternary UBA7-UBE2L6-ISG15 complexes. The results established mechanisms by which UBA7 and UBE2L6 form disulfide bonds, disrupting the ISG15 transfer cascade. While the biochemical and structural experiments are largely convincing, the mechanism under in vivo conditions remains unclear, due to the limited use of a single E2 enzyme. The authors need to repeat their experiments with a representative panel of human E2 enzymes.

    2. Reviewer #1 (Public review):

      Summary:

      Chen and colleagues describe mechanisms by which UBA7 and UBE2L6 form disulfide bonds, disrupting the ISG15 transfer cascade. As other similar structures are currently available, the authors further note that the spontaneous formation of this disulfide suggests that it is a potential regulatory mechanism. Demonstrating that this mechanism occurs and is modulated in cells would greatly improve the impact of their work.

      Strengths:

      The various biochemical and structural experiments are largely convincing.

      Weaknesses:

      (1) The main point of the paper is that this covalent complex could occur and is potentially regulated in cells is limited. The authors even show an experiment in cells where this complex is formed by expressing UBE2L6-V5 and GFP-UBA7, awkwardly referenced in the discussion.

      The authors should consider attempting an experiment with endogenous proteins and either modulate the formation of this complex in different cellular conditions or downplay this part of their story. For example, this sentence, "This redox-sensitive complex implies a link between oxidative stress and regulation of the immune response, highlighting a potential therapeutic target for modulating immune reactions arising from infections and inflammatory conditions." is in the abstract and should be excluded or rephrased considering the lack of cellular data.

      Also, their one-cell-based experiment is shown in the discussion. This should be in the results as is standard practice but also repeated. It appears that the reduced lanes don't seem to have GFP or the GFP-UBA7. Without those controls, this experiment seems incomplete.

      (2) Their intro sets up the paper to explain the disulfide formation they see in Figure 1, but a more fitting experiment would be to look at the disulfide formation between UBA7 and UBE2L6 at different pHs. It would nicely supplement the biochemical pKa data as this reaction is their focal point.

      (3) While the biochemical data is extensive, it is not concise or easily accessible to a broad readership. The authors should try to clarify and simplify the text overall. Furthermore, many figure callouts are missing, interfering with the clarity of the text.

      Minor

      (1) Because the experiments are pKa dependent, knowing what buffers the proteins were finished in (final SEC purification step) is important. Similarly - for all assays, the buffers were not reported (SEC-MALS, biochemical assays).

      (2) While the CBB and fluorescent gel assays look convincing, more controls are needed for their SEC experiments (Figure 1d), particularly because the authors definitively say the binding is because of S-S bonds. Using a reducing buffer like TCEP or DTT or their catalytic mutants to show reduced co-migration would be helpful. This is even more important given the reported high affinities between UBA7/UBE2L6 in Figure 6.

      (3) Based on the data presented, it is unclear that the kinetic values are taken within initial velocity regimes. Some data in the supplement showing that the single time points represent initial velocities would be appreciated.

      (4) As stated, "Previous experiments reveal an intriguing anomaly during the UBA7-UBE2L6-ISG15 thioester transfer reaction. Despite adding more ISG15 and UBE2L6, the level of UBE2L6~ISG15 remained the same." This experiment should be shown or the statement removed.

      (5) Similarly, "Forty human E2 enzymes are classified in the InterProdatabase (https://www.ebi.ac.uk/interpro/), with the majority interacting with UBA1, whereas UBE2L6 and UBE2Z exclusively interact with UBA7 and UBA6, respectively." Is missing a reference.

    3. Reviewer #2 (Public review):

      Summary:

      Chen et al. describe by different techniques that UBA7 and UBE2L6 readily form a complex that is covalently linked by a disulfide bond involving the active site cysteines of UBA6 and UBE2L6. Furthermore, they determined cryo-EM structures of the disulfide-linked UBY7-UBE2L6 complex in the absence and presence of ISG15. They propose that this disulfide-linked complex blocks ISGylation by temporarily rendering UBA7 inactive.

      Strengths:

      The authors employ a wide variety of techniques to study the formation of the binary Uba7-UBE2L6 and ternary UBA7-UBE2L6-ISG15 complexes including the structural characterization of the two complexes by cryo-EM. Despite the shortcomings (see below), the authors provide numerous valuable data that characterize the first steps of the ISGylation pathway, namely the activation of ISG15 and its transfer to UBE2L6.

      Weaknesses:

      (1) The authors correctly state that "Immune responses often entail the generation or reactive oxygen species, antioxidant defense mechanisms, and redox signaling" (1st sentence of 3rd paragraph in the Introduction). Based on the data presented in this study these cellular responses should lead to the formation of the covalent UBA7-UBE2L6. Since this complex renders UBA7 inactive, thus preventing it from initiating the ISGylation cascade in response to viral infections, the underlying cellular logic of complex formation remains a mystery.

      The bulk of their work describes in vitro experiments, which will certainly not reflect the in vivo situation and hence one cannot rule out that this complex will not form inside cells. The authors have also observed this complex in HEK293T cells, however, this involved overexpression of both proteins and one can thus not rule out that the disulfide-linked complex will not form at physiological protein levels. Furthermore, this cellular model appears not to be a suitable system.

      (2) The authors carried out a comparative analysis of E1-E2 disulfide bond formation with UBA1, the major activating enzyme for ubiquitin, and UBE2L3, a ubiquitin-specific E2. The choice of UBE2L3 was motivated by its close relationship to UBE2L6. From these studies, the authors conclude that UBA1 does not form the corresponding complex. Given that there are over 30 ubiquitin-specific E2s this conclusion does not rest on a very solid basis, since, as demonstrated for example in this study (PMID: 22949505), at least yeast Uba1 forms a disulfide-linked complex with Cdc34. Another study documenting the formation of a disulfide-linked complex between Uba1 and an E2 enzyme, in this case, Rad6, (PMID: 35613580) is even cited by the authors. If the authors want to make the argument that Uba1 does not form corresponding E1-E2 complexes, they need to repeat their experiments with a representative panel of human E2 enzymes and the two enzymes employed in the aforementioned studies (Cdc34 and Rad6) or, more precisely, their human counterparts represent obvious starting points. Depending on the outcome of these studies the experiments with the CCL mutants need to be revisited.

    4. Reviewer #3 (Public review):

      Summary:

      In this manuscript, "Elucidating the mechanism underlying UBA7-UBE2L6 disulfide complex formation", Chen et al. describe the mechanism of spontaneous disulfide bond formation between the active site cysteines of UBA7 and UBE2L6. Employing state-of-the-art biochemistry, cryo-EM, and HDX mass spec approaches, the authors provide insights into how this mechanism occurs in UBA7/UBE2L6 but not in related ubiquitin enzymes. A central conclusion of the study is that the length of the catalytic cysteine loop (CCL) in UBA7 is insufficient to block access to the E1's catalytic cysteine, thereby facilitating UBE2L6 disulfide formation. In contrast, the CCL of UBA1 is sufficiently long and shields its catalytic cysteine, preventing access to the Ub E2 enzymes. In addition to the CCL, the authors also show that UBA7's specificity and strong binding affinity for UBE2L6 help promote this disulfide-linked E1-E2 complex.

      Strengths:

      The data within in manuscript is interesting and significantly contributes to our understanding of the mechanisms of the ISG15 conjugation pathway. Moreover, the biochemical and structural experiments were performed at an exceptionally high level and the data throughout the manuscript is convincing.

      Weaknesses:

      It is not clear whether this regulatory mechanism occurs in a biological context (e.g., during IFN signaling or oxidative stress). However, this weakness is somewhat offset by the last experiment of the manuscript which demonstrates the existence of UBA7-UBE2L6 disulfide complex formation in cells under overexpression conditions. If the authors could expand upon this finding, as outlined below it would further improve their study.

    1. eLife Assessment

      In this important quantitative study of HIV-1 evolution in humans and rhesus macaques, selection coefficients are inferred at scale over the HIV genome. Selection coefficients are similar in humans and macaques, providing convincing evidence that these coefficients are representative of the fitness landscapes of these viruses within hosts. This work should be of interest to the community working on quantitative evolution and fitness landscape inference, and the finding that rapid fitness gains in the HIV population predict bNAb emergence has implications for HIV vaccine design.

    2. Reviewer #1 (Public review):

      Summary:

      The present work studies the coevolution of HIV-1 and the immune response in clinical patient data. Using the Marginal Path Likelihood (MPL) framework, they infer selection coefficients for HIV mutations from time-series data of virus sequences as they evolve in a given patient.

      Strengths:

      The authors analyze data from two human patients, consisting of HIV population sequence samples at various points in time during the infection. They infer selection coefficients from the observed changes in sequence abundance using MPL. Most beneficial mutations appear in viral envelop proteins. The authors also analyze SHIV samples in rhesus macaques, and find selection coefficients that are compatible with those found in the corresponding human samples.

      The manuscript is well-written and organized.

      Weaknesses:

      The MPL method used by the authors considers only additive effects of mutations, thus ignoring epistasis.

      Although the evolution of broadly neutralizing antibodies (bnAbs) is a motivating question in the introduction and discussion sections (and the title), the relevance of the analysis and results to better understanding how bnAbs arise is not clear. The only result presented in direct connection to bnAbs is Figure 6.

      Questions or suggestions for further discussion:

      I list here a number of points for which I believe the paper would benefit if additional discussion/results were included.

      The MPL method used by the authors considers only additive effects of mutations, thus ignoring epistasis. In Sohail et al (2022) MBE 39(10), p. msac199 (https://doi.org/10.1093/molbev/msac199) an extension of MPL is developed allowing one to infer epistasis. Can the authors comment on why this was not attempted here?

      I presume one possible reason is that epistasis inference requires considerably more computational effort (and more data). However, since the authors find most beneficial mutations occurring in Env, perhaps restricting the analysis to Env genes only (e.g. the trimer shown in Figure 2) can lead to tractable inference of epistasis within this segment (instead of the full genome).

      Do the authors find correlations in the inferred selection coefficients of the two samples CH505 and CH848? I could not find any discussion of this in the manuscript. Only correlations between Humans and RM are discussed.

    3. Reviewer #2 (Public review):

      Summary

      This paper combines a biological topic of interest with the demonstration of important theoretical/methodological advances. Fitness inference is the foundation of the quantitative analysis of adapting systems. It is a hard and important problem and this paper highlights a compelling approach (MPL) first presented in (1) and refined in (2), roughly summarized in equation 12.

      (1) Sohail, M. S., Louie, R. H., McKay, M. R. & Barton, J. P. Mpl resolves genetic linkage in fitness inference from complex evolutionary histories. Nature biotechnology 39, 472-479 (2021).<br /> (2) Shimagaki, K. & Barton, J. P. Bézier interpolation improves the inference of dynamical models from data. Physical Review E 107, 024116 (2023).

      The authors find that positive selection shapes the variable regions of env in shared patterns across two patient donors. The patterns of positive selection are interesting in and of themselves, they confirm the intuition that hyper-variation in env is the result of immune evasion rather than a broadly neutral landscape (flatness). They show that the immune evasion patterns due to CD8 T and naive B-cell selection are shared across patients. Furthermore, they suggest that a particular evolutionary history (larger flux to high fitness states) is associated with bNAb emergence. Mimicking this evolutionary pattern in vaccine design may help us elicit bNAbs in patients in the future.

      There is a lot of information to be found in the full fitness landscape of env. The enormous strength of reversion-to-consensus in the patterns is a known pattern of HIV post-infection populations but they are nicely quantified here. Agreement between SHIV and HIV evolution is shown. They find selection is larger for autologous antibodies than the bNAbs themselves (perhaps bNAbs are just too small a component of the host response to drive the bulk of selection?), and that big fitness increases precede antibody breadth in rhesus macaques, suggesting that this fitness increase is the immune challenge required to draw forth a bNAb. This is all of high interest to HIV researchers.

      Strength of evidence

      One limitation is, of course, that the fitness model is constant in time when the immune challenge is variable and changing. This simplification may complicate some interpretations.

      Equation 12 in the methods is really a beautiful tool because it is so simple, but accounts for linkage and can be solved precisely even in the presence of detailed mutational and selection models. However, the reliance on incomplete observations of the frequency leads to complications that must be carefully (re)addressed here.

      For instance, the consistent finding of strong selection in hypervariable regions is biologically intuitive but so striking, that I worry that it might be the result of a bias for selection in high entropy regions. Mutational and covariance terms in equation 12 might be underestimated, due to finite sampling effect in highly diverse populations. Sampling effects lead to zeros in x(t) when actual frequency zeros might be rare at the population sizes of HIV viral loads and mutation rates. Both mutational flux and C underestimation will bias selection upward in eq. 12. The prior papers (1) and (2) seem to show robustness to finite sampling effects, but, again, more care needs to be shown that this robustness transfers to the amino acid inference under these conditions. That synonymous sites are rarely selected for in the nucleotide level is a good sign, and it may be a matter of simply fully explaining the amino-acid level model.

      Uncertainty propagates to the later parts of the paper, eg. HIV and SIV shared patterns might be the result of shared biases in the method application. However, this worry does not extend to the apples-to-apples comparison of fitness trajectories across individuals (Figures 5 and 6) which I think are robust (for these sample sizes). The timing evidence is slightly weakened by the fact that bNAb detection is different from bNAb presence and the possibility that fitness increases occurred after the bNAbs appeared remains. Still, their conclusion is plausible and fits in with the other observations which form a coherent and compelling picture.

      Overall this is a convincing paper, part of a larger admirable project of accurately inferring complete fitness landscapes.

    4. Reviewer #3 (Public review):

      Summary:

      Shimagaki et al. investigate the virus-antibody coevolutionary processes that drive the development of broadly neutralizing antibodies (bnAbs). The study's primary goal is to characterize the evolutionary dynamics of HIV-1 within hosts that accompany the emergence of bnAbs, with a particular focus on inferring the landscape of selective pressures shaping viral evolution. To assess the generality of these evolutionary patterns, the study extends its analysis to rhesus macaques (RMs) infected with simian-human immunodeficiency viruses (SHIV) incorporating HIV-1 Env proteins derived from two human individuals.

      Strengths:

      A key strength of the study is its rigorous assessment of the similarity in evolutionary trajectories between humans and macaques. This cross-species comparison is particularly compelling, as it quantitatively establishes a shared pattern of viral evolution using a sophisticated inference method. The finding that similar selective pressures operate in both species adds robustness to the study's conclusions and suggests broader biological relevance.

      Weaknesses:

      However, the study has some limitations. The most significant weakness is that the authors do not sufficiently discuss the implications of the observed similarities. While the identification of shared evolutionary patterns (e.g., Figure 5) is intriguing, the study would benefit from a more explicit discussion of what these findings mean for instance, in the context of HIV vaccine design, immunotherapy, or fundamental viral-host interactions. Even speculative interpretations could provide valuable insights into the broader significance of these results.

      A secondary, albeit less critical, limitation is the placement of methodological details in the Supplementary Information. While it is understandable that the authors focus on results in the main text - especially since the methodology is not novel and has been previously described in earlier publications - some readers might benefit from a more thorough presentation of the method within the main paper.

      Conclusions:

      Overall, the study presents a compelling analysis of HIV-1 evolution and its parallels in SHIV-infected macaques. While the quantitative comparison between species is a notable contribution, a deeper discussion of its broader implications would strengthen the paper's impact.

    1. eLife Assessment

      The authors attempt to identify which patients with benign lesions will progress to cancer using a liquid biomarker. Although the study is valuable, the evidence provided for the liquid biopsy EV miRNA signature developed based on radiomics features is incomplete. This is because the data are derived from discrepant sample sets and the description of the clinical characteristics of the samples enrolled in the study needs to be improved.

    2. Reviewer #1 (Public review):

      Summary:

      The manuscript by Shi et al, has utilized multiple imaging datasets and one set of samples for analyzing serum EV-miRNAs & EV-RNAs to develop an EV miRNA signature associated with disease-relevant radiomics features for early diagnosis of pancreatic cancer. CT imaging features (in two datasets (UMMD & JHC and WUH) were derived from pancreatic benign disease patients vs pancreatic cancer cases), while circulating EV miRNAs were profiled from samples obtained from a different center (DUH). The EV RNA signature from external public datasets (GSE106817, GSE109319, GSE113486, GSE112264) were analyzed for differences in healthy controls vs pancreatic cancer cases. The miRNAs were also analyzed in the TCGA tissue miRNA data from normal adjacent tissue vs pancreatic cancer.

      Strengths:

      The concept of developing EV miRNA signatures associated with disease relevant radiomics features is a strength.

      Weaknesses:

      While the overall concept of developing EV miRNA signature associated with radiomics features is interesting, the findings reported are not convincing for the reasons outlined below:

      (1) Discrepant datasets for analyzing radiomic features with EV-miRNAs: It is not justified how CT images (UMMD & JHC and WUH) and EV-miRNAs (DUH) on different subjects and centers/cohorts shown in Figures 1 &2 were analyzed for association. It is stated that the samples were matched according to age but there is no information provided for the stages of pancreatic cancer and the kind of benign lesions analyzed in each instance.

      (2) The study is focused on low-abundance miRNAs with no adequate explanation of the selection criteria for the miRNAs analyzed.

      (3) While EV-miRNAs were profiled or sequenced (not well described in the Methods section) with two different EV isolation methods, the authors used four public datasets of serum circulating miRNAs to validate the findings. It would be better to show the expression of the three miRNAs in the additional dataset(s) of EV-miRNAs and compare the expressions of the three EV-miRNAs in pancreatic cancer with healthy and benign disease controls.

      (4) It is not clear how the 12 EV-miRNAs in Figure 4C were identified.

      (5) Box plots in Figures 4D-F and G-I of three miRNAs in serum and tissue should show all quantitative data points.

      (6) What is the GBM model in Figure 5?

      (7) What are the AUCs of individual EV-miRNAs integrated as a panel of three EV-miRNAs?

      (8) The authors could have compared the performance of CA19-9 with that of the three EV-miRNAs.

      (9) How was the diagnostic performance of the three EV-miRNAs in the two molecular subtypes identified in Figure 6&7? Do the C1 & C2 clusters correlate with the classical/basal subtypes, staging, and imaging features?

    3. Reviewer #2 (Public review):

      Summary:

      This study investigates a low abundance microRNA signature in extracellular vesicles to subtype pancreatic cancer and for early diagnosis. There are several major questions that need to be addressed. Numerous minor issues are also present.

      Strengths:

      The authors did a comprehensive job with numerous analyses of moderately sized cohorts to describe the clinical and translational significance of their miRNA signature.

      Weaknesses:

      There are multiple weaknesses of this study that should be addressed:

      (1) The description of the datasets in the Materials and Methods lacks details. What were the benign lesions from the various hospital datasets? What were the healthy controls from the public datasets? No pancreatic lesions? No pancreatic cancer? Any cancer history or other comorbid conditions? Please define these better.

      (2) It is unclear how many of the controls and cases had both imaging for radiomics and blood for biomarkers.

      (3) The authors should define the imaging methods and protocols used in more detail. For the CT scans, what slice thickness? Was a pancreatic protocol used? What phase of contrast is used (arterial, portal venous, non-contrast)? Any normalization or pre-processing?

      (4) Who performed the segmentation of the lesions? An experienced pancreatic radiologist? A student? How did the investigators ensure that the definition of the lesions was performed correctly? Raidomics features are often sensitive to the segmentation definitions.

      (5) Figure 1 is full of vague images that do not convey the study design well. Numbers from each of the datasets, a summary of what data was used for training and for validation, definitions of all of the abbreviations, references to the Roman numerals embedded within the figure, and better labeling of the various embedded graphs are needed. It is not clear whether the graphs are real results or just artwork to convey a concept. I suspect that they are just artwork, but this remains unclear.

      (6) The DF selection process lacks important details. Please reference your methods with the Boruta and Lasso models. Please explain what machine learning algorithms were used. There is a reference in the "Feature selection.." section of "the model formula listed below" but I do not see a model formula below this paragraph.

      (7) In Figure 2, more quantitative details are needed. How are patients dichotomized into non-obese and obese? What does alcohol/smoking mean? Is it simply no to both versus one or the other as yes? These two risk factors should be separated and pack years of smoking should be reported. The details of alcohol use should also be provided. Is it an alcohol abuse history? Any alcohol use, including social drinking? Similarly, "diabetes" needs to be better explained. Type I, type II, type 3c? P values should be shown to demonstrate any statistically significant differences in the proportions of the patients from one dataset to another.

      (8) In the section "Different expression radiomic features between pancreatic benign lesions and aggressive tumors", there is a reference to "MUJH" for the first time. What is this? There is also the first reference to "aggressive tumors" in the section. Do the authors just mean the cases? Otherwise there is no clear definition of "aggressive" (vs. indolent) pancreatic cancer. This terminology of tumor "aggressiveness" either needs to be removed or better defined.

      (9) Figure 3 needs to have the specific radiomic features defined and how these features were calculated. Labeling them as just f1, f2, etc is not sufficient for another group to replicate the results independently.

      (10) It is not clear what Figure 4A illustrates as regards model performance. What do the different colors represent, and what are the models used here? This is very confusing.

      (11) Figure 5 shows results for many more model runs than the described 10, please explain what you are trying to convey with each row. What are "Test A" and "Test B"? There is no description in the manuscript of what these represent. In the figure caption, there is a reference to "our center data" which is not clear. Be more specific about what that data is.

      (12) Figure 6 describes the subtypes identified in this study, but the authors do not show a multi-variable cox proportional hazards model to show that this subtype classification independently predicts DFS and OS when incorporating confounding variables. This is essential to show the subtypes are clinically relevant. In particular, the authors need to account for the stage of the patients, and receipt of chemotherapy, surgery, and radiation. If surgery was done, we need to know whether they had R1 or R0 resection. The details about the years in which patients were included is also important.

      (13) How do these subtypes compare to other published subtypes?

    4. Reviewer #3 (Public review):

      Summary:

      The authors appear to be attempting to identify which patients with benign lesions will progress to cancer using a liquid biomarker. They used radiomics and EV miRNAs in order to assess this.

      Strengths:

      It is a strength that there are multiple test datasets. Data is batch-corrected. A relatively large number of patients is included. Only 3 miRNAs are needed to obtain their sensitivity and specificity scores.

      Weaknesses:

      This manuscript is not clearly written, making interpretation of the quality and rigor of the data very difficult. There is no indication from the methods that the patients in their cohorts who are pancreatic cancer patients (from the CT images) had prior benign lesions, limiting the power of their analysis. The data regarding the cluster subtypes is very confusing. There is no discussion or comparison if these two clusters are just representing classical and basal subtypes (which have been well described).

    1. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Dad et al. explored the roles of cytosolic carboxypeptidase 5(CCP5)in the development of ependymal multicilia in the brain. CCP family are erasers of polyglutamylation of ciliary-axoneme microtubules. The authors generated a new mutant mouse of Agbl5 gene, which encodes CCP5, with deletion of its N-terminus and partial carboxypeptidase (CP) domain (named AGBL5M1/M1).

      Strengths:

      The mutant mice revealed lethal hydrocephalus due to degeneration of ependymal multicilia. Interestingly, this is in contrast with the phenotype of Agbl5 mutants with disruption solely in the CP domain of CCP5 (named AGBL5M2/M2) that did not develop hydrocephalus despite increased glutamylation levels in ependymal cilia as observed for AGBL5M1/M1 mutants. The study has been well-performed and the findings suggest a unique function of the N-domain of CCP5 in ependymal multicilia stability.

      Weaknesses:

      The content of this article is relatively descriptive and lacks molecular insights.

      We thank the Reviewer’s positive comments. To address the molecular insights of the dysregulated planar cell polarity (PCP) in Agbl5<sup>M1/M1</sup> ependyma, we are planning to further assess the microtubule polarization and the expression/localization of PCP core proteins in ependymal cells. We also plan to quantify the intensity of actin networks around BB patches to better understand to which extent it is affected in the ependyma of the mutants and contributes to the impaired stability of BBs (Please see below).

      We will also assess whether Agbl5 commonly functions in multiciliated cells of other organs.

      Reviewer #2 (Public review):

      Summary:

      This study analyzed the consequences of Agbl5 mutation on ependymal cell development and function. The authors first characterize their mutant mouse line reporting a reduced lifespand and severe hydrocephalus. Next, they report a defect in ependymal cell cilia number and motility. They provide evidence for impaired basal body organisation and cilia glutamylation.

      Strengths:

      Description of a mutant mouse which implicates Cytosolic Carboxypeptidase 5 (the product of Agbl5 gene) for proper ependymal cells.

      Weaknesses:

      Description of phenotype is incomplete:

      We thank the Reviewer’s constructive comments. We agree that more quantitative analysis of the phenotypes in Agbl5<sup>M1/M1</sup> will strengthen this study.

      - Figure 3G - the sequence from the movie is not really informative. Providing beating frequencies as quantification of the data would be more informative.

      We agree that quantification of the cilia beating frequencies and directions in these experiments will be more informative.

      - Figure 3 - the quantification of actin network would strengthen the message.

      We agree with the Reviewers. We will quantify the total intensity of actin around BB patch and the total intensity of actin per BB to determine to which extent the actin networks are affected in Agbl5<sup>M1/M1</sup> ependymal cells.

      - Lines 219 -220 - the authors conclude “Taken together, in Agbl5<sup>M1/M1</sup> ependymal cells, the expression of genes promoting multiciliogenesis were not impaired but certain proteins associated with differentiated ependymal cells are not properly expressed”. However, they do not assess gene but protein expression (IF). In addition, their quantification shows differences in the number of FoxJ1 positive cells which indeed is an impaired expression.

      We will clarify this statement.

      - Microtubules are involved in the local organization of ciliary basal bodies (see Werner et al., Vladar et al.,2011; Boutin et al., 2014). It would be interesting for the authors to check whether the subapical network of microtubules is glutamylated or not during ependymal cell differentiation and how this network is affected in their mutants.

      We thank the Reviewer’s suggestion. We agree this is an interesting point to look at. We will assess the glutamylation status of the subapical microtubule networks in differentiating ependymal cells and whether they are affected in the mutants.

      - Showing the data mentioned in the discussion on Cep110 would be a nice addition to the paper.

      These results will be provided.

      - Line 354: "The latter serves as a component of tissue polarity that is required for asymmetric PCP protein localization in each cell (Boutin et al., 2014; Vladar et al., 2012)." The cited reference did not demonstrate that this microtubule network is required for asymmetric PCP localization.

      We thank the Reviewer for critical reading. We will correct the citation.

      Reviewer #3 (Public review):

      Summary:

      The authors developed a new Agbl5 KO allele, extending the deletion to the N-terminus of CCP5 to explore its function in mouse ependymal cells.

      Strengths:

      They show that the KO mice exhibit severe hydrocephalus due to disorganized and mislocated basal bodies. Additionally, they present evidence of both impaired beating coordination and a reduction in ciliary beating.

      Weaknesses:

      The manuscript is well-written but lacks specific interpretations of the results presented. Further experiments are needed to be fully convincing.

      We thank the Reviewer’s comments. We plan to conduct the following experiments to strengthen this study.

      (1) Quantify the intensity of actin staining around BB patches and its intensity relative to the number of BBs to assess to which extent the actin networks in Agbl5<sup>M1/M1</sup> ependymal cells are affected (please refer to the above response to the comments of Reviewer 2#).

      (2) Co-stain tdTomato with cell specific markers to strengthen the spatial expression of tdTomato.

      (3) Seek proper antibodies to determine the correlation between signals of GT335 and Ac-Tub in ependymal multicilia of Agbl5<sup>M1/M1</sup> mice.

      (4) Quantitatively compare the size of ependymal cells in the wild-type and Agbl5<sup>M1/M1</sup> mice to address whether there is a consequence of possible dysfunction of primary cilia in the precursors of ependymal cells in the mutants. If so, we will further analyze how the primary cilia in the precursors of ependymal cells are affected in the mutants.

      (5) Address whether the rotational polarity is affected in the Agbl5<sup>M1/M1</sup> mutant mice.

    2. eLife Assessment

      This is a valuable study that explores the function of CCP5 in mouse ependymal cells. The methods, data, and analyses broadly support the claims. However, the study is incomplete as it stands. Minor weaknesses remain and the authors may wish to address them.

    3. Reviewer #1 (Public review):

      Summary:

      Dad et al. explored the roles of cytosolic carboxypeptidase 5(CCP5)in the development of ependymal multicilia in the brain. CCP family are erasers of polyglutamylation of ciliary-axoneme microtubules. The authors generated a new mutant mouse of Agbl5 gene, which encodes CCP5, with deletion of its N-terminus and partial carboxypeptidase (CP) domain (named AGBL5M1/M1).

      Strengths:

      The mutant mice revealed lethal hydrocephalus due to degeneration of ependymal multicilia. Interestingly, this is in contrast with the phenotype of Agbl5 mutants with disruption solely in the CP domain of CCP5 (named AGBL5M2/M2) that did not develop hydrocephalus despite increased glutamylation levels in ependymal cilia as observed for AGBL5M1/M1 mutants. The study has been well-performed and the findings suggest a unique function of the N-domain of CCP5 in ependymal multicilia stability.

      Weaknesses:

      The content of this article is relatively descriptive and lacks molecular insights.

    4. Reviewer #2 (Public review):

      Summary:

      This study analyzed the consequences of Agbl5 mutation on ependymal cell development and function. The authors first characterize their mutant mouse line reporting a reduced lifespand and severe hydrocephalus. Next, they report a defect in ependymal cell cilia number and motility. They provide evidence for impaired basal body organisation and cilia glutamylation.

      Strengths:

      Description of a mutant mouse which implicates Cytosolic Carboxypeptidase 5 (the product of Agbl5 gene) for proper ependymal cells.

      Weaknesses:

      Description of phenotype is incomplete:

      - Figure 3G - the sequence from the movie is not really informative. Providing beating frequencies as quantification of the data would be more informative.

      - Figure 3 - the quantification of actin network would strengthen the message.

      - Lines 219 -220 - the authors conclude «Taken together, in Agbl5M1/M1 ependymal cells, the expression of genes promoting multiciliogenesis were not impaired but certain proteins associated with differentiated ependymal cells are not properly expressed». However, they do not assess gene but protein expression (IF). In addition, their quantification shows differences in the number of FoxJ1 positive cells which indeed is an impaired expression.

      - Microtubules are involved in the local organization of ciliary basal bodies (see Werner et al., Vladar et al.,2011; Boutin et al., 2014). It would be interesting for the authors to check whether the subapical network of microtubules is glutamylated or not during ependymal cell differentiation and how this network is affected in their mutants.

      - Showing the data mentioned in the discussion on Cep110 would be a nice addition to the paper.

      - Line 354: "The latter serves as a component of tissue polarity that is required for asymmetric PCP protein localization in each cell (Boutin et al., 2014; Vladar et al., 2012)." The cited reference did not demonstrate that this microtubule network is required for asymmetric PCP localization.

    5. Reviewer #3 (Public review):

      Summary:

      The authors developed a new Agbl5 KO allele, extending the deletion to the N-terminus of CCP5 to explore its function in mouse ependymal cells.

      Strengths:

      They show that the KO mice exhibit severe hydrocephalus due to disorganized and mislocated basal bodies. Additionally, they present evidence of both impaired beating coordination and a reduction in ciliary beating.

      Weaknesses:

      The manuscript is well-written but lacks specific interpretations of the results presented. Further experiments are needed to be fully convincing.

    1. eLife Assessment

      This study provides an important first look at the influence of vertical transmission in the establishment of the amphibian microbiome, with a specific focus on the potential role of parental care. Through a combination of cross-fostering experimental work, comparative analysis across species that show variable levels of care, and developmental time series, the authors provide convincing evidence that vertical transmission through care is possible, but incomplete evidence that it plays a significant role in shaping frog skin microbiomes in nature or across time. This work will be of interest to researchers studying the evolution of parental care and microbiomes in vertebrates.

    2. Reviewer #1 (Public review):

      Summary:

      This manuscript describes a series of lab and field experiments to understand the role of tadpole transport in shaping the microbiome of poison frogs in early life. The authors conducted a cross-foster experiment in which R. variabilis tadpoles were carried by adults of their own species, carried by adults of another frog species, or not carried at all. After being carried for 6 hours, tadpole microbiomes resembled those of their caregiving species. Next, the authors reported higher microbiome diversity in tadpoles of two species that engage in transport-based parental care compared to one species that does not. Finally, they collected tadpoles either from the backs of an adult (i.e., they had recently been transported) or from eggs (i.e., not transported) but did not find significant overlap in microbiome composition between transported tadpoles and their parents.

      Strengths:

      The cross-foster experiment and the field experiment that reared transported and non-transported tadpoles are creative ways to address an important question in animal microbiome research. Together, they imply a small role for parental care in the development of the tadpole microbiome. The manuscript is generally well-written and easy to understand.

      Weaknesses:

      (1) Developmental time series:

      It was not entirely clear how this experiment relates to the rest of the manuscript, as it does not compare any effects of transport within or across species.

      (2) Cross-foster experiment:

      The "heterospecific transport" tadpoles were manually brushed onto the back of the surrogate frog, while the "biological transport" tadpoles were picked up naturally by the parent. It is a little challenging to interpret the effect of caregiver species since it is conflated with the method of attachment to the parent. I noticed that the uptake of Os-associated microbes by Os-transported tadpoles seemed to be higher than the uptake of Rv-associated microbes by Rv-associated tadpoles (comparing the second box from the left to the rightmost boxplot in panel S2C). Perhaps this could be a technical artifact if manual attachment to Os frogs was more efficient than natural attachment to Rv frogs.

      I was also surprised to see so much of the tadpole microbiome attributed to Os in tadpoles that were not transported by Os frogs (25-50% in many cases). It suggests that SourceTracker may not be effectively classifying the taxa.

      (3) Cross-species analysis:

      Like the developmental time series, this analysis doesn't really address the central question of the manuscript. I don't think it is fair for the authors to attribute the difference in diversity to parental care behavior, since the comparison only includes n=2 transporting species and n=1 non-transporting species that differ in many other ways. I would also add that increased diversity is not necessarily an expectation of vertical transmission. The similarity between adults and tadpoles is likely a more relevant outcome for vertical transmission, but the authors did not find any evidence that tadpole-adult similarity was any higher in species with tadpole transport. In fact, tadpoles and adults were more similar in the non-transporting species than in one of the transporting species (lines 296-298), which seems to directly contradict the authors' hypothesis. I don't see this result explained or addressed in the Discussion.

      (4) Field experiment:

      The rationale and interpretation of the genus-level network are not clear, and the figure is not legible. What does it mean to "visualize the microbial interconnectedness" or to be a "central part of the community"? The previous sentences in this paragraph (lines 337-343) seem to imply that transfer is parent-specific, but the genus-level network is based on the current adult frogs, not the previous generation of parents that transported them. So it is not clear that the distribution or co-distribution of these taxa provides any insight into vertical transmission dynamics.

    3. Reviewer #2 (Public review):

      Summary:

      Here, Fischer et al. attempt to understand the role of parental care, specifically the transport of offspring, in the development of the amphibian microbiome. The amphibian microbiome is an important study system due to its association with host health and disease outcomes. This study provides vertical transfer of bacteria through parental transport of tadpoles as a mechanism influencing tadpole microbiome composition. This paper gives insight into the relative roles of the environment, species, and parental care in determining microbiome composition in amphibians.

      The authors determine the time of bacterial colonization during tadpole development using PCR, observing that tadpoles were not colonized by bacteria prior to hatching from the vitelline membrane. By doing this, the impact of transport can be more accurately assessed in their laboratory experiments. The authors found that caregiver species influenced community composition, with transported tadpoles sharing a greater proportion of their skin communities with the transporting species.

      In a comparison of three sympatric amphibian species that vary in their reproductive strategies, the authors found that tadpole community diversity was not reflective of habitat diversity, but may be associated with the different reproductive strategies of each species. Parental care explained some of the variance of tadpole microbiomes between species, however, transportation by conspecific adults did not lead to more similar microbiomes between tadpoles and adults compared to species that do not exhibit parental transport.

      I did not find any major weaknesses in my review of this paper. The work here could potentially benefit from absolute abundance levels for shared ASVs between adults and tadpoles to more thoroughly understand the influences of vertical transmission that might be masked by relative abundance counts. This would only be a minor improvement as I think the conclusions from this work would likely remain the same, however.

    4. Author response:

      To address Reviewer 1’s concerns, we will implement the following changes:

      Comment 1: We will clarify that, even without direct comparisons within or across species, whether vertically transmitted microbes act as pioneering colonizers or integrate into an existing community is an important factor influencing their effect on community composition.

      Comment 2: We will provide additional details on the biology of the surrogate frog Oophaga sylvatica, explain how tadpole manipulation might influence adhesion to the caregiver, and acknowledge that the lack of knowledge on the physiological mechanisms underlying tadpole attachment currently limits our discussion to speculation.

      We will further clarify in the “Methods” section that SourceTracker’s ability to accurately estimate source proportions was assessed by evaluating how well it assigned training samples to their correct source environments. We will provide the predictions for the training set and describe how they informed our data preprocessing and analysis approach.

      Comment 3: While we predicted that community distances between tadpoles and adults would be smaller in species with parental transport, we explicitly state that our results did not confirm this expectation. We thus see no contradiction in our discussion but will ensure that this point is more clearly communicated. In response to the reviewer’s suggestion, we will incorporate additional literature on how tadpoles’ skin microbial communities change over time and adapt to their environment. We will also expand on how the life history of L. longirostris—specifically, the frequent presence of adults in tadpole habitats—may facilitate horizontal microbiota transmission, potentially contributing to shorter community distances.

      Comment 4: We will remove the network visualization to prevent any misinterpretation.

      Additionally, following Reviewer 2’s suggestion, we will include data on the absolute abundance of ASVs shared between parent and offspring after one month of development to further support the manuscript.

    1. eLife Assessment

      The valuable findings in this study reveal an intricate pattern of memory expression following retrieval extinction at different intervals from retrieval-extinction to test. They document that immediately after extinction there is a nonselective impairment in memory, which leads to no impairment at a 6-hour interval. At a 24-hour interval, there is a selective impairment. The evidence supporting the claims is incomplete and there are inconsistencies in the analyses reported that obscure the interpretation.

    2. Reviewer #1 (Public review):

      Summary:

      The novel advance by Wang et al is in the demonstration that, relative to a standard extinction procedure, the retrieval-extinction procedure more effectively suppresses responses to a conditioned threat stimulus when testing occurs just minutes after extinction. The authors provide some solid evidence to show that this "short-term" suppression of responding involves engagement of the dorsolateral prefrontal cortex.

      Strengths:

      Overall, the study is well-designed and the results are potentially interesting. There are, however, a few issues in the way that it is introduced and discussed. Some of the issues concern clarity of expression/communication. However, others relate to a theory that could be used to help the reader understand why the results should have come out the way that they did. More specific comments and questions are presented below.

      Weaknesses:

      INTRODUCTION & THEORY

      (1) It is difficult to appreciate why the first trial of extinction in a standard protocol does NOT produce the retrieval-extinction effect. This applies to the present study as well as others that have purported to show a retrieval-extinction effect. The importance of this point comes through at several places in the paper. E.g., the two groups in Study 1 experienced a different interval between the first and second CS extinction trials; and the results varied with this interval: a longer interval (10 min) ultimately resulted in less reinstatement of fear than a shorter interval. Even if the different pattern of results in these two groups was shown/known to imply two different processes, there is nothing in the present study that addresses what those processes might be. That is, while the authors talk about mechanisms of memory updating, there is little in the present study that permits any clear statement about mechanisms of memory. The references to a "short-term memory update" process do not help the reader to understand what is happening in the protocol.

      In reply to this point, the authors cite evidence to suggest that "an isolated presentation of the CS+ seems to be important in preventing the return of fear expression." They then note the following: "It has also been suggested that only when the old memory and new experience (through extinction) can be inferred to have been generated from the same underlying latent cause, the old memory can be successfully modified(Gershman et al., 2017). On the other hand, if the new experiences are believed to be generated by a different latent cause, then the old memory is less likely to be subject to modification. Therefore, the way the 1stand 2ndCS are temporally organized (retrieval-extinction or standard extinction) might affect how the latent cause is inferred and lead to different levels of fear expression from a theoretical perspective." This merely begs the question: why might an isolated presentation of the CS+ result in the subsequent extinction experiences being allocated to the same memory state as the initial conditioning experiences? This is not yet addressed in any way.

      (2) The discussion of memory suppression is potentially interesting but, in its present form, raises more questions than it answers. That is, memory suppression is invoked to explain a particular pattern of results but I, as the reader, have no sense of why a fear memory would be better suppressed shortly after the retrieval-extinction protocol compared to the standard extinction protocol; and why this suppression is NOT specific to the cue that had been subjected to the retrieval-extinction protocol.

      (3) Relatedly, how does the retrieval-induced forgetting (which is referred to at various points throughout the paper) relate to the retrieval-extinction effect? The appeal to retrieval-induced forgetting as an apparent justification for aspects of the present study reinforces points 2 and 3 above. It is not uninteresting but lacks clarification/elaboration and, therefore, its relevance appears superficial at best.

      (4) I am glad that the authors have acknowledged the papers by Chalkia, van Oudenhove & Beckers (2020) and Chalkia et al (2020), which failed to replicate the effects of retrieval-extinction reported by Schiller et al in Reference 6. The authors have inserted the following text in the revised manuscript: "It should be noted that while our long-term amnesia results were consistent with the fear memory reconsolidation literature, there were also studies that failed to observe fear prevention (Chalkia, Schroyens, et al., 2020; Chalkia, Van Oudenhove, et al., 2020; Schroyens et al., 2023). Although the memory reconsolidation framework provides a viable explanation for the long-term amnesia, more evidence is required to validate the presence of reconsolidation, especially at the neurobiological level (Elsey et al., 2018). While it is beyond the scope of the current study to discuss the discrepancies between these studies, one possibility to reconcile these results concerns the procedure for the retrieval-extinction training. It has been shown that the eligibility for old memory to be updated is contingent on whether the old memory and new observations can be inferred to have been generated by the same latent cause (Gershman et al., 2017; Gershman and Niv, 2012). For example, prevention of the return of fear memory can be achieved through gradual extinction paradigm, which is thought to reduce the size of prediction errors to inhibit the formation of new latent causes (Gershman, Jones, et al., 2013). Therefore, the effectiveness of the retrieval-extinction paradigm might depend on the reliability of such paradigm in inferring the same underlying latent cause." Firstly, if it is beyond the scope of the present study to discuss the discrepancies between the present and past results, it is surely beyond the scope of the study to make any sort of reference to clinical implications!!! Secondly, it is perfectly fine to state that "the effectiveness of the retrieval-extinction paradigm might depend on the reliability of such paradigm in inferring the same underlying latent cause..." This is not uninteresting, but it also isn't saying much. Minimally, I would expect some statement about factors that are likely to determine whether one is or isn't likely to see a retrieval-extinction effect, grounded in terms of this theory.

      CLARIFICATIONS, ELABORATIONS, EDITS

      (5) Some parts of the paper are not easy to follow. Here are a few examples (though there are others):

      (a) In the abstract, the authors ask "whether memory retrieval facilitates update mechanisms other than memory reconsolidation"... but it is never made clear how memory retrieval could or should "facilitate" a memory update mechanism.

      (b) The authors state the following: "Furthermore, memory reactivation also triggers fear memory reconsolidation and produces cue specific amnesia at a longer and separable timescale (Study 2, N = 79 adults)." Importantly, in study 2, the retrieval-extinction protocol produced a cue-specific disruption in responding when testing occurred 24 hours after the end of extinction. This result is interesting but cannot be easily inferred from the statement that begins "Furthermore..." That is, the results should be described in terms of the combined effects of retrieval and extinction, not in terms of memory reactivation alone; and the statement about memory reconsolidation is unnecessary. One can simply state that the retrieval-extinction protocol produced a cue-specific disruption in responding when testing occurred 24 hours after the end of extinction.

      (c) The authors also state that: "The temporal scale and cue-specificity results of the short-term fear amnesia are clearly dissociable from the amnesia related to memory reconsolidation, and suggest that memory retrieval and extinction training trigger distinct underlying memory update mechanisms." ***The pattern of results when testing occurred just minutes after the retrieval-extinction protocol was different to that obtained when testing occurred 24 hours after the protocol. Describing this in terms of temporal scale is unnecessary; and suggesting that memory retrieval and extinction trigger different memory update mechanisms is not obviously warranted. The results of interest are due to the combined effects of retrieval+extinction and there is no sense in which different memory update mechanisms should be identified with the different pattern of results obtained when testing occurred either 30 min or 24 hours after the retrieval-extinction protocol (at least, not the specific pattern of results obtained here).

      (d) The authors state that: "We hypothesize that the labile state triggered by the memory retrieval may facilitate different memory update mechanisms following extinction training, and these mechanisms can be further disentangled through the lens of temporal dynamics and cue-specificities." *** The first part of the sentence is confusing around usage of the term "facilitate"; and the second part of the sentence that references a "lens of temporal dynamics and cue-specificities" is mysterious. Indeed, as all rats received the same retrieval-extinction exposures in Study 2, it is not clear how or why any differences between the groups are attributed to "different memory update mechanisms following extinction".

      DATA

      (6A) The eight participants who were discontinued after Day 1 in Study 1 were all from the no reminder group. The authors should clarify how participants were allocated to the two groups in this experiment so that the reader can better understand why the distribution of non-responders was non-random (as it appears to be).

      (6B) Similarly, in study 2, of the 37 participants that were discontinued after Day 2, 19 were from Group 30 min and 5 were from Group 6 hours. The authors should comment on how likely these numbers are to have been by chance alone. I presume that they reflect something about the way that participants were allocated to groups: e.g., the different groups of participants in studies 1 and 2 could have been run at quite different times (as opposed to concurrently). If this was done, why was it done? I can't see why the study should have been conducted in this fashion - this is for myriad reasons, including the authors' concerns re SCRs and their seasonal variations.

      (6C) In study 2, why is responding to the CS- so high on the first test trial in Group 30 min? Is the change in responding to the CS- from the last extinction trial to the first test trial different across the three groups in this study? Inspection of the figure suggests that it is higher in Group 30 min relative to Groups 6 hours and 24 hours. If this is confirmed by the analysis, it has implications for the fear recovery index which is partly based on responses to the CS-. If not for differences in the CS- responses, Groups 30 min and 6 hours are otherwise identical. That is, the claim of differential recovery to the CS1 and CS2 across time may simply an artefact of the way that the recovery index was calculated. This is unfortunate but also an important feature of the data given the way in which the fear recovery index was calculated.

      (6D) The 6 hour group was clearly tested at a different time of day compared to the 30 min and 24 hour groups. This could have influenced the SCRs in this group and, thereby, contributed to the pattern of results obtained.

      (6E) The authors find different patterns of responses to CS1 and CS2 when they were tested 30 min after extinction versus 24 h after extinction. On this basis, they infer distinct memory update mechanisms. However, I still can't quite see why the different patterns of responses at these two time points after extinction need to be taken to infer different memory update mechanisms. That is, the different patterns of responses at the two time points could be indicative of the same "memory update mechanism" in the sense that the retrieval-extinction procedure induces a short-term memory suppression that serves as the basis for the longer-term memory suppression (i.e., the reconsolidation effect). My pushback on this point is based on the notion of what constitutes a memory update mechanism; and is motivated by what I take to be a rather loose use of language/terminology in the reconsolidation literature and this paper specifically (for examples, see the title of the paper and line 2 of the abstract).

    3. Reviewer #2 (Public review):

      Summary

      The study investigated whether memory retrieval followed soon by extinction training results in a short-term memory deficit when tested - with a reinstatement test that results in recovery from extinction - soon after extinction training. Experiment 1 documents this phenomenon using a between-subjects design. Experiment 2 used a within-subject control and saw that the effect is also observed in a control condition. In addition, it also revealed that if testing is conducted 6 hours after extinction, there is not effect of retrieval prior to extinction as there is recovery from extinction independently of retrieval prior to extinction. A third Group also revealed that retrieval followed by extinction attenuates reinstatement when the test is conducted 24 hours later, consistent with previous literature. Finally, Experiment 3 used continuous theta-burst stimulation of the dorsolateral prefrontal cortex and assessed whether inhibition of that region (vs a control region) reversed the short-term effect revealed in Experiments 1 and 2. The results of control groups in Experiment 3 replicated the previous findings (short-term effect), and the experimental group revealed that these can be reversed by inhibition of the dorsolateral prefrontal cortex.

      Strengths

      The work is performed using standard procedures (fear conditioning and continuous theta-burst stimulation) and there is some justification of the sample sizes. The results replicate previous findings - some of which have been difficult to replicate and this needs to be acknowledged - and suggest that the effect can also be observed in a short-term reinstatement test.

      The study establishes links between the memory reconsolidation and retrieval-induced forgetting (or memory suppression) literatures. The explanations that have been developed for these are distinct and the current results integrate these, by revealing that the DLPFC activity involved in retrieval-extinction short-term effect. There is thus some novelty in the present results, but numerous questions remain unaddressed.

      Weakness

      The fear acquisition data is converted to a differential fear SCR and this is what is analysed (early vs late). However, the figure shows the raw SCR values for CS+ and CS- and therefore it is unclear whether acquisition was successful (despite there being an "early" vs "late" effect - no descriptives are provided).

      In Experiment 1 (Test results) it is unclear whether the main conclusion stems from a comparison of the test data relative to the last extinction trial ("we defined the fear recovery index as the SCR difference between the first test trial and the last extinction trial for a specific CS") or the difference relative to the CS- ("differential fear recovery index between CS+ and CS-"). It would help the reader assess the data if Fig 1e presents all the indexes (both CS+ and CS-). In addition, there is one sentence which I could not understand "there is no statistical difference between the differential fear recovery indexes between CS+ in the reminder and no reminder groups (P=0.048)". The p value suggests that there is a difference, yet it is not clear what is being compared here. Critically, any index taken as a difference relative to the CS- can indicate recovery of fear to the CS+ or absence of discrimination relative to the CS-, so ideally the authors would want to directly compare responses to the CS+ in the reminder and no-reminder groups. In the absence of such comparison, little can be concluded, in particular if SCR CS- data is different between groups. The latter issue is particularly relevant in Experiment 2, in which the CS- seems to vary between groups during the test and this can obscure the interpretation of the result.

      In experiment 1, the findings suggest that there is a benefit of retrieval followed by extinction in a short-term reinstatement test. In Experiment 2, the same effect is observed to a cue which did not undergo retrieval before extinction (CS2+), a result that is interpreted as resulting from cue-independence, rather than a failure to replicate in a within-subjects design the observations of Experiment 1 (between-subjects). Although retrieval-induced forgetting is cue-independent (the effect on items that are suppressed [Rp-] can be observed with an independent probe), it is not clear that the current findings are similar, and thus that the strong parallels made are not warranted. Here, both cues have been extinguished and therefore been equally exposed during the critical stage.

      The findings in Experiment 2 suggest that the amnesia reported in Experiment 1 is transient, in that no effect is observed when the test is delayed by 6 hours. The phenomena whereby reactivated memories transition to extinguished memories as a function of the amount of exposure (or number of trials) is completely different from the phenomena observed here. In the former, the manipulation has to do with the number of trials (or total amount of time) that the cues are exposed. In the current Experiment 2, the authors did not manipulate the number of trials but instead the retention interval between extinction and test. The finding reported here is closer to a "Kamin effect", that is the forgetting of learned information which is observed with intervals of intermediate length (Baum, 1968). Because the Kamin effect has been inferred to result from retrieval failure, it is unclear how this can be explained here. There needs to be much more clarity on the explanations to substantiate the conclusions.<br /> There are many results (Ryan et al., 2015) that challenge the framework that the authors base their predictions on (consolidation and reconsolidation theory), therefore these need to be acknowledged. These studies showed that memory can be expressed in the absence of the biological machinery thought to be needed for memory performance. The authors should be careful about statements such as "eliminate fear memores" for which there is little evidence.

      The parallels between the current findings and the memory suppression literature are speculated in the general discussion, and there is the conclusion that "the retrieval-extinction procedure might facilitate a spontaneous memory suppression process". Because one of the basic tenets of the memory suppression literature is that it reflects an "active suppression" process, there is no reason to believe that in the current paradigm the same phenomenon is in place, but instead it is "automatic". In other words, the conclusions make strong parallels with the memory suppression (and cognitive control) literature, yet the phenomena that they observed is thought to be passive (or spontaneous/automatic). Ultimately, it is unclear why 10 mins between the reminder and extinction learning will "automatically" suppress fear memories. Further down in the discussion it is argued that "For example, in the well-known retrieval-induced forgetting (RIF) phenomenon, the recall of a stored memory can impair the retention of related long-term memory and this forgetting effect emerges as early as 20 minutes after the retrieval procedure, suggesting memory suppression or inhibition can occur in a more spontaneous and automatic manner". I did not follow with the time delay between manipulation and test (20 mins) would speak about whether the process is controlled or automatic. In addition, the links with the "latent cause" theoretical framework are weak if any. There is little reason to believe that one extinction trial, separated by 10 mins from the rest of extinction trials, may lead participants to learn that extinction and acquisition have been generated by the same latent cause.

      Among the many conclusions, one is that the current study uncovers the "mechanism" underlying the short-term effects of retrieval-extinction. There is little in the current report that uncovers the mechanism, even in the most psychological sense of the mechanism, so this needs to be clarified. The same applies to the use of "adaptive".

      Whilst I could access the data in the OFS site, I could not make sense of the Matlab files as there is no signposting indicating what data is being shown in the files. Thus, as it stands, there is no way of independently replicating the analyses reported.

      The supplemental material shows figures with all participants, but only some statistical analyses are provided, and sometimes these are different from those reported in the main manuscript. For example, the test data in Experiment 1 is analysed with a two-way ANOVA with main effects of group (reminder vs no-reminder) and time (last trial of extinction vs first trial of test) in the main report. The analyses with all participants in the sup mat used a mixed two-way ANOVA with group (reminder vs no reminder) and CS (CS+ vs CS-). This makes it difficult to assess the robustness of the results when including all participants. In addition, in the supplementary materials there are no figures and analyses for Experiment 3.

      One of the overarching conclusions is that the "mechanisms" underlying reconsolidation (long term) and memory suppression (short term) phenomena are distinct, but memory suppression phenomena can also be observed after a 7-day retention interval (Storm et al., 2012), which then questions the conclusions achieved by the current study.

      References:

      Baum, M. (1968). Reversal learning of an avoidance response and the Kamin effect. Journal of Comparative and Physiological Psychology, 66(2), 495.<br /> Chalkia, A., Schroyens, N., Leng, L., Vanhasbroeck, N., Zenses, A. K., Van Oudenhove, L., & Beckers, T. (2020). No persistent attenuation of fear memories in humans: A registered replication of the reactivation-extinction effect. Cortex, 129, 496-509.<br /> Ryan, T. J., Roy, D. S., Pignatelli, M., Arons, A., & Tonegawa, S. (2015). Engram cells retain memory under retrograde amnesia. Science, 348(6238), 1007-1013.<br /> Storm, B. C., Bjork, E. L., & Bjork, R. A. (2012). On the durability of retrieval-induced forgetting. Journal of Cognitive Psychology, 24(5), 617-629.

      Comments on revisions:

      The authors have revised the manuscript but most of my concerns have remained unaddressed.

      (1) There are still no descriptive statistics to substantiate learning in Experiment 1.

      (2) In the revised analyses, the authors now show that CS- changes in different groups (for example, Experiment 2) so this means that there is little to conclude from the differential scores because these depend on CS-. It is unclear whether the effects arise from CS+ performance or the differential which is subject to CS- variations.

      (3) The notion that suppression is automatic is speculative at best

      (4) It still struggle with the parallels between these findings and the "limbo" literature. Here you manipulated the retention interval, whereas in the cited studies the number of extinction (exposure) was varied. These are two completely different phenomena.

      (5) My point about the data problematic for the reconsolidation (and consolidation) frameworks is that they observed memory in the absence of the brain substrates that are needed for memory to be observed. The answer did not address this. I do not understand how the latent cause model can explain this, if the only difference is the first ITI. Wouldn't participants fail to integrate extinction with acquisition with a longer ITI?

      (6) The materials in the OSF site are the same as before, they haven't ben updated.

      (7) Concerning supplementary materials, the robustness tests are intended to prove that you 1) can get the same results by varying the statistical models or 2) you can get the same results when you include all participants. Here authors have done both so this does not help. Also, in the rebuttal letter, they stated "Please note we did not include non-learners in these analyses " which contradicts what is stated in the figure captions "(learners + non learners)"

      (8) Finally, the literature suggesting that reconsolidation interference "eliminates" a memory is not substantiated by data nor in line with current theorising, so I invite a revision of these strong claims.

      Overall, I conclude that the revised manuscript did not address my main concerns.

    1. eLife Assessment

      This study provides a valuable structural analysis of the Sedoheptulose-1,7-Bisphosphatase (SBPase) from Chlamydomonas reinhardtii. The data presented are solid and based on X-ray structures of the CrSBPase in an oxidized and reduced state, the authors identify a disulfide bond in close proximity to the dimer interface. They show that the redox-state of the CrSBPase impacts its oligomeric state and might also influence the activity of the protein.

    2. Joint Public Review:

      The central theme of the manuscript is the structure of SBPase - an enzyme central to the photosynthetic Calvin-Benson-Bassham cycle. The authors claim that the structure is first of its kind from a chlorophyte Chlamydomonas reinhardtii, a model unicellular green microalga. The authors use a number of methods like protein expression, purification, enzymatic assays, SAXS, molecular dynamics simulations and xray crystallography to resolve a 3.09 A crystal structure of the oxidized and partially reduced state. The results are supported by the claims made in the manuscript. While the structure is the first from a chlorophyte, it is not unique. Several structures of SBPase are available and a comparison has been made between the structure reported here and others that have been previously published.

    3. Author response:

      The following is the authors’ response to the previous reviews.

      Recommendations for the authors:

      Reviewer #1:

      The authors have thoroughly changed the manuscript and addressed most of my concerns. I appreciate adding the activity assays of the C115/120S mutants, however, I suggest that the authors embed and also discuss these data more clearly. It also escaped my attention earlier that the positioning of the disulfide bond is 117-122 in the deposited PDBs instead of 115-120. The authors should carefully check which positioning is correct here.

      We thank reviewer #1 for his or her careful assessment of our revised manuscript. As suggested, we detailed the results section “CrSBPase enzymatic activity” with additional numerical values, and discussed more clearly the comparisons of results for activity assays of mutants C115S and C120S in the section “Oligomeric states of CrSBPase”. Residues numbering was carefully proof-checked throughout the manuscript for correctness and homogeneity. C115 and C120 are numbered according to best databases consensus, ie. GenBank and Uniprot, and may differ from one database to another (including PDB) due to varying numbering rules. We clarified the chosen nomenclature in methods section “Cloning and mutagenesis of CrSBPase expression plasmids”.

      Line 246-250: I think it is evident that the two SBPase structures superpose well given the sequence identity of more than 70%. However, it would be great to include a superposition of the two structures in Figure 1, especially with regard to the region harboring C115 and C120.

      We added a panel showing superimposition of CrSBPase 7b2o and PpSBPase 5iz3 and made a close-up view around the region C115-C120 in supplementary figure 5. Given the density in information of figure 1 we prefer not to add additional images on it. Supplementary figure 5 was initially intended to illustrate sequence conservation/variation among homologs, thus fitting with the objective to compare past and present XRC results.

      Line 255-266: I am again missing a panel in Figure 1 here, e.g. a side-by-side view of Xray vs AF2/3 structure.

      We added another panel in supplementary figure 5 to visually compare side-by-side SBPase crystallographic structure 7b2o and our AF3 model. Again, for the sake of clarity we prefer not to overload figure 1 with additional panels. This will also enable thorough comparison of past XRC of PpSBPase, present XRC of CrSBPase, and various AF models (see below, oligomer comparisons).

      Line 261-266: Did the authors predict dimers and tetramers using AF3? What are the confidence metrics in this case? Do the authors see differences to the monomer prediction in case a multimer is confidently predicted?

      We modeled dimers and tetramers using AF3 and added them on supplementary figure 5 side by side with protomer of XRC model 7b2o and with monomer predicted by AF3. Color code for supplementary figure 5 panels F-H is according to AF standard representation of plDDT. Confidence metrics per residue correspond to very high reliability (navy blue) or, locally, confident prediction (cyan) and overall prediction scores range from pTM=0.85-0.91, a high-quality prediction. Interface prediction score is high for both dimer (ipTM=0.9) and tetramer (ipTM=0.82). We reported these data in supplementary figure 5 and corresponding updated legend. XRC and AF models all align with RMSD<0.5 Å, indicating a globally unchanged structure of the protomer in the various methods and oligomeric states.

      Line 441: How does the oligomeric equilibrium change in C115/120S mutants? This information should be added for the mutants. Besides, the mAU units in Fig. 6 could be normalized to allow an easier comparison between the chromatograms of wt and mutants.

      Change in oligomeric equilibrium is assessed by size-exclusion chromatography of WT and mutants C115S, C120S as reported in figure 6A. We made quantitative estimation of WT, and C115S and C120S mutants equilibrium by comparing maximal peak intensity and added this information in the text. Briefly, the oligomer ratio on a scale of 100 is 9:48:43 for WT, 42:25:33 for mutant C115S, and 29:17:54 for mutant C120S (ratio expressed as tetramer:dimer:monomer). We prefer not to normalize values of absorbance, but rather keep the actual measurement of absorbance at 280 nm on the chromatogram of figure 6, for the sake of consistency with the added text and for a more transparent report of the experiment.

      Line 447: WT activity is 12.15+-2.15 and both mutants have a higher activity. The authors should check if their values (96% and 107%) are correct. Besides, did the authors check if the increase in C120S is statistically significant? My impression is that both mutants have a higher activity than the wildtype, in both correlating with increased fractions of the tetramer. This would also make sense, as the corresponding region is part of the tetramer interface in the crystal packing.

      The reported activity values were checked for correctness. Wild-type SBPase specific activity at 12.5 ±2.15 µmol(NADPH) min<sup>-1</sup> mg(SBPase)<sup>-1</sup> was obtained by pre-incubating the enzyme with 1 µM CrTRXf2 supplemented with 1 mM DTT and 10 mM Mg<sup>2+</sup>, while the results of supplementary figure 14 reporting the comparison of activation of WT and mutants, with a variation of 107 or 96 %, were obtained with a slightly different protocol for pre-incubation of the enzyme with 10 mM DTT and 10 mM Mg<sup>2+</sup>. Please note that whether WT enzyme was assayed in 10 mM DTT 10 mM Mg or in 1 µM TRX 1 mM DTT 10 mM Mg, its specific activity appears equal within experimental error. Both mutants have nearly the same activity than the WT in the assay reported in supplementary figure 14: we fully agree that 107% (and 96%) variation is indeed not significant considering the uncertainty of the measurement (see error bars representing standard deviations of the mean in supplementary figure 14). We added this important information in the text. Even though both mutations stabilize the most active tetramer in untreated recombinant protein, we think that after reducting treatment both WT and mutants all reach the same maximal activity because they all form an equivalent proportion of the active tetramer versus alternative oligomeric states. We furhter interprete this piece of data as a decoupling of reduction and catalysis: in physiological conditions we assume that SBPase would initiate activation upon the reduction of disulfide bridges, including but not limited to C115-C120 that restricts the entry into fully active tetramer, at which point SBPase in reduced form reaches maximal activity until another post-translational signal eventually changes its conformation and oligomerisation.

      We thank again reviewer 1 for his or her assessment and valuable suggestions.

    1. eLife Assessment

      The study presents important findings on inositol-requiring enzyme (IRE1α) inhibition on diet-induced obesity (overnutrition) and insulin resistance where IRE1α inhibition enhances thermogenesis and reduces the metabolically active and M1-like macrophages in adipose tissue. The evidence supporting the conclusions is convincing. The work will be of interest to cell biologists and biochemists working in metabolism, insulin resistance and inflammation with a broad eLife readership.

    2. Reviewer #1 (Public review):

      First, the authors confirm the up-regulation of the main genes involved in the three branches of the Unfolded Protein Response (UPR) system in diet-induced obese mice in AT, observations that have been extensively reported before. Not surprisingly, IRE1a inhibition with STF led to an amelioration of the obesity and insulin resistance of the animals. Moreover, non-alcoholic fatty liver disease was also improved by the treatment. More novel are their results in terms of thermogenesis and energy expenditure, where IRE1a seems to act via activation of brown AT. Finally, mice treated with STF exhibited significantly fewer metabolically active and M1-like macrophages in the AT compared to those under vehicle conditions. Overall, the authors conclude that targeting IRE1a has therapeutical potential for treating obesity and insulin resistance.

      The study has some strengths, such as the detailed characterization of the effect of STF in different fat depots and a thorough analysis of macrophage populations. However, the lack of novelty in the findings somewhat limits the study´s impact on the field.

    3. Reviewer #3 (Public review):

      Summary:

      The manuscript by Wu D. et al. explores an innovative approach in immunometabolism and obesity by investigating the potential of targeting macrophage Inositol-requiring enzyme 1α (IRE1α) in cases of overnutrition. Their findings suggest that pharmacological inhibition of IRE1α could influence key aspects such as adipose tissue inflammation, insulin resistance, and thermogenesis. Notable discoveries include the identification of High-Fat Diet (HFD)-induced CD9+ Trem2+ macrophages and the reversal of metabolically active macrophages' activity with IRE1α inhibition using STF. These insights could significantly impact future obesity treatments.

      Strengths:

      The study's key strengths lie in its identification of specific macrophage subsets and the demonstration that inhibiting IRE1α can reverse the activity of these macrophages. This provides a potential new avenue for developing obesity treatments and contributes valuable knowledge to the field.

      Weaknesses:

      The research lacks an in-depth exploration of the broader metabolic mechanisms involved in controlling diet-induced obesity (DIO). Addressing this gap would strengthen the understanding of how targeting IRE1α might fit into the larger metabolic landscape.

      Impact and Utility:

      The findings have the potential to advance the field of obesity treatment by offering a novel target for intervention. However, further research is needed to fully elucidate the metabolic pathways involved and to confirm the long-term efficacy and safety of this approach. The methods and data presented are useful, but additional context and exploration are required for broader application and understanding.

      Comments on revisions:

      The authors have satisfactorily addressed all of my previous concerns.

    4. Author response:

      The following is the authors’ response to the previous reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      First, the authors confirm the up-regulation of the main genes involved in the three branches of the Unfolded Protein Response (UPR) system in diet-induced obese mice in AT, observations that have been extensively reported before. Not surprisingly, IRE1a inhibition with STF led to an amelioration of the obesity and insulin resistance of the animals. Moreover, non-alcoholic fatty liver disease was also improved by the treatment. More novel are their results in terms of thermogenesis and energy expenditure, where IRE1a seems to act via activation of brown AT. Finally, mice treated with STF exhibited significantly fewer metabolically active and M1-like macrophages in the AT compared to those under vehicle conditions. Overall, the authors conclude that targeting IRE1a has therapeutical potential for treating obesity and insulin resistance.

      The study has some strengths, such as the detailed characterization of the effect of STF in different fat depots and a thorough analysis of macrophage populations. However, the lack of novelty in the findings somewhat limits the study´s impact on the field.

      We thank the reviewer for the appreciation of our findings. We would use the opportunity to highlight several novelties. First, we characterized the relationship between the newly discovered CD9<sup>+</sup> ATMs and the “M1-like” CD11c+ ATMs. Second, we demonstrated that M2 macrophage population was not reduced but instead increased in adipose tissue in obesity. Third, IRE1 inhibition does not improve thermogenesis by boosting M2 population, but instead, IRE1 inhibition suppresses pro-inflammatory macrophage populations including the M1-like ATMs.

      Reviewer #3 (Public review):

      Summary:

      The manuscript by Wu D. et al. explores an innovative approach in immunometabolism and obesity by investigating the potential of targeting macrophage Inositol-requiring enzyme 1α (IRE1α) in cases of overnutrition. Their findings suggest that pharmacological inhibition of IRE1α could influence key aspects such as adipose tissue inflammation, insulin resistance, and thermogenesis. Notable discoveries include the identification of High-Fat Diet (HFD)-induced CD9<sup>+</sup> Trem2+ macrophages and the reversal of metabolically active macrophages' activity with IRE1α inhibition using STF. These insights could significantly impact future obesity treatments.

      Strengths:

      The study's key strengths lie in its identification of specific macrophage subsets and the demonstration that inhibiting IRE1α can reverse the activity of these macrophages. This provides a potential new avenue for developing obesity treatments and contributes valuable knowledge to the field.

      Weaknesses:

      The research lacks an in-depth exploration of the broader metabolic mechanisms involved in controlling diet-induced obesity (DIO). Addressing this gap would strengthen the understanding of how targeting IRE1α might fit into the larger metabolic landscape.

      We thank the reviewer for the appreciation of strengths in our manuscript. In particular, we appreciate the reviewer’s recommendation on the exploration of broader metabolic landscape, such as the effect of IRE1 inhibition on non-adipose tissue macrophages and metabolism. We agree that achieving these will certainly broaden the therapeutic potential of IRE1 inhibition to larger metabolic disorders and we will pursue these explorations in future studies.

      Impact and Utility:

      The findings have the potential to advance the field of obesity treatment by offering a novel target for intervention. However, further research is needed to fully elucidate the metabolic pathways involved and to confirm the long-term efficacy and safety of this approach. The methods and data presented are useful, but additional context and exploration are required for broader application and understanding.

      Comments on revisions:

      The author has revised the manuscript and addressed the most relevant comments raised by the reviewers. The paper is now significantly improved, though two minor issues remain.

      (1) Studies were limited to male mice; this should be mentioned in the paper's Title.

      Thanks for comment. We have modified the title to reflect the male mice only.

      (2) Please include the sample size (n=) in all provided tables in the main manuscript and supplementary tables.

      We have included the sample size in the main manuscript.

    1. eLife Assessment

      This manuscript presents important findings on a bacterial effector involved in plant symbiotic signaling. The effector proteolytically targets a key receptor while its activity is counteracted by host-mediated phosphorylation, revealing a dynamic interplay that fine-tunes symbiotic interactions. The evidence supporting these claims is solid, and the findings have potential signaling implications beyond bacterial interactions with plants.

    2. Reviewer #1 (Public review):

      Bacterial effectors that interfere with the inner molecular workings of eukaryotic host cells are of great biological significance across disciplines. On the one hand they help us to understand the molecular strategies that bacteria use to manipulate host cells. On the other hand, they can be used as research tools to reveal molecular details of the intricate workings of the host machinery that is relevant for the interaction/defence/symbiosis with bacteria. The authors investigate the function and biological impact of a rhizobial effector that interacts with and modifies, and curiously is modified by, legume receptors essential for symbiosis. The molecular analysis revealed a bacterial effectorthat cleaves a plant symbiosis signaling receptor to inhibit signaling and the host counterplay by phosphorylation via a receptor kinase. These findings have potential implications beyond bacterial interactions with plants. Bao and colleagues investigated how rhizobial effector proteins can regulate the legume root nodule symbiosis.

      Bao and colleagues investigated how rhizobial effector proteins can regulate the legume root nodule symbiosis. A rhizobial effector is described to directly modify symbiosis-related signaling proteins, altering the outcome of the symbiosis. Overall, the paper presents findings that will have a wide appeal beyond its primary field.

      Out of 15 identified effectors from Sinorhizobium fredii, they focus on the effector NopT, which exhibits proteolytic activity and may therefore cleave specific target proteins of the host plant. They focus on two Nod factor receptors of the legume Lotus japonicus, NFR1 and NFR5, both of which were previously found to be essential for the perception of rhizobial nod factor, and the induction of symbiotic responses such as bacterial infection thread formation in root hairs and root nodule development (Madsen et al., 2003, Nature; Tirichine et al., 2003; Nature). The authors present evidence for an interaction of NopT with NFR1 and NFR5. The paper aims to characterize the biochemical and functional consequences of these interactions and the phenotype that arises when the effector is mutated.

      Evidence is presented that in vitro NopT can cleave NFR5 at its juxtamembrane region. NFR5 appears also to be cleaved in vivo, and NFR1 appears to inhibit the proteolytic activity of NopT by phosphorylating NopT. When NFR5 and NFR1 are ectopically over-expressed in leaves of the non-legume Nicotiana benthamiana, they induce cell death (Madsen et al., 2011, Plant Journal). Bao et al. found that this cell death response is inhibited by the coexpression of nopT. Mutation of nopT alters the outcome of rhizobial infection in L. japonicus. These conclusions are well supported by the data.

      The presented data support the interaction of NopT with NFR1 and NFR5. In particular, there is solid support for cleavage of NFR5 by NopT (Figure 3) and the identification of NopT phosphorylation sites that inhibit its proteolytic activity (Figure 4C). Cleavage of NFR5 upon expression in N. benthamiana (Figure 3A) requires appropriate controls (inactive mutant versions), since Agrobacterium as a closely rhizobia related bacterium might increase defense related proteolytic activity in the plant host cells, and these controls are provided.

      Key results from N. benthamiana appear consistent with data from recombinant protein expression in bacteria. For the analysis in the host legume L. japonicus transgenic hairy roots were included. To demonstrate that the cleavage of NFR5 occurs during the interaction in plant cells, the authors build largely on Western blots. Regardless of whether Nicotiana leaf cells or Lotus root cells are used as the test platform, the Western blots indicate that only a small proportion of NFR5 is cleaved when co-expressed with nopT, and most of the NFR5 persists in its full-length form (Figures 3A-D). The authors discuss how the loss of NFR5 function (loss of cell death, impact on symbiosis) can be explained despite this vast excess of intact NFR5, but do not further explore the impact of this ratio on downstream signaling.

    3. Reviewer #2 (Public review):

      Summary:

      This manuscript presents data demonstrating NopT's interaction with Nod Factor Receptors NFR1 and NFR5 and its impact on cell death inhibition and rhizobial infection. The identification of a truncated NopT variant in certain Sinorhizobium species adds an interesting dimension to the study. These data try to bridge the gaps between classical Nod-factor-dependent nodulation and T3SS NopT effector-dependent nodulation in legume-rhizobium symbiosis. Overall, the research provides interesting insights into the molecular mechanisms underlying symbiotic interactions between rhizobia and legumes.

      Strengths:

      The manuscript nicely demonstrates NopT's proteolytic cleavage of NFR5, regulated by NFR1 phosphorylation, promoting rhizobial infection in L. japonicus. Intriguingly, authors also identify a truncated NopT variant in certain Sinorhizobium species, maintaining NFR5 cleavage but lacking NFR1 interaction. These findings bridge the T3SS effector with the classical Nod-factor-dependent nodulation pathway, offering novel insights into symbiotic interactions.

      Weaknesses:

      (1) In the previous study, when transiently expressed NopT alone in Nicotiana tobacco plants, proteolytically active NopT elicited a rapid hypersensitive reaction. However, this phenotype was not observed when expressing the same NopT in Nicotiana benthamiana (Figure 1A). Conversely, cell death and a hypersensitive reaction were observed in Figure S8. This raises questions about the suitability of the exogenous expression system for studying NopT proteolysis specificity.

      (2) NFR5 Loss-of-function mutants do not produce nodules in the presence of rhizobia in lotus roots, and overexpression of NFR1 and NFR5 produces spontaneous nodules. In this regard, if the direct proteolysis target of NopT is NFR5, one could expect the NGR234's infection will not be very successful because of the Native NopT's specific proteolysis function of NFR5 and NFR1. Conversely, in Figure 5, authors observed the different results.

      (3) In Figure 6E, the model illustrates how NopT digests NFR5 to regulate rhizobia infection. However, it raises the question of whether it is reasonable for NGR234 to produce an effector that restricts its own colonization in host plants.

      (4) The failure to generate stable transgenic plants expressing NopT in Lotus japonicus is surprising, considering the manuscript's claim that NopT specifically proteolyzes NFR5, a major player in the response to nodule symbiosis, without being essential for plant development.

      Comments on the revised version:

      My concerns regarding the potential function of NopT during nodule symbiosis have been adequately addressed in the revised manuscript. Therefore, I have no further questions about this version, aside from a few minor suggestions:

      (1) Please carefully check the text formatting throughout the manuscript to ensure consistency with scientific conventions and the journal's standards. For example, Line 105-117 and line119-131.<br /> (2) The term "detrimental" in line 624 may not accurately describe the function of NopT in rhizobial infection. Since the authors propose that NopT proteolytically cleaves NFR5 and suppresses NF signaling as a potential fine-tuning mechanism for legume symbiosis, a more precise term may be needed.<br /> (3) Lines 632-634 are somewhat unclear. If NopT serves as a strategy for rhizobia to evade detection by plant immunity, then knocking out NopT should, in theory, inhibit rhizobial infection. Clarification on this point would be beneficial.

    4. Author response:

      The following is the authors’ response to the previous reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Bacterial effectors that interfere with the inner molecular workings of eukaryotic host cells are of great biological significance across disciplines. On the one hand they help us to understand the molecular strategies that bacteria use to manipulate host cells. On the other hand they can be used as research tools to reveal molecular details of the intricate workings of the host machinery that is relevant for the interaction/defence/symbiosis with bacteria. The authors investigate the function and biological impact of a rhizobial effector that interacts with and modifies, and curiously is modified by, legume receptors essential for symbiosis. The molecular analysis revealed a bacterial effector that cleaves a plant symbiosis signaling receptor to inhibit signaling and the host counterplay by phosphorylation via a receptor kinase. These findings have potential implications beyond bacterial interactions with plants.

      Bao and colleagues investigated how rhizobial effector proteins can regulate the legume root nodule symbiosis. A rhizobial effector is described to directly modify symbiosis-related signaling proteins, altering the outcome of the symbiosis. Overall, the paper presents findings that will have a wide appeal beyond its primary field.

      Out of 15 identified effectors from Sinorhizobium fredii, they focus on the effector NopT, which exhibits proteolytic activity and may therefore cleave specific target proteins of the host plant. They focus on two Nod factor receptors of the legume Lotus japonicus, NFR1 and NFR5, both of which were previously found to be essential for the perception of rhizobial nod factor, and the induction of symbiotic responses such as bacterial infection thread formation in root hairs and root nodule development (Madsen et al., 2003, Nature; Tirichine et al., 2003; Nature). The authors present evidence for an interaction of NopT with NFR1 and NFR5. The paper aims to characterize the biochemical and functional consequences of these interactions and the phenotype that arises when the effector is mutated.

      Evidence is presented that in vitro NopT can cleave NFR5 at its juxtamembrane region. NFR5 appears also to be cleaved in vivo. and NFR1 appears to inhibit the proteolytic activity of NopT by phosphorylating NopT. When NFR5 and NFR1 are ectopically over-expressed in leaves of the non-legume Nicotiana benthamiana, they induce cell death (Madsen et al., 2011, Plant Journal). Bao et al., found that this cell death response is inhibited by the coexpression of nopT. Mutation of nopT alters the outcome of rhizobial infection in L. japonicus. These conclusions are well supported by the data.

      The authors present evidence supporting the interaction of NopT with NFR1 and NFR5. In particular, there is solid support for cleavage of NFR5 by NopT (Figure 3) and the identification of NopT phosphorylation sites that inhibit its proteolytic activity (Figure 4C). Cleavage of NFR5 upon expression in N. benthamiana (Figure 3A) requires appropriate controls (inactive mutant versions) that have been provided, since Agrobacterium as a closely rhizobia-related bacterium might increase defense related proteolytic activity in the plant host cells.

      We appreciate your recognition of the importance of appropriate controls in our experimental design. In response to your comments, we revised our manuscript to ensure that the figures and legends provide a clear description of the controls used. We also included a more detailed description of our experimental design at several places. In particular, we have highlighted the use of the protease-dead version of NopT as a control (NopT<sup>C93S</sup>). Therefore, NFR5-GFP cleavage in N. benthamiana clearly depended on protease activity of NopT and not on Agrobacterium (Fig. 3A). In the revised text, we carefully revied the conclusion and do not conclude at this stage that NopT proteolyzes NFR5. However, our subsequent experiments, including in vitro experiments, clearly show that NopT is able to proteolyze NFR5.

      Key results from N. benthamiana appear consistent with data from recombinant protein expression in bacteria. For the analysis in the host legume L. japonicus transgenic hairy roots were included. To demonstrate that the cleavage of NFR5 occurs during the interaction in plant cells the authors build largely on western blots. Regardless of whether Nicotiana leaf cells or Lotus root cells are used as the test platform, the Western blots indicate that only a small proportion of NFR5 is cleaved when co-expressed with nopT, and most of the NFR5 persists in its full-length form (Figures 3A-D). It is not quite clear how the authors explain the loss of NFR5 function (loss of cell death, impact on symbiosis), as a vast excess of the tested target remains intact. It is also not clear why a large proportion of NFR5 is unaffected by the proteolytic activity of NopT. This is particularly interesting in Nicotiana in the absence of Nod factor that could trigger NFR1 kinase activity.

      Thank you for your comments regarding the cleavage of NFR5 by NopT and its functional implications. We acknowledge that our immunoblots indicate only a relatively small proportion of the NFR5 cleavage product. Possible explanations could be as follows:

      (1) The presence of full-length NFR5 does not preclude a significant impact of NopT on function of NFR5, as NopT is able to interact with NFR5. In other words, the NopT-NFR5 and NopT-NFR1 interactions at the plasma membrane might influence the function of the NFR1/NFR5 receptor without proteolytic cleavage of NFR5. In fact, protease-dead NopT<sup>C93S</sup> expressed in NGR234ΔnopT showed certain effects in L. japonicus (less infection foci were formed compared to NGR234ΔnopT Fig. 5E). In this context, it is worth mentioning that the non-acylated NopT<sup>C93S</sup> (Fig. 1B) and NopT<sub>USDA257</sub> (Fig. 6B) proteins were unable to suppress NFR1/NFR5-induced cell death in N. benthamina, but this could be explained by the lack of acylation and altered subcellular localization.

      (2) In the cleavage assay, only small portion of NFR5 could be detected for cleavage by NopT. However, this cleavage might be sufficient to suppress signaling pathways, leading to the observed phenotypic changes (loss of cell death in N. benthamiana; altered infection in L. japonicus). We do believe this is a great point, therefore, we carefully revised the conclusion about this point. Throughout the paper, we stated that the cleavage of NFR5 suppresses symbiotic signaling but not disrupt the symbiotic signaling. We also removed the conclusion that cleavage of NFR5 by NopT results in the function loss of NFR5.

      (3) N. benthamiana co-expressing NFR1/NFR5 leads to strong cell death, which suggest that the NFR1 kinase activity might be constitutively active even in the absence of Nod factors. But why co-expression of symbiotic receptor leads to cell death and how kinase activity is active in the absence of Nod factor are not clear, which is of great interest to be studied.

      (4) The proteolytic activity of NopT may be reduced by the interaction of NopT with other proteins such as NFR1, which phosphorylates NopT and inactivates its protease activity.

      In our revised manuscript version, we provide now quantitative data for the efficiency of NFR5 cleavage by NopT in different expression systems used (Figure 3 and Supplemental Fig. 16). We have also improved our Discussion in this context.  

      Comments on latest version:

      The presentation of the figures and the language has greatly improved and the specific mistakes pointed out in the last review have been corrected. I especially appreciate the new images used to illustrate the observed mutant phenotypes, which are much clearer and easier to understand. The pictures used to illustrate the mutant phenotypes seem to be of more comparable root regions than before. Overall, the requested changes have been implemented, with some exceptions described below.

      • Figure 1: New representative images are shown for BAX1 and CERK1. These pictures are more consistent with the phenotype seen in other treatments, but since the data has not changed, I presume the data from leaf discs (where the leaf discs for these treatments looked very different) previously shown is still included. The criteria for what was considered cell death is in my opinion still not described in the legend. The cell death/total ratio has been added for all leaf discs, as requested.

      Thank you so much for carefully pointing out this. Cell death in leaf disc results in the formation of necrotic plaques, which restrains pathogens within deceased cells. These plaques commonly manifest as leaf dehydration, frequently accompanied by a translucent appearance. Brown and shriveled leaf discs serve as indicators of cell death. We have added these descriptions in the figure legend of Figure 1.

      • Figure 2: the discussion of the figure now emphasizes direct protein interaction. There is still no size marker in 2D or a description of size in the figure legend, making it difficult to compare the result to Figure 3. If I understand the rebuttal comments correctly, there are other bands on the blot, including non-specific bands. This does not negate the need to include the full blot as a supplemental figure to show cleaved NFR5 as well as other bands. I do not see any other clarifications on this subject in the manuscript.

      Thank you for your suggestion. In the revised manuscript, we have included the kDa range for all proteins detected in Figure.2D. The full blot of Co-IP assay was shown in Fig S2 (a new supplemental data). Yes, we detected some smaller bands after immunoblot, but we cannot give clear conclusion of what these bands are based on the current study. Interestingly, these smaller bands were immunoprecipitated by anti-FLAG beads, suggesting that these bands are some truncated peptides from NFR5.

      • Figure 5: From the pictures, it is now easier to understand what is meant by "infection foci". Although there is no description in the methods of how these were distinguished from infection threads, I believe the images are clear enough.

      Thank you for your helpful comment. In the revised manuscript, we have added the descriptions about this experiment in the method section and in the legend in Figure 5A.

      • Figure 6: The changes in the discussion are appreciated, but panel E still misrepresents the evidence in the paper, as from the drawing it still seems that the cleaved NFR5 is somehow directly responsible for suppressing infection when this was not shown.

      Thank you for your thoughtful comments. We appreciate your suggestion to the schematic model to illustrate the cleavage of NFR5 to suppressing rhizobia infection. In the revised manuscript, we have changed the model in Figure 6E.

      Reviewer #2 (Public review):

      Summary:

      This manuscript presents data demonstrating NopT's interaction with Nod Factor Receptors NFR1 and NFR5 and its impact on cell death inhibition and rhizobial infection. The identification of a truncated NopT variant in certain Sinorhizobium species adds an interesting dimension to the study. These data try to bridge the gaps between classical Nod-factor-dependent nodulation and T3SS NopT effector-dependent nodulation in legume-rhizobium symbiosis. Overall, the research provides interesting insights into the molecular mechanisms underlying symbiotic interactions between rhizobia and legumes.

      Strengths:

      The manuscript nicely demonstrates NopT's proteolytic cleavage of NFR5, regulated by NFR1 phosphorylation, promoting rhizobial infection in L. japonicus. Intriguingly, authors also identify a truncated NopT variant in certain Sinorhizobium species, maintaining NFR5 cleavage but lacking NFR1 interaction. These findings bridge the T3SS effector with the classical Nod-factor-dependent nodulation pathway, offering novel insights into symbiotic interactions.

      Weaknesses:

      (1) In the previous study, when transiently expressed NopT alone in Nicotiana tobacco plants, proteolytically active NopT elicited a rapid hypersensitive reaction. However, this phenotype was not observed when expressing the same NopT in Nicotiana benthamiana (Figure 1A). Conversely, cell death and a hypersensitive reaction were observed in Figure S8. This raises questions about the suitability of the exogenous expression system for studying NopT proteolysis specificity.

      We appreciate your attention to these plant-specific differences. Previous studies showed that NopT expressed in tobacco (N. tabacum) or in specific Arabidopsis ecotypes (with PBS1/RPS5 genes) causes rapid cell death (Dai et al. 2008; Khan et al. 2022). Khan et al. 2022 reported recently that cell death does not occur in N. benthamiana unless the leaves were transformed with PBS1/RPS5 constructs. Our data shown in Fig. S17 confirm these findings. As cell death is usually associated with induction of plant protease activities, we considered N. tabacum and A. thaliana plants as not suitable for testing NFR5 cleavage by NopT. In fact, no NopT/NFR5 experiments were not performed with these plants in our study. In response to your comment, we now better describe the N. benthamiana expression system and cite the previous articles_. Furthermore, we have revised the Discussion section to better emphasize effector-induced immunity in non-host plants and the negative effect of rhizobial effectors during symbiosis. Our revisions certainly provide a clearer understanding of the advantages and limitations of the _N. benthamiana expression system.

      (2) NFR5 Loss-of-function mutants do not produce nodules in the presence of rhizobia in lotus roots, and overexpression of NFR1 and NFR5 produces spontaneous nodules. In this regard, if the direct proteolysis target of NopT is NFR5, one could expect the NGR234's infection will not be very successful because of the Native NopT's specific proteolysis function of NFR5 and NFR1. Conversely, in Figure 5, authors observed the different results.

      Thank you for this comment, which points out that we did not address this aspect precisely enough in the original manuscript version. We improved our manuscript and now write that nfr1 and nfr5 mutants do not produce nodules (Madsen et al., 2003; Radutoiu et al., 2003) and that over-expression of either NFR1 or NFR5 can activate NF signaling, resulting in formation of spontaneous nodules in the absence of rhizobia (Ried et al., 2014). In fact, compared to the nopT knockout mutant NGR234ΔnopT, wildtype NGR234 (with NopT) is less successful in inducing infection foci in root hairs of L. japonicus (Fig. 5). With respect to formation of nodule primordia, we repeated our inoculation experiments with NGR234ΔnopT and wildtype NGR234 and also included a nopT over-expressing NGR234 strain into the analysis. Our data clearly showed that nodule primordium formation was negatively affected by NopT. The new data are shown in Fig. 5 of our revised version. Our data show that NGR234 infection is not really successful, especially when NopT is over-expressed. This is consistent with our observations that NopT targets Nod factor receptors in L. japonicus and inhibits NF signaling (NIN promoter-GUS experiments). Our findings indicate that NopT might be an “Avr effector” for L. japonicus. However, in other host plants of NGR234, NopT possesses a symbiosis-promoting role (Dai et al. 2008; Kambara et al. 2009). Such differences could be explained by different NopT targets in different plants (in addition to Nod factor receptors), which may influence the outcome of the infection process. Indeed, our work shows that NopT can interact with various kinase-dead LysM domain receptors, suggesting a role of NopT in suppression or activation of plant immunity responses depending on the host plant. We discuss such alternative mechanisms in our revised manuscript version and emphasize the need for further investigation to elucidate the precise mechanisms underlying the observed infection phenotype and the role of NopT in modulating symbiotic signaling pathways. In this context, we would also like to mention the new figures of our manuscript which are showing (i) the efficiency of NFR5 cleavage by NopT in different expression systems (Figure 3), (ii) the interaction between NopT<sup>C93S</sup> and His-SUMO-NFR5JM-GFP (Supplementary Fig. 5), and (iii) cleavage of His-SUMO-NFPJM-GFP by NopT (Supplementary Figs. S8 and S9).

      (3) In Figure 6E, the model illustrates how NopT digests NFR5 to regulate rhizobia infection. However, it raises the question of whether it is reasonable for NGR234 to produce an effector that restricts its own colonization in host plants.

      Thank you for mentioning this point. We are aware of the possible paradox that the broad-host-range strain NGR234 produces an effector that appears to restrict its infection of host plants. As mentioned in our answer to the previous comment, NopT could have additional functions beyond the regulation of Nod factor signaling. In our revised manuscript version, we have modified our text as follows:

      (1) We mention the potential evolutionary aspects of NopT-mediated regulation of rhizobial infection and discuss the possibility that interactions between NopT and Nod factor receptors may have evolved to fine-tune Nod factor signaling to avoid rhizobial hyperinfection in certain host legumes.

      (2) We also emphasize that the presence of NopT may confer selective advantages in other host plants than L. japonicus due to interactions with proteins related to plant immunity. Like other effectors, NopT could suppress activation of immune responses (suppression of PTI) or cause effector-triggered immunity (ETI) responses, thereby modulating rhizobial infection and nodule formation. Interactions between NopT and proteins related to the plant immune system may represent an important evolutionary driving force for host-specific nodulation and explain why the presence of NopT in NGR234 has a negative effect on symbiosis with L. japonicus but a positive one with other legumes.

      (4) The failure to generate stable transgenic plants expressing NopT in Lotus japonicus is surprising, considering the manuscript's claim that NopT specifically proteolyzes NFR5, a major player in the response to nodule symbiosis, without being essential for plant development.

      We also thank for this comment. We have revised the Discussion section of our manuscript and discuss now our failure to generate stable transgenic L. japonicus plants expressing NopT. We observed that the protease activity of NopT in aerial parts of L. japonicus had a negative effect on plant development, whereas NopT expression in hairy roots was possible. Such differences may be explained by different NopT substrates in roots and aerial parts of the plant. In this context, we also discuss our finding that NopT not only cleaves NFR5 but is also able to proteolyze other proteins of L. japonicus such as LjLYS11, suggesting that NopT not only suppresses Nod factor signaling, but may also interfere with signal transduction pathways related to plant immunity. We speculate that, depending on the host legume species, NopT could suppress PTI or induce ETI, thereby modulating rhizobial infection and nodule formation.

      Comments on revised version:

      This version has effectively addressed most of my concerns. However, one key issue remains unresolved regarding the mechanism of NopT in regulating nodule symbiosis. Specifically, the explanation of how NopT catabolizes NFR5 to regulate symbiosis is still not convincing within the current framework of plant-microbe interaction, where plants are understood to genetically control rhizobial colonization.

      While alternative regulatory mechanisms in plant-microbe interactions are plausible, the notion that the NRG234-secreted effector NopT could reduce its own infection by either suppressing plant immunity or degrading the symbiosis receptor remains unsubstantiated. I believe further revisions are needed in the discussion section to more clearly address and clarify these findings and any lingering uncertainties.

      We appreciate your positive comments on the reason why NopT catabolizes NFR5 to regulate symbiosis. NopT belongs to pathogen effecftors YopT family and also cleavage Arabidopsis AtLYK5 and L. japonicus LjLYS11 which trigger immunity responses in plants. NFR5, AtLYK5 and LjLYS11 has the conserved amino acid motif at the juxtamembrane domain, leading to cleaving NFR5 by NopT during symbiosis. Besides, in plant-microbe interaction, effector HopB1 cleaves immune co-receptor BAK1 at the kinase domain to inhibit plant defense. The effect on cleavage of receptor may be positive or negative. NopT suppressing symbiosis may avoid preventing hyperinfection in the specific interaction between rhizobia and legumes. In the revised manuscript, we have emphasized this point more clearly in why NopT could reduce its own infection by either suppressing plant immunity in discussion.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Evaluation of the author's responses to the reviewer comments during the first review round

      Reviewer's Comment:

      Regardless of whether Nicotiana leaf cells or Lotus root cells are used as the test platform, the Western blots indicate that only a small proportion of NFR5 is cleaved when co-expressed with NopT, and most of the NFR5 persists in its full-length form (Figures 3A-D). It is not quite clear how the authors explain the loss of NFR5 function (loss of cell death, impact on symbiosis), as a vast excess of the tested target remains intact. It is also not clear why a large proportion of NFR5 is unaffected by the proteolytic activity of NopT. This is particularly interesting in Nicotiana in the absence of Nod factor that could trigger NFR1 kinase activity.

      Summary of response:

      • NopT could be interfering with the NFR1/NFR5 complex without proteolytic cleavage

      • The cleaved fraction may still be sufficient to disrupt signaling pathways

      • Elevated abundance of NFR5 relative to WT levels

      • Add quantitative data for efficiency of NFR5 cleavage in different systems

      Evaluation of response:

      • The quantification of NFR5 cleavage efficiency is welcome, and there is some discussion of the possible reasons for the large proportion of uncleaved NFR5. It is clear that there is a large difference in cleavage efficiency between L. japonicus roots and N. benthamiana.

      • The data is shown as a bar plot. Given that only 3 biological replicates are used, the data points should be shown, and there is too little data to provide sensible error bars. It would be better to simply make a dot-plot and indicate the mean for each sample. However, the main aim of the comment is addressed.

      Thank you for your constructive comments regarding Figure S16. In the revised manuscript, we have presented these data into dot-Plot format.

      Reviewer's Comment:

      It is also difficult to evaluate how the ratios of cleaved and full-length protein change when different versions of NopT are present without a quantification of band strengths normalized to loading controls (Figure 3C, 3D, 3F). The same is true for the blots supporting NFR1 phosphorylation of NopT (Figure 4A).

      Summary of response:

      • Quantified proportion of cleaved and full length NFR5 in different systems (S14)

      • Band strengths of immunoblots quantified (4B)

      Evaluation of response:

      • The quantification has been performed as requested and the data is shown as bar plots. This type of data is frequently displayed as part of the blot figure itself, printed under each respective lane, making it easier for the reader to connect the ratios to the band sizes. If data is shown in a plot, the data points should be shown on the plot, as described above.

      Thank you for your constructive comments regarding Figure 3. In the revised manuscript, we have added the cleavage efficiency in the 3A-3D.

      Reviewer's Comment:

      Nodule primordia and infection threads are still formed when L. japonicus plants are inoculated with ∆nopT mutant bacteria, but it is not clear if these primordia are infected or develop into fully functional nodules (Figure 5). A quantification of the ratio of infected and non-infected nodules and primordia would reveal whether NopT is only active at the transition from infection focus to thread or perhaps also later in the bacterial infection process of the developing root nodule.

      Summary of response:

      • Additional experiments with NGR234 or NGR234ΔnopT mutants find no non-infected nodules (fig. 5)

      Evaluation of response:

      • The requested quantification has been done, although the support for the findings would be stronger if also mature nodules per plant were quantified and plotted. If non-infected nodules were neither present in NGR234 or NGR234ΔnopT, it would still be advisable to include images of cross-sections of the fully-developed nodules.

      We appreciate your positive comments on the cross-sections of the fully-developed nodules. In the revised manuscript, we have added the cross-section images of nodules in the Figure S12.

    1. eLife Assessment

      This study leverages an impressive and comprehensive longitudinal 16S rRNA gut microbiome dataset from baboons to provide important insight regarding the use of a microbiome-based clock to predict biological age. The evidence for age-associated microbiome features and environmental and social variables that impact microbiome aging is convincing. This study of microbiomes as markers of host age will fuel inquiries and studies and interest a broad range of researchers, especially those interested in alternatives to measuring biological aging.

    2. Reviewer #1 (Public review):

      Summary:

      The authors used a subset of a very large, previously generated 16S dataset to: 1) assess age-associated features; and 2) develop a fecal microbiome clock, based on extensive longitudinal sampling of wild baboons for which near-exact chronological age is known. They further seek to understand deviation from age-expected patterns and uncover if and why some individuals have an older or younger microbiome than expected, and the health and longevity implications of such variation. Overall, the authors compellingly achieved their goals to discover age-associated microbiome features and develop a fecal microbiome clock. They also showed clear and exciting evidence for sex and rank-associated variation in the pace of gut microbiome aging and impacts of seasonality on microbiome age in females. These data add to a growing understanding of modifiers of the pace of age in primates, and links among different biological indicators of age, with implications for understanding and contextualizing human variation. However, in the current version there are gaps in the analyses with respect to the social environment, and in comparisons with other biological indicators of age. Despite this, I anticipate this work will be impactful, generate new areas of inquiry and fuel additional comparative studies.

      Strengths:

      The major strengths of the paper are the size and sampling depth of the study population, including ability to characterize of the social and physical environments, and the application of recent and exciting methods to characterize the microbiome clock. An additional strength was the ability of the authors to compare and contrast the relative age-predictive power of the fecal microbiome clock to other biological methods of age estimation available for the study population (dental wear, blood cell parameters, methylation data). Furthermore, the writing and support materials are clear and informative and visually appealing.

      Revisions made following initial review have further improved the content and clarity.

      Weaknesses:

      Revisions to the manuscript clarified some of the analysis decisions and limitations regarding drawing comparisons between the microbiome clock and other metrics of biological age, and on the impact of sociality on microbiome metrics. Hopefully these interesting topics will be further addressed in forthcoming publications.

    3. Reviewer #2 (Public review):

      Summary:

      Dasari et al present an interesting study investigating the use of 'microbiota age' as an alternative to other measures of 'biological age'. The study provides several curious insights into biological ageing. Although 'microbiota age' holds potential as a proxy of biological age, it comes with limitations considering the gut microbial community can be influenced various non-age related factors, and various age-related stressors may not manifest in changes in the gut microbiota.

      Strengths:

      The dataset this study is based on is impressive, and can reveal various insights into biological ageing and beyond. The analysis implemented is extensive and of high level.

      Weaknesses:

      The key weakness is the use of microbiota age instead of e.g., DNA-methylation based epigenetic age as a proxy of biological ageing, for reasons stated in the summary. DNA methylation levels can be measured from faecal samples, and as such epigenetic clocks too can be non-invasive.

      In the first round of review, I provided authors a list of minor edits, which they have implemented in the revised version of the manuscript.

    4. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      The authors used a subset of a very large, previously generated 16S dataset to:<br /> (1) Assess age-associated features; and (2) develop a fecal microbiome clock, based on an extensive longitudinal sampling of wild baboons for which near-exact chronological age is known. They further seek to understand deviation from age-expected patterns and uncover if and why some individuals have an older or younger microbiome than expected, and the health and longevity implications of such variation. Overall, the authors compellingly achieved their goals of discovering age-associated microbiome features and developing a fecal microbiome clock. They also showed clear and exciting evidence for sex and rank-associated variation in the pace of gut microbiome aging and impacts of seasonality on microbiome age in females. These data add to a growing understanding of modifiers of the pace of age in primates, and links among different biological indicators of age, with implications for understanding and contextualizing human variation. However, in the current version, there are gaps in the analyses with respect to the social environment, and in comparisons with other biological indicators of age. Despite this, I anticipate this work will be impactful, generate new areas of inquiry, and fuel additional comparative studies.

      Thank you for the supportive comments and constructive reviews.

      Strengths:

      The major strengths of the paper are the size and sampling depth of the study population, including the ability to characterize the social and physical environments, and the application of recent and exciting methods to characterize the microbiome clock. An additional strength was the ability of the authors to compare and contrast the relative age-predictive power of the fecal microbiome clock to other biological methods of age estimation available for the study population (dental wear, blood cell parameters, methylation data). Furthermore, the writing and support materials are clear, informative and visually appealing.

      Weaknesses:

      It seems clear that more could be done in the area of drawing comparisons among the microbiome clock and other metrics of biological age, given the extensive data available for the study population. It was confusing to see this goal (i.e. "(i) to test whether microbiome age is correlated with other hallmarks of biological age in this population"), listed as a future direction, when the authors began this process here and have the data to do more; it would add to the impact of the paper to see this more extensively developed.

      Comparing the microbiome clock to other metrics of biological age in our population is a high priority (these other metrics of biological age are in Table S5 and include epigenetic age measured in blood, the non-invasive physiology and behavior clock (NPB clock), dentine exposure, body mass index, and blood cell counts (Galbany et al. 2011; Altmann et al. 2010; Jayashankar et al. 2003; Weibel et al. 2024; Anderson et al. 2021)). However, we have opted to test these relationships in a separate manuscript. We made this decision because of the complexity of the analytical task: these metrics were not necessarily collected on the same subjects, and when they were, each metric was often measured at a different age for a given animal. Further, two of the metrics (microbiome clock and NPB clock) are measured longitudinally within subjects but on different time scales (the NPB clock is measured annually while microbiome age is measured in individual samples). The other metrics are cross-sectional. Testing the correlations between them will require exploration of how subject inclusion and time scale affect the relationships between metrics.

      We now explain the complexity of this analysis in the discussion in lines 447-450. In addition, we have added the NPB clock (Weibel et al. 2024) to the text in lines 260-262 and to Table S5.

      An additional weakness of the current set of analyses is that the authors did not explore the impact of current social network connectedness on microbiome parameters, despite the landmark finding from members of this authorship studying the same population that "Social networks predict gut microbiome composition in wild baboons" published here in eLife some years ago. While a mother's social connectedness is included as a parameter of early life adversity, overall the authors focus strongly on social dominance rank, without discussion of that parameter's impact on social network size or directly assessing it.

      Thank you for raising this important point, which was not well explained in our manuscript. We find that the signatures of social group membership and social network proximity are only detectable our population for samples collected close in time. All of the samples analyzed in  Tung et al. 2015 (“Social networks predict gut microbiome composition in wild baboons”) were collected within six weeks of each other. By contrast, the data set analyzed here spans 14 years, with very few samples from close social partners collected close in time. Hence, the effects of social group membership and social proximity are weak or undetectable. We described these findings in Grieneisen et al. 2021 and Bjork et al. 2022, and we now explain this logic on line 530, which states, “We did not model individual social network position because prior analyses of this data set find no evidence that close social partners have more similar gut microbiomes, probably because we lack samples from close social partners sampled close in time (Grieneisen et al. 2021; Björk et al. 2022).”

      We do find small effects of social group membership, which is included as a random effect in our models of how each microbiome feature is associated with host age (line 529) and our models predicting microbiome Dage (line 606; Table S6).

      Reviewer #2 (Public review):

      Summary:

      Dasari et al present an interesting study investigating the use of 'microbiota age' as an alternative to other measures of 'biological age'. The study provides several curious insights into biological aging. Although 'microbiota age' holds potential as a proxy of biological age, it comes with limitations considering the gut microbial community can be influenced by various non-age related factors, and various age-related stressors may not manifest in changes in the gut microbiota. The work would benefit from a more comprehensive discussion, that includes the limitations of the study and what these mean to the interpretation of the results.

      We agree and have text to the discussion that expands on the limitations of this study and what those limitations mean for the interpretation of the results. For instance, lines 395-400 read, “Despite the relative accuracy of the baboon microbiome clock compared to similar clocks in humans, our clock has several limitations. First, the clock’s ability to predict  individual age is lower than for age clocks based on patterns of DNA methylation—both for humans and baboons (Horvath 2013; Marioni et al. 2015; Chen et al. 2016; Binder et al. 2018; Anderson et al. 2021). One reason for this difference may be that gut microbiomes can be influenced by several non-age-related factors, including social group membership, seasonal changes in resource use, and fluctuations in microbial communities in the environment”

      In addition, lines 405-411 now reads, “Third, the relationships between potential socio-environmental drivers of biological aging and the resulting biological age predictions were inconsistent. For instance, some sources of early life adversity were linked to old-for-age gut microbiomes (e.g., males born into large social groups), while others were linked to young-for-age microbiomes (e.g., males who experienced maternal social isolation or early life drought), or were unrelated to gut microbiome age (e.g., males who experienced maternal loss; any source of early life adversity in females).”

      Strengths:

      The dataset this study is based on is impressive, and can reveal various insights into biological ageing and beyond. The analysis implemented is extensive and high-level.

      Weaknesses:

      The key weakness is the use of microbiota age instead of e.g., DNA-methylation-based epigenetic age as a proxy of biological ageing, for reasons stated in the summary. DNA methylation levels can be measured from faecal samples, and as such epigenetic clocks too can be non-invasive. I will provide authors a list of minor edits to improve the read, to provide more details on Methods, and to make sure study limitations are discussed comprehensively.

      Thank you for this point. In response, we have deleted the text from the discussion that stated that non-invasive sampling is an advantage of microbiome clocks. In addition, we now propose a non-invasive epigenetic clock from fecal samples as an important future direction for our population (see line 450).

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Abstract - The opening 2 sentences are not especially original or reflective of the potential value/ premise of the study. Members of this team have themselves measured variation in biological age in many different ways, and the implication that measuring a microbiome clock is easy or straightforward is not compelling. This paper is very interesting and provides unique insight, but I think overall there is a missed opportunity in the abstract to emphasize this, given the innovative science presented here. Furthermore, the last 2 sentences of the abstract are especially interesting - but missing a final statement on the broader significance of research outside of baboons.

      We appreciate these comments and have revised the Abstract accordingly. The introductory sentences now read, “Mammalian gut microbiomes are highly dynamic communities that shape and are shaped by host aging, including age-related changes to host immunity, metabolism, and behavior. As such, gut microbial composition may provide valuable information on host biological age.” (lines 31-34). The last two sentences of the abstract now read, “Hence, in our host population, gut microbiome age largely reflects current, as opposed to past, social and environmental conditions, and does not predict the pace of host development or host mortality risk. We add to a growing understanding of how age is reflected in different host phenotypes and what forces modify biological age in primates.” (lines 40-43).

      If possible, it would be highly useful to present some comments on concordance in patterns at different levels. Are all ASVs assessed at both the family and genus levels? Do they follow similar patterns when assessed at different levels? What can we learn about the system by looking at different levels of taxonomic assignment?

      The section on relationships between host age and individual microbiome features is already lengthy, so we have not added an analysis of concordance between different taxonomic levels. However, we added a justification for why we tested for age signatures in different levels of taxa to line 171, which reads, “We tested these different taxonomic levels in order to learn whether the degree to which coarse and fine-grained designations categories were associated with host age.”

      To calculate the delta age - please clarify if this was done at the level of years, as suggested in Figure 3C, or at the level of months or portion months, etc?

      Delta age is measured in years. This is now clarified in lines 294, 295, and 578.

      Spelling mistake in table S12, cell B4 (Octovber)

      Thank you. This typo has been corrected.

      Given the start intro with vertebrates, the second paragraph needs some tweaking to be appropriate. Perhaps, "At least among mammals, one valuable marker of biological aging may lie in the composition and dynamics of the mammalian gut microbiome (7-10)." Or simply remove "mammalian".

      We have updated this sentence based on your suggestions in line 54. It reads, “In mammals, one valuable marker of biological aging may lie in the composition and dynamics of the gut microbiome (Claesson et al. 2012; Heintz and Mair 2014; O’Toole and Jeffery 2015; Sadoughi et al. 2022).”

      A rewrite at the end of the introduction is needed to avoid the almost direct repetition in lines 115-118 and 129-131 (including lit cited). One potentially effective way to approach this is to keep the predictions in the earlier paragraph and then more clearly center the approach and the overarching results statement in the latter paragraph. (I.e., "we find that season and social rank have stronger effects on microbiome age than early life events. Further, microbiome age does not predict host development or mortality.").

      Thank you for pointing this out. We have re-organized the predictions in the introduction based on your suggestion. The alternative “recency effects” model now appears in the paragraph that starts in line 110. The final paragraph then centers on the overall approach and the results statement (lines 128-140)

      Be clear in each case where taxon-level trends are discussed if it's at Family, Genus, or other level. It's there most, but not all, of the time.

      We have gone through the text and clarified what taxa or microbiome feature was the subject of our analyses in any places where this was not clear.

      In the legend for Figure 2, add clarification for how values to right versus left of the centered value should be interpreted with respect to age (e.g. "values to x of the center are more abundant in older individuals").

      We now clarify in Figure 2C and 2D that “Positive values are more abundant in older hosts”.

      Figure 3 - Are Panels A, B, and C all needed - can the value for all individuals not also be overlaid in the panel showing sex differences and the same point showing individuals with "old" and "young" microbiomes be added in the same plot if it was slightly larger?

      We agree and have simplified Figure 3. We reduced the number of panels from three to two, and we added the information about how to calculate delta age to Panel A. We also moved the equation from the top of Panel C to the bottom right of Panel A.

      Reviewer #2 (Recommendations for the authors):

      Dasari et al present an interesting study investigating the use of 'microbiota age' as an alternative to other measures of 'biological age'. The study provides several curious insights which in principle warrant publication. However, I do think the manuscript should be carefully revised. Below I list some minor revisions that should be implemented. Importantly, the authors should discuss in the Discussion the pros and cons of using 'microbiota age' as a proxy of 'biological age'. Further, the authors should provide more information on Methods, to make sure the study can be replicated.

      Thank you for these important points. Based on your comments and those of the first reviewer, we have expanded our discussion of the limitations of using microbiota age as a proxy for biological age (see edits to the paragraph starting in line 395).

      We have also expanded our methods around sample collection, DNA extraction, and sequencing to describe our sampling methods, strategies to mitigate and address possible contamination, and batch effects. See lines 483-490 and our citations to the original papers where these methods are described in detail.

      (1) Lines 85-99: I think this paragraph could be revisited to make the assumptions clearer. For instance, the last sentence is currently a little confusing: are authors expecting males to exhibit old-for-age microbiomes already during the juvenile period?

      This prediction has been clarified. Line 96 now reads, “Hence, we predicted that adult male baboons would exhibit gut microbiomes that are old-for-age, compared to adult females (by contrast, we expected no sex effects on microbiome age in juvenile baboons).”

      (2) Lines 118-121: Could the authors discuss this assumption in relation to what has been observed e.g., in humans in terms of delays in gut microbiome development? Delayed/accelerated gut microbiome development has been studied before, so this assumption would be stronger if related to what we know from previous studies.

      This comment refers to the sentence which originally stated, “However, we also expected that some sources of early life adversity might be linked to young-for-age gut microbiota. For instance, maternal social isolation might delay gut microbiome development due to less frequent microbial exposures from conspecifics.” We have slightly expanded the text here (line 117) to explain our logic. We now include citations for our predictions. We did not include a detailed discussion of prior literature on microbiome development in the interest of keeping the same level of detail across all sections on our predictions.

      (3) As the authors discuss, various adversities can lead to old-for-age but also young-for-age microbiome composition. This should be discussed in the limitations.

      We agree. This is now discussed in the sentence starting at line 371, which reads, “…deviations from microbiome age predictions are explained by socio-environmental conditions experienced by individual hosts, especially recent conditions, although the effect sizes are small and are not always directionally consistent.” In addition, the text starting at line 405 now reads, “Third, the relationships between potential socio-environmental drivers of biological aging and the resulting biological age predictions were inconsistent. For instance, some sources of early life adversity were linked to old-for-age gut microbiomes (e.g., males born into large social groups), while others were linked to young-for-age microbiomes (e.g., males who experienced maternal social isolation or early life drought), or were unrelated to gut microbiome age (e.g., males who experienced maternal loss; any source of early life adversity in females).”

      (4) In various places, e.g., lines 129-131, it is a little unclear at what chronological age authors are expecting microbiota to appear young/old-for-age.

      This sentence was removed while responding to the comments from the first reviewer.

      (5) Lines 132-133: this statement could be backed by stating that this is because the gut microbiota can change rapidly e.g., when diet changes (or whatever the authors think could be behind this).

      We have added an expository sentence at line 123, including new citations. This sentence reads, “Indeed, gut microbiomes are highly dynamic and can change rapidly in response to host diet or other aspects of host physiology, behavior, or environments”.

      We now cite:

      · Hicks, A.L., et al. (2018). Gut microbiomes of wild great apes fluctuate seasonally in response to diet. Nature Communications 9, 1786.

      · Kolodny, O., et al. (2019). Coordinated change at the colony level in fruit bat fur microbiomes through time. Nature Ecology & Evolution 3, 116-124.

      · Risely, A., et al. (2021) Diurnal oscillations in gut bacterial load and composition eclipse seasonal and lifetime dynamics in wild meerkats. Nat Commun 12, 6017.

      (6) Lines 135-137: current or past season and social rank? This paragraph introduces the idea that it could be past rather than current socio-environmental factors that might predict microbiota age, so the authors should clarify this sentence.

      We have clarified the information in this sentence. line 135 now reads, “In general, our results support the idea that a baboon’s current socio-environmental conditions, especially their current social rank and the season of sampling, have stronger effects on microbiome age than early life events—many of which occurred many years prior to sampling.”

      (7) Lines 136-137: this sentence could include some kind of a conclusion of this finding. What might this mean?

      We have added a sentence at line 138, which speculates that, “…the dynamism of the gut microbiome may often overwhelm and erase early life effects on gut microbiome age.”

      (8) Use 'microbiota' or 'microbiome' across the manuscript; currently, the terms are used interchangeably. I don't have a strong opinion on this, although typically 'microbiota' is used when data comes from 16S rRNA.

      We have updated the text to replace any instance of “microbiota” with “microbiome”. We use the term microbiome in the sense of this definition from the National Human Genome Research Institute, which defines a microbiome as “the community of microorganisms (such as fungi, bacteria and viruses) that exists in a particular environment”.

      (9) Figure 1 legend: make sure to unify formatting; e.g., present sample sizes as N= or n=, rather than both, and either include or do not include commas in 4-digit values (sample sizes).

      We have checked the formatting related to sample sizes and the use of commas in 4-digits in the main text and supplement. The formats are now consistent.

      (10) Line 166: relative abundances surely?

      Following Gloor et al. (2017), our analyses use centered log-ratio (CLR) transformations of read counts, which is the recommended approach for compositional data such as 16S rRNA amplicon read counts. CLR transformations are scale-invariant, so the same ratio is obtained in a sample with few read versus many reads. We now cite Gloor et al. (2017) at line 169 and in the methods in line 517, which reads “centered log ratio (CLR) transformed abundances (i.e., read counts) of each microbial phyla (n=30), family (n=290), genus (n=747), and amplicon sequence variance (ASV) detected in >25% of samples (n=358). CLR transformations are a recommended approach for addressing the compositional nature of 16S rRNA amplicon read count data (Gloor et al. 2017).”  

      (11) Lines 167-172: were technical factors, e.g., read depth or sequencing batch, included as random effects?

      Thank you for catching this oversight in the text. We did model sequencing depth and batch effects. The sentence starting at line 173 now reads, “For each of these 1,440 features, we tested its association with host age by running linear mixed effects models that included linear and quadratic effects of host age and four other fixed effects: sequencing depth, the season of sample collection (wet or dry), the average maximum temperature for the month prior to sample collection, and the total rainfall in the month prior to sample collection (Grieneisen et al. 2021; Björk et al. 2022; Tung et al. 2015). Baboon identity, social group membership, hydrological year of sampling, and sequencing plate (as a batch effect) were modeled as random effects.”

      (12) Lines 175-180: When discussing how these alpha diversity results relate to previous findings, the authors should be clear about whether they talk about weighted or non-weighted measures of alpha diversity. - also maybe this should be included in the discussion rather than the results? Please consider this when revisiting the manuscript (see how it reads after edits).

      Richness is the only unweighted metric, which we now clarify in line 181. We opted to retain the interpretation in the text in its original location to maintain the emphasis in the discussion on the microbiome clock results.

      (13) Table S1 is very hard to interpret in the provided PDF format as columns are not presented side-by-side. It is currently hard to check model output for e.g., specific families. This needs to be revisited.

      We agree. We believe that eLife’s submission portal automatically generates a PDF for any supplementary item. However, we also include the supplementary tables as an Excel workbook which has the columns presented side-by-side.

      (14) Line 184: taxa meaning what? Unclear what authors refer to with this sentence, taxa across taxonomic levels, or ASVs, or what does the 51.6% refer to?

      We have edited line 191 to clarify that this sentence refers to taxa at all taxonomic levels (phyla to ASVs).

      (15) Line 191: a punctuation mark missing after ref (81).

      We have added the missing period at the end of this sentence.

      (16) Lines 189-197: this should go into the discussion in my opinion.

      We have opted to retain this interpretation, now at line 183.

      (17) Lines 215-219: Not sure what this means; do the authors mean features were not restricted to age-associated taxa, ie also e.g., diversity and other taxa-independent patterns were included? If so, the rest of the highlighted lines should be revisited to make this clear, currently to me it is very unclear what 'These could include features that are not strongly age-correlated in isolation' means. Currently, that sounds like some features included were only age-associated in combination with other features, but unclear how this relates to taxa-dependency/taxa-independency.

      We agree this was not clear. We have revised line 224 to read, “We included all 9,575 microbiome features in our age predictions, as opposed to just those that were statistically significantly associated with age because removing these non-significant features could exclude features that contribute to age prediction via interactions with other taxa.”

      (18) Line 403-407: There is now a paper showing epigenetic clocks can be built with faecal samples, so this argument is not valid. Please revisit in light of this publication: https://onlinelibrary.wiley.com/doi/epdf/10.1111/mec.17330

      Thank you for bringing this paper to our attention. We deleted the text that describes epigenetic clocks as invasive, and we now cite this paper in line 450, which reads, “We also hope to measure epigenetic age in fecal samples, leveraging methods developed in Hanski et al. 2024.”

      (19) Line 427: a punctuation mark/semicolon missing before However.

      We have corrected this typo.

      (20) Lines 419-428: I don't quite understand this speculation. Why would the priority of access to food lead to an old-looking gut microbiome? This paragraph needs stronger arguments, currently unclear and also not super convincing.

      We agree this was confusing. We have revised this text to clarify the explanation. The text starting at line 424 now reads, “This outcome points towards a shared driver of high social status in shaping gut microbiome age in both males and females. While it is difficult to identify a plausible shared driver, one benefit shared by both high-ranking males and females is priority of access to food. This access may result in fewer foraging disruptions and a higher quality, more stable diet. At the same time, prior research in Amboseli suggests that as animals age, their diets become more canalized and less variable (Grieneisen et al. 2021). Hence aging and priority of access to food might both be associated with dietary stability and old-for-age microbiomes. However, this explanation is speculative and more work is needed to understand the relationship between rank and microbiome age.”

      (21) Line 434: remove 'be'.

      We have corrected this typo.

      (22) Line 478: add information on how samples were collected; e.g., were samples collected from the ground? How was cross-contamination with soil microbiota minimised? Were samples taken from the inner part of depositions? These factors can influence microbiota samples quite drastically so detailed info is needed. Also what does homogenisation mean in this context? How soon were samples freeze-dried after sample collection?

      We have expanded our methods with respect to sample collection. This text starts in line 483 and reads, “Samples were collected from the ground within 15 minutes of defecation. For each sample, approximately 20 g of feces was collected into a paper cup, homogenized by stirring with a wooden tongue depressor, and a 5 g aliquot of the homogenized sample was transferred to a tube containing 95% ethanol. While a small amount of soil was typically present on the outside of the fecal sample, mammalian feces contains 1000 times the number of microbial cells in a typical soil sample (Sender, Fuchs, and Milo 2016; Raynaud and Nunan 2014), which overwhelms the signal of soil bacteria in our analyses (Grieneisen et al. 2021). Samples were transported from the field in Amboseli to a lab in Nairobi, freeze-dried, and then sifted to remove plant matter prior to long term storage at -80°C.”

      (23) Line 480 onwards: were negative controls included in extraction batches? Were samples randomised into extraction batches?

      Yes, we included extraction blanks. These are now described in lines 495-500. This text reads, “We included one extraction blank per batch, which had significantly lower DNA concentrations than sample wells (t-test; t=-50, p < 2.2x10-16; Grieneisen et al. 2021). We also included technical replicates, which were the same fecal sample sequenced across multiple extraction and library preparation batches. Technical replicates from different batches clustered with each other rather than with their batch, indicating that true biological differences between samples are larger than batch effects.”

      (24) Were extraction, library prep, and sequencing negative controls included? Is data available?

      We included extraction blanks (described above) and technical replicates, which were the same sample sequenced across multiple extraction and library preparation batches. Technical replicates from different batches clustered with each other rather than with their batch, indicating that true biological differences between samples are larger than batch effects.

      We have updated the data availability statement to read, “All data for these analyses are available on Dryad at https://doi.org/10.5061/dryad.b2rbnzspv. The 16S rRNA gene sequencing data are deposited on EBI-ENA (project ERP119849) and Qiita (study 12949). Code is available at the following GitHub repository: https://github.com/maunadasari/Dasari_etal-GutMicrobiomeAge”.

      (25) Line 562: how were corrected microbiome delta ages calculated? Currently, the authors state x, y and z factors were corrected for, but it is unclear how this was done.

      The paragraph starting at line 577 describes how microbiome delta age was calculated. We have made only a few changes to this text because we were not sure which aspects of these methods confused the reviewer. However, briefly, we calculated sample-specific microbiome Dage in years as the difference between a sample’s microbial age estimate, age<sub>m</sub> from the microbiome clock, and the host’s chronological age in years at the time of sample collection, age<sub>c</sub>. Higher microbiome Dages indicate old-for-age microbiomes, as age<sub>m</sub> > age<sub>c</sub>, and lower values (which are often negative) indicate a young-for-age microbiome, where age<sub>c</sub> > age<sub>m</sub> (see Figure 3).

      (26) Line 579: typo 'as'.

      We have corrected this typo.

      Works Cited

      Altmann, Jeanne, Laurence Gesquiere, Jordi Galbany, Patrick O Onyango, and Susan C Alberts. 2010. “Life History Context of Reproductive Aging in a Wild Primate Model.” Annals of the New York Academy of Sciences 1204:127–38. https://doi.org/10.1111/j.1749-6632.2010.05531.x.

      Anderson, Jordan A, Rachel A Johnston, Amanda J Lea, Fernando A Campos, Tawni N Voyles, Mercy Y Akinyi, Susan C Alberts, Elizabeth A Archie, and Jenny Tung. 2021. “High Social Status Males Experience Accelerated Epigenetic Aging in Wild Baboons.” Edited by George H Perry. eLife 10 (April):e66128. https://doi.org/10.7554/eLife.66128.

      Binder, Alexandra M., Camila Corvalan, Verónica Mericq, Ana Pereira, José Luis Santos, Steve Horvath, John Shepherd, and Karin B. Michels. 2018. “Faster Ticking Rate of the Epigenetic Clock Is Associated with Faster Pubertal Development in Girls.” Epigenetics 13 (1): 85–94. https://doi.org/10.1080/15592294.2017.1414127.

      Björk, Johannes R., Mauna R. Dasari, Kim Roche, Laura Grieneisen, Trevor J. Gould, Jean-Christophe Grenier, Vania Yotova, et al. 2022. “Synchrony and Idiosyncrasy in the Gut Microbiome of Wild Baboons.” Nature Ecology & Evolution, June, 1–10. https://doi.org/10.1038/s41559-022-01773-4.

      Chen, Brian H., Riccardo E. Marioni, Elena Colicino, Marjolein J. Peters, Cavin K. Ward-Caviness, Pei-Chien Tsai, Nicholas S. Roetker, et al. 2016. “DNA Methylation-Based Measures of Biological Age: Meta-Analysis Predicting Time to Death.” Aging (Albany NY) 8 (9): 1844–59. https://doi.org/10.18632/aging.101020.

      Claesson, Marcus J., Ian B. Jeffery, Susana Conde, Susan E. Power, Eibhlís M. O’Connor, Siobhán Cusack, Hugh M. B. Harris, et al. 2012. “Gut Microbiota Composition Correlates with Diet and Health in the Elderly.” Nature 488 (7410): 178–84. https://doi.org/10.1038/nature11319.

      Galbany, Jordi, Jeanne Altmann, Alejandro Pérez-Pérez, and Susan C. Alberts. 2011. “Age and Individual Foraging Behavior Predict Tooth Wear in Amboseli Baboons.” American Journal of Physical Anthropology 144 (1): 51–59. https://doi.org/10.1002/ajpa.21368.

      Gloor, Gregory B., Jean M. Macklaim, Vera Pawlowsky-Glahn, and Juan J. Egozcue. 2017. “Microbiome Datasets Are Compositional: And This Is Not Optional.” Frontiers in Microbiology 8. https://doi.org/10.3389/fmicb.2017.02224.

      Grieneisen, Laura E., Mauna Dasari, Trevor J. Gould, Johannes R. Björk, Jean-Christophe Grenier, Vania Yotova, David Jansen, et al. 2021. “Gut Microbiome Heritability Is Nearly Universal but Environmentally Contingent.” Science 373 (6551): 181–86. https://doi.org/10.1126/science.aba5483.

      Hanski, Eveliina, Susan Joseph, Aura Raulo, Klara M. Wanelik, Áine O’Toole, Sarah C. L. Knowles, and Tom J. Little. 2024. “Epigenetic Age Estimation of Wild Mice Using Faecal Samples.” Molecular Ecology 33 (8): e17330. https://doi.org/10.1111/mec.17330.

      Heintz, Caroline, and William Mair. 2014. “You Are What You Host: Microbiome Modulation of the Aging Process.” Cell 156 (3): 408–11. http://dx.doi.org/10.1016/j.cell.2014.01.025.

      Horvath, Steve. 2013. “DNA Methylation Age of Human Tissues and Cell Types.” Genome Biology 14 (10): R115. https://doi.org/10.1186/gb-2013-14-10-r115.

      Jayashankar, Lakshmi, Kathleen M. Brasky, John A. Ward, and Roberta Attanasio. 2003. “Lymphocyte Modulation in a Baboon Model of Immunosenescence.” Clinical and Vaccine Immunology 10 (5): 870–75. https://doi.org/10.1128/CDLI.10.5.870-875.2003.

      Marioni, Riccardo E., Sonia Shah, Allan F. McRae, Brian H. Chen, Elena Colicino, Sarah E. Harris, Jude Gibson, et al. 2015. “DNA Methylation Age of Blood Predicts All-Cause Mortality in Later Life.” Genome Biology 16 (1): 25. https://doi.org/10.1186/s13059-015-0584-6.

      O’Toole, Paul W., and Ian B. Jeffery. 2015. “Gut Microbiota and Aging.” Science 350 (6265): 1214–15. https://doi.org/10.1126/science.aac8469.

      Raynaud, Xavier, and Naoise Nunan. 2014. “Spatial Ecology of Bacteria at the Microscale in Soil.” PLOS ONE 9 (1): e87217. https://doi.org/10.1371/journal.pone.0087217.

      Sadoughi, Baptiste, Dominik Schneider, Rolf Daniel, Oliver Schülke, and Julia Ostner. 2022. “Aging Gut Microbiota of Wild Macaques Are Equally Diverse, Less Stable, but Progressively Personalized.” Microbiome 10 (1): 95. https://doi.org/10.1186/s40168-022-01283-2.

      Sender, Ron, Shai Fuchs, and Ron Milo. 2016. “Revised Estimates for the Number of Human and Bacteria Cells in the Body.” PLoS Biology 14 (8): e1002533. https://doi.org/10.1371/journal.pbio.1002533.

      Tung, J, L B Barreiro, M B Burns, J C Grenier, J Lynch, L E Grieneisen, J Altmann, S C Alberts, R Blekhman, and E A Archie. 2015. “Social Networks Predict Gut Microbiome Composition in Wild Baboons.” Elife 4. https://doi.org/10.7554/eLife.05224.

      Weibel, Chelsea J., Mauna R. Dasari, David A. Jansen, Laurence R. Gesquiere, Raphael S. Mututua, J. Kinyua Warutere, Long’ida I. Siodi, Susan C. Alberts, Jenny Tung, and Elizabeth A. Archie. 2024. “Using Non-Invasive Behavioral and Physiological Data to Measure Biological Age in Wild Baboons.” GeroScience 46 (5): 4059–74. https://doi.org/10.1007/s11357-024-01157-5.

    1. eLife Assessment

      This research is valuable as it investigates metabolic shuttling between photoreceptors and retinal pigment epithelium (RPE) using in vivo infusion techniques and mouse models. The authors find that the retina significantly relies on circulating glucose, with photoreceptors being the primary consumers of glucose, which is convincing. However, the study has incomplete evidence to support the claims that photoreceptors can use lactate as a fuel source, that lactate exported from photoreceptors is utilized by the RPE, and that lactate contributes to the TCA cycle in the RPE. These claims need substantial revision to include potential alternative explanations or perform key experiments.

    2. Reviewer #1 (Public review):

      Summary:

      In this manuscript, the authors sought to build upon their prior work, which suggested the presence of an outer retinal metabolic microenvironment using ex vivo and in vitro systems, by using in vivo methods and a multitude of genetic models. The authors convincingly demonstrate that the retina prefers circulating glucose to some other circulating fuel sources and that photoreceptors are the main consumers of glucose in the retina. However, the claims regarding the ability of photoreceptors to utilize lactate as a fuel source, that lactate exported specifically from photoreceptors is taken up by RPE and further utilized to support the TCA cycle in the RPE are incomplete or inadequate and would benefit from further experimentation to convince the reader of such biological processes. Considering alternative explanations and performing key experiments to confirm or refute these claims would substantially improve the impact of this study.

      Strengths:

      The major strengths of this study are its in vivo infusion methodologies and utilization of mouse models that are devoid of photoreceptors or are photoreceptor-specific conditional knockouts to provide convincing evidence that the retina utilizes circulating glucose to a significant degree and photoreceptors are the main consumers of glucose in the retina. These in vivo studies are complemented by ex vivo experiments in retinal explants.

      Weaknesses:

      While the in vivo infusion methodologies are a clear strength, not utilizing these techniques or other in vivo methodologies with the genetic models that lack photoreceptors or photoreceptor-specific proteins and not providing in vivo metabolomics data from these infusions in the RPE is a major weakness. Also, some circulating fuel sources may not get into the retina in appreciable amounts, impacting some of the authors' claims. Another major weakness is that for many of the claims noted by the authors, alternative explanations have not been considered nor have the proper experiments been conducted to fully support or refute these claims. For example, the authors claim it is photoreceptors that utilize lactate upon knockout of Glut1. However, other cells in the retina, such as Muller glia, may be the ones actually catabolizing lactate based on prior studies and enzyme expression patterns and their kinetics to support photoreceptors via the production of other metabolites from lactate. This alternative has not been considered nor have experiments been conducted to refute this possibility. Additionally, the authors claim lactate exported from photoreceptors is being taken up by RPE. The models used to support this claim lack photoreceptors, or their ability to take up glucose. None of the models specifically address lactate export from photoreceptors. Finally, the authors claim lactate exported from photoreceptors can be oxidized to TCA cycle intermediates in the RPE in vivo. No experiments specifically addressed the downstream path of lactate exported by photoreceptors in RPE TCA cycle metabolism in vivo, so this conclusion is also not well supported. Hence, the claims need to be significantly amended with an acknowledgment of potential alternatives or with some key experiments performed.

    3. Reviewer #2 (Public review):

      Hass et al. use in vivo and ex vivo mouse models to explore and validate the use of glucose and lactate by the outer retina. While the authors' conclusions are not totally novel, their work uses powerful in vivo models to validate, strengthen, and support their conclusions. This data is an important step forward in the field's understanding of retinal metabolism.

      They performed in vivo metabolite tracing with 5 different fuel sources and found that glucose was the primary fuel for TCA in the retina. While performing these experiments they measured the circulating levels of the tracer metabolites to ensure steady-state labeling which aids in the interpretation of the results. Showing the levels of the labeled tracer in the retina would be a nice addition to establishing if the tracer is getting into the target tissue.

      To support their conclusions that the photoreceptors are the primary consumers of glucose in the retina, the authors used multiple mouse models either with photoreceptor degeneration or a retina lacking the primary glucose transporter. While the photoreceptor degeneration mouse model has some caveats that make interpreting the data challenging, the glucose transporter KO models are a powerful tool to show the changes in metabolite levels between the retina and RPE in a retina. These retinas are not degenerated and have more subtle metabolic rearrangements. Therefore decreases in glucose consumption and lactate export can confidently be attributed to the changes in the photoreceptor metabolism. This model also allowed the authors to show that when glucose uptake is limited the photoreceptors can use lactate.

      The authors show in vivo data to support that the RPE uses lactate from the photoreceptors as a fuel source. They do very short-term tracing in vivo to show that the RPE has reduced lactate levels and TCA labeling in a mouse model lacking photoreceptors. There is no deficiency when the RPE is measured ex vivo. These data clearly show that the adjacent photoreceptor activity is impacting RPE metabolism.

      The manuscript is well-written, and thorough and does a very good job detailing and explaining methods and concepts that are not straightforward. The authors address (and do not bury) confusing data that does not necessarily support their conclusions (for example glycolytic intermediates in Figure 3C being elevated. The authors even perform additional experiments to clarify artifacts they observed in the tracing of the degeneration model due to short-term ischemia.

    4. Reviewer #3 (Public review):

      This work addresses the metabolic interplay between photoreceptors and the adjacent supporting layer of the vertebrate retina, the retinal pigment epithelium (RPE). Prior work from the Hurley lab and others provided evidence, mainly in acutely dissected mouse retina and in cell culture, for the idea that although glucose enters the retina via the RPE, the photoreceptors use most of this glucose via glycolysis, producing lactate that is used by other cells such as Müller cells and RPE cells. In the current study, they build on this by showing that these same principles hold true in vivo, using organism-level stable isotope tracing, as well as in intact retina preparations. They also use several mutant mice that lack photoreceptors, or that lack glucose transporters in either rods or the whole retina, to examine the contribution of photoreceptors to retinal glucose uptake. While many of the concepts were introduced in earlier work, it is an important expansion of this work to show these same mechanisms function in vivo. The authors also use other labeled fuels, lactate, and palmitate, to characterize their use in the presence or absence of glucose transport.

      The paper presents a nice combination of in vivo experiments (with a steady infusion of labeled metabolites into the circulation of a living mouse) with ex vivo experiments that allow the monitoring of lactate production and temporal control of labeling.

      Overall, the work provides convincing evidence that in the eye of a living mouse, photoreceptors are the main consumers of glucose in the retina, and the main producers of lactate. It seems less clear that the incorporation of labeled glucose into TCA metabolites in the RPE is dependent on the photoreceptor processing of glucose to lactate. Figure 5D is cited as the evidence that "much less m+3 lactate reaches the RPE-choroid in AIPL-/- mice than in controls," and indeed there is much less labeled lactate; but the downstream labeling of citrate is not substantially affected. It is also hard to discern whether these in vivo experiments provide evidence that photoreceptor-derived lactate suppresses glucose oxidation in RPE cells (as shown in vitro in Kanow et al., 2017).

    5. Author response:

      We thank the reviewers for their thoughtful reading and review of our manuscript. These reviews make clear that, for this work to be complete, we must make progress on the following fronts:

      (1) Expand the discussion to better incorporate alternate explanations of our data

      (2) Improve data visualization and experimental support or an experimental refutation for the following concepts

      a. Photoreceptor-derived lactate exported specifically from photoreceptors is utilized in the RPE TCA cycle

      b. Photoreceptors can utilize lactate as a fuel source when starved of glucose

      To address these concerns, we will focus our efforts on infusing <sup>13</sup>C<sub>6</sub>-glucose into rodΔglut1 mice. Lactate is not made without glucose, so this experiment should indicate whether glucose utilization in photoreceptors provides lactate to the RPE, and whether that lactate is used in the TCA cycle.

      The reviewers also noted that changes in <sup>13</sup>C labeling of RPE TCA cycle intermediates downstream of lactate is not obvious (between C57BL6J mice and AIPL1<sup>-/-</sup>). We think that at least in part, this is a consequence of the way we presented the data. We will improve how we display our data so that the differences of incorporation of <sup>13</sup>C in TCA cycle intermediates in control and AIPL1<sup>-/-</sup> RPE is clearer.

    1. eLife Assessment

      This study presents an important contribution to the understanding of neural speech tracking, demonstrating how minimal background noise can enhance the neural tracking of the amplitude-onset envelope. The evidence, through a well-designed series of EEG experiments, is convincing. This work will be of interest to auditory scientists, particularly those investigating biological markers of speech processing.

    2. Reviewer #1 (Public review):

      This paper presents a comprehensive study of how neural tracking of speech is affected by background noise. Using five EEG experiments and Temporal response function (TRF), it investigates how minimal background noise can enhance speech tracking even when speech intelligibility remains very high. The results suggest that this enhancement is not attention-driven but could be explained by stochastic resonance. These findings generalize across different background noise types, listening conditions, and speech features (envelope onset and envelope), offering insights into speech processing in real-world environments.

      I find this paper well-written, the experiments and results are clearly described.

      Comments on revisions:

      I thank the author for thoughtful revisions and for adequately addressing my comments. The new version is much clearer and improved. I have no further questions.

    3. Reviewer #2 (Public review):

      The author investigates the role of background noise on EEG-assessed speech tracking in a series of five experiments. In the first experiment the influence of different degrees of background noise is investigated and enhanced speech tracking for minimal noise levels is found. The following four experiments explore different potential influences on this effect, such as attentional allocation, different noise types and presentation mode.

      The step-wise exploration of potential contributors to the effect of enhanced speech tracking for minimal background noise is compelling. The motivation and reasoning for the different studies is clear and logical and therefore easy to follow. The results are discussed in a concise and clear way. While I specifically like the conciseness, one inevitable consequence is that not all results are equally discussed in depth.

      Based on the results of the five experiments, the authors conclude that the enhancement of speech tracking for minimal background noise is likely due to stochastic resonance. Given broad conceptualizations of stochasitc resonance as noise benefit this is a reasonable conclusion.

      This study will likely impact the field as it provides compelling support questioning the relationship between speech tracking and speech processing.

      Comments on revisions:

      All my previous comments were addressed nicely. Some of the comments were mere curiosity questions that were nicely entertained, even though they were not of direct relevance to the manuscript. I like the addition of the amplitude envelope analysis to the supplementary material as it offers direct comparison of those different methods. My only tiny tiny critic is (which bears no significance), that due to the many rearrangement changes in the marked changes document, the changes of content get buried and hard to see.

    1. eLife Assessment

      Saijilafu et al. describe valuable findings suggesting that MLCK and MLCP bidirectionally regulate NMII phosphorylation ultimately impinging on axonal growth during regeneration in the central and peripheral nervous systems. Solid evidence is collected from culture and in vivo models, and through pharmacologic and genetic loss-of-function approaches. However, how MLCK and MLCP regulates NMII activity is not fully addressed or discussed. In sum, this knowledge is of potential interest for the field due to the relevance of identifying mechanistic details that regulate axonal regeneration

    2. Joint Public Review:

      This paper examines the role of MLCK (myosin light chain kinase) and MLCP (myosin light chain phosphatase) in axon regeneration. Using loss-of-function approaches based on small molecule inhibitors and siRNA knockdown, the authors explore axon regeneration in cell culture and in animal models from central and peripheral nervous systems. Their evidence shows that MLCK activity facilitates axon extension/regeneration, while MLCP prevents it. Additionally, they show that when the MLCK/MLCP pathway is experimentally intervened, F-actin is redistributed in the growth cone.

      Strengths:

      This manuscript presents a wide range of experimental models to address its hypothesis and biological question. Notably, the use of multiple in vivo models significantly enhances the overall validity of the study.

      What follows is a discussion of the merits and limitations of different claims of the manuscript in light of the evidence presented.

      (1) The authors combine MLCK inhibitors with Bleb (Figure 6), trying to verify if both pairs of inhibitors act on the same target/pathway. MLCK may regulate axon growth independent of NMII activity. However, this has very important implications for the understanding not only on how NMII works and affects axon extension but also in trying to understand what MLCP is doing. One wonders if MLCP actions, which are opposite of MLCK, also independent of NMII activity? The authors try to address this controversial issue in the discussion section. The reviewers consider that it is still an open question, and acknowledge that it would require a significant amount of experimental work to solve the issue, that goes well beyond the main goal of the present study.

      (2) Using western blot and immunohistochemical analyses, authors first show that MLCK expression is increased in DRG sensory neurons following peripheral axotomy, concomitant to an increase in MLC phosphorylation, suggesting a causal effect (Figure 1). The authors claim that it is common that axon growth-promoting genes are upregulated. It would have been interesting at<br /> this point to study in this scenario the regulation of MLCP.

      (3) Using DRG cultures and sciatic nerve crush in the context of MLCK inhibition (ML-7) and down-regulation, authors conclude that MLCK activity is required for mammalian peripheral axon regeneration both in vitro and in vivo (Figure 2). In parallel, the authors show that these treatments affect, as expected, the phosphorylation levels of MLC.

      (4) The authors then examined the role of the phosphatase MLCP in axon growth during regeneration. The authors first use a known MLCP blocker, phorbol 12,13-dibutyrate (PDBu), to show that is able to increase the levels of p-MLC, with a concomitant increase in the extent of axon regrowth of DRG neurons, both in permissive as well as non-permissive substrates. The authors repeat the experiments using the knockdown of MYPT1, a key component of the MLC-phosphatase, and again can observe a growth-promoting effect (Figure 3).

      (5) In the next set of experiments (presented in Figure 4) authors extend the previous observations in primary cultures from the CNS. For that, they use cortical and hippocampal cultures, and pharmacological and genetic loss-of-function using the above-mentioned strategies. The expected results were obtained in both CNS neurons: inhibition or knockdown of the kinase decreases axon growth, whereas inhibition or knockdown of the phosphatase increases growth. A main weakness in this set is that drugs were used from the beginning of the experiment, and hence, they would also affect axon specification. As pointed out in Materials and Method (lines 143-145) authors counted as "axons" neurites longer than twice the diameter of the cell soma, and hence would not affect the variable measured. In any case, to be sure one is only affecting axon extension in these cells, the drugs should have been used after axon specification and maturation, which occurs at least after 3 DIV. Taking this into account, the conclusions with this experimental design are limited.

    3. Author response:

      The following is the authors’ response to the previous reviews.

      Reviewer #1 (Public review):

      This paper examines the role of MLCK (myosin light chain kinase) and MLCP (myosin light chain phosphatase) in axon regeneration. Using loss-of-function approaches based on small molecule inhibitors and siRNA knockdown, the authors explore axon regeneration in cell culture and in animal models from central and peripheral nervous systems. Their evidence shows that MLCK activity facilitates axon extension/regeneration, while MLCP prevents it.

      Major concerns:

      (1) In the title, authors indicate that the observed effects from loss-of-function of MLCK/MLCP take place via F-actin redistribution in the growth cone. However, there are no experiments showing a causal effect between changes in axon growth mediated by MLCK/MLCP and F-actin redistribution.

      Thank you for your comments. We revised the title of our manuscript to “MLCK/MLCP regulates mammalian axon regeneration and redistributes the growth cone F-actin”. (line 3)

      (2) The author combines MLCK inhibitors with Bleb (Figure 6), trying to verify if both pairs of inhibitors act on the same target/pathway. MLCK may regulate axon growth independent of NMII activity. However, this has very important implications for the understanding not only on how NMII works and affects axon extension, but also in trying to understand what MLCP is doing. One wonders if MLCP actions, which are opposite of MLCK, also independent of NMII activity? The authors, in the discussion section, try to find an explanation for this finding, but I consider it fails since the whole rationale of the manuscript is still around how MLCK and MLCP affect NMII phosphorylation.

      Thank you for your comments. Although both MLCK and MLCP regulate the activity of NMII, it has been reported that they also govern domain-specific spatial control of actin-based motility in the growth cone. Specifically, MLCK activity is essential for arc translocation and retrograde flow within the P domain, while MLCP appears to specifically modulate arc movement and associated myosin II contractility in the T zone and C domain (Ref). Therefore, it is proposed that the regulatory mechanisms of MLCK and MLCP are highly complex during the process of axon growth. 

      [Ref]:Xiao-Feng Zhang, Andrew W Schaefer, Dylan T Burnette, Vincent T Schoonderwoert, Paul Forscher. Rho-dependent contractile responses in the neuronal growth cone are independent of classical peripheral retrograde actin flow. Neuron. 2003 Dec 4;40(5):931-44.

      What follows is a discussion of the merits and limitations of different claims of the manuscript in light of the evidence presented.

      (1) Using western blot and immunohistochemical analyses, authors first show that MLCK expression is increased in DRG sensory neurons following peripheral axotomy, concomitant to an increase in MLC phosphorylation, suggesting a causal effect (Figure 1). The authors claim that it is common that axon growth-promoting genes are upregulated. It would have been interesting at this point to study in this scenario the regulation of MLCP.

      We thank Reviewer for the positive comment on our manuscript.

      (2) Using DRG cultures and sciatic nerve crush in the context of MLCK inhibition (ML-7) and down-regulation, authors conclude that MLCK activity is required for mammalian peripheral axon regeneration both in vitro and in vivo (Figure 2). In parallel, the authors show that these treatments affect as expected the phosphorylation levels of MLC.

      The in vitro evidence is of standard methods and convincing. However, here, as well as in all other experiments using siRNAs, no Control siRNAs were used. Authors do show that the target protein is downregulated, and they can follow transfected cells with GFP. Still, it should be noted that the standard control for these experiments has not been done.

      Thank you for your comments. We utilized scrambled siRNA as a control. I sincerely apologize for the oversight in the manuscript; although we mentioned that scrambled siRNA was used as a control in the figure legends, we failed to clearly articulate this important information in the methods section. We have revised the manuscript accordingly. (line 87, line 549, line, line 562, line 568).

      (3) The authors then examined the role of the phosphatase MLCP in axon growth during regeneration. The authors first use a known MLCP blocker, phorbol 12,13-dibutyrate (PDBu), to show that is able to increase the levels of p-MLC, with a concomitant increase in the extent of axon regrowth of DRG neurons, both in permissive as well as non-permissive substrates. The authors repeat the experiments using the knockdown of MYPT1, a key component of the MLC-phosphatase, and again can observe a growth-promoting effect (Figure 3).

      The authors further show evidence for the growth-enhancing effect in vivo, in nerve crush experiments. The evidence in vivo deserves more evidence and experimental details (see comment 2). A key weakness of the data was mentioned previously: no control siARN was used.

      Thank you for your comments. As mentioned above, we used scramble siRNA as control in vivo experiment as well.

      (4) In the next set of experiments (presented in Figure 4) authors extend the previous observations in primary cultures from the CNS. For that, they use cortical and hippocampal cultures, and pharmacological and genetic loss-of-function using the above-mentioned strategies. The expected results were obtained in both CNS neurons: inhibition or knockdown of the kinase decreases axon growth, whereas inhibition or knockdown of the phosphatase increases growth. A main weakness in this set is that drugs were used from the beginning of the experiment, and hence, they would also affect axon specification. As pointed in Materials and Method (lines 143-145) authors counted as "axons" neurites longer than twice the diameter of the cell soma, and hence would not affect the variable measured. In any case, to be sure one is only affecting axon extension in these cells, the drugs should have been used after axon specification and maturation, which occurs at least after 5 DIV.

      Thank you for your comments. We acknowledge that the early administration of drugs can lead to unintended effects on neuronal polarization and axon formation. However, in line with our previous publication, we focused exclusively on measuring the longest length of the axon. To quantify axon length, we selected neurons exhibiting an axonal process exceeding twice the diameter of their cell body and measured the longest axon from 100 neurons for each condition (Ref 1, Ref 2). Consequently, we believe that drug administration at the onset of cell culture influences axon formation; however, it does not significantly affect the drug's impact on axon length.

      [Ref 1]: Chang-Mei Liu, Rui-Ying Wang, Saijilafu, Zhong-Xian Jiao, Bo-Yin Zhang, Feng-Quan Zhou. MicroRNA-138 and SIRT1 form a mutual negative feedback loop to regulate mammalian axon regeneration. Genes Dev. 2013 Jul 1;27(13):1473-83.

      [Ref 2]: Eun-Mi Hur, Saijilafu, Byoung Dae Lee, Seong-Jin Kim, Wen-Lin Xu, Feng-Quan Zhou. GSK3 controls axon growth via CLASP-mediated regulation of growth cone microtubules. Genes Dev. 2011 Sep 15;25(18):1968-81.

      (5) In Figure 7, the authors a local cytoskeletal action of the drug, but the evidence provided does not differentiate between a localized action of the drugs and a localized cell activity.

      We appreciate the reviewer’s insightful comments and have revised our title to “MLCK/MLCP Regulates mammalian axon regeneration and redistributes growth cone F-actin.” Furthermore, we have made corresponding revisions to the manuscript (line31, line 73).

      References:

      (1) Eun-Mi Hur 1, In Hong Yang, Deok-Ho Kim, Justin Byun, Saijilafu, Wen-Lin Xu, Philip R Nicovich, Raymond Cheong, Andre Levchenko, Nitish Thakor, Feng-Quan Zhou. 2011. Engineering neuronal growth cones to promote axon regeneration over inhibitory molecules. Proc Natl Acad Sci U S A. 2011 Mar 22;108(12):5057-62. doi: 10.1073/pnas.1011258108.

      (2) Garrido-Casado M, Asensio-Juárez G, Talayero VC, Vicente-Manzanares M. 2024. Engines of change: Nonmuscle myosin II in mechanobiology. Curr Opin Cell Biol. 2024 Apr;87:102344. doi: 10.1016/j.ceb.2024.102344.

      (3) Karen A Newell-Litwa 1, Rick Horwitz 2, Marcelo L Lamers. 2015. Non-muscle myosin II in disease: mechanisms and therapeutic opportunities. Dis Model Mech. 2015 Dec;8(12):1495-515. doi: 10.1242/dmm.022103.

      Reviewer #2 (Public review):

      Summary:

      Saijilafu et al. demonstrate that MLCK/MLCP proteins promote axonal regeneration in both the central nervous system (CNS) and peripheral nervous system (PNS) using primary cultures of adult DRG neurons, hippocampal and cortical neurons, as well as in vivo experiments involving sciatic nerve injury, spinal cord injury, and optic nerve crush. The authors show that axon regrowth is possible across different contexts through genetic and pharmacological manipulation of these proteins. Additionally, they propose that MLCK/MLCP may regulate F-actin reorganization in the growth cone, which is significant as it suggests a novel strategy for promoting axonal regeneration.

      Strengths:

      This manuscript presents a wide range of experimental models to address its hypothesis and biological question. Notably, the use of multiple in vivo models significantly enhances the overall validity of the study.

      We thank Reviewer for the positive comment on our manuscript.

      Weaknesses:

      - The authors previously published that blocking myosin II activity stimulates axonal growth and that MLCK activates myosin II. The present work shows that inhibiting MLCK blocks axonal regeneration while blocking MLCP (the protein that dephosphorylates myosin II) produces the opposite effect. Although this contradiction is discussed, no new evidence has been added to the manuscript to clarify this mechanism or address the remaining questions. Critical unresolved questions include: what happens to myosin II expression when both MLCK and MLCP are inhibited? If MLCK/MLCP are acting through an independent mechanism, what would that mechanism be?

      - In the discussion, the authors mention the existence of two myosin II isoforms with opposing effects on axonal growth. Still, there is no evidence in the manuscript to support this point.

      - It is also unclear how MLCK/MLCP acts on the actin cytoskeleton. The authors suggest that proteins such as ADF/cofilin, Arp 2/3, Eps8, Profilin, Myosin II, and Myosin V could regulate changes in F-actin dynamics. However, this study provides no experimental evidence to determine which proteins may be involved in the mechanism.

      Thank you for your comments. Axon growth is an exceptionally intricate process, facilitated by the coordinated regulation of gene expression in the soma, axonal transport along the shaft, and the assembly of cytoskeletal elements and membrane proteins at the growth cone. In this paper, our results primarily demonstrate that MLCK/MLCP plays a crucial role in regulating mammalian axon regeneration and redistributing F-actin within the growth cone; however, we did not investigate which specific proteins act downstream of MLCK/MLCP during axon regeneration.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      - A title more suitable for the evidence shown can be: MLCK/MLCP regulates mammalian axon regeneration and redistributes the growth cone F-actin.

      Thank you for your comments. We revised the title of our manuscript to“MLCK/MLCP regulates mammalian axon regeneration and redistributes the growth cone F-actin” (line 3).

      -In figure 3, It would be useful to indicate in the figure legend, that the red arrow is pointing to a suture that was performed during surgery to mark clearly the injury site.

      Thank you for your comments. We revised Figure 3 legend that indicates the red arrow is pointing to a suture that was performed during surgery to mark clearly the injury site (line 571-572).

      - The following is a concern raised in the previous round, and that the response by the authors was so complete and accurate that I consider it would be useful to include it in the discussion section.

      Thank you for your comments. We included those contents in the discussion section of our revised manuscript (line 348-354, line 355-359).

      The author combines MLCK inhibitors with Bleb (Figure 6), trying to verify if both pairs of inhibitors act on the same target/pathway. The rationale is wrong for at least two reasons.

      a- Because both lines of evidence point to contrasting actions of NMII on axon growth, one approach could never "rescue" the other.

      Reply by authors in R1:If MLCK regulates axon growth through the activation of Myosin, the inhibitory effect of ML-7 (an MLCK inhibitor) on axon growth might be influenced by Bleb, a NMII inhibitor. However, our findings reveal that the combination of Bleb and ML-7 does not alter the rate of axon outgrowth compared to ML-7 alone. This suggests that the roles of ML-7 and Bleb in axon growth are independent. It means MLCK may regulate axon growth independent of NMII activity.

      b- Because the approaches target different steps on NMII activation, one could never "prevent" or rescue the other. For example, for Bleb to provide a phenotype, it should find any p-MLC, because it is only that form of MLC that is capable of inhibiting its ATPase site. In light of this, it is not surprising that Bleb is unable to exert any action in a situation where there is no p-MLC (ML-7, which by inhibiting the kinase drives the levels of p-MLC to zero, Figure 4A). Hence, the results are not possible to validate in the current general interpretation of the authors. (See 'major concern').

      Reply by authors in R1: The reported mechanism of blebbistatin is not through competition with the ATP binding site of myosin. Instead, it selectively binds to the ATPase intermediate state associated with ADP and inorganic phosphate, which decelerates the phosphate release. Importantly, blebbistatin does not impede myosin's interaction with actin or the ATP-triggered disassociation of actomyosin. It rather inhibits the myosin head when it forms a product complex with a reduced affinity for actin. This indicates that blebbistatin functions by stabilizing a particular myosin intermediate state that is independent of the phosphorylation status of myosin light chain (MLC).

      [Ref] Kovács M, Tóth J et al. Mechanism of blebbistatin inhibition of myosin II. J Biol Chem. 2004 Aug 20;279(34):35557-63.

    1. eLife Assessment

      This useful study identifies new monoclonal antibodies produced by cystic fibrosis patients against the Pseudomonas aeruginosa type three secretion system. The evidence supporting the authors' claim is solid. However, in the current version of the manuscript, it is unclear what the benefits of the newly isolated antibodies are with respect to antibodies previously identified using a similar approach. The study will be of interest to those working on developing mAbs against Pseudomonas aeruginosa and also against other pathogens that harbor the T3SS.

    2. Reviewer #1 (Public review):

      Summary:

      Desveaux et al. describe human mAbs targeting protein from the Pseudomonas aeruginosa T3SS, discovered by employing single cell B cell sorting from cystic fibrosis patients. The mAbs were directed at the proteins PscF and PcrV. They particularly focused on two mAbs binding the T3SS with the potential of blocking activity. The supplemented biochemical analysis was crystal structures of P3D6 Fab complex. They also compared the blocking activity with mAbs that were described in previous studies, using an assay that evaluated the toxin injection. They conducted mechanistic structure analysis and found that these mAbs might act through different mechanisms by preventing PcrV oligomerization and disrupting PcrVs scaffolding function.

      Strengths:

      The antibiotic resistance crisis requires the development of new solutions to treat infections caused by MDR bacteria. The development of antibacterial mAbs holds great potential. In that context, this report is important as it paves the way for the development of additional mAbs targeting various pathogens that harbor the T3SS. In this report, the authors present a comparative study of their discovered mAbs vs. a commercial mAb currently in clinical testing resulting in valuable data with applicative implications. The authors investigated the mechanism of action of the mAbs using advanced methods and assays for the characterization of antibody and antigen interaction, underlining the effort to determine the discovered mAbs suitability for downstream application.

      Weaknesses:

      Although the information presented in this manuscript is important, previous reports regarding other T3SS structures complexed with antibodies, reduce the novelty of this report. Nevertheless, we provide several comments that may help to improve the report. The structural analysis of the presented mAbs is incomplete and unfortunately, the authors did not address any developability assessment. With such vital information missing, it is unclear if the proposed antibodies are suited for diagnostic or therapeutic usage. This vastly reduces the importance of the possibly great potential of the authors' findings. Moreover, the structural information does not include the interacting regions on the mAb which may impede the optimization of the mAb if it is required to improve its affinity.

    3. Reviewer #2 (Public review):

      Summary:

      Desveaux et al. performed Elisa and translocation assays to identify among 34 cystic fibrosis patients which ones produced antibodies against P. aeruginosa type three secretion system (T3SS). The authors were especially interested in antibodies against PcrV and PcsF, two key components of the T3SS. The authors leveraged their binding assays and flow cytometry to isolate individual B cells from the two most promising sera, and then obtained monoclonal antibodies for the proteins of interest. Among the tested monoclonal antibodies, P3D6 and P5B3 emerged as the best candidates due to their inhibitory effect on the ExoS-Bla translocation marker (with 24% and 94% inhibition, respectively). The authors then showed that P5B3 binds to the five most common variants of PcrV, while P3D6 seems to recognize only one variant. Furthermore, the authors showed that P3D6 inhibits translocon formation, measured as cell death of J774 macrophages. To get insights into the P3D6-PcrV interaction, the authors defined the crystal structure of the P3D6-PcrV complex. Finally, the authors compared their new antibodies with two previous ones (i.e., MEDI3902 and 30-B8).

      Strengths:

      (1) The article is well written.

      (2) The authors used complementary assays to evaluate the protective effect of candidate monoclonal antibodies.

      (3) The authors offered crystal structure with insights into the P3D6 antibody-T3SS interaction (e.g., interactions with monomer vs pentamers).

      (4) The authors put their results in context by comparing their antibodies with respect to previous ones.

      Weaknesses:

      (1) The authors used a similar workflow to the one previously reported in Simonis et al. 2023 (antibodies from cystic fibrosis patients that included B cell isolation, antibody-PcrV interaction modeling, etc.) but the authors do not clearly explain how their work and findings differentiate from previous work.

      (2) Although new antibodies against P. aerugisona T3SS expand the potential space of antibody-based therapies, it is unclear if P3D6 or P5B3 are better than previous antibodies. In fact, in the discussion section authors suggested that the 30-B8 antibody seems to be the most effective of the tested antibodies.

      (3) The authors should explain better which of the two antibodies they have discovered would be better suited for follow-up studies. It is confusing that the authors focused the last sections of the manuscript on P3D6 despite P3D6 having a much lower ExoS-Bla inhibition effect than P5B3 and the limitation in the PcrV variant that P3D6 seems to recognize. A better description of this comparison and the criteria to select among candidate antibodies would help readers identify the main messages of the paper.

      (4) This work could strongly benefit from two additional experiments:<br /> a) In vivo experiments: experiments in animal models could offer a more comprehensive picture of the potential of the identified monoclonal antibodies. Additionally, this could help to answer a naïve question: why do the patients that have the antibodies still have chronic P. aeruginosa infections?<br /> b) Multi-antibody T3SS assays (i.e., a combination of two or more monoclonal antibodies evaluated with the same assays used for characterization of single ones). This could explore the synergistic effects of combinatorial therapies that could address some of the limitations of individual antibodies.

    1. eLife Assessment

      In this valuable study, the authors show the physiological response and molecular pathway mediating the effect of quinofumelin, a developed fungicide with an unknown mechanism. The authors present convincing data suggesting the involvement of the uridine/uracil biosynthesis pathway, by combining in vivo microbiology characterization as well as in vitro biochemical binding results.

    2. Reviewer #1 (Public review):

      Summary:

      Phytophathogens including fungal pathogens such as F. graminearum remain a major threat to agriculture and food security. Several agriculturally relevant fungicides including the potent Quinofumelin have been discovered to date, yet the mechanisms of their action and specific targets within the cell remain unclear. This paper sets out to contribute to addressing these outstanding questions.

      Strengths:

      The paper is generally well-written and provides convincing data to support their claims for the impact of Quinofumelin on fungal growth, the target of the drug, and the potential mechanism. Critically the authors identify an important pyrimidine pathway dihydroorotate dehydrogenase (DHODH) gene FgDHODHII in the pathway or mechanism of the drug from the prominent plant pathogen F. graminearum, confirming it as the target for Quinofumelin. The evidence is supported by transcriptomic, metabolomic as well as MST, SPR, molecular docking/structural biology analyses.

      Weaknesses:

      Whilst the study adds to our knowledge about this drug, it is, however, worth stating that previous reports (although in different organisms) by Higashimura et al., 2022 https://pmc.ncbi.nlm.nih.gov/articles/PMC9716045/ had already identified DHODH as the target for Quinofumelin and hence this knowledge is not new and hence the authors may want to tone down the claim that they discovered this mechanism and also give sufficient credit to the previous authors work at the start of the write-up in the introduction section rather than in passing as they did with reference 25? other specific recommendations to improve the text are provided in the recommendations for authors section below.

    3. Reviewer #2 (Public review):

      Summary:

      In the current study, the authors aim to identify the mode of action/molecular mechanism of characterized a fungicide, quinofumelin, and its biological impact on transcriptomics and metabolomics in Fusarium graminearum and other Fusarium species. Two sets of data were generated between quinofumelin and no treatment group, and differentially abundant transcripts and metabolites were identified. The authors further focused on uridine/uracil biosynthesis pathway, considering the significant up- and down-regulation observed in final metabolites and some of the genes in the pathways. Using a deletion mutant of one of the genes and in vitro biochemical assays, the authors concluded that quinofumelin binds to the dihydroorotate dehydrogenase.

      Strengths:

      Omics datasets were leveraged to understand the physiological impact of quinofumelin, showing the intracellular impact of the fungicide. The characterization of FgDHODHII deletion strains with supplemented metabolites clearly showed the impact of the enzyme on fungal growth.

      Weaknesses:

      Some interpretation of results is not accurate and some experiments lack controls. The comparison between quinofumelin-treated deletion strains, in the presence of different metabolites didn't suggest the fungicide is FgDHODHII specific. A wild type is required in this experiment.

      Potential Impact: Confirming the target of quinofumelin may help understand its resistance mehchanism, and further development of other inhibitory molecules against the target.

      The manuscript would benefit more in explaining the study rationale if more background on previous characterization of this fungicide on Fusarium is given.

    4. Reviewer #3 (Public review):

      Summary:

      The manuscript shows the mechanism of action of quinofumelin, a novel fungicide, against the fungus Fusarium graminearum. Through omics analysis, phenotypic analysis, and in silico approaches, the role of quinofumelin in targeting DHODH is uncovered.

      Strengths:

      The phenotypic analysis and mutant generation are nice data and add to the role of metabolites in bypassing pyrimidine biosynthesis.

      Weaknesses:

      The role of DHODH in this class of fungicides has been known and this data does not add any further significance to the field. The work of Higashimura et al is not appreciated well enough as they already showed the role of quinofumelin upon DHODH II.

      There is no mention of the other fungicide within this class ipflufenoquin, as there is ample data on this molecule.

    1. eLife Assessment

      This valuable manuscript by Jia et al. investigates the role of cartilage intermediate layer protein (CILP) and moderate exercise in maintaining hyaline cartilage integrity following anterior cruciate ligament transection (ACLt) in rats. Solid data support the downregulation of CILP in human OA cartilage and its potential role in regulating Keap1/Nrf2 interaction and chondrocyte ferroptosis. However, the data supporting a role for CILP in exercise-mediated inhibition of hyaline cartilage fibrosis in early OA are incomplete.

    2. Reviewer #1 (Public review):

      Summary:

      In this study, the authors examined the function of CLIP in exercise-mediate inhibition of osteoarthritis using an ACL transection rat model. The authors rely on rigorous experimental design and methods to demonstrate that CLIP is downregulated in osteoarthritic cartilage tissue and that CLIP expression can be rescued by moderate treadmill exercise. They further show that activation of Nrf2 signaling occurs through CLIP inhibition of Keap1-Nrf2. The results are novel as they suggest a new role for CLIP in OA pathogenesis. The following points need to be addressed in order to bring additional clarity to this work.

      Strengths:

      This is an interesting study that addresses an important global health issue. The significance is high and the work is novel and mechanistic.

      Weaknesses:

      A major concern is that a direct link between exercise and CLIP-mediated inhibition of ferroptosis via Keap1-Nrf2 pathway is not supported by the provided data. The ferroptosis studies were performed in vitro, whereas the effect of exercise was demonstrated in an OA animal model. Therefore, the data suggest a potential correlation between CLIP-Keap1-Nrf2 and exercise. This must be described as a limitation in the discussion section. Consequently, the title of the manuscript needs to better reflect the interpretation of these data.

      Figure 1: Radiomics data are not described in the text. OARSI scoring of damaged and undamaged sections is not presented in the figure.

      Figure 2: Data presentation is very dense in this figure. It is recommended that Figure 2 be split into two figures. Also, the histology and IHC images in Figure 2A are of poor resolution. These data do not sufficiently demonstrate early OA pathology. Clearer images to substantiate the authors' statement need to be provided.

      Figure 3: The superficial zone appears to be misrepresented; it should include only the top 2-3 layers of flat chondrocyte cells.

      Figure 4: This Figure should be listed as supplementary data. CTS is not spelled out in the legend. Also, a rationale for using low, medium, and high CTS needs to be provided.

      Figure 5: Please describe positive and negative controls. Please elaborate on the findings of the yeast hybrid experiment in the results. Please expand KD-02 experimental condition in the legend and results.

      Figure 6: Please move Figure S2 into the main Figures and describe the results in section 2.9 which describes ferroptosis.

      In the results section, it is recommended that the authors describe all panels of the figures appropriately in sequential order. The authors are advised to provide publication-quality figures and, in some cases, to split figure panels into new figures as well as to ensure that the fonts and data are legible. Finally, the use of non-conventional abbreviations (such as G3 for passage-3 chondrocytes, CG for the control condition, and OE for overexpression) may confuse the readership, and describing each abbreviation when used for the first time is required.

    3. Reviewer #2 (Public review):

      Summary:

      Recent studies indicate a beneficial role for moderate-intensity exercise in early osteoarthritis (OA). This manuscript by Jia et al. investigates the role of cartilage intermediate layer protein (CILP) and moderate exercise in maintaining hyaline cartilage integrity following anterior cruciate ligament transection (ACLt) in rats. Single-cell RNA-sequencing of OA and OA+ exercise knee joints from rats at 4 weeks post-ACLt revealed the upregulation of CILP and a higher Col2/Col1 ratio in OA knee chondrocytes from ACLt rats that exercised on a treadmill. CILP was downregulated in the damaged portions, compared to healthy regions of knee cartilage of patients undergoing total knee arthroplasty. In the rat ACLt model, CILP is downregulated in the OA cartilage but not in OA + exercise cartilage. Using CLIP1 over-expression and knockdown in passage 3 cultures of primary rat chondrocytes, the authors demonstrate that the loss of CILP is associated with higher ROS, lipid peroxidation, and iron content in chondrocytes whereas its overexpression is protective against these changes. CILP binds to Keap1, and its overexpression disrupts Keap1/Nrf2 interaction and attenuates Nrf2 ubiquitination. The authors conclude that exercise protects the articular cartilage intermediate zone and the associated upregulation of CILP facilitates Keap1-Nrf2 interaction to prevent chondrocyte ferroptosis and hyaline cartilage fibrosis.

      Strengths:

      The study is interesting, and the experiments are conducted well. The methodology is well-described. The data presented strongly support the downregulation of CILP in human OA cartilage and its potential role in regulating Keap1/Nrf2 interaction and chondrocyte ferroptosis.

      Weaknesses:

      The data do not support a role for CILP in exercise-mediated inhibition of hyaline cartilage fibrosis in early OA. The reason for selecting CILP from the ScRNA-seq for further analysis is not clear. The manuscript is put together sloppily. The abstract, introduction, and results were written confusingly, and hard to follow. Some of the figures were confusing as well. Still, the study is interesting.

    1. eLife Assessment

      This study describes a useful technique to improve imaging depth using confocal microscopy for imaging large, cleared samples. The work is supported by solid findings and will be of broad interest to many microscopical researchers in different fields who want a cost effective way to image deep into samples.

    2. Reviewer #2 (Public review):

      Summary:

      Liu et al investigated the performance of a novel imaging technique called RIM-Deep to enhance the imaging depth for cleared samples. Usually, the imaging depth using the classical confocal microscopy sample chamber is limited due to optical aberrations, resulting in loss of resolution and image quality. To overcome this limitation and increase depth, they generated a special imaging chamber, that is affixed to the objective and filled with a solution matching the refractive indices to reduce aberrations. Importantly, the study was conducted using a standard confocal microscope, that has not been modified apart from exchanging the standard sample chamber with the RIM-Deep sample holder. Upon analysing the imaging depth, the authors claim that the RIM-Deep method increased the depth from 2 mm to 5 mm. In summary, RIM-Deep has the potential to significantly enhance imaging quality of thick samples on a low budget, making in-depth measurements possible for a wide range of researchers that have access to an inverted confocal microscope.

      Strengths:

      The authors used different clearing methods to demonstrate the suitability of RIM-Deep for various sample preparation protocols with clearing solutions of different refractive indices. They clearly demonstrate that the RIM-Deep chamber is compatible with all 3 methods. Brain samples are characterized by complex networks of cells and are often hard to visualize. Despite the dense, complex structure of brain tissue, the RIM-Deep method generated high-quality images of all 3 samples given. As the authors already stated, increasing imaging depth often goes hand in hand with purchasing expensive new equipment, exchanging several microscopy parts or purchasing a new microscopy set-up. Innovations, such as the RIM-Deep chamber, hence, might pave the way for cost-effective imaging and expand the applicability of an inverted confocal microscope.

      Weaknesses:

      (1) However, since this study introduces a novel imaging technique, and therefore, aims to revolutionize the way of imaging large samples, additional control experiments would strengthen the data. From the 3 clearing protocol used (CUBIC, MACS and iDISCO), only the brain section from Macaca fascicularis cleared with iDISCO was imaged with the standard chamber and the RIM-Deep method. This comparison indeed shows that the imaging depth thereby increases more than 2-fold, which is a significant enhancement in terms of microscopy. However, it would have been important to evaluate and show the difference of the imaging depth also on the other two samples, since they were cleared with different protocols and, thus, treated with clearing solutions of different refractive indices compared to iDCISCO.

      (2) The description of the figures and figure panels should be improved for a better understanding of the experiments performed and the thus resulting images/data.

      (3) While the authors used a Nikon AX inverted laser scanning confocal microscope, the study would highly benefit from evaluating the performance of the RIM-Deep method using other inverted confocal microscopes or even wide-field microscopes.

      Comments on Revision:

      Regarding point 1)<br /> Within the revised manuscript, Liu et al focussed on a more detailed comparison of the standard vs the RIM-Deep method of samples cleared with the 3 different methods.

      Regarding point 2)<br /> The revised description of the figures results in a better understanding of the data.

      Regarding point 3)<br /> The authors tested their method on different microscopic setups to show the compatibility.

      Summary: the revised manuscript addressed all previously mentioned points.

    3. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Liu et al., present an immersion objective adapter design called RIM-Deep, which can be utilized for enhancing axial resolution and reducing spherical aberrations during inverted confocal microscopy of thick cleared tissue.

      Strengths:

      RI mismatches present a significant challenge to deep tissue imaging, and developing a robust immersion method is valuable in preventing losses in resolution. Liu et al., present data showing that RIM-Deep is suitable for tissue cleared with two different clearing techniques, demonstrating the adaptability and versatility of the approach.

      Greetings, we greatly appreciate your feedback. In truth, we have utilized three distinct clearing techniques, including iDISCO, CUBIC, and MACS, to substantiate the adaptability and multifunctionality of the RIM-Deep adapter.

      Weaknesses:

      Liu et al., claim to have developed a useful technique for deep tissue imaging, but in its current form, the paper does not provide sufficient evidence that their technique performs better than existing ones.

      We are in complete agreement with your recommendation, and the additional experiments will conduct a thorough comparison of the efficacy between the RIM-deep adapter and the official adapter in the context of fluorescence bead experiments, along with their performance in cubic and MASC tissue clearing techniques.

      Reviewer #1 (Recommendations for the authors):

      Suggestions for improvement:

      Major revisions:

      (1) For the bead experiment, the comparison was made to a 10X dry objective instead of an immersion objective, please make a comparison to the standard immersion objective.

      Thank you for your suggestion. We fully agree with your suggestion to make a comparison with the standard immersion objective. We plan to conduct this comparison in future experiments and will thoroughly analyze the imaging differences between the official adapter and the RIM-deep adapter.

      (2) It is unclear if an accurate comparison of objectives (same NA etc) is being made in Fig 1G-J, since the official adapter image appears to be of lower resolution even at the surface. At the very least, progressive 2D slices of the reconstruction must be shown for both adapters instead of just the RIM-Deep adapter.

      Thank you for your suggestion. We strictly controlled the numerical aperture (NA) of the objectives in Fig 1G-J to ensure the accuracy of the comparison. However, the imaging resolution of the official adapter is consistent with that of the RIM-deep adapter. We agree that showing progressive 2D slices of the reconstruction would provide a more comprehensive comparison of the two adapters.

      (3) Similarly, since there already exists an official adapter, it would be useful to see that RIM-Deep performs better even in the mouse tissue, since the clearing method was different.

      Thank you for your suggestion. We will investigate the imaging performance of the two additional tissue clearing protocols using both the official adapter and the RIM-deep adapter.

      (4) The movies need legends, as it is unclear if they even show 2-D slices very deep into the tissue.

      Thank you for your suggestion. We will add figure legends to each movie.

      (5) The purpose of Supplementary Figure 3 in its current form is unclear, as is the statement in the text related to it : "The effectiveness and utility of this adapter configuration have been substantiated through a comprehensive series of experimental validations".

      Thank you for your suggestion. We will revise the statement to: "We validated the effectiveness and utility of this adapter configuration through a series of experiments."

      (6) The system is variably referred to as RIM-Deep or DepthView Enhancer in the text and figures, it would be beneficial to the readers if the authors stuck to one name.

      Thank you for your suggestion. We will choose RIM-Deep as the sole name.

      Minor revisions

      Figures

      (1) “Confocal" is incorrectly spelled as "confocol" in Figure 1, "media" is misspelled in multiple places.

      Thank you. We will correct these errors.

      (2) The camera is misplaced in the Figure 1 A drawing

      Thank you. We will fix this issue.

      (3) It would be useful to have actual pictures of the immersion objective setup (both RIM-Deep and the pre-existing adapter) since the diagrams are not very clear.

      Thank you. We will include actual pictures of both the RIM-Deep and the pre-existing adapter in the supplementary materials.

    1. eLife Assessment

      This important study reports a novel function of ATG14 in preventing pyroptosis and inflammation in oviduct cells, thus allowing smooth transport of the early embryo to the uterus and implantation. The data supporting the main conclusion are convincing. This work will be of interest to reproductive biologists and physicians practicing reproductive medicine.

    2. Reviewer #1 (Public review):

      This study by Popli et al. evaluated the function of Atg14, an autophagy protein, in reproductive function using a conditional knockout mouse model. The authors showed that female mice lacking Atg14 were infertile partly due to defective embryo transport function of the oviduct and faulty uterine receptivity and decidualization using PgrCre/+;Atg14f/f mice. The findings from this work are exciting and novel. The authors demonstrated that a loss of Atg14 led to an excessive pyroptosis in the oviductal epithelial cells that compromises cellular integrity and structure, impeding the transport function of the oviduct. In addition, the authors use both genetic and pharmacological approaches to test the hypothesis. Therefore, the findings from this study are high-impact and likely reproducible.

      Comments on revisions: Thank you for your time revising the manuscript. The authors have addressed all of my previous concerns.

    3. Reviewer #2 (Public review):

      In this manuscript, Popli et al investigated the roles of autophagy related gene, Atg14, in the female reproductive tract (FRT) using conditional knockout mouse models. By ablation of Atg14 in both oviduct and uterus with PR-Cre (Atg14 cKO), authors discovered that such females are completely infertile. They went on to show that Atg14 cKO females have impaired embryo implantation as well as embryo transport from oviduct to uterus. Further analysis showed that Atg14 cKO leads to increased pyroptosis in oviduct, which disrupts oviduct epithelial integrity and leads to obstructive oviduct lumen and impaired embryo transport. Authors concluded that Atg14 is critical for maintaining the oviduct homeostasis and keeping the inflammation under check to enable proper embryo transport.

      Comments on revisions: Authors have addressed all my concerns in this revised version, which is substantial improved compared to the original version. I have no further comments.

    4. Reviewer #3 (Public review):

      The manuscript by Pooja Popli and co-authors tested importance of Atg14 in female reproductive tract by conditionally deleting Atg14 use PrCre and also Foxj1cre. The authors showed that loss of Atg14 leads to infertility due to retention of embryos within the oviduct. The authors further concluded that the retention of embryos within the oviduct is due to pyroptosis in oviduct cells leading to defective cellular integrity. This revised version of the manuscript has addressed the remaining concerns that were raised earlier. The manuscript is now a convincing one.

    5. Author response:

      The following is the authors’ response to the previous reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      This study by Popli et al. evaluated the function of Atg14, an autophagy protein, in reproductive function using a conditional knockout mouse model. The authors showed that female mice lacking Atg14 were infertile partly due to defective embryo transport function of the oviduct and faulty uterine receptivity and decidualization using PgrCre/+;Atg14f/f mice. The findings from this work are exciting and novel. The authors demonstrated that a loss of Atg14 led to an excessive pyroptosis in the oviductal epithelial cells that compromises cellular integrity and structure, impeding the transport function of the oviduct. In addition, the authors use both genetic and pharmacological approaches to test the hypothesis. Therefore, the findings from this study are high-impact and likely reproducible. However, there are multiple major concerns that need to be addressed to improve the quality of the work.

      Thank you for the additional data that solidified the conclusion of this study. The authors addressed almost all of my previous concerns in this revised manuscript. However, some key points wording still need to be addressed.

      Comments on revisions:

      In Fig. 2A, please ensure that these are 5.0 dpc samples since implantation has already occurred at this point. However, the embryo appeared free-floating adjacent to the luminal epithelial cells (LE), even in control.

      We understand the reviewer’s concern. We have now replaced the previous H & E image with a clearer, higher-quality section that shows a fully attached embryo within a closed uterine lumen representing a typical implantation morphology at the D5 stage of pregnancy. (Revised Figure 2A)

      Fig. 3A-B: "Approximately 80-90% of blastocysts" contradicts the quantification in Figure 3C, which showed a percentage of blastocysts below 50%. Please clarify and correct as needed.

      In Fig. 3A-B, we mean to say approximately 80-90% embryos. We have now corrected the statement in the revised manuscript (Line no: 349-351).  

      The authors showed that Acetylated a-tubulin was present in the ampulla region of cKO (Fig. 4A). However, the revised manuscript still stated that (lines 397-399) ...there was a substantial loss of the ciliary epithelial cells (indicated by fewer a-tubulin and FOXJ1-positive cells) (Fig. 4B, left panel and Fig. S3)... So, the authors may want to tone down their conclusion regarding a "substantial loss" of ciliated epithelial cells if the quantification of ciliated cell number is not performed.

      We thank the reviewer for this suggestion. To avoid redundancy and ambiguity, we have revised the statement as below (Line no: 391-395):

      “As shown in Fig. 4A, normal ciliary structures were observed in the ampulla of both control and cKO oviducts. However, in the isthmus of cKO oviducts, we observed a reduction in both the FOXJ1- and PAX8-expressing cells (Fig. 4B, and Fig. S3).”

      Fig. 4C - the areas with red inset boxes labeled for isthmus are not really isthmus (in both control and cKO). The zoomed-in images (Fig. 4C - The far-right panel for both control and cKO, images are the transitional zone from the ampulla to the isthmus. The isthmus areas should have a thick muscle layer with almost no ciliated cells - see Fig. 4B cKO - those are true isthmus areas.

      We thank the reviewer for noting this. We have corrected the label accordingly. Since ciliary epithelial cells predominantly reside in the ampulla, we have included high-resolution images specifically for the ampulla regions.

      • Fig. 3A and 3C, it appears that the images were taken at different magnifications, but the scale bars are the same at 200 um. The authors, please double-check the scale bars.

      We thank the reviewer for noting this. We have double-checked all the figures to ensure the scale bars are correctly displayed and aligned with the resolution.

      • Fig. 6D - why polyphillin-treated samples did not sum to 100%? - please double-check.

      Since approximately 50% of the embryos were retained in the oviduct following polyphyllin treatment (Figure 6C, upper bar), the bar in Figure 6D represents this percentage (50% retained) rather than 100%.

      Reviewer #2 (Public review)

      In this manuscript, Popli et al investigated the roles of autophagy-related gene, Atg14, in the female reproductive tract (FRT) using conditional knockout mouse models. By ablation of Atg14 in both oviduct and uterus with PR-Cre (Atg14 cKO), authors discovered that such females are completely infertile. They went on to show that Atg14 cKO females have impaired embryo implantation as well as embryo transport from oviduct to uterus. Further analysis showed that Atg14 cKO leads to increased pyroptosis in oviduct, which disrupts oviduct epithelial integrity and leads to obstructive oviduct lumen and impaired embryo transport. The authors concluded that Atg14 is critical for maintaining the oviduct homeostasis and keeping the inflammation under check to enable proper embryo transport.

      The authors have barely addressed most of my concerns in this revised version with a few minor issues remaining to be addressed:

      (1) The authors tried to address my first concern regarding the statement that "autophagy is critical for maintaining the oviduct homeostasis". The revised statement in Lines 53-54 "we report that Atg14-dependent autophagy plays a crucial role in maintaining..." is still not correct. It should be corrected as " we report that autophagy-related protein Atg14 plays a crucial role in maintaining...".

      We thank the reviewer for this nice suggestion. We have now modified the statement as suggested (Line no: 54).

      (2) Line 349-351 described 80-90% of blastocysts retrieved from oviducts of cKO mice, which is in consistent with Figure 3B (showing more than 98%).

      We thank the reviewer for noting this. We have now corrected the statement as: “Unexpectedly, oviduct flushing from cKO mice resulted in the retrieval of approximately 90% of embryos, suggesting their potential entrapment within the oviducts, impeding their transit to the uterus”. (Line No: 349-351).

      (3) Line 447, "Fig. 5E" should be Fig. 6A. In addition, grammar error in the next sentence.

      We have corrected the figure number and addressed the grammatical error.

      (4) In Figure 6D, why the composition of blastocysts in chemical treated group do not add up to 100%.

      As explained in Reviewer 1 responses, the bar in Figure 6D represents the 50% retained embryos from Figure 6C upper bar the full count.

      Reviewer #3 (Public review):

      Summary:

      The manuscript by Pooja Popli and co-authors tested the importance of Atg14 in the female reproductive tract by conditionally deleting Atg14 use PrCre and also Foxj1cre. The authors showed that loss of Atg14 leads to infertility due to the retention of embryos within the oviduct. The authors further concluded that the retention of embryos within the oviduct is due to pyroptosis in oviduct cells leading to defective cellular integrity. The revised manuscript has included new experimental data (Figs. S2B, 5B, 5C, and S3) that satisfied the concerns of this reviewer. The manuscript should provide important advancement to the field.

      We sincerely thank the reviewer for the thoughtful evaluation of our manuscript and appreciate your constructive feedback.

    1. eLife Assessment

      This fundamental study provides a comprehensive analysis of the EmrE efflux pump and the role of the C-terminal domain in preventing uncoupled proton leak in the absence of substrate. The evidence supporting the conclusions is solid, although incomplete analyses limit some of the conclusions.

    2. Reviewer #1 (Public review):

      Summary:

      Work by Brosseau et. al. combines NMR, biochemical assays, and MD simulations to characterize the influence of the C-terminal tail of EmrE, a model multi-drug efflux pump, on proton leak. The authors compare the WT pump to a C-terminal tail deletion, delta_107, finding that the mutant has increased proton leak in proteoliposome assays, shifted pH dependence with a new titratable residue, faster-alternating access at high pH values, and reduced growth, consistent with proton leak of the PMF.

      Strengths:

      The work combines thorough experimental analysis of structural, dynamic, and electrochemical properties of the mutant relative to WT proteins. The computational work is well aligned in vision and analysis. Although all questions are not answered, the authors lay out a logical exploration of the possible explanations.

      Weaknesses:

      There are a few analyses that are missing and important data left out. For example, the relative rate of drug efflux of the mutant should be reported to justify the focus on proton leak. Additionally, the correlation between structural interactions should be directly analyzed and the mutant PMF also analyzed to justify the claims based on hydration alone. Some aspects of the increased dynamics at high pH due to a potential salt bridge are not clear.

    3. Reviewer #2 (Public review):

      Summary:

      This manuscript explores the role of the C-terminal tail of EmrE in controlling uncoupled proton flux. Leakage occurs in the wild-type transporter under certain conditions but is amplified in the C-terminal truncation mutant D107. The authors use an impressive combination of growth assays, transport assays, NMR on WT and mutants with and without key substrates, classical MD, and reactive MD to address this problem. Overall, I think that the claims are well supported by the data, but I am most concerned about the reproducibility of the MD data, initial structures used for simulations, and the stochasticity of the water wire formation. These can all be addressed in a revision with more simulations as I point out below. I want to point out that the discussion was very nicely written, and I enjoyed reading the summary of the data and the connection to other studies very much.

      Strengths:

      The Henzler-Wildman lab is at the forefront of using quantitative experiments to probe the peculiarities in transporter biophysics, and the MD work from the Voth lab complements the experiments quite well. The sheer number of different types of experimental and computational approaches performed here is impressive.

      Weaknesses:

      The primary weaknesses are related to the reproducibility of the MD results with regard to the formation of water wires in the WT and truncation mutant. This could be resolved with simulations starting from structures built using very different loops and C-terminal tails.

      The water wire gates identified in the MD should be tested experimentally with site-directed mutagenesis to determine if those residues do impact leak.

    4. Author response:

      We appreciate the reviewers thoughtful consideration of our manuscript, and their recognition of the variety of experimental and computational approaches we have brought to bear in probing the very challenging question of uncoupled proton leak through EmrE.

      We did record SSME measurements with MeTPP+, a small molecule substrate at two different protein:lipid ratios. These experiments report the rate of net flux when both proton-coupled substrate antiport and substrate-gated proton leak are possible. We will add this data to the revision, including data acquired with different lipid:protein ratio that confirms we are detecting transport rather than binding. In brief, this data shows that the net flux is highly dependent on both proton concentration (pH) and drug-substrate concentration, as predicted by our mechanistic model. This demonstrates that both types of transport contribute to net flux when small molecule substrates are present.

      In the absence of drug-substrate, proton leak is the only possible transport pathway. The pyranine assay directly assesses proton leak under these conditions and unambiguously shows faster proton entry into proteoliposomes through the ∆107-EmrE mutant than through WT EmrE, with the rate of proton entry into ∆107-EmrE proteoliposomes matching the rate of proton entry achieved by the protonophore CCCP. We have revised the text to more clearly emphasize how this directly measures proton leak independently of any other type of transport activity. The SSME experiments with a proton gradient only (no small molecule substrate present) provide additional data on shorter timescales that is consistent with the pyranine data. The consistency of the data across multiple LPRs and comparison of transport to proton leak in the SSME assays  further strengthens the importance of the C-terminal tail in determining the rate of flux.

      None of the current structural models have good resolution (crystallography, EM) or sufficient restraints (NMR) to define the loop and tail conformations sufficiently for comparison with this work. We are in the process of refining an experimental structure of EmrE with better resolution of the loop and tail regions implicated in proton-entry and leak. Direct assessment of structural interactions via mutagenesis is complicated because of the antiparallel homodimer structure of EmrE. Any point mutation necessarily affects both subunits of the dimer, and mutations designed to probe the hydrophobic gate on the more open face of the transporter also have the potential to disrupt closure on the opposite face, particularly in the absence of sufficient resolution in the available structures. Thus, mutagenesis to test specific predicted structural features is deferred until our structure is complete so that we can appropriately interpret the results.

      In our simulation setup, the MD results can be considered representative and meaningful for two reasons. First, the C-terminal tail, not present in the prior structure and thus modeled by us, is only 4 residues long. We will show in the revision and detailed response that the system will lose memory of its previous conformation very quickly, such that velocity initialization alone is enough for a diverse starting point. Second, our simulation is more like simulated annealing, starting from a high free energy state to show that, given such random initialization, the tail conformation we get in the end is consistent with what we reported. It is also difficult to sample back-and-forth tail motion within a realistic MD timescale. Therefore, it can be unconclusive to causally infer the allosteric motions with unbiased MD of the wildtype alone. The best viable way is to look at the equilibrium statistics of the most stable states between WT- and ∆107-EmrE and compare the differences.

    1. eLife Assessment

      This valuable descriptive manuscript builds on prior research showing that the elimination of Origin Recognition Complex (ORC) subunits does not halt DNA replication. The authors obtain solid data using various methods to genetically remove one or two ORC subunits from specific tissues and still observe replication. The replication appears to be primarily endoreduplication, indicating that ORC-independent replication may promote genome reduplication without mitosis. The mechanism behind this ORC-independent replication remains to be elucidated. The study and mutants described herein lay the groundwork for future research to explore how cells compensate for the absence of ORC and to develop functional approaches to investigate this process. The reviewers suggested the observations could be supported by additional experiments. This work will be of interest to those studying genome duplication and replication.

    2. Reviewer #1 (Public review):

      The origin recognition complex (ORC) is an essential loading factor for the replicative Mcm2-7 helicase complex. Despite ORC's critical role in DNA replication, there have been instances where the loss of specific ORC subunits has still seemingly supported DNA replication in cancer cells, endocycling hepatocytes, and Drosophila polyploid cells. Critically, all tested ORC subunits are essential for development and proliferation in normal cells. This presents a challenge, as conditional knockouts need to be generated, and a skeptic can always claim that there were limiting but sufficient ORC levels for helicase loading and replication in polyploid or transformed cells. That being said, the authors have consistently pushed the system to demonstrate replication in the absence or extreme depletion of ORC subunits.

      Here, the authors generate conditional ORC2 mutants to counter a potential argument with prior conditional ORC1 mutants that Cdc6 may substitute for ORC1 function based on homology. They also generate a double ORC1 and ORC2 mutant, which is still capable of DNA replication in polyploid hepatocytes. While this manuscript provides significantly more support for the ability of select cells to replicate in the absence or near absence of select ORC subunits, it does not shed light on a potential mechanism.

      The strengths of this manuscript are the mouse genetics and the generation of conditional alleles of ORC2 and the rigorous assessment of phenotypes resulting from limiting amounts of specific ORC subunits. It also builds on prior work with ORC1 to rule out Cdc6 complementing the loss of ORC1.

      The weakness is that it is a very hard task to resolve the fundamental question of how much ORC is enough for replication in cancer cells or hepatocytes. Clearly, there is a marked reduction in specific ORC subunits that is sufficient to impact replication during development and in fibroblasts, but the devil's advocate can always claim minimal levels of ORC remaining in these specialized cells.

      The significance of the work is that the authors keep improving their conditional alleles (and combining them), thus making it harder and harder (but not impossible) to invoke limiting but sufficient levels of ORC. This work lays the foundation for future functional screens to identify other factors that may modulate the response to the loss of ORC subunits.

      This work will be of interest to the DNA replication, polyploidy, and genome stability communities.

    3. Reviewer #2 (Public review):

      This manuscript proposes that primary hepatocytes can replicate their DNA without the six-subunit ORC. This follows previous studies that examined mice that did not express ORC1 in the liver. In this study, the authors suppressed expression of ORC2 or ORC1 plus ORC2 in the liver.

      Comments:

      (1) I find the conclusion of the authors somewhat hard to accept. Biochemically, ORC without the ORC1 or ORC2 subunits cannot load the MCM helicase on DNA. The question arises whether the deletion in the ORC1 and ORC2 genes by Cre is not very tight, allowing some cells to replicate their DNA and allow the liver to develop, or whether the replication of DNA proceeds via non-canonical mechanisms, such as break-induced replication. The increase in the number of polyploid cells in the mice expressing Cre supports the first mechanism, because it is consistent with few cells retaining the capacity to replicate their DNA, at least for some time during development.

      (2) Fig 1H shows that 5 days post infection, there is no visible expression of ORC2 in MEFs with the ORC2 flox allele. However, at 15 days post infection, some ORC2 is visible. The authors suggest that a small number of cells that retained expression of ORC2 were selected over the cells not expressing ORC2. Could a similar scenario also happen in vivo?

      (3) Figs 2E-G show decreased body weight, decreased liver weight and decreased liver to body weight in mice with recombination of the ORC2 flox allele. This means that DNA replication is compromised in the ALB-ORC2f/f mice.

      (4) Figs 2I-K do not report the number of hepatocytes, but the percent of hepatocytes with different nuclear sizes. I suspect that the number of hepatocytes is lower in the ALB-ORC2f/f mice than in the ORC2f/f mice. Can the authors report the actual numbers?

      (5) Figs 3B-G do not report the number of nuclei, but percentages, which are plotted separately for the ORC2-f/f and ALB-ORC2-f/f mice. Can the authors report the actual numbers?

      (6) Fig 5 shows the response of ORC2f/f and ALB-ORC2f/f mice after partial hepatectomy. The percent of EdU+ nuclei in the ORC2-f/f (aka ALB-CRE-/-) mice in Fig 5H seems low. Based on other publications in the field it should be about 20-30%. Why is it so low here? The very low nuclear density in the ALB-ORC2-f/f mice (Fig 5F) and the large nuclei (Fig 5I) could indicate that cells fire too few origins, proceed through S phase very slowly and fail to divide.

      (7) Fig 6F shows that ALB-ORC1f/f-ORC2f/f mice have very severe phenotypes in terms of body weight and liver weight (about on third of wild-type!!). Fig 6H and 6I, the actual numbers should be presented, not percentages. The fact that there are EYFP negative cells, implies that CRE was not expressed in all hepatocytes.

      (8) Comparing the EdU+ cells in Fig 7G versus 5G shows very different number of EdU+ cells in the control animals. This means that one of these images is not representative. The higher fraction of EdU+ cells in the double-knockout could mean that the hepatocytes in the double-knockout take longer to complete DNA replication than the control hepatocytes. The control hepatocytes may have already completed DNA replication, which can explain why the fraction of EdU+ cells is so low in the controls. The authors may need to study mice at earlier time points after partial hepatectomy, i.e. sacrifice the mice at 30-32 hours, instead of 40-52 hours.

      (9) Regarding the calculation of the number of cell divisions during development: the authors assume that all the hepatocytes in the adult liver are derived from hepatoblasts that express Alb. Is it possible to exclude the possibility that pre-hepatoblast cells that do not express Alb give rise to hepatocytes? For example, the cells that give rise to hepatoblasts may proliferate more times than normal giving rise to a higher number of hepatoblasts than in wild-type mice.

      (10) My interpretation of the data is that not all hepatocytes have the ORC1 and ORC2 genes deleted (eg EYFP-negative cells) and that these cells allow some proliferation in the livers of these mice.

      My comments regarding the previous version still stand, since the authors did not perform experiments to address them.

    4. Reviewer #3 (Public review):

      Summary:

      The authors address the role of ORC in DNA replication and that this protein complex is not essential for DNA replication in hepatocytes. They provide evidence that ORC subunit levels are substantially reduced in cells that have been induced to delete multiple exons of the corresponding ORC gene(s) in hepatocytes. They evaluate replication both in purified isolated hepatocytes and in mice after hepatectomy. In both cases, there is clear evidence that DNA replication does not decrease at a level that corresponds with the decrease in detectable ORC subunit and that endoreduplication is the primary type of replication observed. It remains possible that small amounts of residual ORC are responsible for the replication observed, although the authors provide arguments against this possibility. The mechanisms responsible for the DNA replication observed in the absence of ORC are not examined, including why such replication would primarily be due to endoreduplication.

      Strengths:

      The authors clearly show that there are dramatic reductions in the amount of the targeted ORC subunits in the cells that have been targeted for deletion. They also provide clear evidence that there is replication in a subset of these cells and that it is likely due to endoreduplication. Although there is no replication in MEFs derived from cells with the deletion, there is clearly DNA replication occurring in hepatocytes (both isolated in culture and in the context of the liver). Interestingly, the cells undergoing replication exhibit enlarged cell sizes and elevated ploidy indicating endoreduplication of the genome. These findings raise the interesting possibility that endoreduplication does not require ORC while normal replication does.

      Weaknesses:

      There remain two significant weaknesses in this manuscript. The first is that although there is clearly robust reduction of the targeted ORC subunit, the authors cannot confirm that it is deleted in all cells. For example, the analysis in Fig. 4B would suggest that a substantial number of cells have not lost the targeted region of ORC2. In their response, the authors suggest that this is due to contaminating non-hepatocyte cells but do not provide evidence that this is the case. Although the western blots show stronger effects, this type of analysis is notorious for non-linear response curves and no standards are not provided. The second weakness is that there is no evaluation of the molecular nature of the replication observed. In response to the initial review the authors point out that a previous publication mapped Mcm2-7 loading in the absence of ORC1, ORC2 and ORC5 and saw no deficit or altered location. Unfortunately, this is not done for the mutants discussed here and this previous data supports a model that limiting residual ORC is responsible for the replication observed rather than some novel mechanism (which would be expected to alter location or amounts of loading). The manuscript provides no exploration of why "ORC-independent" replication would drive endoreduplicaiton (which is the strongest evidence for an alternative mechanism of initiation but is unique to this experiment and not the previously mutants analyzed for Mcm2-7 loading). Most importantly, it remains true that after numerous papers from this lab and others claiming that ORC is not required for eukaryotic DNA replication, we still have no information about an alternative pathway that could explain Mcm2-7 loading in the absence of ORC. Without some insights in this area, studies such as these will remain controversial.

    5. Author response:

      The following is the authors’ response to the original reviews.

      eLife Assessment

      This descriptive manuscript builds on prior research showing that the elimination of Origin Recognition Complex (ORC) subunits does not halt DNA replication. The authors use various methods to genetically remove one or two ORC subunits from specific tissues and observe continued replication, though it may be incomplete. The replication appears to be primarily endoreduplication, indicating that ORC-independent replication may promote genome reduplication without mitosis. Despite similar findings in previous studies, the paper provides convincing genetic evidence in mice that liver cells can replicate and undergo endoreduplication even with severely depleted ORC levels. While the mechanism behind this ORC-independent replication remains unclear, the study lays the groundwork for future research to explore how cells compensate for the absence of ORC and to develop functional approaches to investigate this process. The reviewers agree that this valuable paper would be strengthened significantly if the authors could delve a bit deeper into the nature of replication initiation, potentially using an origin mapping experiment. Such an exciting contribution would help explain the nature of the proposed new type of Mcm loading, thereby increasing the impact of this study for the field at large.

      We appreciate the reviewers’ suggestion. Till now we know of only one paper that mapped origins of replication in regenerating mouse liver, and that was published two months back in Cell (PMID: 39293447).  We want to adopt this method, but we do not need it to answer the question asked.  We have mapped origins of replication in ORC-deleted cancer cell lines and compared to wild-type cells in Shibata et al., BioRXiv (PMID: 39554186) (it is under review).  We report the following:  Mapping of origins in cancer cell lines that are wild type or engineered to delete three of the subunits, ORC1, ORC2 or ORC5 shows that specific origins are still used and are mostly at the same sites in the genome as in wild type cells. Of the 30,197 origins in wild type cells (with ORC), only 2,466 (8%) are not used in any of the three ORC deleted cells and 18,319 (60%) are common between the four cell types. Despite the lack of ORC, excess MCM2-7 is still loaded at comparable rates in G1 phase to license reserve origins and is also repeatedly loaded in the same S phase to permit re-replication. 

      Citation: Specific origin selection and excess functional MCM2-7 loading in ORC-deficient cells. Yoshiyuki Shibata, Mihaela Peycheva, Etsuko Shibata, Daniel Malzl, Rushad Pavri, Anindya Dutta. bioRxiv 2024.10.30.621095; doi: https://doi.org/10.1101/2024.10.30.621095 (PMID: 39554186)

      We have now included this in the discussion.

      Public Reviews:

      Reviewer #1 (Public review):

      The origin recognition complex (ORC) is an essential loading factor for the replicative Mcm2-7 helicase complex. Despite ORC's critical role in DNA replication, there have been instances where the loss of specific ORC subunits has still seemingly supported DNA replication in cancer cells, endocycling hepatocytes, and Drosophila polyploid cells. Critically, all tested ORC subunits are essential for development and proliferation in normal cells. This presents a challenge, as conditional knockouts need to be generated, and a skeptic can always claim that there were limiting but sufficient ORC levels for helicase loading and replication in polyploid or transformed cells. That being said, the authors have consistently pushed the system to demonstrate replication in the absence or extreme depletion of ORC subunits.

      Here, the authors generate conditional ORC2 mutants to counter a potential argument with prior conditional ORC1 mutants that Cdc6 may substitute for ORC1 function based on homology. They also generate a double ORC1 and ORC2 mutant, which is still capable of DNA replication in polyploid hepatocytes. While this manuscript provides significantly more support for the ability of select cells to replicate in the absence or near absence of select ORC subunits, it does not shed light on a potential mechanism.

      The strengths of this manuscript are the mouse genetics and the generation of conditional alleles of ORC2 and the rigorous assessment of phenotypes resulting from limiting amounts of specific ORC subunits. It also builds on prior work with ORC1 to rule out Cdc6 complementing the loss of ORC1.

      The weakness is that it is a very hard task to resolve the fundamental question of how much ORC is enough for replication in cancer cells or hepatocytes. Clearly, there is a marked reduction in specific ORC subunits that is sufficient to impact replication during development and in fibroblasts, but the devil's advocate can always claim minimal levels of ORC remaining in these specialized cells.

      The significance of the work is that the authors keep improving their conditional alleles (and combining them), thus making it harder and harder (but not impossible) to invoke limiting but sufficient levels of ORC. This work lays the foundation for future functional screens to identify other factors that may modulate the response to the loss of ORC subunits.

      This work will be of interest to the DNA replication, polyploidy, and genome stability communities.

      Thank you.

      Reviewer #2 (Public review):

      This manuscript proposes that primary hepatocytes can replicate their DNA without the six-subunit ORC. This follows previous studies that examined mice that did not express ORC1 in the liver. In this study, the authors suppressed expression of ORC2 or ORC1 plus ORC2 in the liver.

      Comments:

      (1) I find the conclusion of the authors somewhat hard to accept. Biochemically, ORC without the ORC1 or ORC2 subunits cannot load the MCM helicase on DNA. The question arises whether the deletion in the ORC1 and ORC2 genes by Cre is not very tight, allowing some cells to replicate their DNA and allow the liver to develop, or whether the replication of DNA proceeds via non-canonical mechanisms, such as break-induced replication. The increase in the number of polyploid cells in the mice expressing Cre supports the first mechanism, because it is consistent with few cells retaining the capacity to replicate their DNA, at least for some time during development.

      In our study, we used EYFP as a marker for Cre recombinase activity. ~98% of the hepatocytes in tissue sections and cells in culture express EYFP, suggesting that the majority of hepatocytes successfully expressed the Cre protein to delete the ORC1 or ORC2 genes. To assess deletion efficiency, we employed sensitive genotyping and Western blotting techniques to confirm the deletion of ORC1 and ORC2 in hepatocytes isolated from Alb-Cre mice. Results in Fig. 2C and Fig. 6D demonstrate the near-complete absence of ORC2 and ORC1 proteins, respectively, in these hepatocytes.

      The mutant hepatocytes underwent at least 15–18 divisions during development. The inherited ORC1 or ORC2 protein present during the initial cell divisions, would be diluted to less than 1.5% of wild-type levels within six divisions, making it highly unlikely to support DNA replication, and yet we observe hepatocyte numbers that suggest there was robust cell division even after that point.

      Furthermore, the EdU incorporation data confirm DNA synthesis in the absence of ORC1 and ORC2. Specifically, immunofluorescence showed that both in vitro and in vivo, EYFP-positive hepatocytes (indicating successful ORC1 and ORC2 deletion) incorporated EdU, demonstrating that DNA synthesis can occur without ORC1 and ORC2.

      Finally, the Alb-ORC2f/f mice have 25-37.5% of the number of hepatocyte nuclei compared to WT mice (Table 2).  If that many cells had an undeleted ORC2 gene, that would have shown up in the genotyping PCR and in the Western blots.

      (2) Fig 1H shows that 5 days post infection, there is no visible expression of ORC2 in MEFs with the ORC2 flox allele. However, at 15 days post infection, some ORC2 is visible. The authors suggest that a small number of cells that retained expression of ORC2 were selected over the cells not expressing ORC2. Could a similar scenario also happen in vivo?

      This would not explain the significant incorporation of EdU in hepatocytes that are EYFP positive and do not have detectable ORC by Western blots.  Also note that for MEFs we are delivering the Cre by Adenovirus infection in vitro, so there is a finite probability that a cell will not receive the virus, the Cre and will not delete ORC2.  However, in vivo, the Alb-Cre will be expressed in every cell that turns on albumin.  There is no escaping the expression of Cre.

      (3) Figs 2E-G shows decreased body weight, decreased liver weight and decreased liver to body weight in mice with recombination of the ORC2 flox allele. This means that DNA replication is compromised in the ALB-ORC2f/f mice.

      It is possible that DNA replication is partially compromised or may slow down in the absence of ORC2. However, it is important to emphasize that livers with ORC2 deletion remain capable of DNA replication, so much so that liver function and life span are near normal. Therefore, some kind of DNA replication has to serve as a compensatory mechanism in the absence of ORC2 to maintain liver function and support regeneration.

      (4) Figs 2I-K do not report the number of hepatocytes, but the percent of hepatocytes with different nuclear sizes. I suspect that the number of hepatocytes is lower in the ALB-ORC2f/f mice than in the ORC2f/f mice. Can the authors report the actual numbers?

      We show in Table 2 that the Alb-Orc2f/f mice have about 25-37.5% of the hepatocytes compared to the WT mice.

      (5) Figs 3B-G do not report the number of nuclei, but percentages, which are plotted separately for the ORC2-f/f and ALB-ORC2-f/f mice. Can the authors report the actual numbers?

      In all the FACS experiments in Fig. 3B-G we collect data for a total of 10,000 nuclei (or cells).  For Fig. 3E-G we divide the 10,000 nuclei into the bottom 40% on the EYFP axis (EYFP low, which is mostly EYFP negative) as the control group, and EYFP high (top 20% on the EYFP axis) test group.  We have described this in the Methods in the revision and labeled EYFP negative and positive as EYFP low and high in the Figures and Figure legends.

      (6) Fig 5 shows the response of ORC2f/f and ALB-ORC2f/f mice after partial hepatectomy. The percent of EdU+ nuclei in the ORC2-f/f (aka ALB-CRE-/-) mice in Fig 5H seems low. Based on other publications in the field it should be about 20-30%. Why is it so low here? The very low nuclear density in the ALB-ORC2-f/f mice (Fig 5F) and the large nuclei (Fig 5I) could indicate that cells fire too few origins, proceed through S phase very slowly and fail to divide.

      The percentage of EdU+ nuclei in the ORC2f/f without Alb-Cre mice is 8%, while in PMID 10623657 ~10% of wild type nuclei incorporate  EdU at 42 hr post partial hepatectomy (mid-point between the 36-48 hr post hepatectomy that was used in our study).  The important result here is that in the ORC2f/f mice with Alb-Cre (+/-) we are seeing significant EdU incorporation. We have also corrected the X-axis labels in 5F, 5I, 7E and 7F to reflect that those measurements were not made at 36 hr post-resection but later (as was indicated in the schematic in Fig. 5A).

      (7) Fig 6F shows that ALB-ORC1f/f-ORC2f/f mice have very severe phenotypes in terms of body weight and liver weight (about on third of wild-type!!). Fig 6H and 6I, the actual numbers should be presented, not percentages. The fact that there are EYFP negative cells, implies that CRE was not expressed in all hepatocytes.

      The liver weight is very dependent on the body weight, and so we have to look at the liver to body weight ratio to determine if it is inordinately small, and the ratio is 70% of the WT.  In females the liver and body weight are low (although in proportion to each other), which maybe is what the reviewer is talking about.  However, the fact that liver weight and body weight are not as low in males, suggest that this is a gender (hormone?) specific effect and not a DNA replication defect.  We had discussed this possibility.  We have another paper also in BioRXiv (Su et al. doi.org/10.1101/2024.12.18.629220) that suggests that ORC subunits have significant effect on gene expression, so it is possible that that is what leads to this sexual dimorphism in phenotype.  We have now added this to the discussion.

      The bottom 40% of nuclei on the EYFP axis in the FACS profiles (what was labeled EYFP negative but will now be called EYFP low) contains mostly non-hepatocytes that are genuinely EYFP negative.   Non-hepatocytes (bile duct cells, endothelial cells, Kupffer cells, blood cells) are a significant part of cells in the dissociated liver (as can be seen in the single cell sequencing results in PMID: 32690901).  Their presence does not mean that hepatocytes are not expressing Cre.  Hepatocytes are nearly 100% EYFP positive, as can be seen in the tissue sections (where the hepatocytes take up most of visual field) and in cells in culture.  Also if there are EYFP negative hepatocyte nuclei in the FACS, that still does not rule out EYFP presence in the cytoplasm.  The important point from the FACS is that the EYFP high nuclei (which have expressed Cre for the longest period) are polyploid relative to the EYFP low nuclei.

      (8) Comparing the EdU+ cells in Fig 7G versus 5G shows very different number of EdU+ cells in the control animals. This means that one of these images is not representative. The higher fraction of EdU+ cells in the double-knockout could mean that the hepatocytes in the double-knockout take longer to complete DNA replication than the control hepatocytes. The control hepatocytes may have already completed DNA replication, which can explain why the fraction of EdU+ cells is so low in the controls. The authors may need to study mice at earlier time points after partial hepatectomy, i.e. sacrifice the mice at 30-32 hours, instead of 40-52 hours.

      The apparent difference that the reviewer comments on stems from differences in nuclear density in the images in Fig. 7G and 5G (also quantitated in Fig. 7F and 5F).  The quantitation in Fig. 7H and 5H show that the % of EdU plus cells are comparable (5-8%). 

      (9) Regarding the calculation of the number of cell divisions during development: the authors assume that all the hepatocytes in the adult liver are derived from hepatoblasts that express Alb. Is it possible to exclude the possibility that pre-hepatoblast cells that do not express Alb give rise to hepatocytes? For example the cells that give rise to hepatoblasts may proliferate more times than normal giving rise to a higher number of hepatoblasts than in wild-type mice.

      Single cell sequencing of mouse liver at e11 shows hepatoblasts expressing hepatocyte specific markers (PMID: 32690901).  All the cells annotated from the single-cell seq analysis are differentiated cells arguing against the possibility that undifferentiated endodermal cells (what the reviewer probably means by pre-hepatoblasts) exist at e11.  We have added this citation to the paper.

      Here is a review that says the hepatoblasts expressing Albumin are present before e13.  (https://www.ncbi.nlm.nih.gov/books/NBK27068/) says: “The differentiation of bi-potential hepatoblasts into hepatocytes or BECs begins around e13 of mouse development. Initially hepatoblasts express genes associated with both adult hepatocytes (Hnf4α, Albumin) ...”  Thus, we can be certain that hepatoblasts before e13 express albumin.  Our calculation of number of cell divisions in Table 2 begins from e12.

      The reviewer may be suggesting that ORC deletion leads to the immediate demise of hepatoblasts (despite having inherited ORC protein from the endodermal cells) causing undifferentiated endodermal cells to persist and proliferate much longer than in normal development.  We consider it unlikely, but if true it will be very unexpected, both by suggesting that deletion of ORC immediately leads to the death of the hepatoblasts (despite a healthy reserve of inherited ORC protein) and by suggesting that there is a novel feedback mechanism from the death/depletion of hepatoblasts leading to the persistence and proliferation of undifferentiated endodermal cells. We have added the reviewer’s suggestion to the discussion.

      (10) My interpretation of the data is that not all hepatocytes have the ORC1 and ORC2 genes deleted (eg EYFP-negative cells) and that these cells allow some proliferation in the livers of these mice.

      Please see the reply in question #1.  Particularly relevant: “Finally, the Alb-ORC2f/f mice have 25-37.5% of the number of hepatocyte nuclei compared to WT mice (Table 2).  If that many cells had an undeleted ORC2 gene, that would have shown up in the genotyping PCR and in the Western blots.

      Reviewer #3 (Public review):

      Summary:

      The authors address the role of ORC in DNA replication and that this protein complex is not essential for DNA replication in hepatocytes. They provide evidence that ORC subunit levels are substantially reduced in cells that have been induced to delete multiple exons of the corresponding ORC gene(s) in hepatocytes. They evaluate replication both in purified isolated hepatocytes and in mice after hepatectomy. In both cases, there is clear evidence that DNA replication does not decrease at a level that corresponds with the decrease in detectable ORC subunit and that endoreduplication is the primary type of replication observed. It remains possible that small amounts of residual ORC are responsible for the replication observed, although the authors provide arguments against this possibility. The mechanisms responsible for DNA replication in the absence of ORC are not examined.

      Strengths:

      The authors clearly show that there are dramatic reductions in the amount of the targeted ORC subunits in the cells that have been targeted for deletion. They also provide clear evidence that there is replication in a subset of these cells and that it is likely due to endoreduplication. Although there is no replication in MEFs derived from cells with the deletion, there is clearly DNA replication occurring in hepatocytes (both isolated in culture and in the context of the liver). Interestingly, the cells undergoing replication exhibit enlarged cell sizes and elevated ploidy indicating endoreduplication of the genome. These findings raise the interesting possibility that endoreduplication does not require ORC while normal replication does.

      Weaknesses:

      There are two significant weaknesses in this manuscript. The first is that although there is clearly robust reduction of the targeted ORC subunit, the authors cannot confirm that it is deleted in all cells. For example, the analysis in Fig. 4B would suggest that a substantial number of cells have not lost the targeted region of ORC2. Although the western blots show stronger effects, this type of analysis is notorious for non-linear response curves and no standards are provided. The second weakness is that there is no evaluation of the molecular nature of the replication observed. Are there changes in the amount of location of Mcm2-7 loading that is usually mediated by ORC? Does an associated change in Mcm2-7 loading lead to the endoreduplication observed? After numerous papers from this lab and others claiming that ORC is not required for eukaryotic DNA replication in a subset of cells, we still have no information about an alternative pathway that could explain this observation.

      We do not see a significant deficit in MCM2-7 loading (amount and rate) in cancer cell lines where we have deleted ORC1, ORC2 or ORC5 genes separately in Shibata et al. bioRxiv 2024.10.30.621095; doi: https://doi.org/10.1101/2024.10.30.621095 (PMID: 39554186).  This is now cited in the discussion.

      The authors frequently use the presence of a Cre-dependent eYFP expression as evidence that the ORC1 or ORC2 genes have been deleted. Although likely the best visual marker for this, it is not demonstrated that the presence of eYFP ensures that ORC2 has been targeted by Cre. For example, based on the data in Fig. 4B, there seems to be a substantial percentage of ORC2 genes that have not been targeted while the authors report that 100% of the cells express eYFP.

      (1) The PCR reactions in Fig. 4B are still contaminated by DNA from non-hepatocyte cells:  bile duct cells, endothelial, Kupfer cells and blood cells.  Microscopy of  cultured cells idnetifies the hepatocytes unequivocally from their morphology. <2% of the hepatocyte cells in culture in Fig. 4C are EYFP-.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      The authors should present the data as suggested in the review and reformulate their conclusions. If possible, mice should be examined 30-32 hours after partial hepatectomy.

      Based on the Literature we chose a time that is consistent with the previous paper from us (Uchida et al., Genes & Dev).

      Reviewer #3 (Recommendations for the authors):

      (1) It would improve the paper to use single-cell methods (e.g. FISH) to assess the deletion of ORC subunits in the targeted cells.

      This is something we will reserve for future studies.

      (2) The importance of the paper would be increased dramatically by showing that the elimination of ORC changed the location of Mcm2-7 loading. This would be highly likely if the authors hypothesis that ORC is not involved is true. On the other hand, given ORC's role in origin selection, an observation that the same sites are used but less frequently would support a hypothesis that residual intact ORC is responsible for the replication observed.

      Shibata et al (PMID: 39554186) has answered this question.  The loss of ORC does not change the locations of origins or even the ability to specify origins.  We argue that this is what is to be expected from our hypothesis, that although ORC is clearly important for MCM loading in yeast and in biochemical experiments, something unexpected is going on in human cells.  Either a vanishingly small amount of ORC (undetectable by commonly used methods) can load the full complement of MCM2-7 at a rate that is comparable to wild type cells, or there is an ORC-independent mechanism of MCM2-7 loading.   This is now added to the discussion.

    1. eLife Assessment

      This valuable study reports the critical role of two cyclin-dependent kinases, CDK8 and CDK19, in spermatogenesis. The data presented are generally supportive of the main conclusion and are considered solid. This work may be of interest to reproductive biologists and physicians working on male fertility.

    2. Reviewer #1 (Public review):

      Summary:

      In this paper, Bruter and colleagues report effects of inducible deletion of the genes encoding the two paralogous kinases of the Mediator complex in adult mice. The physiological roles of these two kinases, CDK8 and CDK19, are currently rather poorly understood; although conserved in all eukaryotes, and among the most highly conserved kinases in vertebrates, individual knockouts of genes encoding CDK8 homologues in different species have revealed generally rather mild and specific effects, in contrast to Mediator itself. Here, the authors provide evidence that neither CDK8 nor CDK19 are required for adult homeostasis but they are functionally redundant for maintenance of reproductive tissue morphology and fertility in males.

      Strengths:

      The morphological data on atrophy of the male reproductive system and arrest of spermatocyte meiosis are solid and are reinforced by single cell transcriptomics data, which is a challenging technique to implement in vivo. The main findings are important and will be of interest to scientists in the fields of transcription and developmental biology.

      Weaknesses:

      There are several weaknesses.

      The first is that data comparing general health of mice with single and double knockouts is not shown, and data on effects in other tissues are sparse and very preliminary. The only strong phenotype of double knockouts that is described is in the male reproductive system. Furthermore, data for the genitourinary system in single knockouts are very sparse; data are described for fertility in figure 1E, ploidy and cell number in figure 3B and C, plasma testosterone and luteinizing hormone levels in figure 6C and 6D and morphology of testis and prostate tissue for single Cdk8 knockout in supplementary figure 1E (although in this case the images do not appear very comparable between control and CDK8 KO), but, for example, there is no analysis of different meiotic stages or of gene expression in single knockouts. Given that the authors have shown that CDK8 and CDK19 expression levels differ widely between different cell types, such an analysis would be interesting. This might have provided insight into the sterility of induced CDK8 knockout.

      The second weakness is that the correlation between double knockout and reduced expression of genes involved in steroid hormone biosynthesis is hypothesized to be a causal mechanism for the phenotypes observed. While this is a possibility, there are no experiments performed to provide evidence that this is the case. Furthermore, there is no evidence shown that CDK8 and/or CDK19 are directly responsible for transcription of the genes concerned.

      Finally, the authors propose that the phenotypes are independent of the kinase activity of CDK8 or CDK19 because treatment of mice for a month with an inhibitor does not recapitulate the effects of the knockout, and nor does expression of two steroidogenic genes change in cultured Leydig cells upon treatment with an inhibitor. However, there are no controls for effective target inhibition shown.

      Comments on revisions:

      This manuscript is slightly improved compared to the previous version, though it still does not address the weaknesses that were highlighted in the first version, which largely remain relevant. Please note the typo in the abstract (line 30) and the absence of response to the query of how many crypts and villi were counted in the experiment shown in Suppl Fig 1D.

    3. Reviewer #2 (Public review):

      Summary:

      The authors tried to test the hypothesis that Cdk8 and Cdk19 stabilize the cytoplasmic CcNC protein, the partner protein of Mediator complex including CDK8/19 and Mediator protein via a kinase-independent function by generating induced double knockout of Cdk8/19. However the evidence presented suffer from a lack of focus and rigor and does not support their claims.

      Strengths:

      This is the first comprehensive report on the effect of a double knockout of CDK8 and CDK19 in mice on male fertility, hormones and single cell testicular cellular expression. The inducible knockout mice led to male sterility with severe spermatogenic defects, and the authors attempted to use this animal model to test the kinase-independent function of CDK8/19, previously reported for human. Single cell RNA-seq of knockout testis presented a high resolution of molecular defects of all the major cell types in the testes of the inducible double knockout mice. The authors also have several interesting findings such as reentry into cell cycles by Sertoli cells, loss of Testosterone in induced dko that could be investigated further.

      Weaknesses:

      The claim of reproductive defects in the induced double knockout of CDK8/19 resulted from the loss of CCNC via a kinase-independent mechanism is interesting but was not supported by the data presented. While the construction and analysis of the systemic induced knockout model of Cdk8 in Cdk19KO mice is not trivial, the analysis and data is weakened by systemic effect of Cdk8 loss, making it difficult to separate the systemic effect from the local testis effect.

      The analysis of male sterile phenotype is also inadequate with poor image quality, especially testis HE sections. Male reproductive tract picture is also small and difficult to evaluate. The mice crossing scheme is unusual as you have three mice to cross to produce genotypes, while we could understand that it is possible to produce pups of desired genotypes with different mating schemes, such vague crossing scheme is not desirable and of poor genetics practice. Also using TAM treated wild type as control is ok, but a better control will be TAM treated ERT2-cre; CDK8f/f or TAM treated ERT2 Cre CDK19/19 KO, so as to minimize the impact from well-recognized effect of TAM.

      While the authors proposed that the inducible loss of CDK8 in the CDK19 knockout background is responsible for spermatogenic defects, it was not clear in which cells CDK8/19 genes are interested and which cell types might have a major role in spermatogenesis. The authors also put forward the evidence that reduction/loss of Testosterone might be the main cause of spermatogenic defects, which is consistent with the expression change in genes involved in steroigenesis pathway in Leydig cells of inducible double knockout. But it is not clear how the loss of Testosterone contributed to the loss of CcnC protein.

      The authors should clarify or present the data on where CDK8 and CDK19 as well as CcnC are expressed so as to help the readers to understand which tissues that both CDK might be functioning and cause the loss of CcnC. It should be easier to test the hypothesis of CDK8/19 stabilize CcnC protein using double knock out primary cells, instead of the whole testis.

      Since CDK8KO and CDK19KO both have significantly reduced fertility in comparison with wildtype, it might be important to measure the sperm quantity and motility among CDK8 KO, CDK19KO and induced DKO to evaluate spermatogenesis based on their sperm production.

      Some data for the inducible knockout efficiency of Cdk8 were presented in Supplemental figure 1, but there is no legend for the supplemental figures, it was not clear which band represented deletion band, which tissues were examined? Tail or testis? It seems that two months after the injection of Tam, all the Cdk8 were completely deleted, indicating extremely efficient deletion of Tam induction by two-month post administration. Were the complete deletion of Cdk8 happening even earlier ? an examination of timepoints of induced loss would be useful and instructional as to when is the best time to examine phenotypes.

      The authors found that Sertoli cells re-entered cell cycle in the inducible double knockout but stop short of careful characterization other than increased expression of cell cycle genes.

      Overall this work suffered from a lack of focus and rigor in the analysis and lack of sufficient evidence to support their main conclusions.

      Comments on revisions:

      This reviewer appreciated the authors' effort in improving the quality of this manuscript during their revision. While some concerns remain, the revision is a much improved work and the authors addressed most of my major concerns.<br /> Figure 2E CDK8 and CDK19 immunofluorescent staining images seem to show CDK8 and CDK19 location are completely distinct and in different cells, the authors need to elaborate on this results and discuss what such a distinct location means in line of their double knockout data.

    1. eLife Assessment

      This manuscript presents a detailed characterization of male and female wildtype and Ctrp10 knockout mice, and reveals that knockout mice develop female-specific obesity that is largely uncoupled from metabolic dysfunction. The data are convincing, and the work will be an important contribution to understanding how obesity is coupled to metabolic dysfunction, and how this can occur in a sex-specific manner.

    2. Reviewer #1 (Public review):

      Summary

      The manuscript by Chen et al. presents a detailed metabolic characterization of male and female WT and Ctrp10 knockout mice. The main finding is that female KO mice become obese on both low-fat and high-fat diets, but without evidence of marked insulin resistance, hepatic steatosis, dyslipidemia, or increased inflammatory markers. The authors performed a detailed transcriptomic analysis and identified differentially-expressed genes that distinguish high-fat diet -fed Ctrp10 KO from WT control mice. They further show that this set of genes exhibits cross correlation in human tissues, and that this is greater in females than in males. The data indicate that the Ctrp10 KO model may be useful to understand how obesity and metabolic dysfuction are coupled to each other, and how this occurs by a sex-biased mechanism.

      Strengths

      The work presents a large amount of data, which has been carefully acquired and is convincing. The transcriptomic analysis will further help to define what pathways are associated with obesity, but not necessarily with metabolic dysfunction. The manuscript will be of interest to investigators studying metabolic diseases, and to those studying sex-specific differences in metabolic physiology. The limitations of the study are acknowledged, including that a whole-body knockout was used. The cause of the increased body weight is not entirely clear, despite the careful and detailed analysis that was performed. Notwithstanding these limitations, the phenotype is interesting, and this work will establish basis for further work to understand the mechanisms that are involved.

      Weaknesses

      The main weaknesses are that no antibody is available to detect Ctrp10, and the knockout is a global knockout since no conditional allele is available. These limitations are discussed in the manuscript. Despite these weaknesses, the current work establishes the intriguing phenotype and its sex-specificity, and will provide a solid foundation for future studies.

    3. Reviewer #2 (Public review):

      Summary:

      Here the authors have shown the role of sex differences in MHO phenotype, which increases the scope for research in this area.

      Strengths:

      The study provides a detailed idea of how the genes are regulated in sex sex-dependent manner.

      Weaknesses:

      The mechanistic details are missing

    4. Reviewer #3 (Public review):

      Summary:

      This study examines the impact of CTRP10/C1QL2 absence on obesity and metabolic health in mice. Female mice lacking CTRP10 tend to develop obesity, particularly on a high-fat diet. Surprisingly, they do not display the typical metabolic traits associated with obesity, like fatty liver or glucose intolerance. This indicates a disconnection between weight gain and metabolic issues in these female mice. The research underscores the need to understand sex-specific factors in how obesity influences metabolic health.

      Strengths:

      The study provides compelling evidence regarding Ctrp10's role in female-specific metabolic regulation in mice, shedding light on its potential significance in metabolically healthy obese (MHO) individuals.

      Weaknesses:

      -The analysis and description of sex-specific human data require more details to highlight the relevance of Ctrp10 mouse data and the analysis of differentially expressed genes in humans.<br /> -There's a lack of analysis regarding secreted Ctrp10 under various dietary conditions.

    5. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Recommendations For The Authors):

      Although the scripts are available at the github link that is shown, the Readme file is not available as a text file. Spreadsheets summarizing the RNA-seq data ought to be available for download, but these are not present. Likewise, are spreadsheets available for the data used to generate the plots in Fig. 10, so that the identities of particular, correlated genes can be viewed?

      We have now included the excel sheet with all the DEGs shown in Figure 8-9 (Figure 8 – Source data 1-8). The source data include DEGs that are up- and down-regulated in gWAT, iWAT, liver, and skeletal muscle. The source data files (excel) are the standard output format. We have also updated the github (https://github.com/Leandromvelez/CTRP10-Manuscript-DEG-Sex-specific-connectivities-and-integration) to include a README file and updated the R scripts to annotate steps and processing considerations.  In addition, the README file now contains drive links to the files used the unfiltered kallisto TPM and counts at the transcript-level, as well as resulting Differential Expression results based on genotype.  Obviously, all criteria from aligned transcripts such as gene filtering and normalization are included in the scripts provided.

      Several items would strengthen the work:

      (1) Is a CTRP10 antibody available, and does the protein abundance correlate with the mRNA abundances that were assessed in Fig. 1?

      Unfortunately, no validated antibody currently exists for CTRP10. Consequently, we were not able to assess protein abundance of CTRP10 in our study.

      (2) Were there compensatory changes in the abundance of other CTRP family members? This might be observed at the protein, but not mRNA, level. It might be reasonable to test for the effects of liver, gWAT, skeletal muscle, and iWAT.

      We observed no compensatory changes in other CTRP family members based on our RNA-seq data. Unfortunately, we do not have protein data for other CTRP family members.

      (3) The gene expression changes shown in Fig. 9 are ranked according to z-score, but it is not clear how this is calculated. It would be helpful to indicate the log2 change in each case.

      The z-score is a very commonly used method to show DEGs in studies involving RNA-seq data. We calculate the z-score based on the gene transcript source data (Fig. 8 – Source data 1-8). Z-score is defined as z = (x-μ)/σ, where x is the raw score (gene transcript level), μ is the population mean (mean of gene expression across both WT and KO samples), and σ is the population standard deviation. In essence, the z-score is the raw score minus the population mean, divided by the population standard deviation. We now included this information in Fig. 9 legend.

      (4) In Fig. 6, female HFD-fed KO mice had increased glucose (and insulin) after an overnight fast, but increased glucose was not observed in the GTT data. Possibly, this is because the mice were fasted for only 6h for the GTT. This might be mentioned during the description of these data, on lines 221-224. However, this also raises the question of whether there is a difference in the rate of gluconeogenesis (or possibly glycogenolysis for the 6h data) in the KO compared to the controls. Understanding this would require the use of tracers, and is reasonably beyond the scope of this study, but might be mentioned in the discussion.

      Per reviewer’s suggestion, we have included this in the “limitation section” of the discussion.

      Reduced RER in the HFD-fed female mice might begin to suggest a mechanism since this suggests the mice might have decreased oxidation of carbohydrates and increased oxidation of fat compared to control animals. A glucose tracer might be used to test whether more glucose is stored and, if so, in what tissue this occurs. Possibly, this could be done ex vivo on isolated tissues or cells. Again, this is reasonably beyond the scope of the present study.

      Per reviewer’s suggestion, we have included this in the “limitation section” of the discussion.

      (5) The discussion includes a brief discussion of the role of estrogen and suggests that in CTRP10 KO mice there are differences in other factors that would be needed to explain the phenotype. Although it is agreed that this is likely the case, estrogen levels were not measured in the present study. It seems like this would be important to study, and might shed light on the female-specific phenotype.

      We have now included serum estrogen data. No significant differences in estrogen levels were seen between WT and KO female mice fed either a low-fat diet (Fig. 4 – figure supplement 1) or a high-fat diet (Fig. 5 – figure supplement 2).

      Reviewer #2 (Recommendations For The Authors):

      While the concept is potentially exciting, there are major problems with the current manuscript. It lacks the mechanistic details behind MHO.

      (1) There is a significant gap that was not addressed by the authors. How exactly does CTRP10 lead to the activation of proteins like Fgf1, Fgf21, Il22ra1, Ucp3, and Klf15 in Ctrp10 knockout female mice? Is it likely that CTRP10 regulates these proteins via indirect mechanisms?

      We acknowledge that the lack of mechanistic understanding of how CTRP10 loss-of-function leads to changes in gene expression is a major limitation of the study. We have highlighted this limitation in the discussion section.

      • The author notes that Ctrp10 knockout female mice, particularly those on a high-fat diet lack Nr1d1 and can sustain a relatively healthy metabolic state. This is supported by the demonstrated upregulation of Fgf1, Fgf21, Il22ra1, Ucp3, and Klf15 in Ctrp10 knockout female mice. However, the mechanisms through which Ctrp10 knockout influences the expression of these molecules are not elucidated.

      We acknowledge that this is a major limitation of the study. We have highlighted this limitation in the discussion section. 

      • How do you substantiate the role of age and a high-nutrient diet in the development of obesity in knockout female mice? However, it is still unclear whether administering a high-fat diet in >20 week age of mice can develop insulin resistance where obesity is developing in LFD.

      When fed a low-fat diet, Ctrp10-KO female mice developed obesity with age and yet show little if any glucose intolerance or insulin resistance based on our glucose tolerance and insulin tolerance tests. For the HFD group, we are only comparing WT and KO mice on the same diet (not across diet). While WT mice on HFD gained significant amount of weight over time as expected, Ctrp10-KO female mice gain substantially higher amount of weight relative to WT littermates. Despite this, we did not observe a worsening of glucose tolerance and insulin resistance (based on GTT and ITT) in the KO female mice relative to WT controls as we would expect, since greater adiposity in HFD-fed mice generally correlated with worse metabolic outcomes. 

      (2) The authors should add the NR1D1 dependency study in female mice if possible.

      To address would require the generation of Ctrp10/Nr1d1 double KO mouse model and to carry out the entire study again in these double KO mice. Although this suggestion by the reviewer is a good one, this is beyond the scope of the present study.

      (3) NR1D1 represses the set of genes that promotes lipogenesis (the author should add some data that validates this statement).

      The role of NR1D1 in regulating metabolic genes are extensively documented in the published literature. NR1D1 (also known as REV-ERBα) is a constitutive transcriptional repressor (PMID: 26044300; PMID: 27445394). Many metabolic genes that are normally represses by NR1D1 is de-repressed in mice lacking NR1D1 globally or in the tissue-specific manner (PMID: 26044300; PMID: 34350828; PMID: 22562834). Among the many NR1D1 target genes involved in lipid metabolism include: CD36, Plin2, Elovl5, Acss3 (from: PMID: 26044300); as well as Scd1, Scd2, Pnpla5, Acsl1, Fasn, Hadhb, and Oxsm (from: PMID: 34350828).  We have included this information in the discussion section.

      (4) The authors should study the effect of Ctrp10 overexpression in HFD-fed female mice and also with KO of CTRP10 in adult mice if possible.

      The suggestion by the reviewer is a good one. However, this is beyond the scope of the study. We do not have a Ctrp10 conditional KO mouse model; as such, we could not study the effect of knocking out CTRP10 in adult mice. Overexpression studies are often considered non-physiological these days since the level of the overexpressed protein is generally much higher than the normal physiological level. For this reason, we did not attempt any overexpression study. 

      Reviewer #3 (Recommendations For The Authors):

      Line 114: Could you please provide definitions for "GluK2" and "GluK4" for readers unfamiliar with these terms?

      We have now provided definition for these terms.

      Line 140: It's stated that skeletal muscle and the pancreas express similar levels of Ctrp10 as the brain. Please double-check and clarify this assertion for accuracy.

      In mice, based on our own data (Fig. 1B), Ctrp10 expression in skeletal muscle and pancreas is comparable to that in the whole brain. In human, based on publicly available data (e.g., Genotype-Tissue Expression portal; GTex), brain expresses much higher level of CTRP10 transcript relative to other peripheral tissues.

      Line 141: Have you investigated whether Ctrp10 levels in plasma change after refeeding? If not, consider exploring this aspect to enhance the comprehensiveness of the study.

      No validated antibody currently exists for CTRP10. As such, we could not assess plasma level of CTRP10 after refeeding. We have included this as limitation of our study in the discussion section.  

      Lines 143-144: Clarify the age bracket of the animals used in the study. Additionally, have you observed similar responses, such as downregulation of Ctrp10 in response to refeeding, in both old and young mice in peripheral tissues?

      We have now included the age of the mice (~10 weeks old) for the fasting refeeding study as shown in Fig. 1C in the result and method sections.  

      Lines 135-149: To complement the experiments shown in Fig 1B-D, provide data pertaining to females.

      Ideally, we would like to have this data as well. However, to do this for females would involve 47 mice and the collection of 120 tissues (Fig. 1B; n = 10 per tissue), 390 tissues (Fig. 1C; n = 7-8 per tissue per fast or refed state), and 528 tissues (Fig. 1D; n = 11 per tissue per HFD or LFD). This would be a total of 1038 tissue samples. The main purpose of Fig. 1B-D is to demonstrate that Ctrp10 transcript is widely expressed and that its expression is modulated by nutritional (HFD vs. LFD) and metabolic (fast vs. refeed) states. These data provided a rationale to examine the metabolic phenotype in mice lacking CTRP10.

      To address the reviewer’s point, we looked at the expression levels of CTRP10/C1QL1 between males and females in the Genotype-Tissue Expression (GTEx) database portal and it does not appear that there are sex differences in CTRP10 expression patterns in normal tissues.  

      Line 152: Can you provide evidence supporting the hypothesis that Ctrp10 is secreted into the circulation?

      CTRP10 has a classic signal peptide sequence and the protein is secreted when expressed in HEK 293 cells (PMID: 18783346). We have shown previously that CTRP10 can be found in the FPLC-fractionated mouse serum using a polyclonal rabbit anti-mouse CTRP10 antibody we generated (PMID: 18783346); this antibody, however, does not work on tissue lysates (many non-specific bands). There is evidence in published literature to show that CTRP10/C1QL2 is clearly found circulating in human plasma. Some of the studies include: 1) Human C1QL2/CTRP10 is detected in the human plasma from UK BioBank (PMID: 37794186; C1QL2 is highlighted in page 335) and serum samples from pregnant females (PMID: 39062451; C1QL2 is highlighted in Table 2). We have included this information in the Introduction section.

      Line 178: In Fig 4 D and E (and other figures in the paper), it would be more accurate to express adipocyte size in "μm²" instead of "uM2."

      We have double checked and fixed this issue in the figure 4 and 7.

      Line 259: Please specify the age of the animals used in the study.

      In the method section, we did mention that LFD was provided for the duration of the study, beginning at 5 weeks of age; and that HFD was provided for 14 weeks, beginning at 6-7 weeks of age. Also, in Figure 2A and Figure 4A, the age of the mice is also indicated.

      Lines 275-283 and 288-296: It would be more appropriate to move this content to the Discussion section for better contextualization.

      We feel that the published information on NR1D1 and FGF21 should be mentioned in the result section so that the readers can immediately appreciate the significance of our data shown in Fig. 8 and 9. However, we also included similar information concerning NR1D1 in the discussion section for better contextualization as suggested.  

      Line 301: The section on DEG analysis requires additional details. How was the DEG analysis conducted? Were the DEGs from "wild type and KO mice" compared with "human DEGs regulated by sex"? Also, details about the phenotype of the human subjects and their association with obesity should be included. Additionally, discuss specific genes identified by the analysis and their relevance to the Ctrp10 story and human sex-specific gene connectivity analysis.

      We have updated the section on DEG analysis and, related to reviewer comments above, significantly expanded the github repository, detailing an analytical walkthrough of all computational analyses performed. To clarify the human integration analysis, we have added the following to the methods:

      “To investigate the degree of conservation of CTRP-engaged pathways, we mapped the differentially expressed genes (DEGs) identified from Ctrp10 knockout (KO) versus wild-type (WT) mice to their human orthologs, including human CTRP10, in the GTEx database for transcriptional correlations. Individuals were stratified by sex to examine sex-specific gene connectivity, consisting of 210 males and 100 females to compare gene expression across tissues. Gene-connectivity analyses were performed based on population correlation significances summarized by cumulative -log10(pvalues) as previously described"

      Line 330: In Fig 7L, increased oxidative stress in the liver of KO mice is shown. Please provide an explanation for the claim that Ctrp10-KO female mice resembled the WT controls.

      In Fig. 7L, we did observe a modest, but significant, increase in oxidative stress in the liver based on the quantification of malondialdehyde (MDA) level, a marker of tissue oxidative stress. However, we did not see any significant differences in the expression of oxidative genes in the liver between WT and KO female mice (Fig. 7J); thus, the statement in line 330 (discussion section) that pertains to oxidative gene expression in fat and liver (Fig. 7E and 7J) is correct. 

      Line 375: Could you clarify the term "adipose tissue health" and further discuss or provide evidence demonstrating compromised adipose tissue health in female KO mice following HFD?

      Adipose tissue health refers to the healthy functioning of adipose tissue (based on its functionality, immune cell population and profile, and metabolic gene expression profiles). Adipose tissue releases free fatty acids in response to fasting and takes up lipids in response to refeeding. Both are these functions are preserved in KO mice as we did not observe any significant differences in free fatty acids (NEFA) and triglyceride levels in the fasted and refed states (Fig. 6AB). Also, we did not observe any significant differences in the expression of inflammatory and fibrotic genes in the adipose tissue of WT and KO female mice fed a high-fat diet (Fig. 7E). If anything, we actually observed a modest, but significant, reduction in the expression of some ER and oxidative stress genes in the KO female mice relative to WT controls (Fig. 7E). 

      Line 408: Please provide data regarding estrogen levels in wild-type and KO female mice for comparison.

      We have now included serum estrogen data. No significant differences in estrogen levels were seen between WT and KO female mice fed either a low-fat diet (Fig. 4 – figure supplement 1) or a high-fat diet (Fig. 5 – figure supplement 2).

      Line 587: The GitHub link provided seems to be inactive or incorrect. Please verify and provide the correct link.

      We have also updated the github (https://github.com/Leandromvelez/CTRP10-Manuscript-DEG-Sex-specific-connectivities-and-integration) to include a README file and updated the R scripts to annotate steps and processing considerations. 

      Lines 590-599: Provide additional details about the analysis of human sex-specific genes. Including a table of the top DEGs and pathways differentially regulated by sex would be beneficial for readers' comprehension.

      We have expanded the methods, results and associated github repositories to detail all reproducible parameters used in these analyses.  The new table of DEGs is included in the manuscript and github repositories.

    1. eLife Assessment

      This paper provides a valuable contribution to our understanding of how adenosine acts as a signal of nutrient insufficiency and extends this idea to suggest that adenosine is released by metabolically active cells in proportion to the activity of methylation events. Convincing data supports this idea. The authors use metabolic tracing approaches to identify the biochemical pathways that contribute to the regulation of adenosine levels and the S-adenosylmethionine cycle in Drosophila larval hemocytes in response to wasp egg infection.

    2. Reviewer #1 (Public review):

      Summary:

      In this article, Nedbalova et al. investigate the biochemical pathway that acts in circulating immune cells to generate adenosine, a systemic signal that directs nutrients toward the immune response, and S-adenosylmethionine (SAM), a methyl donor for lipid, DNA, RNA, and protein synthetic reactions. They find that SAM is largely generated through uptake of extracellular methionine, but that recycling of adenosine to form ATP contributes a small but important quantity of SAM in immune cells during the immune response. The authors propose that adenosine serves as a sensor of cell activity and nutrient supply, with adenosine secretion dominating in response to increased cellular activity. Their findings of impaired immune action but rescued larval developmental delay when the enzyme Ahcy is knocked down in hemocytes are interpreted as due to effects on methylation processes in hemocytes and reduced production of adenosine to regulate systemic metabolism and development, respectively. Overall this is a strong paper that uses sophisticated metabolic techniques to map the biochemical regulation of an important systemic mediator, highlighting the importance of maintaining appropriate metabolite levels in driving immune cell biology.

      Strengths:

      The authors deploy metabolic tracing - no easy feat in Drosophila hemocytes - to assess flux into pools of the SAM cycle. This is complemented by mass spectrometry analysis of total levels of SAM cycle metabolites to provide a clear picture of this metabolic pathway in resting and activated immune cells.

      The experiments show that recycling of adenosine to ATP, and ultimately SAM, contributes meaningfully to the ability of immune cells to control infection with wasp eggs.

      This is a well-written paper, with very nice figures showing metabolic pathways under investigation. In particular, the italicized annotations, for example "must be kept low", in Figure 1 illustrate a key point in metabolism - that cells must control levels of various intermediates to keep metabolic pathways moving in a beneficial direction.

      Experiments are conducted and controlled well, reagents are tested, and findings are robust and support most of the authors' claims.

      Weaknesses:

      The authors posit that adenosine acts a sensor of cellular activity, with increased release indicating active cellular metabolism and insufficient nutrient supply. The authors have provided a discussion of how generalizable they think this may be across different cell types or organs, but mechanisms for the role of adenosine in specific cell types, and whether cell autonomous or cell-nonautonomous mechanisms may be employed in sensing, are largely unknown.

    3. Reviewer #2 (Public review):

      Summary:

      In this work, the authors wish to explore the metabolic support mechanisms enabling lamellocyte encapsulation, a critical antiparasitic immune response of insects. They show that S-adenosylmethionine metabolism is specifically important in this process through a combination of measurements of metabolite levels and genetic manipulations of this metabolic process.

      Strengths:

      The metabolite measurements and the functional analyses are generally very strong, and clearly show that the metabolic process under study is important in lamellocyte immune function.

      Previous weaknesses:

      The previous version of the manuscript contained RNAseq data that were inadequately explained. In this version, the treatment and representation of these data are significantly improved, such that they no longer represent a significant weakness. This version also contains increased evidence that SAM transmethylation is directly required for encapsulation.

    4. Reviewer #3 (Public review):

      Summary:

      The authors of this study provides evidence that Drosophila immune cells show upregulated SAM transmethylation pathway and adenosine recycling upon wasp infection. Blocking this pathway compromises the lamellocyte formation, developmental delay and the host survival, suggesting its physiological relevance.

      Strengths:

      Snapshot quantification of the metabolite pool does not provide evidence that the metabolic pathway is active or not. The authors use an ex vivo isotope labelling to precisely monitor the SAM and adenosine metabolism. During infection, the methionine metabolism and adenosine recycling are upregulated, which is necessary to support the immune reaction. By combining the genetic experiment, they successfully show that the pathway is activated in immune cells.

      Weaknesses:

      The authors knocked down Ahcy to prove the importance of SAM methylation pathway. However, Ahcy-RNAi produces massive accumulation of SAH, in addition to block adenosine production. To further validate the phenotypic causality, it is important to manipulate other enzymes in the pathway, such as Sam-S, Cbs, SamDC, etc. The authors do not demonstrate how infection stimulates the metabolic pathway given the gene expression of metabolic enzymes is not upregulated by infection stimulus.

    5. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In this article, Nedbalova et al. investigate the biochemical pathway that acts in circulating immune cells to generate adenosine, a systemic signal that directs nutrients toward the immune response, and S-adenosylmethionine (SAM), a methyl donor for lipid, DNA, RNA, and protein synthetic reactions. They find that SAM is largely generated through the uptake of extracellular methionine, but that recycling of adenosine to form ATP contributes a small but important quantity of SAM in immune cells during the immune response. The authors propose that adenosine serves as a sensor of cell activity and nutrient supply, with adenosine secretion dominating in response to increased cellular activity. Their findings of impaired immune action but rescued larval developmental delay when the enzyme Ahcy is knocked down in hemocytes are interpreted as due to effects on methylation processes in hemocytes and reduced production of adenosine to regulate systemic metabolism and development, respectively. Overall this is a strong paper that uses sophisticated metabolic techniques to map the biochemical regulation of an important systemic mediator, highlighting the importance of maintaining appropriate metabolite levels in driving immune cell biology.

      Strengths:

      The authors deploy metabolic tracing - no easy feat in Drosophila hemocytes - to assess flux into pools of the SAM cycle. This is complemented by mass spectrometry analysis of total levels of SAM cycle metabolites to provide a clear picture of this metabolic pathway in resting and activated immune cells.

      The experiments show that the recycling of adenosine to ATP, and ultimately SAM, contributes meaningfully to the ability of immune cells to control infection with wasp eggs.

      This is a well-written paper, with very nice figures showing metabolic pathways under investigation. In particular, the italicized annotations, for example, "must be kept low", in Figure 1 illustrate a key point in metabolism - that cells must control levels of various intermediates to keep metabolic pathways moving in a beneficial direction.

      Experiments are conducted and controlled well, reagents are tested, and findings are robust and support most of the authors' claims.

      Weaknesses:

      The authors posit that adenosine acts as a sensor of cellular activity, with increased release indicating active cellular metabolism and insufficient nutrient supply. It is unclear how generalizable they think this may be across different cell types or organs.

      In the final part of the Discussion, we elaborate slightly more on a possible generalization of our results, while being aware of the limited space in this experimental paper and therefore intend to address this in more detail and comprehensively in a subsequent perspective article.

      The authors extrapolate the findings in Figure 3 of decreased extracellular adenosine in ex vivo cultures of hemocytes with knockdown of Ahcy (panel B) to the in vivo findings of a rescue of larval developmental delay in wasp egg-infected larvae with hemocyte-specific Ahcy RNAi (panel C). This conclusion (discussed in lines 545-547) should be somewhat tempered, as a number of additional metabolic abnormalities characterize Ahcy-knockdown hemocytes, and the in vivo situation may not mimic the ex vivo situation. If adenosine (or inosine) measurements were possible in hemolymph, this would help bolster this idea. However, adenosine at least has a very short half-life.

      We agree with the reviewer, and in the 4th paragraph of the Discussion we now discuss more extensively the limitations of our study in relation to ex vivo adenosine measurements and the importance of the SAM pathway on adenosine production.

      Reviewer #2 (Public review):

      Summary:

      In this work, the authors wish to explore the metabolic support mechanisms enabling lamellocyte encapsulation, a critical antiparasitic immune response of insects. They show that S-adenosylmethionine metabolism is specifically important in this process through a combination of measurements of metabolite levels and genetic manipulations of this metabolic process.

      Strengths:

      The metabolite measurements and the functional analyses are generally very strong and clearly show that the metabolic process under study is important in lamellocyte immune function.

      Weaknesses:

      The gene expression data are a potential weakness. Not enough is explained about how the RNAseq experiments in Figures 2 and 4 were done, and the representation of the data is unclear.

      The RNAseq data have already been described in detail in our previous paper (doi.org/10.1371/journal.pbio.3002299), but we agree with the reviewer that we should describe the necessary details again here. The replicate numbers for RNAseq data were added to figure legends, the TPM values for the selected genes shown in figures are in S1_Data and new S4_Data file with complete RNAseq data (TPM and DESeq2) was added to this revised version.

      The paper would also be strengthened by the inclusion of some measure of encapsulation effectiveness: the authors show that manipulation of the S-adenosylmethionine pathway in lamellocytes affects the ability of the host to survive infection, but they do not show direct effects on the ability of the host to encapsulate wasp eggs.

      The reviewer is correct that wasp egg encapsulation and host survival may be different (the host can encapsulate and kill the wasp egg and still not survive) and we should also include encapsulation efficiency. This is now added to Figure 3D, which shows that encapsulation efficiency is reduced upon Ahcy-RNAi, which is consistent with the reduced number of lamellocytes.

      Reviewer #3 (Public review):

      Summary:

      The authors of this study provide evidence that Drosophila immune cells show upregulated SAM transmethylation pathway and adenosine recycling upon wasp infection. Blocking this pathway compromises the lamellocyte formation, developmental delay, and host survival, suggesting its physiological relevance.

      Strengths:

      Snapshot quantification of the metabolite pool does not provide evidence that the metabolic pathway is active or not. The authors use an ex vivo isotope labelling to precisely monitor the SAM and adenosine metabolism. During infection, the methionine metabolism and adenosine recycling are upregulated, which is necessary to support the immune reaction. By combining the genetic experiment, they successfully show that the pathway is activated in immune cells.

      Weaknesses:

      The authors knocked down Ahcy to prove the importance of SAM methylation pathway. However, Ahcy-RNAi produces a massive accumulation of SAH, in addition to blocking adenosine production. To further validate the phenotypic causality, it is necessary to manipulate other enzymes in the pathway, such as Sam-S, Cbs, SamDC, etc.

      We are aware of this weakness and have addressed it in a much more detailed discussion of the limitations of our study in the 6th paragraph of the Discussion.

      The authors do not demonstrate how infection stimulates the metabolic pathway given the gene expression of metabolic enzymes is not upregulated by infection stimulus.

      Although the goal of this work was to test by 13C tracing whether the SAM pathway activity is upregulated, not to analyze how its activity is regulated, we certainly agree with the reviewer that an explanation of possible regulation, especially in the context of the enzyme expressions we show, should be included in our work. Therefore, we have supplemented the data with methyltransferase expressions (Figure 2-figure supplement 3. And S3_Data) and better describe the changes in expression of some SAM pathway genes, which also support stimulation of this pathway by changes in expression. The enzymes of the SAM transmethylation pathway are highly expressed in hemocytes, and it is known that the activity of this pathway is primarily regulated by (1) increased methionine supply to the cell and (2) the actual utilization of SAM by methyltransferases. Therefore, a possible increase in SAM transmethylation pathway in our work can be suggested (1) by increased expression of 4 transporters capable of transporting methionine, (2) by decreased expression of AhcyL2 (dominant-negative regulator of Ahcy) and (3) by increased expression of 43 out of 200 methyltransferases. This was now added to the first section of Results.

      Recommendations for the authors:

      Reviewing Editor Comments:

      In the discussion with the reviewers, two points were underlined as very important:

      (1) Knocking down Ahyc and other enzymes in the SAM methylation pathway may give very distinct phenotypes. Generalising the importance of "SAM methyaltion" only by Ahcy-RNAi is a bit cautious. The authors should be aware of this issue and probably mention it in the Discussion part.

      We are aware of this weakness and have addressed it in a much more detailed discussion of the limitations of our study in the 6th paragraph of the Discussion.

      (2) Sample sizes should be indicated in the Figure Legends. Replicate numbers on the RNAseq are important - were these expression levels/changes seen more than once?

      Sample sizes are shown as scatter plots with individual values wherever possible and all graphs are supplemented with S1_Data table with raw data. The RNAseq data have already been described in detail in our previous paper (doi.org/10.1371/journal.pbio.3002299), but we agree with the reviewers that we should describe the necessary details again here. The replicate numbers for RNAseq data were added to figure legends, the TPM values for the selected genes shown in figures are in S1_Data and new S4_Data file with complete RNAseq data (TPM and DESeq2) was added to this revised version.

      Reviewer #1 (Recommendations for the authors):

      Major points:

      (1) Please provide sample sizes in the legends rather than in a supplementary table.

      Sample sizes are shown either as scatter plots with individual values or added to figure legends now.

      (2) More details in the methods section are needed:

      For hemocyte counting, are sessile and circulating hemocytes measured?

      We counted circulating hemocytes (upon infection, most sessile hemocytes are released into the circulation). While for metabolomics all hemocyte types were included, for hemocyte counting we were mainly interested in lamellocytes. Therefore, we counted them 20 hours after infection, when most of the lamellocytes from the first wave are fully differentiated but still mostly in circulation, as they are just starting to adhere to the wasp egg. This was added to the Methods section.

      How were levels of methionine and adenosine used in ex vivo cultures selected? This is alluded to in lines 158-159, but no references are provided.

      The concentrations are based on measurements of actual hemolymph concentrations in wild-type larvae in the case of methionine, and in the case of adenosine, we used a slightly higher concentration than measured in the adgf-a mutant to have a sufficiently high concentration to allow adenosine to flow into the hemocytes. This is now added to the Methods section.

      Minor points:

      Response to all minor points:  Thank you, errors has now been fixed.

      (1) Line 186 - spell out MTA - 5-methylthioadenosine.

      (2) Lines 196-212 (and elsewhere) - spelling out cystathione rather than using the abbreviation CTH is recommended because the gene cystathione gamma-lyase (Cth) is also discussed in this paragraph. Using the full name of the metabolite will reduce confusion.

      We rather used cystathionine γ-lyase as a full name since it is used only three times while CTH many more times, including figures.

      (3) Figure 2 - supplement 2: please include scale bars.

      (4) Line 303 - spelling error: "trabsmethylation" should be "transmethylation".

      (5) Line 373 - spelling error: "higer" should be "higher".

      Reviewer #2 (Recommendations for the authors):

      For the RNAseq data, it's unclear whether the gene expression data in Figures 2 and 4 include biological replicates, so it's unclear how much weight we should place on them.

      The replicate numbers for RNAseq data were added to figure legends, the TPM values for the selected genes shown in figures are in S1_Data and new S4_Data file with complete RNAseq data (TPM and DESeq2) was added to this revised version.

      The representation of these data is also a weakness: Figure 2 shows measurements of transcripts per million, but we don't know what would be high or low expression on this scale.

      We have added the actual TPM values for each cell in the RNAseq heatmaps in Figure 2, Figure 2-figure supplement 3, and Figure 4 to make them more readable. Although it is debatable what is high or low expression, to at least have something for comparison, we have added the following information to the figure legends that only 20% of the genes in the presented RNAseq data show expression higher than 15 TPM.

      Figure 4 is intended to show expression changes with treatment, but expression changes should be shown on a log scale (so that increases and decreases in expression are shown symmetrically) and should be normalized to some standard level (such as uninfected lamellocytes).

      The bars in Figure 4C,D show the fold change (this is now stated in the y-axis legend) compared to 0 h (=uninfected) Adk3 samples - the reason for this visualization is that we wanted to show (1) the differences in levels between Adk3 and Adk2 and in levels between Ak1 and Ak2, respectively, and at the same time (2) the differences between uninfected and infected Adk3 and Ak1. In our opinion, these fold change differences are also much more visible in normal rather than log scale.

      Reviewer #3 (Recommendations for the authors):

      (1) It might be interesting to test how general this finding would be. How about Bacterial or fungal infection? The authors may also try genetic activation of immune pathways, e.g. Toll, Imd, JAK/STAT.

      Although we would also like to support our results in different systems, we believe that our results are already strong enough to propose the final hypothesis and publish it as soon as possible so that it can be tested by other researchers in different systems and contexts than the Drosophila immune response.

      (2) How does the metabolic pathway get activated? Enzyme activity? Transporters? Please test or at least discuss the possible mechanism.

      The response is already provided above in the Reviewer #3 (Public review) section.

      (3) The authors might test overexpression or genetic activation of the SAM transmethylation pathway.

      Although we agree that this would potentially strengthen our study, it may not be easy to increase the activity of the SAM transmethylation pathway - simply overexpressing the enzymes may not be enough, the regulation is primarily through the utilization of SAM by methyltransferases and there are hundreds of them and they affect numerous processes. 

      (4) Supplementation of adenosine to the Ahcy-RNAi larvae would also support their conclusion.

      Again, this is not an easy experiment, dietary supplementation would not work, direct injection of adenosine into the hemolymph would not last long enough, adenosine would be quickly removed.

      (5) It is interesting to test genetically the requirement of some transporters, especially for gb, which is upregulated upon infection.

      Although this would be an interesting experiment, it is beyond the scope of this study; we did not aim to study the role of the SAM transmethylation pathway itself or its regulation, only its overall activity and its role in adenosine production.

    1. eLife Assessment

      This is an important study that describes the development of optical biosensors for various Rab GTPases and explores the contributions of Rab10 and Rab4 to structural and functional plasticity at hippocampal synapses during glutamate uncaging. The evidence supporting the conclusions of the paper is solid, and several improvements were noted by the reviewers upon revision, although some persisting inconsistencies would benefit from further clarification.

    2. Reviewer #1 (Public review):

      Summary:

      Wang et al. created a series of specific FLIM-FRET sensors to measure the activity of different Rab proteins in small cellular compartments. They apply the new sensors to monitor Rab activity in dendritic spines during induction of LTP. They find sustained (30 min) inactivation of Rab10 and transient (5 min) activation of Rab4 after glutamate uncaging in zero Mg. NMDAR function and CaMKII activation are required for these effects. Knock-down of Rab4 reduced spine volume change while knock-down of Rab10 boosted it and enhanced functional LTP (in KO mice). To test Rab effects on AMPA receptor exocytosis, the authors performed FRAP of fluorescently labeled GluA1 subunits in the plasma membrane. Within 2-3 min, new AMPARs appear on the surface via exocytosis. This process is accelerated by Rab10 knock-down and slowed by Rab4 knock-down. The authors conclude that CaMKII promotes AMPAR exocytosis by i) activating Rab4, the exocytosis driver and ii) inhibiting Rab10, possibly involved in AMPAR degradation.

      Strengths:

      The work is a technical tour de force, adding fundamental insights to our understanding of the crucial functions of different Rab proteins in promoting/preventing synaptic plasticity. The complexity of compartmentalized Ras signaling is poorly understood and this study makes substantial inroads. The new sensors are thoroughly characterized, seem to work very well and will be quite useful for the neuroscience community and beyond (e.g. cancer research). The use of FLIM for read-out is compelling for precise activity measurements in rapidly expanding compartments (i.e., spines during LTP). In addition to structural changes, evidence for functional LTP is provided, too.

      Weaknesses:

      The interpretation of the FRAP experiments (Fig. 5, Ext. Data Fig. 13) is not straightforward as spine volume and surface area greatly expand during uncaging. I appreciate the correction for added spine membrane shown in Extended Data Fig. 14i.<br /> Pharmacological experiments were not conducted or analyzed blind, risking bias in the selection/exclusion of experiments for analysis.

    3. Reviewer #2 (Public review):

      Summary:

      Wang et al. developed a set of optical sensors to monitor Rab protein activity. Their investigation into Rab activity in dendritic spines during structural long-term plasticity (sLTP) revealed sustained Rab10 inactivation (>30min) and transient Rab4 activation (~5 min). Through pharmacological and genetic manipulation to constitutively activate or inhibit Rab proteins, the authors discovered that Rab10 negatively regulates sLTP and AMPA receptor trafficking, while Rab4 positively influences sLTP but only during the transient phase. These optical sensors provide new tools for studying Rab activity in cell biology and neurobiology. The distinct kinetics and functions of Rab proteins are important for understanding synaptic plasticity. However, there are some concerns regarding result inconsistencies within this manuscript and with prior work.

      Strengths:

      (1) The introduction of a series of novel sensors that can address numerous questions in Rab biology.<br /> (2) The use of multiple methods to manipulate Rab proteins to reveal the roles of Rab10 and Rab4 in LTP.<br /> (3) The discovery of Rab4 activation and Rab10 inhibition with different kinetics during sLTP, correlating with their functional roles in the transient (Rab4) and both transient and sustained (Rab10) phases of sLTP.

      Weaknesses:

      (1) The discrepancy between spine phenotype and sLTP potential with Rab10 perturbation remains unexplained (refer to previous Weakness #4). The basal state is the outcome of many activity-dependent processes that are physiologically relevant. It is also unclear why different preparations would yield different results. These can be experimentally addressed, and it is at least important to highlight and discuss the discrepancies.<br /> (2) In the response, the authors estimated that the bleed-through from mEGFP-Rab is ~3% and the red channel signal from FRET changes is ~20%. The context of these percentages is unclear. Are they percentages of the total signal in the red channel, or does 3% refer to 3% of the green channel signal? Additionally, there is no explanation of how these numbers were estimated.<br /> (3) The changes in the fEPSP slope in response to theta burst stimulation (a decrease followed by a gradual increase) differ from prior publications (e.g. PMID: 1359925, 3967730, 19144965, 20016099). The explanation of these differences due to different conditions in response to Reviewer's recommendation #6 does not seem sufficient.

    4. Reviewer #3 (Public review):

      Summary:

      This study examines the roles of Rab10 and Rab4 proteins in structural long-term potentiation (sLTP) and AMPA receptor (AMPAR) trafficking in hippocampal dendritic spines using various different methods and organotypic slice cultures as the biological model.<br /> The paper shows that Rab10 inactivation enhances AMPAR insertion and dendritic spine head volume increase during sLTP, while Rab4 supports the initial stages of these processes. The key contribution of this study is identifying Rab10 inactivation as a previously unknown facilitator of AMPAR insertion and spine growth, acting as a brake on sLTP when active. Rab4 and Rab10 seems to be playing opposing roles, suggesting a somewhat coordinated mechanism that precisely controls synaptic potentiation, with Rab4 facilitating early changes and Rab10 restricting the extent and timing of synaptic strengthening.

      Strengths:

      The study combines multiple techniques such as FRET/FLIM imaging, pharmacology, genetic manipulations and electrophysiology to dissect the roles of Rab10 and Rab4 in sLTP. The authors developed highly sensitive FRET/FLIM-based sensors to monitor Rab protein activity in single dendritic spines. This allowed them to study the spatiotemporal dynamics of Rab10 and Rab4 activity during glutamate uncaging induced sLTP. They also developed various controls to ensure the specificity of their observations. For example, they used a false acceptor sensor to verify the specificity of the Rab10 sensor response.

      This study reveals previously unknown roles for Rab10 and Rab4 in synaptic plasticity, showing their opposing functions in regulating AMPAR trafficking and spine structural plasticity during LTP.

      Weaknesses:

      In the first round of revision I raised these points:

      (1) In sLTP, the initial volume of stimulated spines is an important determinant of induced plasticity. To address changes in initial volume and those induced by uncaging, the authors present Extended Data Figure 2. In my view, the methods of fitting, sample selection, or both may pose significant limitations for interpreting the overall results. While the initial spine size distribution for Rab10 experiments spans ~0.1-0.4 fL (with an unusually large single spine at the upper end), Rab4 spine distribution spans a broader range of ~0.1-0.9 fL. If the authors applied initial size-matched data selection or used polynomial rather than linear fitting, panels a, b, e, f, and g might display a different pattern. In that case, clustering analysis based on initial size may be necessary to enable a fair comparison between groups-not only for this figure but also for main Figures 2 and 3.

      - The authors responded to this point as follows: For sensor uncaging experiments, we usually uncaged glutamate at large mushroom spines because we need to have a good signal-to-noise ratio. We just happen to choose these spines with different initial sizes for Rab4 sensor and Rab10 sensor uncaging experiments.

      Even if they happen to choose these spine sizes, it is possible to compare only those that match in size. This does not require any additional experiments. Because of this, I do not find this response satisfactory.

      (2) Another limitation is the absence of in vivo validation, as the experiments were performed in organotypic hippocampal slices, which may not fully replicate the complexity of synaptic plasticity in an intact brain, where excitatory and inhibitory processes occur concurrently. High concentrations of MNI-glutamate (4 mM in this study) are known to block GABAergic responses due to its antagonistic effect on GABA-A receptors, thereby precluding the study of inhibitory network activity or connectivity, which is already known to be altered in organotypic slice cultures.

      - I found the Authors following response reasonable and useful:

      We appreciate the reviewer's comments and would like to clarify that we have conducted experiments in acute slices for LTP using conditional Rab10 knockout (Fig. 4k, 4l), and we obtained similar results. Additionally, we have recently published findings on the behavioral deficits observed in heterozygous Rab10 knockout mice (PubMed 37156612). These studies further support our conclusions and provide additional context for our findings.

    5. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      Summary:

      Wang et al. created a series of specific FLIM-FRET sensors to measure the activity of different Rab proteins in small cellular compartments. They apply the new sensors to monitor Rab activity in dendritic spines during induction of LTP. They find sustained (30 min) inactivation of Rab10 and transient (5 min) activation of Rab4 after glutamate uncaging in zero Mg. NMDAR function and CaMKII activation are required for these effects. Knockdown of Rab4 reduced spine volume change while knockdown of Rab10 boosted it and enhanced functional LTP (in KO mice). To test Rab effects on AMPA receptor exocytosis, the authors performed FRAP of fluorescently labeled GluA1 subunits in the plasma membrane. Within 2-3 min, new AMPARs appear on the surface via exocytosis. This process is accelerated by Rab10 knock-down and slowed by Rab4 knock-down. The authors conclude that CaMKII promotes AMPAR exocytosis by i) activating Rab4, the exocytosis driver and ii) inhibiting Rab10, possibly involved in AMPAR degradation.

      Strengths:

      The work is a technical tour de force, adding fundamental insights to our understanding of the crucial functions of different Rab proteins in promoting/preventing synaptic plasticity. The complexity of compartmentalized Ras signaling is poorly understood and this study makes substantial inroads. The new sensors are thoroughly characterized, seem to work very well, and will be quite useful for the neuroscience community and beyond (e.g. cancer research). The use of FLIM for read-out is compelling for precise activity measurements in rapidly expanding compartments (i.e., spines during LTP).

      Thank you for the evaluation.

      Weaknesses:

      The interpretation of the FRAP experiments (Figure 5, Ext. Data Figure 13) is not straightforward as spine volume and surface area greatly expand during uncaging. I appreciate the correction for the added spine membrane shown in Extended Data Figure 14i, but shouldn't this be a correction factor (multiplication) derived from the volume increase instead of a subtraction?

      We thank the reviewer for this question. The fluorescence change should reflect a subtraction of surface area, as SEP-GluA1 is only fluorescent on the cell surface, unlike cytosolic mCherry, whose fluorescence intensity is proportional to spine volume. Therefore, the overall fluorescence change (ΔF) should be the addition of the contribution from AMPAR trafficking (ΔF<sub>t</sub>) and the change in surface area (ΔS) multiplied by the remaining SEP-GluA1 fluorescence per unit area (f):

      ΔF = ΔF<sub>t</sub> + fΔS

      Since fluorescence immediately after photobleaching (before AMPAR trafficking happens), F<sub>o</sub>, is given by fS (S is the surface area of the spine):

      ΔF/F<sub>o</sub> = ΔF<sub>t</sub>/ F<sub>o</sub> + fΔS / fS

      \= ΔF<sub>t</sub>/fS + ΔS/S

      Assuming that the surface area change (ΔS/S) is the volume change (ΔV/V) to the power of 2/3, the contribution of the AMPAR trafficking can be calculated as:

      ΔF<sub>t</sub>/F = ΔF/F – (Δ<sup>V/V)<sup>2/3</sup>

      This is the reason that we subtracted the contribution of the spine surface area. We have discussed this in the updated method section.

      Also, experiments were not conducted or analyzed blind, risking bias in the selection/exclusion of experiments for analysis. This reduces my confidence in the results.

      We acknowledge the reviewer's concern regarding the lack of blinding in our experiments. However, it is challenging to conduct blinded experiments for certain types of studies, such as sensor screening for a protein family, where we do not have expected results or a specific hypothesis prior to the experiments. In these cases, our primary readout is whether the sensor indicates any activity change upon stimulation.

      To address this concern, after identifying that Rab10 is inactivated during structural LTP (sLTP) and is likely important for inhibiting spine structural LTP, we performed blinded electrophysiology experiments and obtained similar results (deletion of Rab10 from Camk2a-positive neurons leads to enhanced LTP; Fig. 4k, 4l).

      Reviewer #2 (Public review):

      Summary:

      Wang et al. developed a set of optical sensors to monitor Rab protein activity. Their investigation into Rab activity in dendritic spines during structural long-term plasticity (sLTP) revealed sustained Rab10 inactivation (>30min) and transient Rab4 activation (~5 min). Through pharmacological and genetic manipulation to constitutively activate or inhibit Rab proteins, they found that Rab10 negatively regulates sLTP and AMPA receptor insertion, while Rab4 positively influences sLTP but only in the transient phase. The optical sensors provide new tools for studying Rab activity in cells and neurobiology. However, a full understanding of the timing of Rab activity will require a detailed characterization of sensor kinetics.

      Strengths:

      (1) Introduction of a series of novel sensors that can address numerous questions in Rab biology.

      (2) Multiple methods to manipulate Rab proteins to reveal the roles of Rab10 and rab4 in LTP.

      (3) Discovery of Rab4 activation and Rab10 inhibition with different kinetics during sLTP, correlating with their functional roles in the transient (Rab4) and both transient and sustained (Rab10) phases of sLTP.

      Thank you for the positive evaluation.

      Weaknesses:

      (1) Lack of characterization of sensor kinetics, making it difficult to determine if the observed Rab kinetics during sLTP were due to sensor behavior or actual Rab activity.

      We estimated that the kinetics of the sensors for Rab4 and Rab10 are within a few minutes. For Rab4, we observed rapid increase and decrease of the activation in response to glutamate uncaging. Thus, this would be the upper limit of the ON/OFF time constants of Rab4. For Rab10, we observed a rapid dissociation of the sensor in response to sLTP induction within ~1 min. This means that the donor and acceptor molecules are quickly dissociated during the process. Thus, the off kinetics of the sensor is within the range of minute. Meanwhile, we have the on-kinetics from Rab10 activation (donor/accepter association) in response to NMDA application and again this is within a few minutes. Given these rapid sensor kinetics in neurons, our observation of the sustained inactivation of Rab10 should reflect the true behavior of Rab10, rather than just the sensor’s response.

      We revised our manuscript discussion session as follows:

      “Understanding the kinetics of Rab4 and Rab10 sensors is essential for interpreting their actual activity during sLTP. The Rab4 sensor exhibits a rapid rise and fall in activation (Fig. 3), indicating ON/OFF times of less than a few minutes. In contrast, the Rab10 sensor rapidly dissociates during sLTP induction (Fig. 2), with OFF kinetics occurring within one minute and fast ON kinetics in response to NMDA (Fig. 1j). Given these rapid kinetics, the observed sustained inactivation of Rab10 likely reflects its true behavior rather than sensor dynamics.”

      (2) It is crucial to assess whether the overexpression of Rab proteins as reporters, affects Rab activity and cellular structure and physiology (e.g. spine number and size).

      While we did not measure the effects of Rab sensor overexpression on Rab activity or cellular structure and physiology, we showed that sLTP is similar in neurons expressing sensors. This suggests that the overexpression of Rab sensors does not significantly disrupt signaling required for sLTP.

      (3) The paper does not explain the apparently different results between NMDA receptor activation and glutamate uncaging. NMDA receptor activation increased Rab10 activity, while glutamate uncaging decreased it. NMDA receptor activation resulted in sustained Rab4 activation, whereas glutamate uncaging caused only brief activation of about 5 minutes. A potential explanation, ideally supported by data, is needed.

      It is a long-standing question in the field why simple NMDA receptor activation by bath application of NMDA does not induce LTP, but instead induce LTD. Rab proteins are regulated by many GEFs and GAPs and identifying different mechanisms requires completely different techniques, such as molecular screening. While our manuscript provides some insights into this question by showing that they provide opposing signals for Rab10, we believe that identifying exact mechanisms would be out of the scope of this manuscript.

      (4) There is a discrepancy between spine phenotype and sLTP potential with Rab10 perturbation. Rab10 perturbation affected spine density but not size, suggesting a role in spinogenesis rather than sLTP. However, glutamate uncaging affected sLTP, and spinogenesis was not examined. Explaining the discrepancy between spine size and sLTP potential is necessary. Exploring spinogenesis with glutamate uncaging would strengthen these results. Additionally, Figure 4j shows no change in synaptic transmission with Rab10 knockout, despite an increase in spine density. An explanation, ideally supported by data, is needed for the unchanged fEPSP slope despite an increase in spine density.

      We thank the reviewer for raising these important questions. In our findings, shRNA-mediated knockdown of Rab10 did not alter spine size but did increase spine density in the basal state (Extended Data Fig. 11i). This suggests that Rab10 may restrict spinogenesis without affecting spine size. Conversely, sLTP induction via glutamate uncaging is an activity-dependent process that may involve different molecular mechanisms. The signal interplay between spinogenesis and sLTP and how the exact roles of Rab signaling in different modalities of plasticity would remain elusive for the future study.

      The lack of change in synaptic transmission with Rab10 knockout, despite the increase in spine density from Rab10 shRNA knockdown, may be due to different preparation and developmental stages: spine density measurements were conducted with shRNA knockdown in organotypic slices (sliced at P6-8, DIV 9-13), while electrophysiological recordings were performed in knockout mice in acute slices from adult animals (P30-60).

      (5) Spine volume was imaged using acceptor fluorophores (mCherry, or mCherry/Venus) at 920nm, where the two-photon cross-section of mCherry is minimal. 920nm was also used to excite the donor fluorophore, hence the spine volume measurement based on total red channel fluorescence is the sum of minimal mCherry fluorescence from direct 920nm excitation, bleed-through from the green channel, and FRET. This confounded measurement requires correction and clarification.

      We assumed that the most of fluorescence is from direct excitation of mCherry at 920 nm. The contribution from the bleed-through from mEGFP-Rab (~3%) and from FRET changes (~20%) may influence the volume measurements. However, since we observed similar fluorescence changes in the green and red channels, these factors would have only a minor impact on our results (Extended Data Fig. 6a, 6d). Also, please note that the volume change in neurons expressing sensors is just to check if the volume change is normal, and not a major point of this manuscript.  We clarified this in the method section as:

      “For the sensor experiments, we used mCherry as a volume indicator. We acknowledge that contributions from bleed-through from mEGFP-Rab (approximately 3%) and FRET changes (around 20%) could affect the volume measurements. However, since we observed similar fluorescence changes in both the green and red channels, we believe these factors have a minimal impact on our results (Extended Data Fig. 6a, 6d).”

      Reviewer #3 (Public review):

      Summary:

      This study examines the roles of Rab10 and Rab4 proteins in structural long-term potentiation (sLTP) and AMPA receptor (AMPAR) trafficking in hippocampal dendritic spines using various different methods and organotypic slice cultures as the biological model.

      The paper shows that Rab10 inactivation enhances AMPAR insertion and dendritic spine head volume increase during sLTP, while Rab4 supports the initial stages of these processes. The key contribution of this study is identifying Rab10 inactivation as a previously unknown facilitator of AMPAR insertion and spine growth, acting as a brake on sLTP when active. Rab4 and Rab10 seem to be playing opposing roles, suggesting a somewhat coordinated mechanism that precisely controls synaptic potentiation, with Rab4 facilitating early changes and Rab10 restricting the extent and timing of synaptic strengthening.

      Strengths:

      The study combines multiple techniques such as FRET/FLIM imaging, pharmacology, genetic manipulations, and electrophysiology to dissect the roles of Rab10 and Rab4 in sLTP. The authors developed highly sensitive FRET/FLIM-based sensors to monitor Rab protein activity in single dendritic spines. This allowed them to study the spatiotemporal dynamics of Rab10 and Rab4 activity during glutamate uncaging-induced sLTP. They also developed various controls to ensure the specificity of their observations. For example, they used a false acceptor sensor to verify the specificity of the Rab10 sensor response.

      This study reveals previously unknown roles for Rab10 and Rab4 in synaptic plasticity, showing their opposing functions in regulating AMPAR trafficking and spine structural plasticity during LTP.

      Thank you for the positive evaluation.

      Weaknesses:

      In sLTP, the initial volume of stimulated spines is an important determinant of induced plasticity. To address changes in initial volume and those induced by uncaging, the authors present Extended Data Figure 2. In my view, the methods of fitting, sample selection, or both may pose significant limitations for interpreting the overall results. While the initial spine size distribution for Rab10 experiments spans ~0.1-0.4 fL (with an unusually large single spine at the upper end), Rab4 spine distribution spans a broader range of ~0.1-0.9 fL. If the authors applied initial size-matched data selection or used polynomials rather than linear fitting, panels a, b, e, f, and g might display a different pattern. In that case, clustering analysis based on initial size may be necessary to enable a fair comparison between groups not only for this figure but also for main Figures 2 and 3.

      We thank the reviewer for these questions. For sensor uncaging experiments, we usually uncaged glutamate at large mushroom spines because we need to have a good signal-to-noise ratio. We just happen to choose these spines with different initial sizes for Rab4 sensor and Rab10 sensor uncaging experiments.

      Another limitation is the absence of in vivo validation, as the experiments were performed in organotypic hippocampal slices, which may not fully replicate the complexity of synaptic plasticity in an intact brain, where excitatory and inhibitory processes occur concurrently. High concentrations of MNI-glutamate (4 mM in this study) are known to block GABAergic responses due to its antagonistic effect on GABA-A receptors, thereby precluding the study of inhibitory network activity or connectivity [1], which is already known to be altered in organotypic slice cultures.

      (1) https://www.frontiersin.org/journals/neural-circuits/articles/10.3389/neuro.04.002.2009/full

      We appreciate the reviewer's comments and would like to clarify that we have conducted experiments in acute slices for LTP using conditional Rab10 knockout (Fig. 4k, 4l), and we obtained similar results. Additionally, we have recently published findings on the behavioral deficits observed in heterozygous Rab10 knockout mice (PubMed 37156612). These studies further support our conclusions and provide additional context for our findings.

      Recommendations for the authors:

      From the Senior/Reviewing Editor:

      I apologize that this took longer than intended. As you will see from the reviews there was some disagreement on several points. There was some disagreement among reviewers as to the strength of the evidence with some characterizing it as "compelling," "convincing," or "solid" while others felt the characterization of the sensors was "incomplete" and that this could have affected some of the conclusions. After extensive discussion, reviewers agreed that there was a valid concern that the conclusion that Rab10 activation is sustained could reflect a feature of the sensor. If Rab10/RBD dissociation rate were very low, and the affinity of binding were very high, this could lead to an incorrect estimate of the sustained binding due to sensor kinetics, not Rab10 activation. It was noted that this has been seen in other sensors previously (e.g. first generation PKA activity sensors), which the developers altered in later generations to increase reversibility and off kinetics of the sensor.

      There was also discussion of how this might be addressed and we would be interested in your comments on this issue. It was suggested that it might be helpful to revise Figure 2b to show binding fraction dynamics separately for each spine (to determine whether any actually return to baseline). Subsequently, clustering of these binding dynamics into two groups could be summarized in a version of Fig. 2e for each cluster. Differences in spine volume dynamics between these clusters would provide a measure of how strongly Rab10 binding correlates with spine volume. If they never go back to baseline, some extra experiments with longer post-plasticity induction (150mins instead of 35), might show if any reversible Rab10 binding exists post-LTP induction.

      An alternative suggestion was to measure the time course in the presence of a GAP or GEF, which should alter the kinetics.

      Thanks for the comments. It is important that the inactivation is observed as the dissociation of the donor and acceptor of the sensor.  Thus, the fact that the sensor rapidly decreases in response to uncaging means that they have rapid off kinetics. In addition, we provide evidence of a rapid increase of Rab10 in response to NMDA application, suggesting that kinetics is also rapid. We added discussion about this in the revised manuscript as:

      “Understanding the kinetics of Rab4 and Rab10 sensors is essential for interpreting their actual activity during sLTP. The Rab4 sensor exhibits a rapid rise and fall in activation (Fig. 3), indicating ON/OFF times of just a few minutes. In contrast, the Rab10 sensor rapidly dissociates during sLTP induction (Fig. 2), with OFF kinetics occurring within one minute and fast ON kinetics in response to NMDA (Fig. 1j). Given these rapid kinetics, the observed sustained inactivation of Rab10 likely reflects its true behavior rather than sensor dynamics.”

      There was also further discussion of the nature of the "spine volume" signal, given the fact that the two-photon cross-section of mCherry is minimal at 920nm. It was suggested that this could be due to direct acceptor excitation rather than FRET, but there was agreement that further clarity on this issue would be valuable.

      We assumed that the most of fluorescence is from direct excitation of mCherry at 920 nm. The contribution from the bleed-through from mEGFP-Rab (~3%) and from FRET changes (~20%) may influence the volume measurements. However, since we observed similar fluorescence changes in the green and red channels, these factors would have only a minor impact on our results (Extended Data Fig. 6a, 6d). Also, please note that the volume change in neurons expressing sensors is just to check if the volume change is normal, and not a major point of this manuscript.  We clarified this in the method section as:

      “For the sensor experiments, we used mCherry as a volume indicator. We acknowledge that contributions from bleed-through from mEGFP-Rab (approximately 3%) and FRET changes (around 20%) could affect the volume measurements. However, since we observed similar fluorescence changes in both the green and red channels, we believe these factors have a minimal impact on our results (Extended Data Fig. 6a, 6d).”

      The equations in the methods section differ from other papers by the same lab (e.g. Laviv et al, Neuron 2020, Tu et al. Sci Adv. 2023, Jain et al. Nature 2024). Please clarify which equations are correct.

      Thanks for pointing this out. In fact, some of the equations in this manuscript were wrong, and we have corrected them in the method session.

      Reviewer #1 (Recommendations for the authors):

      The effects of Rab knockdown affect both spine volume expansion and AMPAR recovery in a very similar fashion. To explain this tight coupling, the authors suggest that the availability of membrane could be a limiting factor for spine enlargement. However, some Rabs are known to affect actin dynamics, which could also explain the dual effects on AMPAR exocytosis and spine enlargement. It is not easy to come up with an experiment to differentiate between these alternative explanations, as blocking actin polymerization would likely affect exocytosis, too. The authors should consider/discuss the possibility that all of the observed Ras effects result from altered actin dynamics and that the lipid bilayer is sufficiently fluid to form a minimal surface around the expanding cytoskeleton.

      Thanks for the suggestions. We included the discussion about the potential impact on the actin cytoskeleton by Rab10.

      Typos: heterougenous, compartmantalization, chemaical, ballistically/biolistically (chose one).

      Thanks for pointing out these typos. We have corrected them in the revised manuscript.

      Reviewer #2 (Recommendations for the authors):

      (1) Venus shows pH sensitivity, which can be significant at synapses due to pH changes. Characterizing the pH sensitivity of the sensors is essential.

      Thanks for the suggestions. We did not measure pH dependence, but the PKa of these fluorophores has already been published. PKa for EGFP and Venus are both 6.0, and it is unlikely that it influenced our measurements.

      (2) Presenting individual data points within all bar graphs (e.g. Fig. 2c, 2d) would enhance data transparency.

      Thanks for the suggestions. We now provide individual data points in the revised main figures.

      (3) In Figure 1f: Rab5 GAP expression increased the binding fraction against expectations. In addition, clarifying the color scheme in Figure 1 is needed. Are GAPs supposed to be blue/green, and GEFs red/orange? Figure 1f seems to contradict this color scheme.

      Thanks for the suggestions. We clarified these issues.

      (4) Quantification of the point spread function of the uncaging laser, response/settle time of the scan mirror during uncaging, and reason for changes in neighboring spines in many example images (e.g. Figure 2a, especially at 240 s; Figure 4a) would be important.

      The laser is controlled by Pockels cells, which changes the laser intensity with microsecond resolution. The laser is parked for milliseconds during uncaging, much longer than the settling time of the mirror (~0.1 milliseconds). The point spread function of the uncaging laser is limited by the diffraction (~0.5 um). The uncaging spot size is mostly limited by the diffusion of uncaged glutamate, but our calcium imaging and CaMKII imaging show that the signaling is induced mostly in the stimulated spines (Lee et al., 2009; Chang et al., 2017, 2019).

      (5) Please include traces for "false" sensors in stimulated spines in Figures 2b, 2e, 3b, and 3e.

      The traces for the false sensors have been presented in Extended Data Fig. 3 and Extended Data Fig. 8.

      (6) The traces in Figure 4k (fEPSP slope in response to theta burst stimulation, where there is a decrease in fEPSP slope followed by a gradual increase) differ from prior publications (e.g. PMID: 1359925, 3967730, 19144965, 20016099). An investigation and explanation for these differences are necessary.

      We appreciate the reviewer’s comments. We performed the experiments blindly and did not try to find a condition providing control data similar to previous publications. The variations in fEPSP responses compared to prior publications may be attributed to several factors, including differences in experimental conditions such as the genetic background of the animals used, the specific protocols for theta burst stimulation, and variations in the preparation of the hippocampal slices.

      (7) The title and text state that Rab10 inactivation promotes AMPAR insertion. It is unclear if this is a direct effect on AMPAR insertion or an indirect effect through membrane remodeling. Providing data to distinguish these possibilities or adjusting the title/text to reflect alternative interpretations would be beneficial.  

      We appreciate the reviewer's feedback. To clarify, we have revised our terminology to use "AMPAR trafficking" instead of "AMPAR insertion", as it includes both insertion and other mechanisms of AMPAR movement within the cell.

      (8) Please provide an explanation for the initial Rab10 inactivation observed in Figure 1j upon NMDA application.

      The application of NMDA in Fig. 1j is similar to the commonly used chemical LTD induction protocol. We used this broad stimulation approach to test whether our sensors could report Rab activity changes in neurons upon strong stimulation. However, it is an entirely different stimulation approach from the sLTP induction protocol, thus resulting in different sensor activity changes.  We describe the phenomenon in the revised manuscript, but we believe that detailed analyses of Rab10 activation in response to NMDA application are beyond the scope of this manuscript.

      (9) Please explain why the study focuses on Rab4 and Rab10 instead of other Rab proteins.

      During our initial screening of sensors for various Rab proteins, we observed significant activity changes in the sensors for Rab4 and Rab10 upon sLTP induction. This suggested their potential relevance in synaptic processes, leading us to focus on understanding their specific roles in structural long-term potentiation.

      Reviewer #3 (Recommendations for the authors):

      (1) Although it might seem trivial, the definition of adjacent spine has not been made in the text. It would be nice to have it in the Methods section.

      We included it in the Methods section as follows:

      "The adjacent spine refers to the first or second spine located next to the stimulated spine, typically positioned opposite the stimulated spine. Additionally, the size of the adjacent spine must be sufficiently large for imaging."

      (2) The transfection method has been mentioned as "ballistic" and "biolistic" transfection. You might want to use only one term. Additionally, you can add the equipment used (Bio-rad?) and pressure (psi) in the Methods section.

      We use “biolistic” throughout the manuscript now. We also added the equipment and conditions used.

    1. eLife Assessment

      This study presents important findings on the role of pyramidal cells driving vasoconstriction in brain arteries through a COX-2/PGE2 pathway, with additional contributions from NPY (interneurons) and 20-HETE (astrocytes). Optogenetic stimulation of cortical pyramidal neurons induces vasoconstriction, potentially leading to oxygen and nutrient undersupply in regions with sustained activation - a mechanism potentially relevant under pathological conditions. The authors provide convincing evidence from brain slice experiments and some in vivo data from anesthetized animals, carefully discussing the strengths and limitations of both approaches.

    2. Reviewer #1 (Public review):

      SNeuronal activity spatiotemporal fine-tuning of cerebral blood flow balances metabolic demands of changing neuronal activity with blood supply. Several 'feed-forward' mechanisms have been described that contribute to activity-dependent vasodilation as well as vasoconstriction leading to a reduction in perfusion. Involved messengers are ionic (K+), gaseous (NO), peptides (e.g., NPY, VIP) and other messengers (PGE2, GABA, glutamate, norepinephrine) that target endothelial cells, smooth muscle cells, or pericytes. Contributions of the respective signaling pathways likely vary across brain regions or even within specific brain regions (e.g., across cortex) and are likely influenced by the brain's physiological state (resting, active, sleeping) or pathological departures from normal physiology.

      The manuscript "Elevated pyramidal cell firing orchestrates arteriolar vasoconstriction through COX-2-derived prostaglandin E2 signaling" by B. Le Gac, et al. investigates mechanisms leading to activity-dependent arteriole constriction. Here, mainly working in brain slices from mice expressing channelrhodopsin 2 (ChR2) in all excitatory neurons (Emx1-Cre; Ai32 mice), the authors show that strong optogenetic stimulation of cortical pyramidal neurons is leading to constriction that is mediated through the cyclooxygenase-2 / prostaglandin E2 / EP1 and EP3 receptor pathway with contribution of NPY-releasing interneurons and astrocytes releasing 20-HETE. Specifically, using patch clamp, the authors show that 10-s optogenetic stimulation at 10 and 20 Hz leads to vasoconstriction (Figure 1), in line with a stimulation frequency-dependent increase in somatic calcium (Figure 2). The vascular effects were abolished in presence in TTX and significantly reduced in presence of glutamate receptor antagonists (Figure 3). The authors further show with RT-PCR on RNA isolated from patched cells that ~50% of analyzed cells express COX-1 or -2 and other enzymes required to produce PGE2 or PGF2a (Figure 4). Further, blockade of COX-1 and -2 (indomethacin), or COX-2 (NS-398) abolishes constriction. In animals with chronic cranial window that were anesthetized with ketamine and medetomidine, 10-s long optogenetic stimulation at 10 Hz leads to considerable constriction, which is reduced in presence of indomethacin. Blockade of EP1 and EP3 receptors leads to significant reduction of the constriction in slices (Figure 5). Finally, the authors show that blockade of 20-HETE synthesis caused moderate and NPY Y1 receptor blockade a complete reduction of constriction.

      The mechanistic analysis of neurovascular coupling mechanisms as exemplified here will guide further in-vivo studies and has important implications for human neuroimaging in health and disease. Most of the data in this manuscript uses brain slices as experimental model which contrasts with neurovascular imaging studies performed in awake (headfixed) animals. However, the slice preparation allows for patch clamp as well as easy drug application and removal. Further, the authors discuss their results in view of differences between brain slices and in vivo observations experiments, including the absence of vascular tone as well as blood perfusion required for metabolite (e.g., PGE2) removal, and the presence of network effects in the intact brain. The manuscript and figures present the data clearly; regarding the presented mechanism, the data supports the authors conclusions. Some of the data was generated in vivo in head-fixed animals under anesthesia; in this regard, the authors should revise introduction and discussion to include the important distinction between studies performed in slices, or in acute or chronic in-vivo preparations under anesthesia (reduced network activity and reduced or blockade of neuromodulation, or in awake animals (virtually undisturbed network and neuromodulatory activity). Further, while discussed to some extent, the authors could improve their manuscript by more clearly stating if they expect the described mechanism to contribute to CBF regulation under 'resting state conditions' (i.e., in absence of any stimulus), during short or sustained (e.g., visual, tactile) stimulation, or if this mechanism is mainly relevant under pathological conditions; especially in context of the optogenetic stimulation paradigm being used (10-s long stimulation of many pyramidal neurons at moderate-high frequencies) and the fact that constriction leading to undersupply in response to strongly increased neuronal activity seems counterintuitive?

      The authors have addressed all comments, and I appreciate their insightful discussion and revision of the manuscript.

    3. Reviewer #2 (Public review):

      Summary:

      The present study by Le Gac et al. investigates the vasoconstriction of cerebral arteries during neurovascular coupling. It proposes that pyramidal neurons firing at high frequency lead to prostaglandin E2 (PGE2) release and activation of arteriolar EP1 and EP3 receptors, causing smooth muscle cell contraction. The authors further claim that interneurons and astrocytes also contribute to the vasoconstriction via neuropeptide Y (NPY) and 20-hydroxyeicosatetraenoic acid (20-HETE) release, respectively. The study mainly uses brain slices and pharmacological tools in combination with Emx1-Cre;Ai32 transgenic mice expressing the H134R variant of channelrhodopsin-2 (ChR2) in the cortical glutamatergic neurons for precise photoactivation. Stimulation with 470 nm light using 10-second trains of 5-ms pulses at frequencies from 1-20 Hz revealed small constrictions at 10 Hz and robust constrictions at 20 Hz, which were abolished by TTX and partially inhibited by a cocktail of glutamate receptor antagonists. Inhibition of cyclooxygenase-1 (COX-1) or -2 (COX-2) by indomethacin blocked the constriction both ex vivo (slices) and in vivo (pial artery), and inhibition of EP1 and EP3 showed the same effect ex vivo. Single-cell RT-PCR from patched neurons confirmed the presence of the PGE2 synthesis pathway. While the data are convincing, the overall experimental setting presents some limitations. How is the activation protocol comparable to physiological firing frequency? The delay (minutes) between the stimulation and the constriction appears contradictory to the proposed pathway, which would be expected to occur rapidly. The experiments are conducted in the absence of vascular "tone," which further questions the significance of the findings. Some of the targets investigated are expressed by multiple cell types, which makes the interpretation difficult; for example, cyclooxygenases are also expressed by endothelial cells. Finally, how is the complete inhibition of the constriction by the NPY Y1 receptor antagonist BIBP3226 consistent with a direct effect of PGE2 and 20-HETE in arterioles? Overall, the manuscript is well-written with clear data, but the interpretation and physiological relevance have some limitations. However, vasoconstriction is a rather understudied phenomenon in neurovascular coupling, and the present findings may be of significance in the context of pathological brain hypoperfusion.

    1. eLife Assessment

      This study presents a useful demonstration that a specific protein fragment may induce the loss of synapses in Alzheimer's disease. The evidence supporting the data is solid but only partially supports the conclusion and would benefit from additional discussion indicated by the literature from reviewer #1. The application of the findings is limited because blocking the formation of the protein fragment has not benefited patients in several clinical trials.

    2. Reviewer #1 (Public review):

      Summary of what the authors were trying to achieve:

      In this manuscript, the authors investigated the role of β-CTF on synaptic function and memory. They report that β-CTF can trigger the loss of synapses in neurons that were transiently transfected in cultured hippocampal slices and that this synapse loss occurs independently of Aβ. They confirmed previous research (Kim et al, Molecular Psychiatry, 2016) that β-CTF-induced cellular toxicity occurs through a mechanism involving a hexapeptide domain (YENPTY) in β-CTF that induces endosomal dysfunction. Although the current study also explores the role of β-CTF in synaptic and memory function in the brain using mice chronically expressing β-CTF, the studies are inconclusive because potential effects of Aβ generated by γ-secretase cleavage of β-CTF were not considered. Based on their findings, the authors suggest developing therapies to treat Alzheimer's disease by targeting β-CTF. While they acknowledge that clinical trials of potent BACE1 inhibitors - which also target β-CTF - have failed to show clinical improvement, their study lacks in vivo evidence directly linking β-CTF to brain function, which weakens its significance.

      Major strengths and weaknesses of the methods and results:

      The conclusions of the in vitro experiments using cultured hippocampal slices were well supported by the data, but aspects of the in vivo experiments need additional clarification.<br /> In contrast to the in vitro experiments in which a γ-secretase inhibitor was used to exclude possible effects of Aβ, this possibility was not examined in in vivo experiments assessing synapse loss and function (Fig. 3) and cognitive function (Fig. 4). The absence of plaque formation (Fig. 4C) is not sufficient to exclude the possibility that Aβ is involved. The potential involvement of Aβ is an important consideration given the 4-month duration of protein expression in the in vivo studies. This issue could be addressed using γ-secretase modulators to avoid the off-target effects of inhibitors. Evidence that the detrimental effects in mice are directly caused by β-CTF rather than indirectly via Aβ is critical to support the authors' conclusion.

      Appraisal of whether the authors achieved their aims, and whether the results support their conclusion:

      See above

      Discussion of likely impact of the work on the field, and the utility of the methods and data to the community:

      The authors' use of sparse expression to examine the role of β-CTF on spine loss could be a useful general tool for examining synapses in brain tissue.

      Any additional context that might help readers interpret or understand the significance of the work:

      The discovery of BACE1 stimulated an international effort to develop BACE1 inhibitors to treat Alzheimer's disease. BACE1 inhibitors block the formation of β-CTF which, in turn, prevents the formation of Aβ and other fragments. Unfortunately, BACE1 inhibitors not only did not improve cognition in patients with Alzheimer's disease, they appeared to worsen it, suggesting that β-CTF could facilitate learning and memory. Therefore, it seems unlikely that the disruptive effects of β-CTF on endosomes plays a significant role in the human disease.

      Comments on revisions:

      The authors may be interested in the study by Ma et al., PNAS 2007 titled "Involvement of β-site APP cleaving enzyme 1 (BACE1) in amyloid precursor protein-mediated enhancement of memory and activity-dependent synaptic plasticity," which provides significant insights into the physiological role of BACE1 in synaptic function. The researchers demonstrated that BACE1-mediated cleavage of amyloid precursor protein (APP) is essential for enhancing learning, memory, and synaptic plasticity in vivo. They observed that overexpression of APP in transgenic mice led to improved spatial memory retention and potentiation of synaptic plasticity, effects that were abolished when one or both copies of the BACE1 gene were eliminated. This suggests that BACE1's cleavage of APP facilitates activity-dependent synaptic modifications, potentially through the production of APP intracellular domain (AICD) via β-CTF, rather than amyloid-β (Aβ) or soluble APPα (sAPPα). These findings highlight a physiological mechanism where BACE1-mediated APP processing leading to β-CTF supports cognitive functions, potentially explaining the detrimental effects of BACE1 inhibitors on cognitive function in clinical trials.

    3. Reviewer #3 (Public review):

      Summary:

      Most previous studies have focused on the contributions of Abeta and amyloid plaques in the neuronal degeneration associated with Alzheimer's disease, especially in the context of impaired synaptic transmission and plasticity which underlies the impaired cognitive functions, a hallmark in AD. But processes independent of Abeta and plaques are much less explored, and to some extent, the contributions of these processes are less well understood. Luo et all addressed this important question with an array of approaches, and their findings generally support the contribution of beta-CTF-dependent but non-Abeta dependent process to the impaired synaptic properties in the neurons. Interestingly, the above process appears to operate in a cell-autonomous manner. This cell-autonomous effect of beta-CTF as reported here may facilitate our understanding of some potential important cellular processes related to neurodegeneration. Although these findings are valuable, it is key to understand the probability of this process occurring in a more natural condition, such as when this process occurring in many neurons at the same time. This will put the authors' findings into a context for a better understanding of their contribution to either physiological or pathological processes, such as Alzheimer's. The experiments and results using cell system are quite solid, but the in vivo results are incomplete and hence less convincing (see below). The mechanistic analysis is interesting but primitive, and does not add much more weight to the significance. Hence, further efforts from the authors are required to clarify, and solidify their results, in order to provide a complete picture and support for the authors' conclusions.

      Strengths:

      (1) The authors have addressed an interesting and potentially important question<br /> (2) The analysis using the cell system are solid and provides strong support for the authors' major conclusions. This analysis has used various technical approaches to support the authors' conclusions from different aspects and most of these results are consistent with each other.

      Weaknesses:

      (1) The relevance of the authors' major findings to the pathology, especially the Abeta-dependent processes is less clear, and hence the importance of these findings may be limited.<br /> (2) In vivo analysis is incomplete, with certain caveats in the experimental procedures and some of the results need to be further explored to confirm the findings.<br /> (3) The mechanistic analysis is rather primitive and does not add further significance.

      Comments on revisions:

      The authors have satisfactorily addressed my main questions.

    4. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary of what the authors were trying to achieve:

      In this manuscript, the authors investigated the role of β-CTF on synaptic function and memory. They report that β-CTF can trigger the loss of synapses in neurons that were transiently transfected in cultured hippocampal slices and that this synapse loss occurs independently of Aβ. They confirmed previous research (Kim et al, Molecular Psychiatry, 2016) that β-CTF-induced cellular toxicity occurs through a mechanism involving a hexapeptide domain (YENPTY) in β-CTF that induces endosomal dysfunction. Although the current study also explores the role of β-CTF in synaptic and memory function in the brain using mice chronically expressing β-CTF, the studies are inconclusive because potential effects of Aβ generated by γ-secretase cleavage of β-CTF were not considered. Based on their findings, the authors suggest developing therapies to treat Alzheimer's disease by targeting β-CTF, but did not address the lack of clinical improvement in trials of several different BACE1 inhibitors, which target β-CTF by preventing its formation.

      We would like to thank the reviewer for his/her suggestions. We have addressed the specific comments in following sections.

      Major strengths and weaknesses of the methods and results:

      The conclusions of the in vitro experiments using cultured hippocampal slices were well supported by the data, but aspects of the in vivo experiments and proteomic studies need additional clarification.

      (1) In contrast to the in vitro experiments in which a γ-secretase inhibitor was used to exclude possible effects of Aβ, this possibility was not examined in in-vivo experiments assessing synapse loss and function (Figure 3) and cognitive function (Figure 4). The absence of plaque formation (Figure 4B) is not sufficient to exclude the possibility that Aβ is involved. The potential involvement of Aβ is an important consideration given the 4-month duration of protein expression in the in vivo studies.

      We appreciate the reviewer for raising this question. While our current data did not exclude the potential involvement of Aβ-induced toxicity in the synaptic and cognitive dysfunction observed in mice overexpressing β-CTF, addressing this directly remains challenging. Treatment with γ-secretase inhibitors could potentially shed light on this issue. However, treatments with γ-secretase inhibitors are known to lead to brain dysfunction by itself likely due to its blockade of the γ-cleavage of other essential molecules, such as Notch[1, 2]. Therefore, this approach is unlikely to provide a clear answer, which prevents us from pursuing it further experimentally in vivo. We hope the reviewer understands this limitation. We have included additional discussion (page 14 of the revised manuscript) to highlight this question.

      (2) The possibility that the results of the proteomic studies conducted in primary cultured hippocampal neurons depend in part on Aβ was also not taken into consideration.

      We thank the reviewer for raising this question. In the revised manuscript, we examined the protein levels of synaptic proteins after treatment with γ-secretase inhibitors and found that the levels of certain synaptic proteins were further reduced in neurons expressing β-CTF (Supplementary figure 5A-B). These results do not support Aβ as a major contributor of the proteomic changes induced by β-CTF.

      Likely impact of the work on the field, and the utility of the methods and data to the community:

      The authors' use of sparse expression to examine the role of β-CTF on spine loss could be a useful general tool for examining synapses in brain tissue.

      We thank the reviewer for these comments.

      Additional context that might help readers interpret or understand the significance of the work:

      The discovery of BACE1 stimulated an international effort to develop BACE1 inhibitors to treat Alzheimer's disease. BACE1 inhibitors block the formation of β-CTF which, in turn, prevents the formation of Aβ and other fragments. Unfortunately, BACE1 inhibitors not only did not improve cognition in patients with Alzheimer's disease, they appeared to worsen it, suggesting that producing β-CTF actually facilitates learning and memory. Therefore, it seems unlikely that the disruptive effects of β-CTF on endosomes plays a significant role in human disease. Insights from the authors that shed further light on this issue would be welcome.

      Response: We would like to express our gratitude to the reviewer for raising this question. It remains puzzling why BACE1 inhibition has failed to yield benefits in AD patients, while amyloid clearance via Aβ antibodies are able to slow down disease progression. One possible explanation is that pharmacological inhibition of BACE1 may not be as effective as its genetic removal. Indeed, genetic depletion of BACE1 leads to the clearance of existing amyloid plaques[3], whereas its pharmacological inhibition prevents the formation of new plaques but does not deplete the existing ones[4]. We think the negative results of BACE1 inhibitors in clinical trials may not be sufficient to rule out the potential contribution of β-CTF to AD pathogenesis. Given that cognitive function continues to deteriorate rapidly in plaque-free patients after 1.5 years of treatment with Aβ antibodies in phase three clinical studies[5], it is important to consider the potential role of other Aβ-related fragments in AD pathogenesis, such as β-CTF. We included further discussion in the revised manuscript (page 15 of the revised manuscript) to discusss this question.

      Reviewer #2 (Public Review):

      Summary:

      In this study, the authors investigate the potential role of other cleavage products of amyloid precursor protein (APP) in neurodegeneration. They combine in vitro and in vivo experiments, revealing that β-CTF, a product cleaved by BACE1, promotes synaptic loss independently of Aβ. Furthermore, they suggest that β-CTF may interact with Rab5, leading to endosomal dysfunction and contributing to the loss of synaptic proteins.

      We would like to thank the reviewer for his/her suggestions. We have addressed the specific comments in following sections.

      Weaknesses:

      Most experiments were conducted in vitro using overexpressed β-CTF. Additionally, the study does not elucidate the mechanisms by which β-CTF disrupts endosomal function and induces synaptic degeneration.

      We would like to thank the reviewer for this comment. While a significant portion of our experiments were conducted in vitro, the main findings were also confirmed in vivo (Figure 3 and 4). Repeating all the experiments in vivo would be challenging and may not be possible because of technical difficulties. Regarding the use of overexpressed β-CTF, we acknowledge that this represents a common limitation in neurodegenerative disease studies. These diseases progress slowly over decades in patients. To model this progression in cell or mouse models within a time frame feasible for research, overexpression of certain proteins is often inevitable. Since β-CTF levels are elevated in AD patients[6], its overexpression is not a irrelevant approach to investigate its potential effects.

      We did not further investigate the mechanisms by which β-CTF disrupted endosomal function because our preliminary results align with previous findings that could explain its mechanism. Kim et al. demonstrated that β-CTF recruits APPL1 (a Rab5 effector) via the YENPTY motif to Rab5 endosomes, where it stabilizes active GTP-Rab5, leading to pathologically accelerated endocytosis, endosome swelling and selectively impaired transport of Rab5 endosomes[6]. However, this paper did not show whether this Rab5 overactivation-induced endosomal dysfunction leads to any damages in synapses. In our study, we observed that co-expression of Rab5<sub>S34N</sub> with β-CTF effectively mitigated β-CTF-induced spine loss in hippocampal slice cultures (Figures 6L-M), indicating that Rab5 overactivation-induced endosomal dysfunction contributed to β-CTF-induced spine loss. We included further discussion in the revised manuscript to clarify this (page 15 of the revised manuscript).

      Reviewer #3 (Public Review):

      Summary:

      Most previous studies have focused on the contributions of Abeta and amyloid plaques in the neuronal degeneration associated with Alzheimer's disease, especially in the context of impaired synaptic transmission and plasticity which underlies the impaired cognitive functions, a hallmark in AD. But processes independent of Abeta and plaques are much less explored, and to some extent, the contributions of these processes are less well understood. Luo et all addressed this important question with an array of approaches, and their findings generally support the contribution of beta-CTF-dependent but non-Abeta-dependent process to the impaired synaptic properties in the neurons. Interestingly, the above process appears to operate in a cell-autonomous manner. This cell-autonomous effect of beta-CTF as reported here may facilitate our understanding of some potentially important cellular processes related to neurodegeneration. Although these findings are valuable, it is key to understand the probability of this process occurring in a more natural condition, such as when this process occurs in many neurons at the same time. This will put the authors' findings into a context for a better understanding of their contribution to either physiological or pathological processes, such as Alzheimer's. The experiments and results using the cell system are quite solid, but the in vivo results are incomplete and hence less convincing (see below). The mechanistic analysis is interesting but primitive and does not add much more weight to the significance. Hence, further efforts from the authors are required to clarify and solidify their results, in order to provide a complete picture and support for the authors' conclusions.

      We would like to thank the reviewer for the suggestions. We have addressed the specific comments in following sections.

      Strengths:

      (1) The authors have addressed an interesting and potentially important question

      (2) The analysis using the cell system is solid and provides strong support for the authors' major conclusions. This analysis has used various technical approaches to support the authors' conclusions from different aspects and most of these results are consistent with each other.

      We would like to thank the reviewer for these comments.

      Weaknesses:

      (1) The relevance of the authors' major findings to the pathology, especially the Abeta-dependent processes is less clear, and hence the importance of these findings may be limited.

      We would like to thank the reviewer for this question. Phase 3 clinical trial data from Aβ antibodies show that cognitive function continues to decline rapidly, even in plaque-free patients, after 1.5 years of treatment[5]. This suggests that plaque-independent mechanisms may drive AD progression. Therefore, it is crucial to consider the potential contributions of other Aβ species or related fragments, such as alternative forms of Aβ and β-CTF. While it is early to predict how much β-CTF contributes to AD progression, it is notable that β-CTF induced synaptic deficits in mice, which recapitulates a key pathological feature of AD. Ultimately, the contribution of β-CTF in AD pathogenesis can only be tested through clinical studies in the future.

      (2) In vivo analysis is incomplete, with certain caveats in the experimental procedures and some of the results need to be further explored to confirm the findings.

      We would like to thank the reviewer for this suggestion. We have corrected these caveats in the revised manuscript.

      (3) The mechanistic analysis is rather primitive and does not add further significance.

      We would like to thank the reviewer for this comment. We did not delve further into the underlying mechanisms because our analysis indicates that Rab5 overactivation-induced endosomal dysfunction underlies β-CTF-induced synaptic dysfunction, which is consistent with another study and has been addressed in our study[6]. We hope the reviewer could understand that our focus in this paper is on how β-CTF triggers synaptic deficits, which is why we did not investigate the mechanisms of β-CTF-induced endosomal dysfunction further.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      Suggestions for improved or additional experiments, data, or analyses:

      (1) In Figures 4H, 4J, 4K and Supplemental Figures 3C, 3E, and 3G, it was unclear whether a repeated measures 2-way ANOVA, rather than a 2-way ANOVA, followed by appropriate post-hoc analyses was used to strengthen the conclusion that there were significant effects in the behavioral tests.

      We appreciate the reviewer for raising this point and apologize for the lack of clear description in the manuscript. In those figures mentioned above, we use a repeated measures 2-way ANOVA to analyze the data by Graphpad Prism. In Figure 4H, fear conditioning tests were conducted. The same cohort of mice were used in the baseline, contextual and cued tests. Firstly, baseline freezing was tested; then these mice underwent tone and foot shock training, followed by contextual test and cued test. So, a repeated measures 2-way ANOVA is more appropriate for the experiment.

      In water T maze tests (Figure 4J and K), the same cohort of mice were trained and tested each day. So, it’s also appropriate to use a repeated measures 2-way ANOVA.

      In Supplementary figure 3C, 3E and 3G, OFT was conducted. In this experiment, the locomotion of the same cohort of mice were recorded. Also, it’s appropriate to use a repeated measures 2-way ANOVA.

      Clearer description for these experiments has been provided in the revised manuscript.

      (2) Including gender analyses would be helpful.

      The mice we used in this study were all males.

      Minor corrections to text and figures:

      (1) Quantitative analyses in Figures 5A-C, 5H, 6G, 6H, and Supplementary Figures 4 and 5C would be helpful.

      We have provided quantitative analysis of these results (Figure 5D, 5J, 6K, Supplementary figure 4D, 5F) mentioned above in the revised manuscript.

      (2) Percent correct (%) in Figures 4J and 4K should be labeled as 0, 50, and 100 instead of 0.0, 0.5, and 1.0.

      We would like to thank the reviewer for pointing out this. We have made corrections in the revised manuscript.

      Reviewer #2 (Recommendations For The Authors):

      In the study conducted by Luo et al, it was observed that the fragment of amyloid precursor protein (APP) cleaved by beta-site amyloid precursor protein cleaving enzyme 1 (BACE1), known as β-CTF, plays a crucial role in synaptic damage. The study found increasing expression of β-CTF in neurons could induce synapse loss both in vitro and in vivo, independent of Aβ. Mechanistically, they explored how β-CTF could interfere with the endosome system by interacting with RAB5. While this study is intriguing, there are several points that warrant further investigation:

      (1) The study involved overexpressing β-CTF in neurons. It would be valuable to know if the levels of β-CTF are similarly increased in Alzheimer's disease (AD) patients or AD mouse models.

      We would like to thank the reviewer for the suggestion. It’s reported β-CTF levels were significantly elevated in the AD cerebral cortex[6]. Most AD mouse models are human APP transgenic mouse models with elevated β-CTF levels[7].

      (2) The study noted that β-CTF in neurons is a membranal fragment, but the overexpressed β-CTF was not located in the membrane. It is important to ascertain whether the membranal β-CTF and cytoplasmic β-CTF lead to synapse loss in a similar manner.

      We apologize for not clearly explaining the localization of β-CTF in the original manuscript. β-CTF is produced from APP through β-cleavage, a process that occurs in organelles such as endo-lysosomes[8]. The overexpressed β-CTF is also primarily localized in the endo-lysosomal systems (Figure 5C and Supplementary figure 4C), similar to those generated by APP cleavage.

      (3) The study found a significant decrease in GluA1, a subunit of AMPA receptors, due to β-CTF. It would be beneficial to investigate whether there are systematic alterations in NMDA receptors, including GluN2A and GluN2B.

      We would like to express our gratitude to the reviewer for bringing up this question. The protein levels of GluN2A and GluN2B are also reduced in neurons expressing β-CTF (Figure 6E-F)

      (4) The study showed a significant decrease in the frequency of miniature excitatory postsynaptic currents (mEPSC), indicating disrupted presynaptic vesicle neurotransmitter release. It would be pertinent to test whether the expression level of the presynaptic SNARE complex, which is required for vesicle release, is altered by β-CTF.

      We would like to express our gratitude to the reviewer for bringing up this question. The protein level of the presynaptic SNARE complex, such as VAMP2, is also reduced in neurons expressing β-CTF (Figure 6E, G).

      (5) Since AMPA receptors are glutamate receptors, it is important to determine whether the ability of glutamate release is altered by β-CTF. In vivo studies using a glutamate sensor should be conducted to examine glutamate release.

      We would like to express our gratitude to the reviewer for this suggestion. It will be interesting to use glutamate sensors to assess the ability of glutamate release in the future.

      (6) The quality of immunostaining associated with Figures 4B and 4C was noted to be suboptimal.

      We apologize for the suboptimal quality of these images. The immunostaining in Figures 4B and 4C were captured using the stitching function of a confocal microscope to display larger areas, including the entire hemisphere and hippocampus. We have reprocessed the images to obtain higher-quality versions.

      (7) It would be insightful to investigate whether treatment with a BACE1 inhibitor in the study could reverse synaptic deficits mediated by β-CTF.

      We would like to thank the reviewer for this sggestion. In Figure 1I-M, we constructed an APP mutant (APP<sub>MV</sub>), which cannot be cleaved by BACE1 to produce β-CTF and Aβ but has no impact on β’-cleavage. When co-expressed with BACE1, APP<sub>MV</sub> failed to induce spine loss, supporting the effect of β-CTF. We think these results domonstrate that β-CTF underlies the synaptic deficits. It would be interesting to test the effects of BACE1 inhibition in the future.

      (8) Considering the potential implications for therapeutics, it is worth exploring whether extremely low levels of β-CTF have beneficial effects in regulating synaptic function or promoting synaptogenesis at a physiological level.

      We would like to thank the reviewer for raising this question. We found that when the plasmid amount was reduced to 1/8 of the original dose, β-CTF no longer induced a decrease in dendritic spine density (Supplementary figure 2E-F). It’s reported APP-Swedish mutation in familial AD increased synapse numbers and synaptic transmission, whereas inhibition of BACE1 lowered synapse numbers, suppressed synaptic transmission in wild type neurons, suggesting that at physiological level, β-CTF might be synaptogenic[9].

      (9) The molecular mechanism through which β-CTF interferes with Rab5 function should be elucidated.

      We would like to thank the reviewer for raising this question. Kim et al have elucidated the mechanism through which β-CTF interferes with Rab5 function. β-CTF recruited APPL1 (a Rab5 effector) via YENPTY motif to Rab5 endosomes, where it stabilizes active GTP-Rab5, leading to pathologically accelerated endocytosis, endosome swelling and selectively impaired transport of Rab5 endosomes[6]. We have included additional discussion for this question in the revised manuscript (page 15 of the revised manuscript).

      (10) The study could compare the role of β-CTF and Aβ in neurodegeneration in AD mouse models.

      We would like to thank the reviewer for raising this point. While it is easier to dissect the role of Aβ and β-CTF in vitro, some of the critical tools are not applicabe in vivo, such as γ-secretase inhibitors, which lead to severe side effects because of their inhibition on other γ substrates[1, 2]. Therefore it will be difficult to deomonstrate their different roles in vivo. There are studies showing that β-CTF accumulation precedes Aβ deposition in model mice and mediates Aβ independent intracellular pathologies[10, 11], consistent with our results.

      (11) Based on the findings, it would be valuable to discuss possible explanations for the failure of most BACE1 inhibitors in recent clinical trials for humans.

      Response: We would like to express our gratitude to the reviewer for raising this recommendation. It is a big puzzle why BACE1 inhibition failed to provide beneficial effects in AD patients whereas clearance of amyloid by Aβ antibodies could slow down the AD progress. One potential answer is that pharmacological inhibition of BACE1 might be not as effective as its genetic removal. Indeed, genetic depletion of BACE1 leads to clearance of existing amyloid plaques[3], whereas pharmacological inhibition of BACE1 could not stop growth of existing plaques, although it prevents formation of new plaques[4]. The negative result of BACE1 inhibitors might not be sufficient to exclude the possibility that β-CTF could also contribute to the AD pathogenesis. We have included additional discussion for this question in the revised manuscript (page 15 of the revised manuscript).

      Reviewer #3 (Recommendations For The Authors):

      Major:

      (1) The cell experiments were performed at DIV 9, do the authors know whether at this age, the neurons are still developing and spine density has not reached a pleated yet? If so, the observed effect may reflect the impact on development and/or maturation, rather than on the mature neurons. The authors should be more specific about this issue.

      We would like to thank the reviewer for pointing out this question. These slice cultures were made from 1-week-old rats. DIV 9 is about two weeks old. These neurons are still developing and spine density has not reached a plateau yet[12]. In addition, we also investigated the effects of β-CTF on the synapses of mature neurons in two-month-old mice (Figure 3). So we think the observed effect reflects the impact on both immature and mature neurons.

      (2) mEPSCs shown in Figure 3D were of small amplitudes, perhaps also indicating that these synapses are not yet mature.

      In Figure 3D, the mEPSC results were obtained from pyramidal neurons in the CA1 region of two-month-old mice. At the age of two months, neurotransmitter levels and synaptic density have reached adult levels[13].

      (3) There was no data on the spine density or mEPSCs in the mice OE b-CTF, hence it is unclear whether a primary impact of this manipulation (b-CTF effect) on the synaptic transmission still occurs in vivo.

      In Figure 3, we examined the density of dendritic spines and mEPSCs from CA1 pyramidal neurons infected with lentivirus expressing β-CTF in mice and showed that those neurons expressing additional amount of β-CTF exhibited lower spine density and less mEPSCs, supporting that β-CTF also damaged synaptic transmission in vivo.

      (4) OE of b-CTF should lead to the production of Abeta, although this may not lead to the formation of significant plaques. How do the authors know whether their findings on behavioral and cognitive impairments were not largely mediated by Abeta, which has been widely reported by previous studies?

      We would like to thank the reviewer for pointing out this question. Indeed, our in vivo data could not exclude the potential involvement of Aβ in the pathology, despite the absence of amyloid plaque formation. It will be difficult to demonstrate this question in vivo because of the severe side effects from γ inhibition.

      (5) Figure 4H, the freezing level in the cued fear conditioning was very high, likely saturated; this may mask a potential reduction in the b-CTF OE mice (there is a hint for that in the results). The authors should repeat the experiments using less strong footshock strength (hence resulting in less freezing, <70%).

      We would like to express our gratitude to the reviewer for bringing up this question. The contextual fear conditioning test assesses hippocampal function, while the cued fear conditioning test assesses amygdala function. We hope the reviewer understands that our primary goal is to assess hippocampus-related functions in this experiment and we did see a significant difference between GFP and β-CTF groups. Therefore, we think the intensity of footshock we used was suitable to serve the primary purpose of this experiment.

      (6) Why was the deficit in the Morris water maze in the b-CTF OE mice only significant in the training phase?

      We would like to thank the reviewer for rasing this question and apologize for not describing the test clearly. This is a water T maze test, not Morris water maze test.

      To make the behavioral paradigm of the water T maze test easier to understand, we have provided a more detailed description of the methods in the new version of the manuscript.

      The acquisition phase of the Water T Maze (WTM) evaluates spatial learning and memory, where mice use spatial cues in the environment to navigate to a hidden platform and escape from water, while the reversal learning measures cognitive flexibility in which mice must learn a new location of the hidden platform[14]. In reversal learning task (Figure 4J-K), the learning curves of the two groups of mice did not show any significant differences, indicating that the expression of β-CTF only damages spatial learning and memory but not cognitive flexibility. This is consistent with a previous report using APP/PS1 mice[15].

      (7) Will the altered Rab5 in the b-CTF OE condition also affect the level of other proteins?

      We would like to express our gratitude to the reviewer for raising this interesting question.  Expression of Rab5<sub>S34N</sub> in β-CTF-expressing neurons did not alter the levels of synapse-related proteins that were reduced in these neurons (Supplementary figure 5G-H), suggesting Rab5 overactivation did not contribute to these protein expression changes induced by β-CTF.

      (8) How do the authors reconcile their findings with the well-established findings that Abeta affects synaptic transmission and spine density? Do they think these two processes may occur simultaneously in the neurons, or, one process may dominate in the other?

      APP, Aβ, and presenilins have been extensively studied in mouse models, providing convincing evidence that high Aβ concentrations are toxic to synapses[16]. Moreover, addition of Aβ to murine cultured neurons or brain slices is toxic to synapses[17]. However, Aβ-induced synaptotoxicity was not observed in our study. A major difference between our study and others is that our study used a isolated expression system that apply Aβ only to individual neurons surrounded by neurons without excessive amount of Aβ, whereas the rest studies generally apply Aβ to all the neurons. Therefore, we predict that Aβ does not lead to synaptic deficits from individual neurons in cell autonomous manners, whereas β-CTF does. Aβ and β-CTF represent two parallel pathways of action. Additional discussion for this question has been included in the revised manuscript (page 14 of the revised manuscript).

      Minor:

      Fig 2F-G, "prevent" rather than "reverse"?

      We would like to thank the reviewer for pointing this out. We have made corrections in the revised manuscript.

      Reference:

      (1) GüNER G, LICHTENTHALER S F. The substrate repertoire of γ-secretase/presenilin [J]. Seminars in cell & developmental biology, 2020, 105: 27-42.

      (2) DOODY R S, RAMAN R, FARLOW M, et al. A phase 3 trial of semagacestat for treatment of Alzheimer's disease [J]. The New England journal of medicine, 2013, 369(4): 341-50.

      (3) HU X, DAS B, HOU H, et al. BACE1 deletion in the adult mouse reverses preformed amyloid deposition and improves cognitive functions [J]. The Journal of experimental medicine, 2018, 215(3): 927-40.

      (4) PETERS F, SALIHOGLU H, RODRIGUES E, et al. BACE1 inhibition more effectively suppresses initiation than progression of β-amyloid pathology [J]. Acta neuropathologica, 2018, 135(5): 695-710.

      (5) SIMS J R, ZIMMER J A, EVANS C D, et al. Donanemab in Early Symptomatic Alzheimer Disease: The TRAILBLAZER-ALZ 2 Randomized Clinical Trial [J]. Jama, 2023, 330(6): 512-27.

      (6) KIM S, SATO Y, MOHAN P S, et al. Evidence that the rab5 effector APPL1 mediates APP-βCTF-induced dysfunction of endosomes in Down syndrome and Alzheimer's disease [J]. Molecular psychiatry, 2016, 21(5): 707-16.

      (7) MONDRAGóN-RODRíGUEZ S, GU N, MANSEAU F, et al. Alzheimer's Transgenic Model Is Characterized by Very Early Brain Network Alterations and β-CTF Fragment Accumulation: Reversal by β-Secretase Inhibition [J]. Frontiers in cellular neuroscience, 2018, 12: 121.

      (8) ZHANG X, SONG W. The role of APP and BACE1 trafficking in APP processing and amyloid-β generation [J]. Alzheimer's research & therapy, 2013, 5(5): 46.

      (9) ZHOU B, LU J G, SIDDU A, et al. Synaptogenic effect of APP-Swedish mutation in familial Alzheimer's disease [J]. Science translational medicine, 2022, 14(667): eabn9380.

      (10) LAURITZEN I, PARDOSSI-PIQUARD R, BAUER C, et al. The β-secretase-derived C-terminal fragment of βAPP, C99, but not Aβ, is a key contributor to early intraneuronal lesions in triple-transgenic mouse hippocampus [J]. The Journal of neuroscience : the official journal of the Society for Neuroscience, 2012, 32(46): 16243-1655a.

      (11) KAUR G, PAWLIK M, GANDY S E, et al. Lysosomal dysfunction in the brain of a mouse model with intraneuronal accumulation of carboxyl terminal fragments of the amyloid precursor protein [J]. Molecular psychiatry, 2017, 22(7): 981-9.

      (12) HARRIS K M, JENSEN F E, TSAO B. Three-dimensional structure of dendritic spines and synapses in rat hippocampus (CA1) at postnatal day 15 and adult ages: implications for the maturation of synaptic physiology and long-term potentiation [J]. The Journal of neuroscience : the official journal of the Society for Neuroscience, 1992, 12(7): 2685-705.

      (13) SEMPLE B D, BLOMGREN K, GIMLIN K, et al. Brain development in rodents and humans: Identifying benchmarks of maturation and vulnerability to injury across species [J]. Progress in neurobiology, 2013, 106-107: 1-16.

      (14) GUARIGLIA S R, CHADMAN K K. Water T-maze: a useful assay for determination of repetitive behaviors in mice [J]. Journal of neuroscience methods, 2013, 220(1): 24-9.

      (15) ZOU C, MIFFLIN L, HU Z, et al. Reduction of mNAT1/hNAT2 Contributes to Cerebral Endothelial Necroptosis and Aβ Accumulation in Alzheimer's Disease [J]. Cell reports, 2020, 33(10): 108447.

      (16) CHAPMAN P F, WHITE G L, JONES M W, et al. Impaired synaptic plasticity and learning in aged amyloid precursor protein transgenic mice [J]. Nature neuroscience, 1999, 2(3): 271-6.

      (17) WANG Z, JACKSON R J, HONG W, et al. Human Brain-Derived Aβ Oligomers Bind to Synapses and Disrupt Synaptic Activity in a Manner That Requires APP [J]. The Journal of neuroscience : the official journal of the Society for Neuroscience, 2017, 37(49): 11947-66.

    1. eLife Assessment

      NAD deficiency perturbs embryonic development resulting in multiple congenital malformations, collectively termed Congenital NAD Deficiency Disorder (CNDD). The authors report fundamental findings demonstrating that extra-embryonic visceral yolk sac endoderm is critical for NAD de novo synthesis during early organogenesis and perturbations of this pathway may underlie CNDD. The authors combine gene expression with metabolic assays to provide solid evidence of an essential role of the extra-embryonic visceral yolk sac in both mouse and human embryos.

    2. Reviewer #1 (Public review):

      Summary:

      This study investigated the mechanism underlying Congenital NAD Deficiency Disorder (CNDD) using a mouse model with loss of function of the HAAO enzyme which mediates a key step in the NAD de novo synthesis pathway. This study builds on the observation that the kynurenine pathway is required in the conceptus, as HAAO null embryos are sensitive to maternal deficiency of NAD precursors (vitamin B3) and tryptophan, and narrows the window of sensitivity to a 3 day period.

      An important finding is that de novo NAD synthesis occurs in an extra-embryonic tissue, the visceral yolk sac, before the liver develops in the embryo. It is suggested that lack of this yolk sac activity leads to impaired NAD supply in the embryo leading to structural abnormalities found later in development.

      Strengths:

      Previous studies show a requirement for HAOO activity for normal development of the embryos develop abnormalities under conditions of maternal vitamin B3 deficiency, indicating a requirement for NAD synthesis in the conceptus. Analysis of scRNA-seq datasets combined with metabolite analysis of yolk sac tissue shows that the NAD synthesis pathway is expressed and functional in the yolk sac from E10.5 onwards (prior to liver development).

      HAOO enzyme assay enabled quantification of enzyme activity in relevant tissues including liver (from E12.5), embryo, placenta and yolk sac (from E11.5).<br /> Comprehensive metabolite analysis of the NAD synthesis pathway supports the predicted effects of HAOO knockout and provides analysis of yolk sac, placenta and embryo at a series of stages.

      The dietary study (with lower vitamin B3 in maternal diet from E7.5-10.5) is an incremental addition to previous studies which imposed similar restrictions from E7.5-12.5. Nevertheless, this emphasises the importance of the synthesis pathway on the conceptus at stages before liver activity is prominent.

      Weaknesses:

      The current dietary study narrows the period when deficiency can cause malformations (analysed at E18.5), and altered metabolite profiles (eg, increased 3HAA, lower NAD) are detected in yolk sac and embryo at E10.5.

      More importantly, there is still a question of whether in addition to the yolks sac, there is HAAO activity within the embryo itself has been assayed as early as E11.5, with minimal activity prior to E12.5 (when it is assayed in liver). These findings support the hypothesis that within the conceptus (embryo, chorioallantoic placenta and visceral yok sac) the embryo is unlikely to be the site of NAD synthesis prior to liver development.

      Evidence for lack of function of the NAD synthesis pathway in the embryos itself from kynurenine at E7.5-10.5 comes from reanalysis of scRNA-seq. This suggests low or absent expression of HAAO in the embryo prior to E10.5 (corresponding to the period when the authors have demonstrated that de novo NAD synthesis in the conceptus is needed). The caveat to this conclusion is that additional analysis of RNA and/or protein expression in the embryos at E7.5-10.5 has not been performed to validate the scRNA-seq data.

    3. Reviewer #2 (Public review):

      Summary:

      Disruption of the nicotinamide adenine dinucleotide (NAD) de novo Synthesis Pathway, by which L-tryptophan is converted to NAD results in multi-organ malformations which collectively has been termed Congenital NAD Deficiency Disorder (CNDD).

      While NAD de novo synthesis is primarily active in the liver postnatally, the site of activity prior to and during organogenesis is unknown. However, mouse embryos are susceptible to CNDD between E7.5-E12.5, before the embryo has developed a functional liver. Therefore, NAD de novo synthesis is likely active in another cell or tissue during this time window of susceptibility.

      The body of work presented in this paper continues the corresponding author's labs investigation of the cause and effects of NAD Deficiency and the primary goal was to determine the cell or tissue responsible for NAD de novo synthesis during early embryogenesis.

      The authors conclude that visceral yolk sac endoderm is the source of NAD de novo synthesis, which is essential for mouse embryonic development, and furthermore that the dynamics of NAD synthesis are conserved in human equivalent cells and tissues, the perturbation of which results in CNDD.

      Strengths:

      Overall, the primary findings regarding the source of NAD synthesis, the temporal requirement and conservation between rodent and human species is quite novel and important for our understanding of NAD synthesis and function and role in CNDD.

      The authors used UHPLC-MS/MS to quantify NAD+ and NAD-related metabolites and showed convincingly that the NAD salvage pathway can compensate for the loss of NAD synthesis in Haao-/- embryos, then determined that Haao activity was present in the yolk sac prior to hepatic development identifying this organ as the site of de novo NAD synthesis. Dietary modulation between E7.5-10.5 was sufficient to induce CNDD phenotypes, narrowing the window of susceptibility, and then re-analysis of RNA-seq datasets suggested the endoderm was the cell source of NAD synthesis.

      Weaknesses:

      Page 4 and Table S4. The descriptors for malformations of organs such as the kidney and vertebrae are quite vague and uninformative. More specific details are required to convey the type and range of anomalies observed as a consequence of NAD deficiency.

      Can the authors define whether the role for the NAD pathway in a couple of tissue or organ systems is the same. By this I mean is the molecular or cellular effect of NAD deficiency the same in the vertebrae and organs such as the kidney. What unifies the effects on these specific tissues and organs and are all tissues and organs affected. If some are not, can the authors explain why they escape the need for the NAD pathway.

      Page 5 and Figure 6C. The expectation and conclusion for whether specific genes are expressed in particular cell types in scRNA-seq datasets depends on number of cells sequenced, the technology (methodology) used, the depth of sequencing and also the resolution of the analysis. It is therefore essential to perform secondary validation of the analysis of scRNA-seq data. At a minimum, the authors should perform in situ hybridization or immunostaining for Tdo2, Afmid, Kmo, Kynu, Haao, Qprt and Nadsyn1 or some combination thereof at multiple time points during early mouse embryogenesis to truly understand the spatiotemporal dynamics of expression and NAD synthesis.

      Absolute functional proof of the yolk sac endoderm as being essential and required for NAD synthesis in the context of CNDD might require conditional deletion of Haao in the yolk sac versus embryo using appropriate Cre driver lines or in the absence of a conditional allele, could be performed by tetraploid embryo-ES cell complementation approaches. But temporal dietary intervention can also approximate the same thing by perturbing NAD synthesis then the yolk sac is the primary source versus when the liver becomes the primary source in the embryo.

      In further revisions, the authors have added data to Supp Table 4 and Supplemental Figures 1 and 2

      Although the authors did not perform in situ hybridization for some of the genes requested to define the critical cell type of expression, available scRNA-sequencing suggests the yolk sac endoderm are the only likely source of NAD synthesis prior to its synthesis in the liver. Absolute functional proof of the yolk sac endoderm as being essential and required for NAD synthesis in the context of CNDD still requires validation but nonetheless it seems likely given the absence of a functional liver in embryos prior to E12.5. The authors provided some additional data pertaining to the type of kidney and vertebral anomalies observed which makes this data more complete.

    4. Author response:

      The following is the authors’ response to the current reviews.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      A number of modifications/additions have been made to the text which help to clarify the background and details of the study and I feel have improved the study.

      NAD deficiency induced using the dietary/Haao null model showed a window of susceptibility at E7.5-10.5. Further, HAAO enymze activity data has been added at E11.5 and the minimal HAAO activity in the embryo act E11.5 supports the hypothesis that the NAD synthesis pathway from kynurenine is not functional until the liver starts to develop.

      The caveat to this is that absence of expression/activity in embryonic cells at E7.5-10/5 relies on previous scRNA-seq data. Both reviewers commented that analysis of RNA and/or protein expression at these stages (E7.5-10.5) would be necessary to rule this out, and would strongly support the conclusions regarding the necessity for yolk sac activity.

      There are a number of antibodies for HAAO, KNYU etc so it is surprising if none of these are specific for the mouse proteins, while an alternative approach in situ hydridisation would also be possible.

      We have tested 2 anti-HAAO antibodies, 2 anti-KYNU antibodies and 1 anti-QPRT antibody on adult liver and various embryonic tissues.

      Given that all tested antibodies only detected a specific band in tissues with very high expression and abundant target protein levels (adult liver), they were determined to be unsuitable to conclusively prove that these proteins of the NAD _de novo_synthesis pathway are absent in embryos prior to the development of a functional liver. They were also unsuitable for IHC experiments to determine which cell types (if any) have these proteins.

      The antibodies, tested assays and samples, and the results obtained were as follows:

      Anti-HAAO antibody (ab106436, Abcam, UK) 

      • Was tested in western blots of liver, E11.5-E14.5 yolk sac, E14.5 placenta, and E14.5 and E16.5 embryonic liver lysates from wild-type (WT) and Haao-/- mice. The target band (32.5 KD) was visible in the WT liver samples and absent in_Haao_-/- livers, and faintly visible in E11.5-E14.5 WT yolk sac, with intensity gradually increasing in E12.5 and E13.5 WT yolk sac. Multiple strong non-specific bands occurred in all samples, requiring cutting off the >50 KD area of the blots.

      • Was re-tested in western blots comparing WT, Haao-/-, and Kynu-/- E9.5-E11.5 embryo, E9.5 yolk sac, and adult liver tissues. It detected the target band faintly only in WT and Kynu-/- liver lysates. No target band could be resolved in E9.5 yolk sac or embryo lysates. Due to the low sensitivity of the antibody, it is unsuitable to conclusively determine whether HAAO is present or absent in E9.5 yolk sacs and E9.5-E11.5 embryos.

      • Was tested in IHC with DAB and IF, producing non-specific staining on both WT and Haao-/- liver and kidney tissue. 

      Anti-HAAO antibody (NBP1-77361, Novus Biologicals, LLC, CO, USA)

      • Was tested in western blots and detected a very faint target band in WT liver lysate that was absent in Haao-/- lysate, with stronger non-specific bands occurring in both genotypes.

      • Was tested in IHC with DAB, producing non-specific staining on both WT and Haao-/- liver and kidney tissue 

      Anti-L-Kynurenine Hydrolase antibody (11796-1-AP, Proteintech Group, IL, USA)

      • Was tested in western blots and detected a faint target band (52 KD) in E11.5, E12.5 E13.5, and E14.5 yolk sac lysates. Detected a weak band in E14.5 liver, a stronger band in E16.5 liver, but not in E14.5 placenta. The target band was only resolved with normal ECL substrate and extended exposure when the >75 KD part of the blot was cut off. 

      • Was re-tested in western blots comparing WT, Haao-/-, and Kynu-/- E9.5-E11.5 embryo, E9.5 yolk sac, and adult liver tissues. It detected the target band only in WT and Haao-/- liver lysates, requiring Ultra Sensitive Substrate. No target band could be resolved in yolk sac or embryo lysates of any genotype.

      Anti-L-Kynurenine Hydrolase antibody (ab236980, Abcam, UK)

      • Was tested in western blots and detected a very faint target band (52 KD) in WT liver lysates and no band in Kynu-/- liver lysates. Multiple non-specific bands occurred irrespective of the Kynu genotype of the lysate.

      • Was tested in IHC with DAB and IF, producing non-specific staining on both WT and Kynu-/- liver and kidney tissue 

      Anti-QPRT (orb317756, Biorbyt, NC, USA)

      • Was tested in western blots and detected a faint target band (31 KD) with multiple other bands between 25-75 KD and an extremely strong band around 150 KD on WT liver lysates.

      The following is the authors’ response to the original reviews.

      Reviewer 1 Public Review:

      The current dietary study narrows the period when deficiency can cause malformations (analysed at E18.5), and altered metabolite profiles (eg, increased 3HAA, lower NAD) are detected in the yolk sac and embryo at E10.5. However, without analysis of embryos at later stages in this experiment it is not known how long is needed for NAD synthesis to be recovered - and therefore until when the period of exposure to insufficient NAD lasts. This information would inform the understanding of the developmental origin of the observed defects.

      Our previous published work (Cuny et al 2023 https://doi.org/10.1242/dmm.049647) indicates that the timing of NAD de novo synthesis pathway precursor availability and consequently the timing of NAD deficiency during organogenesis drives which organs are affected in their development. Furthermore, experimental data of another project (manuscript submitted) shows that mouse embryos (from mothers on an NAD precursor restricted diet that induces CNDD) were NAD deficient at E9.5 and E11.5, but embryo NAD levels were fully recovered at E14.5 when compared to same-stage embryos from mothers on precursor-sufficient diet. This was observed irrespective of the embryos’ Haao genotype. In the current study, NAD precursor provision was only restricted until E10.5. Thus, we expect that our embryos phenotyped at E18.5 had recovered their NAD levels back to normal by E14.5 at the latest.  More research, beyond the scope of the current manuscript, is required to spatio-temporally link embryonic NAD deficiency to the occurrence of specific defect types and elucidate the mechanistic origin of the defects. To acknowledge this, we updated the respective Discussion paragraph on page 7 and added the following statement: “This observation supports our hypothesis that the timing of NAD deficiency during organogenesis determines which organs/tissues are affected (Cuny et al., 2023), but more research is needed to fully characterise the onset and duration of embryonic NAD deficiency in dietary NAD precursor restriction mouse models.”

      More importantly, there is still a question of whether in addition to the yolk sac, there is HAAO activity within the embryo itself prior to E12.5 (when it has first been assayed in the liver - Figure 1C). The prediction is that within the conceptus (embryo, chorioallantoic placenta, and visceral yok sac) the embryo is unlikely to be the site of NAD synthesis prior to liver development. Reanalysis of scRNA-seq (Fig 1B) shows expression of all the enzymes of the kynurenine pathway from E9.5 onwards. However, the expression of another available dataset at E10.5 (Fig S3) suggested that expression is 'negligible'. While the expression in Figure 1B, Figure S1 is weak this creates a lack of clarity about the possible expression of HAAO in the hepatocyte lineage, or especially elsewhere in the embryo prior to E10.5 (corresponding to the period when the authors have demonstrated that de novo NAD synthesis in the conceptus is needed). Given these questions, a direct analysis of RNA and/or protein expression in the embryos at E7.5-10.5 would be helpful. 

      We now have included additional data showing that whole embryos at E11.5 and embryos with their livers removed at E14.5 have negligible HAAO enzyme activity. The observed lack of HAAO activity in the embryo at E11.5 is consistent with the absence of a functional embryonic liver at that stage. Thus, it confirms that the embryo is dependent of extraembryonic tissues (the yolk sac) for NAD de novo synthesis prior to E12.5. The additional datasets are now included in Supplementary Table S1 and as Supplementary Figure 2. The Results section on page 2 has been updated to refer to these datasets.

      Reviewer #2 (Public Review): 

      Page 4 and Table S4. The descriptors for malformations of organs such as the kidney and vertebrae are quite vague and uninformative. More specific details are required to convey the type and range of anomalies observed as a consequence of NAD deficiency. 

      We now provide more information about the malformation types in the Results on page 4. Also, Table S4 now defines the missing vertebral, sternum, and kidney descriptors.

      Can the authors define whether the role of the NAD pathway in a couple of tissue or organ systems is the same? By this I mean is the molecular or cellular effect of NAD deficiency is the same in the vertebrae and organs such as the kidney. What unifies the effects on these specific tissues and organs and are all tissues and organs affected? If some are not, can the authors explain why they escape the need for the NAD pathway? 

      This is a good comment, highlighting that further research, beyond the scope of this manuscript, is needed to better understand the underlying mechanisms of CNDD causation. We have expanded the Discussion paragraph “NAD deficiency in early organogenesis is sufficient to cause CNDD” to indicate that while the timing of NAD deficiency during embryogenesis explains variability in phenotypes among the CNDD spectrum, it is unknown why other organs/tissues are seemingly not affected by NAD deficiency.

      To answer the reviewer’s questions and elucidate the underlying cellular and molecular processes in individual organs affected by NAD deficiency, a multiomic approach is required. This is because NAD is involved in hundreds of molecular and cellular processes affecting gene expression, protein levels, metabolism, etc. For details of NAD functions that have relevance to embryogenesis, the reviewer may refer to our recent review article (Dunwoodie et al 2023 https://doi.org/10.1089/ars.2023.0349). 

      Page 5 and Figure 6C. The expectation and conclusion for whether specific genes are expressed in particular cell types in scRNA-seq datasets depend on the number of cells sequenced, the technology (methodology) used, the depth of sequencing, and also the resolution of the analysis. It is therefore essential to perform secondary validation of the analysis of scRNA-seq data. At a minimum, the authors should perform in situ hybridization or immunostaining for Tdo2, Afmid, Kmo, Kynu, Haao, Qprt, and Nadsyn1 or some combination thereof at multiple time points during early mouse embryogenesis to truly understand the spatiotemporal dynamics of expression and NAD synthesis. 

      We have tested antibodies against HAAO, KYNU, and QPRT in adult mouse liver samples (the main site of NAD de novo synthesis) but these produced non-specific bands in western blotting experiments. Therefore, immunostaining studies on embryonic tissues were not feasible. 

      However, we agree that histological methods such as in situ hybridisation would provide secondary validation of the exact cell types that express these genes. To acknowledge this, we have updated a sentence on page 5 referring to the data shown in Figure 6C as follows: “While histological methods such as in situ hybridisation would be required to confirm the exact cell types expressing these genes, the available expression data indicates that the genes encoding those enzymes required to convert L-kynurenine to NAD (kynurenine pathway) are exclusively expressed in the yolk sac endoderm lineage from the onset of organogenesis (E8.0-8.5).”

      Absolute functional proof of the yolk sac endoderm as being essential and required for NAD synthesis in the context of CNDD might require conditional deletion of Haao in the yolk sac versus embryo using appropriate Cre driver lines or in the absence of a conditional allele, could be performed by tetraploid embryo-ES cell complementation approaches. But temporal dietary intervention can also approximate the same thing by perturbing NAD synthesis Shen the yolk sac is the primary source versus when the liver becomes the primary source in the embryo. 

      Reviewer 1 has made a similar comment about confirming that indeed NAD de novo synthesis activity is limited to extraembryonic tissues (=yolk sacs) and absent in the embryo prior to development of an embryonic liver. We now have included additional data showing that whole embryos at E11.5 and embryos with their livers removed at E14.5 have negligible HAAO enzyme activity. The observed lack of HAAO activity in the embryo at E11.5 is consistent with the absence of a functional embryonic liver at that stage. We think this provides enough proof that the embryo is dependent of extraembryonic tissues (the yolk sac) for NAD de novo synthesis prior to E12.5. The additional datasets are now included in Supplementary Table S1 and as Supplementary Figure 2. The Results section on page 2 has been updated to refer to these data.

      Reviewer #1 (Recommendations For The Authors): 

      (1) Introduction (page 1) introduces mouse models with defects in the kynurenine pathway "confirming that NAD de novo synthesis is required during embryogenesis ...". This requirement is revealed by the imposition of maternal dietary deficiency and more detail (or a more clear link to the following sentences) here would help the reader who is not familiar with the previous papers using the HAAO mice and dietary modulation.

      We have updated this paragraph in the Introduction to better indicate that the requirement of NAD de novo synthesis for embryogenesis was confirmed in mouse models by modulating the maternal dietary NAD precursor provision during pregnancy.

      (2) Discussion - throughout the introduction and results the authors refer to the NAD de novo synthesis pathway, with the study focussing on the effects of HAAO loss of function. Data implies that the kynurenine pathway is active in the yolk sac but whether de novo synthesis from L-tryptophan occurs has not been addressed. The first sub-heading of the discussion could be more accurate referring to the kynurenine pathway, or synthesis from kynurenine. 

      We agree that our manuscript needed to make better distinction between NAD de novo synthesis starting from kynurenine and starting from tryptophan. We removed “from Ltryptophan” from the sub-heading in the Discussion and clarified in this paragraph which genes are required to convert tryptophan to kynurenine and which genes to convert kynurenine to NAD. We also updated two Results paragraphs (page 2, 2nd paragraph; page 5, 5th paragraph) to improve clarity.

      It is worth noting that our statement in the Discussion “this is the first demonstration of NAD de novo synthesis occurring in a tissue outside of the liver and kidney.” is valid because vascular smooth muscle cells express Tdo2 and in combination with the other requisite genes expressed in endoderm cells, the yolk sac has the capability to synthesise NAD de novo from L-tryptophan.

      (3) Outlook - While this section is designed to be looking ahead to the potential implications of the work, the last section on gene therapy of the yolk sac seems far removed from the paper content and highly speculative. I feel this could detract from the main points of the study and could be removed. 

      We have updated the Outlook paragraph and shortened the final part to “Further research is required to better understand the mechanisms of CNDD causation and of other causes of adverse pregnancy outcomes involving the yolk sac.”

      (4) In Figure 2D it would be useful to label the clusters as the colours in the legend are difficult to match to the heatmap. 

      We now have labelled the clusters with lowercase letters above the heatmap to make it easier to match the clusters in Figure 2D to the colours used for designating tissues and genotypes. These labels are described in the figure’s key and the figure legend.  

      Reviewer #2 (Recommendations For The Authors): 

      Page 4 and Table S4. The descriptors for malformations of organs such as the kidney and vertebrae are quite vague and uninformative. More specific details are required to convey the type and range of anomalies observed as a consequence of NAD deficiency. 

      We now provide more information about the malformation types in the Results on page 4. Also, Table S4 now defines the missing vertebral, sternum, and kidney descriptors.

      Can the authors define whether the role of the NAD pathway in a couple of tissue or organ systems is the same? By this I mean is the molecular or cellular effect of NAD deficiency is the same in the vertebrae and organs such as the kidney. What unifies the effects on these specific tissues and organs and are all tissues and organs affected? If some are not, can the authors explain why they escape the need for the NAD pathway? 

      This is a good comment, highlighting that further research, beyond the scope of this manuscript, is needed to better understand the underlying mechanisms of CNDD causation. We have expanded the Discussion paragraph “NAD deficiency in early organogenesis is sufficient to cause CNDD” to indicate that while the timing of NAD deficiency during embryogenesis explains variability in phenotypes among the CNDD spectrum, it is unknown why other organs/tissues are seemingly not affected by NAD deficiency.

      To answer the reviewer’s questions and elucidate the underlying cellular and molecular processes in individual organs affected by NAD deficiency, a multiomic approach is required. This is because NAD is involved in hundreds of molecular and cellular processes affecting gene expression, protein levels, metabolism, etc. For details of NAD functions that have relevance to embryogenesis, the reviewer may refer to our recent review article (Dunwoodie et al 2023 https://doi.org/10.1089/ars.2023.0349). 

      Page 5 and Figure 6C. The expectation and conclusion for whether specific genes are expressed in particular cell types in scRNA-seq datasets depend on the number of cells sequenced, the technology (methodology) used, the depth of sequencing, and also the resolution of the analysis. It is therefore essential to perform secondary validation of the analysis of scRNA-seq data. At a minimum, the authors should perform in situ hybridization or immunostaining for Tdo2, Afmid, Kmo, Kynu, Haao, Qprt, and Nadsyn1 or some combination thereof at multiple time points during early mouse embryogenesis to truly understand the spatiotemporal dynamics of expression and NAD synthesis. 

      We have tested antibodies against HAAO, KYNU, and QPRT in adult mouse liver samples (the main site of NAD de novo synthesis) but these produced non-specific bands in western blotting experiments. Therefore, immunostaining studies on embryonic tissues were not feasible. 

      However, we agree that histological methods such as in situ hybridisation would provide secondary validation of the exact cell types that express these genes. To acknowledge this, we have updated a sentence on page 5 referring to the data shown in Figure 6C as follows: “While histological methods such as in situ hybridisation would be required to confirm the exact cell types expressing these genes, the available expression data indicates that the genes encoding those enzymes required to convert L-kynurenine to NAD (kynurenine pathway) are exclusively expressed in the yolk sac endoderm lineage from the onset of organogenesis (E8.0-8.5).”

    1. eLife Assessment

      This manuscript addresses the role of alpha oscillations in sensory gain control. The authors use an attention-cuing task in an initial EEG study followed by a separate MEG replication study to demonstrate that whilst (occipital) alpha oscillations are increased when anticipating an auditory target, so is visual responsiveness as assessed with frequency tagging. The authors propose their results demonstrate a general vigilance effect on sensory processing and offer a re-interpretation of the inhibitory role of the alpha rhythm. While these results are valuable, the provided evidence is incomplete.

    2. Reviewer #1 (Public review):

      In this paper by Brickwedde et al., the authors observe an increase in posterior alpha when anticipating auditory as opposed to visual targets. The authors also observe an enhancement in both visual and auditory steady-state sensory evoked potentials in anticipation of auditory targets, in correlation with enhanced occipital alpha. The authors conclude that alpha does not reflect inhibition of early sensory processing, but rather orchestrates signal transmission to later stages of the sensory processing stream. However, there are several major concerns that need to be addressed in order to draw this conclusion.

      First, I am not convinced that the frequency tagging method and the associated analyses are adequate for dissociating visual vs auditory steady-state sensory evoked potentials.

      Second, if the authors want to propose a general revision for the function of alpha, it would be important to show that alpha effects in the visual cortex for visual perception are analogous to alpha effects in the auditory cortex for auditory perception.

      Third, the authors propose an alternative function for alpha - that alpha orchestrates signal transmission to later stages of the sensory processing stream. However, the supporting evidence for this alternative function is lacking. I will elaborate on these major concerns below.

      (1) Potential bleed-over across frequencies in the spectral domain is a major concern for all of the results in this paper. The fact that alpha power, 36Hz and 40Hz frequency-tagged amplitude and 4Hz intermodulation frequency power is generally correlated with one another amplifies this concern. The authors are attaching specific meaning to each of these frequencies, but perhaps there is simply a broadband increase in neural activity when anticipating an auditory target compared to a visual target?

      (2) Moreover, 36Hz visual and 40Hz auditory signals are expected to be filtered in the neocortex. Applying standard filters and Hilbert transform to estimate sensory evoked potentials appears to rely on huge assumptions that are not fully substantiated in this paper. In Figure 4, 36Hz "visual" and 40Hz "auditory" signals seem largely indistinguishable from one another, suggesting that the analysis failed to fully demix these signals.

      (3) The asymmetric results in the visual and auditory modalities preclude a modality-general conclusion about the function of alpha. However, much of the language seems to generalize across sensory modalities (e.g., use of the term 'sensory' rather than 'visual').

      (4) In this vein, some of the conclusions would be far more convincing if there was at least a trend towards symmetry in source-localized analyses of MEG signals. For example, how does alpha power in the primary auditory cortex (A1) compare when anticipating auditory vs visual target? What do the frequency-tagged visual and auditory responses look like when just looking at the primary visual cortex (V1) or A1?

      (5) Blinking would have a huge impact on the subject's ability to ignore the visual distractor. The best thing to do would be to exclude from analysis all trials where the subjects blinked during the cue-to-target interval. The authors mention that in the MEG experiment, "To remove blinks, trials with very large eye-movements (> 10 degrees of visual angle) were removed from the data (See supplement Fig. 5)." This sentence needs to be clarified since eye-movements cannot be measured during blinking. In addition, it seems possible to remove putative blink trials from EEG experiments as well, since blinks can be detected in the EEG signals.

      (6) It would be interesting to examine the neutral cue trials in this task. For example, comparing auditory vs visual vs neutral cue conditions would be indicative of whether alpha was actively recruited or actively suppressed. In addition, comparing spectral activity during cue-to-target period on neutral-cue auditory correct vs incorrect trials should mimic the comparison of auditory-cue vs visual-cue trials. Likewise, neutral-cue visual correct vs incorrect trials should mimic the attention-related differences in visual-cue vs auditory-cue trials.

      (7) In the abstract, the authors state that "This implies that alpha modulation does not solely regulate 'gain control' in early sensory areas but rather orchestrates signal transmission to later stages of the processing stream." However, I don't see any supporting evidence for the latter claim, that alpha orchestrates signal transmission to later stages of the processing stream. If the authors are claiming an alternative function to alpha, this claim should be strongly substantiated.

    3. Reviewer #2 (Public review):

      Brickwedde et al. investigate the role of alpha oscillations in allocating intermodal attention. A first EEG study is followed up with a MEG study that largely replicates the pattern of results (with small to be expected differences). They conclude that a brief increase in the amplitude of auditory and visual stimulus-driven continuous (steady-state) brain responses prior to the presentation of an auditory - but not visual - target speaks to the modulating role of alpha that leads them to revise a prevalent model of gating-by-inhibition.

      Overall, this is an interesting study on a timely question, conducted with methods and analysis that are state-of-the-art. I am particularly impressed by the author's decision to replicate the earlier EEG experiment in MEG following the reviewer's comments on the original submission. Evidently, great care was taken to accommodate the reviewer's suggestions.

      Nevertheless, I am struggling with the report for two main reasons: It is difficult to follow the rationale of the study, due to structural issues with the narrative and missing information or justifications for design and analysis decisions, and I am not convinced that the evidence is strong, or even relevant enough for revising the mentioned alpha inhibition theory. Both points are detailed further below.

      Strength/relevance of evidence for model revision: The main argument rests on 1) a rather sustained alpha effect following the modality cue, 2) a rather transient effect on steady-state responses just before the expected presentation of a stimulus, and 3) a correlation between those two. Wouldn't the authors expect a sustained effect on sensory processing, as measured by steady-state amplitude irrespective of which of the scenarios described in Figure 1A (original vs revised alpha inhibition theory) applies? Also, doesn't this speak to the role of expectation effects due to consistent stimulus timing? An alternative explanation for the results may look like this: Modality-general increased steady-state responses prior to the expected audio stimulus onset are due to increased attention/vigilance. This effect may be exclusive (or more pronounced) in the attend-audio condition due to higher precision in temporal processing in the auditory sense or, vice versa, too smeared in time due to the inferior temporal resolution of visual processing for the attend-vision condition to be picked up consistently. As expectation effects will build up over the course of the experiment, i.e., while the participant is learning about the consistent stimulus timing, the correlation with alpha power may then be explained by a similar but potentially unrelated increase in alpha power over time.

      Structural issues with the narrative and missing information: Here, I am mostly concerned with how this makes the research difficult to access for the reader. I list the major points below:

      In the introduction the authors pit the original idea about alpha's role in gating against some recent contradictory results. If it's the aim of the study to provide evidence for either/or, predictions for the results from each perspective are missing. Also, it remains unclear how this relates to the distinction between original vs revised alpha inhibition theory (Fig. 1A). Relatedly if this revision is an outcome rather than a postulation for this study, it shouldn't be featured in the first figure.

      The analysis of the intermodulation frequency makes a surprise entrance at the end of the Results section without an introduction as to its relevance for the study. This is provided only in the discussion, but with reference to multisensory integration, whereas the main focus of the study is focussed attention on one sense. (Relatedly, the reference to "theta oscillations" in this sections seems unclear without a reference to the overlapping frequency range, and potentially more explanation.) Overall, if there's no immediate relevance to this analysis, I would suggest removing it.

    4. Reviewer #3 (Public review):

      Brickwedde et al. attempt to clarify the role of alpha in sensory gain modulation by exploring the relationship between attention-related changes in alpha and attention-related changes in sensory-evoked responses, which surprisingly few studies have examined given the prevalence of the alpha inhibition hypothesis. The authors use robust methods and provide novel evidence that alpha likely exhibits inhibitory control over later processing, as opposed to early sensory processing, by providing source-localization data in a cross-modal attention task.

      This paper seems very strong, particularly given that the follow-up MEG study both (a) clarifies the task design and separates the effect of distractor stimuli into other experimental blocks, and (b) provides source-localization data to more concretely address whether alpha inhibition is occurring at or after the level of sensory processing, and (c) replicates most of the EEG study's key findings.

      There are some points that would be helpful to address to bolster the paper. First, the introduction would benefit from a somewhat deeper review of the literature, not just reviewing when the effects of alpha seem to occur, but also addressing how the effect can change depending on task and stimulus design (see review by Morrow, Elias & Samaha (2023). Additionally, the discussion could benefit from more cautionary language around the revision of the alpha inhibition account. For example, it would be helpful to address some of the possible discrepancies between alpha and SSEP measures in terms of temporal specificity, SNR, etc. (see Peylo, Hilla, & Sauseng, 2021). The authors do a good job speculating as to why they found differing results from previous cross-modal attention studies, but I'm also curious whether the authors think that alpha inhibition/modulation of sensory signals would have been different had the distractors been within the same modality or whether the cues indicated target location, rather than just modality, as has been the case in so much prior work?

      Overall, the analyses and discussion are quite comprehensive, and I believe this paper to be an excellent contribution to the alpha-inhibition literature.

    1. eLife Assessment

      This study presents a valuable finding on the importance of the plasma metabolome in glaucoma risk prediction. The authors have used the UK Biobank data to interrogate the association between plasma metabolites and glaucoma. The evidence supporting the claims of the authors is solid and the work offers insights into the design of protective therapeutic strategies for glaucoma.

    2. Reviewer #1 (Public review):

      Summary:

      The authors explore associations between plasma metabolites and glaucoma, a primary cause of irreversible vision loss worldwide. The study relies on measurements of 168 plasma metabolites in 4,658 glaucoma patients and 113,040 controls from the UK Biobank. The authors show that metabolites improve the prediction of glaucoma risk based on polygenic risk score (PRS) alone, albeit weakly. The authors also report a "metabolomic signature" that is associated with a reduced risk (or "resilience") for developing glaucoma among individuals in the highest PRS decile (reduction of risk by an estimated 29%). The authors highlight the protective effect of pyruvate, a product of glycolysis, for glaucoma development and show that this molecule mitigates elevated intraocular pressure and optic nerve damage in a mouse model of this disease.

      Strengths:

      This work provides additional evidence that glycolysis may play a role in the pathophysiology of glaucoma. Previous studies have demonstrated the existence of an inverse relationship between intraocular pressure and retinal pyruvate levels in animal models (Hader et al. 2020, PNAS 117(52)) and pyruvate supplementation is currently being explored for neuro-enhancement in patients with glaucoma (De Moraes et al. 2022, JAMA Ophthalmology 140(1)). The study design is rigorous and relies on validated, standard methods. Additional insights gained from a mouse model are valuable.

      Weaknesses:

      Caution is warranted when examining and interpreting the results of this study. Among all participants (cases and controls) glaucoma status was self-reported, determined on the basis of ICD codes or previous glaucoma laser/surgical therapy. This is problematic as it is not uncommon for individuals in the highest PRS decile to have undiagnosed glaucoma (as shown in previous work by some of the authors of this article). The authors acknowledge a "relatively low glaucoma prevalence in the highest decile group" but do not explore how undiagnosed glaucoma may affect their results. This also applies to all controls selected for this study. The authors state that "50 to 70% of people affected [with glaucoma] remain undiagnosed". Therefore, the absence of self-reported glaucoma does not necessarily indicate that the disease is not present. Validation of the findings from this study in humans is, therefore, critical. This should ideally be performed in a well-characterized glaucoma cohort, in which case and control status has been assessed by qualified clinicians.

      The authors indicate that within the top decile of PRS participants with glaucoma are more likely to be of white ethnicity, while they are more likely to be of Black and Asian ethnicity if they are in the bottom half of PRS. Have the authors explored how sensitive their predictions are to ethnicity? Since their cohort is predominantly of European ancestry (85.8%), would it make sense to exclude other ethnicities to increase the homogeneity of the cohort and reduce the risk for confounders that may not be explicitly accounted for?

      The authors discuss the importance of pyruvate, and lactate for retinal ganglion cell survival, along with that of several lipoproteins for neuroprotection. However, there is a distinction to be made between locally produced/available glycolysis end products and lipoproteins and those circulating in the blood. It may be useful to discuss this in the manuscript, and for the authors to explore if plasma metabolites may be linked to metabolism that takes place past the blood-retinal barrier.

    3. Reviewer #2 (Public review):

      Summary:

      The authors have used the UK Biobank data to interrogate the association between plasma metabolites and glaucoma.

      (1) They initially assessed plasma metabolites as predictors of glaucoma: The addition of NMR-derived metabolomic data to existing models containing clinical and genetic data was marginal.

      (2) They then determined whether certain metabolites might protect against glaucoma in individuals at high genetic risk: Certain molecules in bioenergetic pathways (lactate, pyruvate, and citrate) conferred protection.

      (3) They provide support for protection conferred by pyruvate in a murine model.

      Strengths:

      (1) The huge sample size supports a powerful statistical analysis and the opportunity for the inclusion of multiple covariates and interactions without overfitting the models.

      (2) The authors have constructed a robust methodology and statistical design.

      (3) The manuscript is well written, and the study is logically presented.

      (4) The figures are of good quality.

      (5) Broadly, the conclusions are justified by the findings.

      Weaknesses:

      (1) Although it is an invaluable treasure trove of data, selection bias and self-reporting are inescapable problems when using the UK Biobank data for glaucoma research. The high-impact glaucoma-related GWAS publications (references 26 and 27) referenced in support of the method suffer the same limitations. This doesn't negate the conclusions but must be taken into consideration. The authors might note that it is somewhat reassuring that the proportion of glaucoma cases (4%) is close to what would be expected in a population-based study of 40-69-year-olds of predominantly white ethnicity.

      (2) As noted by the authors, a limitation is the predominantly white ethnicity profile that comprises the UK Biobank.

      (3) Also as noted by the authors, the study is cross-sectional and is limited by the "correlation does not imply causation" issue.

      (4) The optimal collection, transport, and processing of the samples for NMR metabolite analysis is critical for accurate results. Strict policies were in place for these procedures, but deviations from protocol remain an unknown influence on the data.

      (5) In addition, all UK Biobank blood samples had unintended dilution during the initial sample storage process at UK Biobank facilities. (Julkunen, H. et al. Atlas of plasma NMR biomarkers for health and disease in 118,461 individuals from the UK Biobank. Nat Commun 14, 604 (2023) Samples from aliquot 3, used for the NMR measurements, suffered from 5-10% dilution. (Allen, Naomi E., et al. Wellcome Open Research 5 (2021): 222.) Julkunen et al. report that "The dilution is believed to come from mixing of participant samples with water due to seals that failed to hold a system vacuum in the automated liquid handling systems. While this issue is likely to have an impact on some of the absolute biomarker concentration values, it is expected to have limited impact on most epidemiological analyses."

      Impact:

      The findings advance personalized prognostics for glaucoma that combine metabolomic and genetic data. In addition, the protective effect of certain metabolites influences further research on novel therapeutic strategies.

    1. eLife Assessment

      This important study examines the neuronal mechanisms underlying visual perception of integrated face and body cues. The innovative paradigm, which employs monkey avatars in combination with electrophysiological recordings from fMRI-defined brain areas, is a compelling approach. These results should be of wide interest to system and cognitive neuroscientists, psychologists, and behavioural biologists working on visual and social cognition.

    2. Reviewer #1 (Public review):

      Summary:

      The study addresses how faces and bodies are integrated in two STS face areas revealed by fMRI in the primate brain. It builds upon recordings and analysis of the responses of large populations of neurons to three sets of images, that vary face and body positions. These sets allowed the authors to thoroughly investigate invariance to position on the screen (MC HC), to pose (P1 P2), to rotation (0 45 90 135 180 225 270 315), to inversion, to possible and impossible postures (all vs straight), to the presentation of head and body together or in isolation. By analyzing neuronal responses, they found that different neurons showed preferences for body orientation, head orientation, or the interaction between the two. By using a linear support vector machine classifier, they show that the neuronal population can decode head-body angle presented across orientations, in the anterior aSTS patch (but not middle mSTS patch), except for mirror orientation.

      Strengths:

      These results extend prior work on the role of Anterior STS fundus face area in face-body integration and its invariance to mirror symmetry, with a rigorous set of stimuli revealing the workings of these neuronal populations in processing individuals as a whole, in an important series of carefully designed conditions.

      Minor issues and questions that could be addressed by the authors:

      (1) Methods. While monkeys certainly infer/recognize that individual pictures refer to the same pose with varying orientations based on prior studies (Wang et al.), I am wondering whether in this study monkeys saw a full rotation of each of the monkey poses as a video before seeing the individual pictures of the different orientations, during recordings.

      (2) Experiment 1. The authors mention that neurons are preselected as face-selective, body-selective, or both-selective. Do the Monkey Sum Index and ANOVA main effects change per Neuron type?

      (3) I might have missed this information, but the correlation between P1 and P2 seems to not be tested although they carry similar behavioral relevance in terms of where attention is allocated and where the body is facing for each given head-body orientation.

      (4) Is the invariance for position HC-MC larger in aSTS neurons compared to mSTS neurons, as could be expected from their larger receptive fields?

      (5) L492 "The body-inversion effect likely results from greater exposure to upright than inverted bodies during development". Monkeys display more hanging upside-down behavior than humans, however, does the head appear more tilted in these natural configurations?

      (6) Methods in Experiment 1. SVM. How many neurons are sufficient to decode the orientation?

      (7) Figure 3D 3E. Could the authors please indicate for each of these neurons whether they show a main effect of face, body, or interaction, as well as their median corrected correlation to get a flavor of these numbers for these examples?

      (8) Methods and Figure 1A. It could be informative to precise whether the recordings are carried in the lateral part of the STS or in the fundus of the STS both for aSTS and mSTS for comparison to other studies that are using these distinctions (AF, AL, MF, ML).

      Wang, G., Obama, S., Yamashita, W. et al. Prior experience of rotation is not required for recognizing objects seen from different angles. Nat Neurosci 8, 1768-1775 (2005). https://doi-org.insb.bib.cnrs.fr/10.1038/nn1600

    3. Reviewer #2 (Public review):

      Summary:

      This paper investigates the neuronal encoding of the relationship between head and body orientations in the brain. Specifically, the authors focus on the angular relationship between the head and body by employing virtual avatars. Neuronal responses were recorded electrophysiologically from two fMRI-defined areas in the superior temporal sulcus and analyzed using decoding methods. They found that: (1) anterior STS neurons encode head-body angle configurations; (2) these neurons distinguish aligned and opposite head-body configurations effectively, whereas mirror-symmetric configurations are more difficult to differentiate; and (3) an upside-down inversion diminishes the encoding of head-body angles. These findings advance our understanding of how visual perception of individuals is mediated, providing a fundamental clue as to how the primate brain processes the relationship between head and body - a process that is crucial for social communication.

      Strengths:

      The paper is clearly written, and the experimental design is thoughtfully constructed and detailed. The use of electrophysiological recordings from fMRI-defined areas elucidated the mechanism of head-body angle encoding at the level of local neuronal populations. Multiple experiments, control conditions, and detailed analyses thoroughly examined various factors that could affect the decoding results. The decoding methods effectively and consistently revealed the encoding of head-body angles in the anterior STS neurons. Consequently, this study offers valuable insights into the neuronal mechanisms underlying our capacity to integrate head and body cues for social cognition-a topic that is likely to captivate readers in this field.

      Weaknesses:

      I did not identify any major weaknesses in this paper; I only have a few minor comments and suggestions to enhance clarity and further strengthen the manuscript, as detailed in the Private Recommendations section.

    4. Reviewer #3 (Public review):

      Summary:

      Zafirova et al. investigated the interaction of head and body orientation in the macaque superior temporal sulcus (STS). Combining fMRI and electrophysiology, they recorded responses of visual neurons to a monkey avatar with varying head and body orientations. They found that STS neurons integrate head and body information in a nonlinear way, showing selectivity for specific combinations of head-body orientations. Head-body configuration angles can be reliably decoded, particularly for neurons in the anterior STS. Furthermore, body inversion resulted in reduced decoding of head-body configuration angles. Compared to previous work that examined face or body alone, this study demonstrates how head and body information are integrated to compute a socially meaningful signal.

      Strengths:

      This work presents an elegant design of visual stimuli, with a monkey avatar of varying head and body orientations, making the analysis and interpretation straightforward. Together with several control experiments, the authors systematically investigated different aspects of head-body integration in the macaque STS. The results and analyses of the paper are mostly convincing.

      Weaknesses:

      (1) Using ANOVA, the authors demonstrate the existence of nonlinear interactions between head and body orientations. While this is a conventional way of identifying nonlinear interactions, it does not specify the exact type of the interaction. Although the computation of the head-body configuration angle requires some nonlinearity, it's unclear whether these interactions actually contribute. Figure 3 shows some example neurons, but a more detailed analysis is needed to reveal the diversity of the interactions. One suggestion would be to examine the relationship between the presence of an interaction and the neural encoding of the configuration angle.

      (2) Figure 4 of the paper shows a better decoding of the configuration angle in the anterior STS than in the middle STS. This is an interesting result, suggesting a transformation in the neural representation between these two areas. However, some control analyses are needed to further elucidate the nature of this transformation. For example, what about the decoding of head and body orientations - dose absolute orientation information decrease along the hierarchy, accompanying the increase in configuration information?

      (3) While this work has characterized the neural integration of head and body information in detail, it's unclear how the neural representation relates to the animal's perception. Behavioural experiments using the same set of stimuli could help address this question, but I agree that these additional experiments may be beyond the scope of the current paper. I think the authors should at least discuss the potential outcomes of such experiments, which can be tested in future studies.

    1. eLife Assessment

      This study presents SegPore, a valuable new method for processing direct RNA nanopore sequencing data, which improves the segmentation of raw signals into individual bases and boosts the accuracy of modified base detection. The evidence presented to benchmark SegPore is solid and the authors provide a fully documented implementation of the method. If updated to process newer RNA nanopore sequencing data types, SegPore will be of great interest to researchers studying RNA modifications.

    2. Reviewer #1 (Public review):

      Summary:

      In this manuscript, the authors describe a new computational method (SegPore), which segments the raw signal from nanopore-direct RNA-Seq data to improve the identification of RNA modifications. In addition to signal segmentation, SegPore includes a Gaussian Mixture Model approach to differentiate modified and unmodified bases. SegPore uses Nanopolish to define a first segmentation, which is then refined into base and transition blocks. SegPore also includes a modification prediction model that is included in the output. The authors evaluate the segmentation in comparison to Nanopolish and Tombo, and they evaluate the impact on m6A RNA modification detection using data with known m6A sites. In comparison to existing methods, SegPore appears to improve the ability to detect m6A, suggesting that this approach could be used to improve the analysis of direct RNA-Seq data.

      Strengths:

      SegPore addresses an important problem (signal data segmentation). By refining the signal into transition and base blocks, noise appears to be reduced, leading to improved m6A identification at the site level as well as for single-read predictions. The authors provide a fully documented implementation, including a GPU version that reduces run time. The authors provide a detailed methods description, and the approach to refine segments appears to be new.

      Weaknesses:

      In addition to Nanopolish and Tombo, f5c and Uncalled4 can also be used for segmentation, however, the comparison to these methods is not shown. The overall improvement in accuracy appears to be relatively small. The run time and resources that are required to run SegPore are not shown, however, it appears that the GPU version is essential, which could limit the application of this method in practice. The method was only applied to data from the RNA002 direct RNA-Sequencing version, which is not available anymore, currently, it remains unclear if the methods still work on RNA004.

    3. Reviewer #2 (Public review):

      Summary:

      The work seeks to improve the detection of RNA m6A modifications using Nanopore sequencing through improvements in raw data analysis. These improvements are said to be in the segmentation of the raw data, although the work appears to position the alignment of raw data to the reference sequence and some further processing as part of the segmentation, and result statistics are mostly shown on the 'data-assigned-to-kmer' level.

      As such, the title, abstract, and introduction stating the improvement of just the 'segmentation' does not seem to match the work the manuscript actually presents, as the wording seems a bit too limited for the work involved.

      The work itself shows minor improvements in m6Anet when replacing Nanopolish eventalign with this new approach, but clear improvements in the distributions of data assigned per kmer. However, these assignments were improved well enough to enable m6A calling from them directly, both at site-level and at read-level.

      Strengths:

      A large part of the improvements shown appear to stem from the addition of extra, non-base/kmer specific, states in the segmentation/assignment of the raw data, removing a significant portion of what can be considered technical noise for further analysis. Previous methods enforced the assignment of all raw data, forcing a technically optimal alignment that may lead to suboptimal results in downstream processing as data points could be assigned to neighbouring kmers instead, while random noise that is assigned to the correct kmer may also lead to errors in modification detection.

      For an optimal alignment between the raw signal and the reference sequence, this approach may yield improvements for downstream processing using other tools.<br /> Additionally, the GMM used for calling the m6A modifications provides a useful, simple, and understandable logic to explain the reason a modification was called, as opposed to the black models that are nowadays often employed for these types of tasks.

      Weaknesses:

      The work seems limited in applicability largely due to the focus on the R9's 5mer models. The R9 flow cells are phased out and not available to buy anymore. Instead, the R10 flow cells with larger kmer models are the new standard, and the applicability of this tool on such data is not shown. We may expect similar behaviour from the raw sequencing data where the noise and transition states are still helpful, but the increased kmer size introduces a large amount of extra computing required to process data and without knowledge of how SegPore scales, it is difficult to tell how useful it will really be. The discussion suggests possible accuracy improvements moving to 7mers or 9mers, but no reason why this was not attempted.

      The manuscript suggests the eventalign results are improved compared to Nanopolish. While this is believably shown to be true (Table 1), the effect on the use case presented, downstream differentiation between modified and unmodified status on a base/kmer, is likely limited as during actual modification calling the noisy distributions are usually 'good enough', and not skewed significantly in one direction to really affect the results too terribly.

      Furthermore, looking at alternative approaches where this kind of segmentation could be applied, Nanopolish uses the main segmentation+alignment for a first alignment and follows up with a form of targeted local realignment/HMM test for modification calling (and for training too), decreasing the need for the near-perfect segmentation+alignment this work attempts to provide. Any tool applying a similar strategy probably largely negates the problems this manuscript aims to improve upon.

      Finally, in the segmentation/alignment comparison to Nanopolish, the latter was not fitted(/trained) on the same data but appears to use the pre-trained model it comes with. For the sake of comparing segmentation/alignment quality directly, fitting Nanopolish on the same data used for SegPore could remove the influences of using different training datasets and focus on differences stemming from the algorithm itself.

      Appraisal:

      The authors have shown their method's ability to identify noise in the raw signal and remove their values from the segmentation and alignment, reducing its influences for further analyses. Figures directly comparing the values per kmer do show a visibly improved assignment of raw data per kmer. As a replacement for Nanopolish eventalign it seems to have a rather limited, but improved effect, on m6Anet results. At the single read level modification modification calling this work does appear to improve upon CHEUI.

      Impact:

      With the current developments for Nanopore-based modification largely focusing on Artificial Intelligence, Neural Networks, and the like, improvements made in interpretable approaches provide an important alternative that enables a deeper understanding of the data rather than providing a tool that plainly answers the question of whether a base is modified or not, without further explanation. The work presented is best viewed in the context of a workflow where one aims to get an optimal alignment between raw signal data and the reference base sequence for further processing. For example, as presented, as a possible replacement for Nanopolish eventalign. Here it might enable data exploration and downstream modification calling without the need for local realignments or other approaches that re-consider the distribution of raw data around the target motif, such as a 'local' Hidden Markov Model or Neural Networks. These possibilities are useful for a deeper understanding of the data and further tool development for modification detection works beyond m6A calling.

    4. Reviewer #3 (Public review):

      Summary:

      Nucleotide modifications are important regulators of biological function, however, until recently, their study has been limited by the availability of appropriate analytical methods. Oxford Nanopore direct RNA sequencing preserves nucleotide modifications, permitting their study, however, many different nucleotide modifications lack an available base-caller to accurately identify them. Furthermore, existing tools are computationally intensive, and their results can be difficult to interpret.

      Cheng et al. present SegPore, a method designed to improve the segmentation of direct RNA sequencing data and boost the accuracy of modified base detection.

      Strengths:

      This method is well-described and has been benchmarked against a range of publicly available base callers that have been designed to detect modified nucleotides.

      Weaknesses:

      However, the manuscript has a significant drawback in its current version. The most recent nanopore RNA base callers can distinguish between different ribonucleotide modifications, however, SegPore has not been benchmarked against these models.

      I recommend that re-submission of the manuscript that includes benchmarking against the rna004_130bps_hac@v5.1.0 and rna004_130bps_sup@v5.1.0 dorado models, which are reported to detect m5C, m6A_DRACH, inosine_m6A and PseU.

      A clear demonstration that SegPore also outperforms the newer RNA base caller models will confirm the utility of this method.

    1. eLife Assessment

      The study is a valuable contribution to the question of evolutionary shifts in neuronal proliferation patterns and the timing of developmental progressions. The authors present convincing data which confirm the presence of type-II NB lineages in beetle with the same molecular characteristics as the Drosophila counterparts but differing in lineage size and number. The data lay the foundation for future analysis of the role and molecular characteristics of individual lineages and of whether differences in the identity, proliferation pattern and timing of developmental progression can be linked to differences in the development of functionality of the central complex.

    2. Reviewer #1 (Public review):

      Summary:

      Insects inhabit diverse environments and have neuroanatomical structures appropriate to each habitat. Although the molecular mechanism of insect neural development has been mainly studied in Drosophila, the beetle, Tribolium castaneum has been introduced as another model to understand the differences and similarities in the process of insect neural development. In this manuscript, the authors focused on the origin of the central complex. In Drosophila, type II neuroblasts have been known as the origin of the central complex. Then, the authors tried to identify those cells in the beetle brain. They established a Tribolium fez enhancer trap line to visualize putative type II neuroblasts and successfully identified 9 of those cells. In addition, they also examined expression patterns of several genes that are known to be expressed in the type II neuroblasts or their lineage in Drosophila. They concluded that the putative type II neuroblasts they identified were type II neuroblasts because those cells showed characteristics of type II neuroblasts in terms of genetic codes, cell diameter, and cell lineage.

      Strengths:

      The authors established a useful enhancer trap line to visualize type II neuroblasts in Tribolium embryos. Using this tool, they have identified that there are 9 type II neuroblasts in the brain hemisphere during embryonic development. Since the enhancer trap line also visualized the lineage of those cells, the authors found that the lineage size of the type II neuroblasts in the beetle is larger than that in the fly. They also showed that several genetic markers are also expressed in the type II neuroblasts and their lineages as observed in Drosophila.

      Comments on revisions:

      The revisions have improved the manuscript greatly. However, I still have some concerns about the lack of examination of the expression of NB markers. Without examining the expression of at least one unequivocal neuroblast marker, no one can say confidently that it is a neuroblast. However, it is acknowledged that such a marker is currently not available for Tribolium.

    3. Reviewer #2 (Public review):

      The authors address the question of differences in the development of the central complex (Cx), a brain structure mainly controlling spatial orientation and locomotion in insects, which can be traced back to the neuroblast lineages that produce the Cx structure. The lineages are called type-II neuroblast (NB) lineages and assumed to be conserved in insects. While Tribolium castaneum produces a functional larval Cx that only consists of one part of the adult Cx structure, the fan-shaped body, in Drosophila melanogaster a non-functional neuropile primordium is formed by neurons produced by the embryonic type-II NBs which then enter a dormant state and continue development in late larval and pupal stages.

      The authors present a meticulous study demonstrating that type-II neuroblast (NB) lineages are indeed present in the developing brain of Tribolium castaneum. In contrast to type-I NB lineages, type-II NBs produce additional intermediate progenitors. The authors generate a fluorescent enhancer trap line called fez/earmuff which prominently labels the mushroom bodies but also the intermediate progenitors (INPs) of the type-II NB lineages. This is convincingly demonstrated by high resolution images that show cellular staining next to large pointed labelled cells, a marker for type-II NBs in Drosophila melanogaster. Using these and other markers (e.g. deadpan, asense), the authors show that the cell type composition and embryonic development of the type-II NB lineages are similar to their counterparts in Drosophila melanogaster. Furthermore, the expression of the Drosophila type-II NB lineage markers six3 and six4 in subsets of the Tribolium type-II NB lineages (anterior 1-4 and 1-6 type-II NB lineages) and the expression of the Cx marker skh in the distal part of most of the lineages provide further evidence that the identified NB lineages are equivalent to the Drosophila lineages that establish the central complex. However, in contrast to Drosophila, there are 9 instead of 8 embryonic type-II NB lineages per brain hemisphere and the lineages contain more progenitor cells compared to the Drosophila lineages. The authors argue that the higher number of dividing progenitor cells supports the earlier development of a functional Cx in Tribolium.

      While the manuscript clearly shows that type-II NB lineages similar to Drosophila exist in Tribolium, it does not establish a direct link between the characteristics of these lineages and a functional larval Cx in Tribolium, i.e., it does not identify the cause of the heterochronic development of the Cx in these insects. However, the detailed study lays the foundation for lineage tracing and gene function experiments that will elucidate if the higher number of Tribolium type-II NB lineage progenitors, the additional lineage and the timing of developmental progression of the progenitors can indeed be linked with the earlier function of the Cx and/or if other components are required for establishing the functional larval neural circuits in Tribolium such as e.g. larval born neurons as is the case in Drosophila.