10,000 Matching Annotations
  1. Last 7 days
    1. Reviewer #1 (Public review):

      Summary:

      This carefully executed study uncovers the functional relevance of curl signals that impinge on the retina every time an observer's gaze direction and movement direction are not aligned.

      Strengths:

      This finding is important, highlighting the functional role of an abundant incidental signal (curl in retinal motion) that has thus far believed to be a nuisance that needs to be filtered out of the retinal motion stream.

      The study's evidence is compelling: a combination of psychophysical experiments and critical manipulations, control theory and neural modeling, which together make an internally consistent and biologically plausible case for the role of curl signals in estimating heading direction.

      This study uncovers the functional relevance of curl signals that occur on the retina when an observer is moving, and gaze is not straight ahead. The experimental and modeling results clearly go beyond previous studies and significantly advance our understanding of vision-based navigation.

      Another clear strength is that the study uses tightly controlled experimental manipulation to provide strong test cases for the hypothesis that curl is used for visual navigation. These conditions are important to constrain the proposed model (and future models) of heading control.

      The modeling is very clearly described, and the modeling and analysis code is published and freely available. The authors go beyond a back-of-the-envelope control model and show how it might be implemented at the neural-circuit level. The model is biologically plausible.

      Weaknesses:

      The discussion would benefit from an extension of the implications of the study and predictions of their model.

    2. Reviewer #2 (Public review):

      This study examines how curl in the retinal flow field can be used as a control variable for estimating and controlling the heading of a moving observer. The basic idea (which is not entirely new, see Matthis et al. 2022) is that translation along a path with eccentric gaze (meaning that the subject is not heading toward the point they are looking at) produces a pattern of optic flow on the retina with a rotational component around the point of fixation (which can be captured by the mathematical "curl" operator). The sign and magnitude of retinal curl vary with heading relative to the point of fixation, such that curl can be used as a control variable to steer rightward or leftward to move toward the fixated target. The authors perform behavioral experiments and show that there are biases in perceived heading that seem to be largely governed by retinal curl. They also show that a simple controller model can use curl to steer toward a target, and they provide a neural network model that provides a biologically plausible implementation of the controller (although there are some questions about that).

      There is a core of interesting work here that I think can be important to the field. However, there is a lack of clarity on several important fronts, including design of the behavioral experiments, presentation of the behavioral data, conceptual framing of what curl can and cannot do, etc. Equally importantly, the manuscript is not written in a manner that will make it accessible to most vision scientists. I consider myself to be pretty knowledgeable about optic flow, and I had to read most of the manuscript 3 or 4 times to be able to understand the bulk of it. And my experience is that most vision scientists do not understand optic flow well, so I fear that most of the readers that the authors should want to reach would struggle to understand the work. As written, this is mainly going to make an impact on a handful of optic flow gurus. Thus, I consider that this manuscript will need a major overhaul to clarify important issues and make it more accessible.

      Major issues:

      (1) The manuscript contains inconsistent, if not misleading, messaging about what information retinal curl does, and does not, provide regarding heading estimation. In the Abstract, the authors state: "We propose an alternative: the visual system utilizes retinal curl directly to estimate heading, rendering the explicit recovery of the FOE unnecessary." Based on my understanding of the rest of the manuscript, I find this statement to be a misrepresentation for two main reasons:

      a) To "directly estimate heading" relative to what? When not qualified, most people interpret "heading" to mean an observer's heading relative to the world (or some allocentric reference frame). But retinal curl only gives information about an observer's heading relative to the point on which their eyes are fixated. Moreover, that point of fixation will change every few hundred milliseconds in natural viewing, so the retinal curl will change with each new fixation even as heading relative to the world remains unchanged. So I think most readers would grossly misinterpret the claim that retinal curl can be used "directly to estimate heading". Indeed, in the authors' controller model, the initial heading needs to be given, and then the controller can work. But from where does the visual system get the initial heading, since it does not come from curl? These issues are left hanging. Thus, while curl can provide a very useful input for steering toward a fixated target, other signals are needed to estimate heading relative to the world. This has to be made much clearer early on, and a conceptual schematic diagram might help. Also, the authors generally do not specify the reference frame of the variables they are talking about, leaving lots of room for misinterpretations. It should be clear each time they are talking about a variable, such as heading, whether it is relative to the fixation target, body, world, etc.

      b) It seems to me that retinal curl will depend on other variables, in addition to heading relative to the fixation target. For example, it seems to me that the magnitude of retinal curl will depend on self-motion speed, the depth structure of the scene, the angle of elevation of the fixated target, and perhaps others. This is not discussed at all, and many readers would get the misguided impression that there is a 1:1 mapping from curl to heading (relative to fixation). If I am right that this is not correct, it means that retinal curl can tell the observer whether to steer right or left to move toward the fixated target, but it cannot tell them how much to steer. Indeed, in the authors' controller model, there is a free parameter that calibrates curl to angle. It makes sense that this works to fit trajectory data that are given from a fixed environment, but it is unclear how the brain would use retinal curl to control steering when these other variables are uncertain or changing unpredictably. Moreover, how does the system change the mapping from curl to steering command as the location of fixation changes relative to the current heading? These are issues that need to be brought up in framing the problem and discussed at some length. If the authors can show mathematically that retinal curl is only dependent on heading (relative to fixation) and not any of these other variables, it would be very valuable to show the equations for this relationship.

      (2) The description of the behavioral experiment and presentation of behavioral data leaves a lot to be desired.

      a) First, it is stated (line 158) that "Participants continuously reported their perceived direction of self-motion while maintaining fixation on the yellow dot." Again, the reference frame is completely unspecified. Participants were reporting their perceived heading relative to what? The fixation target? The world? What exactly were the instructions given to the subjects to perform the task? Based on the description of how perceived paths are computed (line 166-), it seems to be presumed that subjects are reporting their heading relative to the world because those angles are then converted into x and z coordinates in what I presume is a world-centered reference frame. But how do we know that subjects are accurately reporting their heading relative to the world? What if they are biased in their reports by the location of the fixation target relative to the scene, or by some other reference signal? Is it possible for the authors to rule out the possibility that perceptual biases seen in the unaltered curl condition result from observers not fully adopting the assumed reference frame of the task? If this cannot be firmly excluded, it seems to create problems for the rest of the study.

      b) I also feel that there is a mismatch between what the behavioral task requires and what the controller model does. Subjects are apparently asked to report their heading relative to the world, but the controller model only controls their heading relative to the point that they are fixating. I understand how this is resolved in the model, but I think this type of distinction is buried and will not be apparent to most readers. Again, the reference frames of what is being measured and controlled need to be specified explicitly in all parts of the paper, and the authors need to explain how the system would combine curl-based control with some other measures of (at least initial) heading for world-centered heading to be computed. All of the assumptions need to be clearly specified.

      c) In addition, I found it frustrating that the authors never present raw perceptual data from the observers. Rather, in Figure 2, we see reconstructed trajectories that are perfectly smooth with no indications of noise whatsoever. Since these paths are computed from the perceptual reports, there must be some noise inherent in them. The figures should represent this uncertainty somehow, and it should be explained how these perfectly smooth trajectories are obtained.

      (3) "...the magnitude of retinal curl in the fovea can specify the body trajectory relative to gaze (Matthis et al., 2022)." The main idea put forward by the authors here seems to overlap heavily with this statement that they attribute to Matthis et al. 2022. While I think this paper still adds importantly to the topic, the authors do not discuss how their findings are different from those of Matthis et al. 2022, why they are an important extension, etc. Readers should not have to go read this other paper to have any idea how the present findings are placed in importance relative to the literature.

      (4) The analysis and treatment of eye movements is extremely weak. The authors discarded trials for which gaze deviated from the fixation point by more than 3 degrees (which is a LOT given that the eye speeds are generally in the neighborhood of 0.5 deg/sec), and they provide basic stats on the distribution of positions. But this largely misses the point: it is not small position errors that are likely to matter, but rather velocity errors. Even a small amount of retinal slip of the target while it is being pursued will cause image motion that is going to alter the optic flow field around the fixation target. So, for example, the retinal curl field may no longer be centered on the fixation target. How do we know that some of the perceptual biases are not influenced by image motion resulting from imperfect tracking of the fixation target? This needs to be analyzed and discussed.

      (5) I found the sections of text comparing the separate and joined fits (starting line 287) to be a bit too rosy. The authors show the separate fits in the main text, and it is not very surprising that these fits are good, given that the model has 30 parameters, and these data are pretty low-dimensional. The authors only show the joined fits in the supplement, and they say that they are almost as good as the separate fits (indeed, they are better in a model comparison sense, but this is 30 parameters vs. 2 parameters). However, when I look at the fits of the joined model in the supplement, I don't find them to be very impressive. In particular, the model grossly misses the data for the straight paths for several subjects (e.g., id5, id6, id8, id10). And fitting the straight paths would presumably be easiest. This implies that the joined model is really missing something and that fitting the curved paths interacts strongly with fitting the data for different fixation target locations on the straight path. I think that the authors should discuss the results a bit more soberly and tone down their conclusions here.

      (6) The section of the paper on neural simulations (starting line 387) has a few weaknesses. First, why are only straight paths simulated here? This does not seem to provide a very rigorous test of the model. Second, it is awkward that the simulation results are presented in units of pixels, rather than degrees. Third, the authors seem to downplay the fact that the neural estimates of heading seem to oscillate rather wildly (over a range of hundreds of pixels, whatever that means, see especially Figure S16). It was far from clear to me how an estimate of heading with these large oscillations is useful. It would seem to require that heading estimates are integrated over substantial lengths of time to be reliable. It was therefore unclear how the model produces such smooth paths from these oscillating estimates.

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript uses a novel paradigm to demonstrate that rotational motion patterns in the retinal image, called curl, directly influence perception of heading direction. This means that it is not necessary to recover the focus of expansion, defined by the point of zero motion when moving along a straight trajectory toward a target, as is commonly thought.

      Strengths:

      It has long been accepted that the focus of expansion of the optic flow field generated by self-motion is used to guide heading direction. While there have been many challenges to the need to recover the focus of expansion when gaze is not in the direction of travel, it is still not well understood how retinal motion patterns contribute to heading perception. Recent work has demonstrated the complexity of the retinal motion patterns during natural walking, where body motion adds a rotational component. A rotational component also results from curved paths as well as gaze off the direction of travel. This rotational component is called curl. The primary contribution of this manuscript is to demonstrate convincingly that curl influences perception of heading, and that it is not necessary to recover the focus of expansion.

      A strength of the manuscript is that realistic retinal motion patterns are generated by recording the image sequences generated by a walker in a virtual environment, and then using those patterns as stimuli in the experiment. This allows the creation of the more complex flow patterns that are a consequence of the bob and sway of natural walking, which are often considered a minor factor. The elegant experimental design allows direct manipulation of the curl signal, and this in turn directly influences measured heading perception. Another strength is that the authors ground their findings in control theory and neural computations, using a model that produces human-like path trajectories.

      The study is timely, given the long history of this question, together with the growing understanding of the complexity of naturally generated retinal motion and the absence of direct evidence for the way that these motion patterns are used in heading perception. It adds an important piece of evidence for how retina-centered optic flow may be used by the visual system, which is critical for our understanding of motion processing in the brain.

      Weaknesses:

      The primary limitation of the paper is that it avoids discussion of some of the inevitable complexities of heading perception. The main issue is what exactly is meant by heading. Different behaviors evolve over different timescales. The geometry of retinal motion defines instantaneous heading, which varies widely through the gait cycle. Time-varying information like this is known to be important in the momentary control of balance. Heading can also be thought of as steering the body toward a distant goal, which evolves over longer timescales. The current manuscript appears to be concerned with heading information integrated over a few seconds and seems to provide evidence that heading is indeed integrated over the gait cycle. The issue of the time scale of the computation is touched on, but it is not related to how it might be used in normal walking or what situations it might apply to. Steering toward a distant goal during walking is not a very difficult problem and may not require evaluation of retinal motion, but control of balance is more challenging and may depend critically on curl. Consequently, the timescale of the computation needs to be considered in order to understand what is meant by heading.

    1. Reviewer #1 (Public review):

      Summary:

      The study is technically extensive and employs a wide range of experimental approaches, including in vivo analyses, cell-based assays, and transcriptomic data integration. The authors provide a detailed characterization of inflammatory and stress-related pathways activated following IMQ exposure in mouse skin. These datasets may be informative for researchers specifically interested in IMQ-induced dermatitis or in stress responses triggered by chemical skin irritants.

      Strengths:

      The study is technically extensive and employs a wide range of experimental approaches, including in vivo analyses, cell-based assays, and transcriptomic data integration. The authors provide a detailed characterization of inflammatory and stress-related pathways activated following IMQ exposure in mouse skin. These datasets may be informative for researchers specifically interested in IMQ-induced dermatitis or in stress responses triggered by chemical skin irritants.

      Weaknesses:

      A major limitation of the manuscript is its exclusive reliance on the IMQ model, which does not adequately represent the immunological drivers, cellular interactions, or therapeutic responsiveness of human psoriasis, despite the manuscript's framing. IMQ-induced inflammation is dominated by innate immune activation and mouse-specific pathways, whereas human psoriasis is driven primarily by IL-23/IL-17-mediated interactions between keratinocytes and Th17/Tc17 cells. As a result, conclusions drawn entirely from IMQ-based experiments have limited relevance to human disease biology.

      Consistent with this issue, the manuscript places strong emphasis on pathways such as TLR signaling, inflammasome activation, and IL-1-associated responses, none of which are established as central drivers of plaque psoriasis in patients. Therapeutic strategies targeting these pathways have failed to achieve clinical efficacy comparable to IL-23 or IL-17 blockade, yet this translational gap is not adequately addressed.

      The in vitro keratinocyte experiments further limit interpretability. Stimulation of keratinocytes with IMQ is not an accepted model of psoriasis-relevant keratinocyte activation, and the study does not demonstrate induction of well-established psoriasis signature gene programs. Without this validation, it is difficult to assess the relevance of the observed cellular stress responses to human disease.

      The RNA-sequencing analyses raise additional concerns regarding rationale and interpretation. The basis for selecting specific mouse and human datasets is unclear, including the use of unpublished or non-psoriasis inflammatory datasets. Key methodological details related to data processing, normalization, cross-species comparison, and statistical analysis are insufficiently described. In addition, the limited number of differentially expressed genes identified does not align with the extensive psoriasis transcriptomic literature, raising concerns about analytical rigor.

      Finally, the manuscript emphasizes a small number of genes described as "psoriasis-associated" while failing to demonstrate regulation of widely accepted psoriasis signature genes known to correlate with disease activity and therapeutic response in patients.

    2. Reviewer #2 (Public review):

      Summary:

      This paper shows that imiquimod, a compound often used to induce a psoriasis-like skin inflammation in mice, has a TLR7-independent effects that induce the unfolded protein response and amplify cytokine expression in dendritic cells. Although these effects of imiquimod have been described in the literature before, this study provides more detailed evidence and different contexts to this observation. These findings add to existing literature that imiquimod has a pleotropic mechanism of action involving changes in mitochondrial functions and cellular stress responses. Specifically, the authors show that imiquimod can induce calcium signaling in immune cells and potentiate two branches of the unfolded protein response in a TLR7-independent and MyD88-independent manner. They also show that some of these effects might be partially mediated by direct binding of imiquimod to Gelsolin. These findings expand our understanding of imiquimod-mediated inflammation and are useful for the field of experimental skin immunology and mouse models of psoriasis. However, the molecular and cellular mechanisms connecting Gelsolin to the unfolded protein response and skin inflammation presented in this paper require further investigation in the context of TLR-mediated inflammation.

      Strengths:

      (1) TLR7-independent effects of imiquimod on the expression of genes and proteins involved in the unfolded protein response are well demonstrated.

      (2) Gelsolin is identified as a new imiquimod-binding protein in mouse cells.

      Weaknesses:

      (1) Effects of imiquimod on mitochondrial Ca signaling are not clear from the presented data.

      (2) The mechanism of action connecting imiquimod to Gelsolin on the unfolded protein response and cytokine production remains not fully explained.

      (3) It remains unclear if Gelsolin contributes to regulating TLR7 (or other types of TLR-mediated) inflammation in vivo.

    1. Reviewer #1 (Public review):

      Porte et al. investigate how observers form confidence judgments about the presence vs absence of near-threshold audiovisual stimuli. In two psychophysical detection experiments, human participants judged whether a stimulus (visual, auditory, or audiovisual) was present or absent, reported amodal confidence, and then gave modality-specific detection and confidence ratings using a bidimensional scale. The authors report that audiovisual (AV) stimuli are detected more accurately than unimodal stimuli, but that multisensory stimulation does not improve metacognitive efficiency. Participants are more confident in absence than in presence judgments. They extend a previously proposed model to an audiovisual setting, assuming evidence is available only for presence and that absence is inferred via counterfactual detectability. Detection is modeled with a disjunctive integration rule across modalities, while confidence is explained by a combination of conjunctive (for presence) and disjunctive/negation-of-disjunction (for absence) rules.

      There are several points I wish to have clarified, outlined below:

      (1) Framing of bimodal vs unimodal detection

      On p.3, the introduction states that "Adults typically show higher detection rates and faster reaction times for bimodal than for unimodal stimuli." This is broadly consistent with the literature, but as written, it obscures the fact that these effects depend critically on experimenter-defined stimulus strengths. It is trivial to construct cases where a strong unimodal stimulus is more detectable than a bimodal stimulus made of two very weak unimodal stimuli. If "bimodal" is understood as the co-presentation of two unimodal components matched in detectability, then Bayes-rule-based arguments indeed predict better detection for the bimodal case; how much better is theoretically interesting, but not quantified in this paper. There is an entire literature on the combination of two unimodal stimuli, which is not touched on. For a pertinent reference, see Ernst & Banks 2002. I recommend clarifying that the statement assumes comparable unimodal intensities.

      (2) Relationship to signal detection theory and counterfactual perceptibility

      In the introduction, the authors write, "If sensory evidence is only available for presence," motivating counterfactual perceptibility as a necessary ingredient to infer absence. However, standard signal detection theory (SDT) already provides a widely accepted framework in which a continuous internal response is present on both signal and noise (absent) trials, with absence corresponding to the noise distribution and decisions implemented by a criterion.

      Thus, there is no logical need to invoke counterfactual perceptibility simply to define absence; rather, the Mazor-style framework adds an explicit belief model about detectability and an optimal stopping policy. It would strengthen the paper to more clearly state how the proposed model goes beyond SDT conceptually, acknowledge that SDT can account for presence/absence decisions without counterfactuals, and position the counterfactual account as a hypothesis about how observers actually compute absence/confidence, not as a necessity. One of the central claims of the paper is that detection in the case of absence requires counterfactual reasoning. The authors should demonstrate whether or not an SDT-based generative model can describe these amodal and uni- and bi-modal stimulus decisions. In such an SDT model, an SDT-based generative model in which the noise distribution is shared across conditions, and unimodal vs bimodal differences are captured by changes in the mean or variance of the signal+noise distribution.

      (3) Confidence vs performance: is AV confidence special?

      The paper's central claims about multisensory confidence and metacognition would be stronger if the authors showed that AV confidence deviates from what is expected given performance alone. From the reported results, AV accuracy is around 80%, with visual and auditory at about 60% and 40%, respectively. Given that confidence typically monotonically scales with accuracy, the first question is whether AV confidence is entirely explained by improved performance, or whether there is an additional multisensory contribution. A simple, informative analysis would be for each subject, plot mean confidence vs per cent correct for AV, V, A, and absent conditions, and to test whether AV confidence lies above the trend predicted by accuracy alone.

      (4) Metacognitive measures: logistic regression slopes vs meta-d′/d′

      In the "Multisensory effects on metacognitive performance" section, the authors define "metacognitive sensitivity" as the slope of a Bayesian logistic regression predicting accuracy from confidence. There is substantial literature showing that logistic-slope measures of metacognitive sensitivity are criterion-dependent and can be affected by both task and confidence criteria (for one example, see Rausch & Zehetleitner, 2017). In contrast, meta-d′/d′ was specifically developed to provide a bias-invariant measure of metacognitive efficiency. Though this, too, is dated (see Boundy-Singer et al., 2023). Given that the authors already estimate HMeta-d-based M-ratios, it is unclear why they rely on logistic regression slopes as their primary "metacognitive sensitivity" metric in Figure 4A. I suggest either replacing the logistic-slope metric with SDT-based measures (meta-d′, meta-d′/d′) or providing a clear justification for using logistic slopes, along with a discussion of their known limitations.

      Additionally, Figure 3 reports M-ratios without showing the corresponding d′ or meta-d′ for judge-present vs judge-absent conditions. Presenting these would help contextualize the metacognitive efficiency results and clarify whether differences are driven mainly by changes in metacognitive sensitivity, changes in task performance, or both. The d' values per condition could be added to Figure 2A.

      (5) Interpretation of confidence in absence vs presence

      The authors emphasise that it is surprising subjects are more confident in absence than in presence judgments, both at amodal and modality-specific levels. However, Figure 2B suggests that absent responses are very accurate: absent is reported as present only in about 10% of absent trials, implying a high correct rejection rate. If confidence tracks outcome probability, higher confidence for absence may be at least partly expected. Before attributing this asymmetry primarily to counterfactual reasoning, it would be important to explicitly relate confidence to accuracy for hits, misses, false alarms, and correct rejections and show whether absence confidence remains elevated relative to presence after controlling for accuracy differences across judgment types and conditions. Without this, the interpretation that higher absence confidence is inherently "unexpected" seems overstated.

      (6) Model: integration rules, confidence, and evidence strength

      The modeling section extends the Mazor et al. ideal observer to two modality-specific sensors, with disjunctive integration for detection and then disjunctive vs conjunctive integration rules for confidence. I have a few comments.

      First, the detection rule is disjunctive and is reported as a finding. However, the conclusion that detection relies on a disjunctive rule ("present if A or V") closely mirrors the task instructions-participants are explicitly told to respond "present" if they detect the stimulus in any modality. As such, this seems more like a sanity check than a novel empirical finding.

      Relatedly, the conjunctive detection is a weak null. The conjunctive rule ("present only if both A and V") is behaviorally implausible given the task instructions. A more informative baseline would be an SDT-style scalar-evidence model (see comment 2), rather than a conjunctive rule that participants would have to actively violate the instructions to follow.

      Second, confidence in the model is defined as the probability of being correct at the time of the detection decision. However, this implies a fixed amount of evidence at decision time unless additional mechanisms are invoked. This issue is well known in diffusion modeling (see Kiani et al. 2014) and deserves explicit discussion; otherwise, it is unclear how the model produces graded confidence from a bound-crossing rule alone.

      Third, the authors do not consider a straightforward evidence-strength account of confidence. When both modalities indicate presence, there is, on average, more total sensory evidence than in unimodal trials, making correct decisions more likely and, under most frameworks, confidence higher. Likewise, weak evidence in both modalities can be stronger evidence for absence than moderate in one and weak in the other. Many of the patterns that motivate the presence-conjunctive/absence-disjunctive mix could arise from a model where confidence simply reflects the amount of evidence for the chosen option, without positing distinct logical integration rules for presence vs absence. As the authors note, purely disjunctive or purely conjunctive confidence rules fail to capture the trends in confidence reports in Figure 7, leading them to adopt a combined presence-conjunctive / absence-disjunctive rule. A more parsimonious alternative-that confidence scales with evidence magnitude and cross-modal agreement-should be explicitly considered and, ideally, implemented as a competing model.


Finally, if the model is intended as a good account of the data, it would be useful to report whether it also reproduces the metacognitive efficiency patterns (M-ratios) beyond the mean confidence patterns shown in Figures 7-8. At present, the model appears systematically over-confident, which should be acknowledged and quantified.

      (7) Confidence asymmetry index (CAI) and modality weighting

      The confidence asymmetry index (CAI) is defined as the difference between auditory and visual confidence on AV vs absent trials, and the authors report strong correlations between observed and simulated CAI across participants. They interpret this as evidence that subjects place different weights on auditory vs visual signals. Several questions arise. First, does CAI capture asymmetries beyond what is expected from accuracy differences between modalities and conditions? Second, because the simulated data are generated from model fits to the observed data, a correlation between observed and simulated CAI is expected: the model is built to reproduce the individual patterns it is then compared to. A stronger test would compare CAI from data simulated with modality-specific belief parameters, versus CAI from data simulated with constrained equal belief parameters (same θs). Relatedly, the paper would benefit from a plot showing the distribution of θs for A and V- present stimuli across subjects. These values could also be related to unimodal sensitivity measured in the calibration/training phases. A natural prediction is that higher unimodal sensitivity should correspond to higher belief parameters for presence.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, across two experiments, the authors wrestle with the question: What is the profile of confidence judgments in presence/absence decisions for audiovisual stimuli? After thresholding observers to 50% target detection rates in each modality, the authors conducted one experiment that included 75% target presence (spread equally across bimodal, auditory, and visual targets) and one experiment with 50% overall target presence. Results showed that, overall, detection performance was higher for audiovisual stimuli compared to unimodal ones, and that a recent model for stimulus detection could be extended to this multisensory scenario. By incorporating a disjunctive rule for absence judgments and a conjunctive rule for presence judgments, the model was able to qualitatively reproduce some of the trends observed in the human data regarding confidence.

      Strengths:

      (1) The paper makes novel contributions to the study of multisensory confidence judgments for yes/no target detection.

      (2) The paper further extends the use of a leading model of stimulus detection (from Mazor et al., 2025).

      (3) Pre-registration of the study was implemented, and the code is publicly available (although the GitLab link requires registration to access the materials).

      (4) One of the empirical results (higher confidence for absence compared to presence judgments) is especially interesting, contributing another empirical finding to a very mixed literature on this topic (as the authors note).

      Weaknesses:

      (1) Page 5 - I have concerns about the use of the equal-variance model from Signal Detection Theory to analyze the data. For example, the authors should read the recent paper by Miyoshi, Rahnev, and Lau in iScience, found at this link: https://www.cell.com/iscience/fulltext/S2589-0042(26)00373-1. In this paper, the authors note how the equal variance model should be used with caution in yes/no detection tasks, since the variances of the "stimulus present" and "stimulus absent" distributions are often different from one another. In a revision, I highly recommend that the authors explicitly discuss this paper and review whether the assumptions for the equal-variance model have been met (e.g., since they have confidence data, one way to do this would be to evaluate if the slope of the line in zROC space differs from 1). The authors may also want to incorporate methods from this iScience paper into the current manuscript, or potentially move to using an unequal variance SDT model and compute d'a and c'a.

      (2) Related to the computation/measurement of the response criterion, the authors note on page 18 in the Methods that for Experiment 1, signals are actually present on 75% of trials, since a bimodal stimulus is present on 25% of trials, the visual circle only occurs on 25% of trials, the sinusoidal tone occurs on 25% of trials, and then only noise is present on 25% of trials. Did the authors have any a priori hypotheses about the response criteria that participants would exhibit in Experiment 1, considering the unbalanced target presentation rate in this task? Also, in Experiment 2, what did it mean to equate target present and target absent trials? Is it that they broke 50% target present trials down into 16.67% bimodal targets, 16.67% visual targets, and 16.67% auditory targets? A few more details would be good to explicitly note for those trying to replicate the task.

      (3) It is important to plot the individual data for Figure 2. If the authors didn't match detection performance for the visual and auditory modalities, it would be good to see the individual data to know why. Is it that the thresholding procedure didn't work for some of the participants in the visual modality, and that's why the "yes" response rate is (on average) ~60% or higher across the two experiments? Similarly, in the auditory domain, do the authors have participants that are at floor? Or is it simply that the staircases failed to successfully target 50% detection on average?

      (4) The authors mentioned that data were collected on the Prolific platform. What checks did they conduct to ensure that this data wasn't produced by bots? There are recent high-profile publications in PNAS and Behavioral Research Methods that indicate how online data collection is problematic (e.g., https://www.pnas.org/doi/10.1073/pnas.2535585123 and https://link.springer.com/article/10.3758/s13428-025-02852-7). What analyses or quality checks are there to ensure that humans were the ones completing the task?

      (5) Page 7 - Since confidence was collected on a continuous scale, the authors should say a bit more about how they were able to compute measures of metacognitive efficiency. My understanding is that to compute meta-d', the data has to be binned. How was the binning implemented? With whatever bin size the authors chose, would it make any difference to the results if they changed the number of the bins in the analysis?

      (6) Page 8 - Is there a prior precedent for using slope of the Bayesian logistic regression predicting accuracy from confidence as a measure of metacognitive sensitivity? If so, can the authors cite those papers as a reference? If not, can they place this analysis within the context of other measures of metacognitive sensitivity that exist? (meta-d', AUROC (Type 2), etc.)

      (7) Page 8 - Another one of the results on page 8 is worth reflecting further upon: the authors note how in Experiment 1, no credible difference was found between unimodal and bimodal trials (DeltaM = -0.25 [-0.59, 0.10]), but in Experiment 2, "we observed higher metacognitive efficiency in unimodal compared to bimodal trials (DeltaM = -0.28 [-0.54, -0.02]. Those DeltaM values are nearly identical, so without a power analysis motivating the number of participants the authors collected, how certain are they that the results from these two experiments are really that distinct? It reminds me a bit of the Andrew Gelman blog post, "The difference between significance and non-significance is not significant".

      (8) Is there any way to look at whether the presence of multisensory hallucinations (or perhaps that word is too strong, and we should simply consider them miscategorizations) increased as the task progressed? That is, the authors have repeated presentations of audiovisual stimuli for at least some percentage of the trials. Since the percentages for auditory stimuli being correctly categorized as auditory are at 85% in Experiment 1 and 79% in Experiment 2, were the trials where they miscategorized these stimuli equally spread throughout the task? Or did they come later in the experiment, after being repeatedly exposed to multisensory trials?

      (9) Would the authors obtain the same results if they got rid of the amodal confidence judgment in their task, and simply had participants report the bimodal confidence following the presence/absence judgment? Part of the reason for asking this is that, according to page 11, the model is only fitted to amodal detection accuracy and response time data. This surprised me. I would have expected that the bimodal confidence would provide more useful information for the model fit. The authors should further explain this rationale in the paper. It seems odd to me to have the multisensory confidence ratings and not have them play a central role in the modeling work.

      (10) In Figure 6, it appears the model is a bit off in its estimate of auditory responses (panel B, E) in the AV condition. Do the authors have any intuitions about why this might be happening?

      (11) The authors talk about how the model is reproducing effects in the human data, but there's no systematic comparison, quantitatively, of how the two things relate. The authors should include some quantitative measure that reflects this.

      (12) Related to this, I am not sure I agree with the characterization in Figure 7 that "when confidence followed a disjunctive rule, the model failed to capture important aspects of the data. On the other hand, when confidence followed a conjunctive rule, it reproduced confidence in presence judgments but failed to capture variability in confidence ratings for absence judgments." What, quantitatively, is the basis of this claim? This applies to Figure 8, too. I am not clear how, specifically, and quantitatively, the authors are justifying their claims about model fits. I don't think the confidence asymmetry index in Figure 8 is enough to quantify the quality of the model fitting procedure.

      (13) Is there any chance the higher metacognitive efficiency for auditory trials is simply driven by differences in the d' values across the modalities? It might be good to probe this effect further.

      (14) Lastly, I think it would be interesting to look at how instructions about modality-specific attention could modulate these findings, in terms of how unimodal (unimodal visual, unimodal auditory) or bimodal attention might modulate these effects. This is an idea for future work.

    3. Reviewer #3 (Public review):

      Summary:

      This study used a pre-registered novel behavioural paradigm and computational modelling to investigate multi-sensory influences on detection and confidence. Participants performed amodal detection of auditory and visual stimuli (indicating that a stimulus was there when either an auditory stimulus or a visual stimulus or both were present), followed by amodal and unimodal confidence ratings. Detection was higher when both stimuli were present, and the presence of one modality increased the confidence in the presence of the other modality. In contrast to previous detection studies, confidence was higher for absent than for present judgements, but metacognitive efficiency was higher for present judgements. Metacognitive sensitivity was higher for bimodal stimuli, but this was not the case for metacognitive efficiency, suggesting that the sensitivity might be driven by first-order performance. The computational model showed that both detection and confidence in absence followed a disjunctive evidence integration rule, while confidence in presence followed a conjunctive integration rule.

      Strengths:

      The paper has several major strengths. Firstly, it addresses a novel research question using an innovative and well-controlled paradigm. Furthermore, the paradigm and analyses were pre-registered, and all effects that were interpreted were replicated in two independent samples. Finally, the paper uses an advanced computational model to capture counterintuitive patterns in the data.

      Weaknesses:

      The major weakness of the paper is the narrative structure. It is not always clear how the different analyses relate to the main research question. Many different effects are reported in terms of detection accuracy, bias, confidence and metacognition, as well as cross-modal and unimodal versus bimodal effects. It would help readability if the paper were streamlined in terms of the research question that is being answered, which I believe is specifically about multimodal absence judgements. Relatedly, for a reader not intimately familiar with the metacognition literature, the difference between MRatio, metacognitive sensitivity and metacognitive efficiency is not obvious. It would be good to clarify this more in the manuscript.

      In general, the conclusions drawn by the authors seem to be supported by the results. However, I was missing quantitative model comparisons between the conjunctive and the disjunctive models and an explanation of why the models systematically overestimated the confidence ratings. Furthermore, the 'perceptual multisensory interference' section reports on very interesting effects, but these are not supported by statistical tests in the main text. It would help to assess the strength of the claims if the statistical evidence in favour of these claims were presented together in the main text.

      One other concern is that in real-world multi-sensory perception, such as the mosquito example in the introduction, the auditory and visual signals have a strong natural association, which means that if you hear the auditory signal, you expect that you will see the visual signal soon and vice versa. As far as I understood, this association was not present in the current paradigm, which might influence the type of effects that one would expect to see.

    1. Reviewer #1 (Public review):

      Summary:

      Bot et al. introduce GeneSLand, a computational framework to quantify and visualize gene expression specificity across diverse transcriptomic datasets. The method leverages expression level-breadth (L-B) relationships to construct multi-level specificity landscapes and derives metrics such as lbSpec and dRate to characterize gene specificity in a threshold-independent manner. The authors showed the applicability of the approach across bulk RNA-seq, single-cell datasets, and cross-species primate brain data, showing that specificity patterns captured by this approach reflect both tissue-specific expression and evolutionary distances. Overall, the framework represents an interesting and potentially useful contribution to the analysis of gene expression specificity.

      Strengths:

      (1) Introduces an original conceptual framework based on expression level-breadth relationships to characterize gene specificity.

      (2) Provides a threshold-independent approach that could overcome some limitations of classical specificity metrics.

      (3) Demonstrates the applicability of the framework across different biological datasets.

      Weaknesses:

      (1) The method relies on predefined binning thresholds for expression levels, and the sensitivity of the derived metrics to this parameter is not fully explored.

      (2) The advantages of lbSpec relative to established metrics could be more clearly shown with some biological examples.

      (3) The robustness of the framework with noisy datasets, small sample sizes, or lower sequencing depth is not well evaluated.

    2. Reviewer #2 (Public review):

      Summary:

      Bot & Davila-Velderrain present a new method to understand expression specificity, based on an analysis of the relation between expression level and breadth for each gene. They show that the method captures biological differences across organs, diverse cell types, and specific cell subtypes, for different biological processes and across species.

      Strengths:

      This manuscript addresses an important question in an original manner, and was a pleasure to read. The authors frame the question very clearly: gene expression is a complex trait, which can be summarized in an informative manner by its specificity. The method the authors propose (which I'll call "LB" in this review) has several attractive features, summarising different specificity profiles in a more nuanced manner than the widely used tau. They show convincingly that their method captures relevant biology at different scales. I especially appreciated the comparative analyses of specificity within broad cell types and within neuronal subtypes.

      Weaknesses:

      Surprisingly, while the method works well, the authors never compare it to the state-of-the-art. Thus, comments 1 and 2 are my only "major" comments.

      (1) In the Introduction, the authors should explain which shortcomings of existing methods motivate the development of a new one.

      (2) In the Results section, the authors should compare the results of LB with other methods, at least tau and Gini (which is conceptually quite similar to LB).

      (3) It would be good to show the sensitivity of LB to different numbers of bins.

      (4) The conservation of specificity across primates was already reported in Kryuchkova-Mostacci 2016 (https://doi.org/10.1371/journal.pcbi.1005274). But also see Dunn et al 2018 (https://doi.org/10.1073/pnas.1707515115) for criticism of this type of naive pairwise comparisons.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript investigates the degradation dynamics of extracellular DNA in soils and its impact on estimates of microbial abundance and diversity. By combining a broad geographic sampling design with a primer-labeling strategy, qPCR quantification, amplicon sequencing, and PMA treatment, the authors aim to disentangle total versus intracellular DNA signals and explore sequence-specific degradation patterns. The topic is relevant, particularly given the increasing awareness of relic DNA as a confounding factor in microbial ecology. The experimental design is ambitious and potentially impactful. However, several conceptual inconsistencies, methodological ambiguities, and statistical limitations currently weaken the robustness of the conclusions. These issues need to be addressed.

      Strengths:

      The manuscript addresses a timely and important question in microbial ecology, particularly given the growing recognition that relic DNA can bias interpretations of community composition derived from amplicon sequencing. The study is ambitious in scope, incorporating a broad geographic sampling design across multiple soil types, which enhances the generalizability of the findings. The use of a controlled microcosm experiment combined with a primer-labeling strategy to track extracellular DNA dynamics is conceptually innovative and provides a structured framework to investigate degradation processes.

      In addition, the integration of multiple approaches, including qPCR for absolute quantification, high-throughput sequencing for community profiling, and PMA treatment to differentiate extracellular from intracellular DNA, represents a comprehensive attempt to disentangle complex sources of bias in soil microbiome analyses. The effort to link degradation dynamics with environmental variables and to explore sequence-level patterns further demonstrates the authors' intent to move beyond descriptive analyses toward a mechanistic understanding.

      Weaknesses:

      Several conceptual and methodological issues currently limit confidence in the study's conclusions. Key terms such as "sequence-specific degradation" are not clearly defined or supported by a mechanistic or structural hypothesis, making it difficult to interpret the biological meaning of the results. In addition, the bioinformatic workflow presents inconsistencies, particularly the use of ASVs followed by clustering at 97% similarity, which undermines the resolution required to support sequence-level inferences. Statistical analyses are also insufficiently described, including unclear definitions of "T values," a lack of detail on pairing structure, and no indication of multiple testing correction.

      Furthermore, important methodological details are missing or unclear, including primer design (e.g., GAPDH tag vs ACTF), Illumina library preparation (e.g., adapter and indexing strategy), and validation of PMA treatment efficiency. The interpretation of PMA-treated samples as representing "living communities" is likely overstated, given the known limitations of the method in soil systems. Finally, typographical errors, inconsistent terminology, and unclear phrasing throughout the manuscript reduce readability and further complicate interpretation.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript describes the results of an interesting study examining the rate of degradation of extracellular DNA in soil ecosystems using a clever experimental approach. 16S ribosomal RNA genes were amplified from soil samples, and then purified PCR amplicons, containing a 5' linker sequence on the forward primer, were introduced to soils and monitored over time using real-time quantitative PCR and NGS amplicon sequencing. The study was able to measure rates of overall extracellular DNA degradation, but also sequence-specific degradation rates. I like the idea and execution of the study, and the results are interesting. The manuscript needs some help to improve the overall readability. Please see general and editorial comments below.

      Strengths:

      Innovative experimental design that is well deployed across a large number of soil types, revealing interesting variability in extracellular DNA degradation.

      Weaknesses:

      (1) The manuscript needs another review to improve the readability of the document.

      (2) The authors have used 16S genes to look at sequence-specific degradation. But 16S rRNA genes are actually pretty well conserved, and there isn't as much genetic variation across this gene among organisms as there is for other genes. It might be more relevant to look at metagenomic DNA degradation from high AT, high GC organisms, etc. This would be more generalizable than 16S genes.

      (3) Consideration of differential cell lysis during soil DNA extraction needs to be considered as well.

      (4) It is not clear why the authors didn't put GAPDH linkers on the reverse primer as well. This would have given an easier amplicon to amplify (no degeneracies at all).

    1. Reviewer #1 (Public review):

      Summary:

      Sugarman, Vanselow et al. adapted a microCT instrument to permit imaging of an entire organism, a hatchling octopus. In the resulting 3D dataset, they segmented the major organ systems, including the vascular, respiratory, digestive, and nervous systems. The authors released the dataset through a public web interface, and present some observations about body-wide neuroanatomy.

      Strengths:

      - The dataset is of good quality and access to a whole-cephalopod anatomical resource will be useful for the scientific community.

      - The interactive web interface facilitates exploration of the dataset.

      Weaknesses:

      - The authors identify a series of bundles of nerve fibers between the suckers and the central brain and propose that these structures together constitute the chemotactile pathway, linking sensation to learning and memory. This is an over-interpretation of the available evidence. The data is not presented in a way that allows the reader to independently assess the proposed anatomical relationships: many images include near-opaque colored overlays on the fibers of interest, making it difficult to determine whether these bundles truly merge or interface. Additionally, the mesoscale resolution achieved here reveals the presence of large nerve bundles but cannot resolve the origin or synaptic relationships of the neurons in the bundles - including those from the chemotactile receptors of the suckers. There are likely multiple synapses between the periphery and the central brain, and the location and connectivity of individual neurons are not visible at this resolution. Consequently, this dataset does not demonstrate structural connectivity. Elucidating a neural circuit would require complementary approaches such as neuronal tracing or electron microscopy connectomics.

      - The language used in the manuscript is often overly complex and convoluted, making it difficult to follow the main arguments and to assess the strength of the claims. In addition, some vocabulary in the main text is overly technical (e.g. related to microCT or anatomy), making it difficult for a general biology or cephalopod audience to understand, while some neuroscience vocabulary is used imprecisely or in ways that overstate what can be concluded from anatomical data. A substantial rewrite using clearer, more cautious language is recommended. Additionally, a deeper discussion of the observed octopus arm anatomy, and how this may relate to its semi-autonomous function would make this article of greater interest to a broader audience.

    2. Reviewer #2 (Public review):

      Sugarman et al show a major advance in the volumetric imaging of the cephalopod body and nervous system, using wide field high resolution micro-CT imaging. The new detection optics are striking in their performance, and the conclusions made from the images seem well-founded. The technical advance and the conclusions both justify the reader's attention, but the authors should make the figures and the text teach the reader so that the findings are more accessible and convincing.

      The paper is now written in a style that will impress those ready to be impressed and fail to impress many of the readers, although it should.

      (1) The authors must improve the text so that it cleanly states what was known previously, and how the current results extend this. For example. putting a section in the middle of the results section (page 3) that states: "Long-range connections between sucker and brain were demonstrated by fine chemical and tactile sensing by suckers in behavioral experiments with live O. bimaculoides (Buresch et al., 2022, 2024; Sepela et al., 2025; van Giesen et al., 2020; Wells, 1978a; Wells & Young, 1969) and by loss of chemotactile learning and memory observed after ablation of the "inferior frontal system" (i.e., inferior frontal/subfrontal/buccal lobe complex) (Wells, 1978a)..." seems a bit confusing to me. Similarly, putting in a reference to optical imaging approaches for combining data sets (Preibisch et al., 2009) as only the citation does little to make the work accessible. Please expand the text so that it teaches what the authors are thinking.

      (2) The authors must improve the figures so the work is more digestible. The data is a pyramid, and the "google earth" range of magnifications and details is not clear in the figures. In short, the figure will impress those who know to be impressed and fail to impress the majority.

      (3) The videos are far more useful in this contribution that in almost any other paper. Use them more so the reader realizes how key they are. Revising them to demonstrate the amazing range of scales in the data would be wise.

      (4) The demonstration of the data visualization tool is excellent as far as it goes. Expanding the treatment of the multi-scale rendering would be wise.

      With proper expansion of the text and the figures, it will become far more obvious that this is landmark work.

    3. Reviewer #3 (Public review):

      Summary:

      Sugarman et al. present a microCT scan of a hatchling octopus from the species Octopus bimaculoides. The scan is publicly available and poses as a valuable tool for the field of cephalopod biology. Using this scan, the authors uncover two undescribed neural pathways: the intermediate longitudinal tract (iLTs) in the axial nerve cord linking the suckers to the brain, and the arm-to-arm u-tracts (AAUTs) interconnecting neighboring arms. How the eight sucker-lined octopus arms are coordinated with one another and with the brain have been long standing questions in the octopus motor control field, and the results presented here have promise for addressing these questions. However, major weaknesses addressed below limit the interpretability of the dataset.

      Strengths:

      The authors have publicly published a scan of an entire hatchling octopus, with major organs and subdivisions of the nervous system already segmented. Accessing the data is straightforward, and the authors provide adequate instructions on how to navigate the dataset.

      The authors provide validation of the AAUTs using lucifer yellow and wheat germ agglutinin. To overcome motion artifact in the hatchling dataset, the connections between the iLTs and the suckers are validated with a microCT scan of a distal section of adult arm.

      Weaknesses:

      Given the resolution of the dataset, neural connectivity is determined by texture differences alone, which can be misleading. As such, any claims of anatomical connectivity will need further validation, ideally with tracing techniques. While the authors investigated the AAUTs with other techniques, no such validation exists for the iLTs. Furthermore, the authors themselves state that as the iLTs converge with the brachial nerve, they become indistinguishable from other fibers. Any connections of the iLTs to the brain are only hypothesized, despite their claim of demonstrating a clear pathway from the suckers to the brain.

      The relevant prior research on octopus neurobiology is not well explained, making it challenging to understand the significance of the results in a broader context.

    1. Reviewer #1 (Public review):

      Mutations in CDHR1, the human gene encoding an atypical cadherin-related protein expressed in photoreceptors, are thought to cause cone-rod dystrophy (CRD). However, the pathogenesis leading do this disease is unknown. Previous work has led to the hypothesis that CDHR1 is part of a cadherin-based junction that facilitates the development of new membranous discs at the base of the photoreceptor outer segments, without which photoreceptors malfunction and ultimately degenerate. CDHR1 is hypothesized to bind to a transmembrane partner to accomplish this function, but the putative partner protein has yet to be identified.

      The manuscript by Patel et al. makes an important contribution toward improving our understanding of the cellular and molecular basis of CDHR1 associated CRD. Using gene editing, they generate a loss of function mutation in the zebrafish cdhr1a gene, an ortholog of human CDHR1, and show that this novel mutant model has a retinal dystrophy phenotype, specifically related to defective growth and organization of photoreceptor outer segments (OS) and calyceal processes (CP). This phenotype seems to be progressive with age. Importantly, Patel et al, present intriguing evidence that pcdh15b, also known for causing retinal dystrophy in previous Xenopus and zebrafish loss of function studies, is the putative cdhr1a partner protein mediating the function of the junctional complex that regulates photoreceptor OS growth and stability.

      This research is significant in that it:

      (1) provides evidence for a progressive, dystrophic photoreceptor phenotype in the cdhr1a mutant and, therefore, effectively models human CRD; and

      (2) identifies pcdh15b as the putative, and long sought after, binding partner for cdhr1a, further supporting the theory of a cadherin-based junction complex that facilitates OS disc biogenesis.

      Comments on the revised version of the manuscript:

      The authors adequately addressed previous comments related to lack of details on quantitative and statistical analyses and methods. In this regard, I believe the revised manuscript presents a stronger analysis of the data. I also appreciated the revised discussion section, which better contextualizes their new data with previous observations in different animal models.

      The authors provided additional evidence in Fig 1C-H for the co-localization of pcdh15b and actin within CPs using immunolabeling with super resolution imaging. This data firmly supports their other observations. A similar approach tends to also show co-localization of actin and cdhr1a, although the authors suggest that the pattern of expression is less overlapping, which would be expected if cdhr1a is predominately expressed in the OS membranes whereas pcdh15b is predominantly expressed in the CP membranes. In my opinion the data presented to support this separation is still not that convincing. Moreover, the authors show that both cdhr1a and pcdh15b are expressed in CPs using immuno-TEM (Fig 1I). This is a difficult question to address experimentally, and it is, of course, still plausible that pcdh15b within the CP membrane and cdhr1a within the OS membrane are interacting in trans. However, I just don't think that the data unequivocally support mutually exclusive localization of these proteins as suggested by the authors and depicted in the model in Fig 1J.

    2. Reviewer #2 (Public review):

      Summary:

      The goal of this study was to develop a model for CDHR1-based Con-rod dystrophy and study the role of this cadherin in cone photoreceptors. Using genetic manipulation, a cell binging assay, and high- resolution microscopy the authors find that like rods, cones localize CDHR1 to the lateral edge of outer segment (OS) discs and closely opposes PCDH15b which is known to localize to calyceal processes (CPs). Ectopic expression of CDHR1 and PCDH15b in K652 cells indicate these cadherins promote cell aggregation as heterophilic interactants, but not through homophilic binding. This data suggests a model where CDHR1 and PCDH15b link OS and CPs and potential stabilize cone photoreceptor structure. Mutation analysis of each cadherin results in cone structural defects at late larval stages. While pcdh15b homozygous mutants are lethal, cdhr1 mutants are viable and subsequently show photoreceptor degeneration by 3-6 months.

      Strengths:

      A major strength of this research is the development of an animal model to study the cone specific phenotypes associate with CDHR1-based CRD. The data supporting CDHR1 (OS) and PCDH15 (CP) binding is also a strength, although this interaction could be better characterized in future studies. The quality of the high-resolution imaging (at the light and EM levels) is outstanding. In general, results support the conclusions of the authors.

      Weaknesses:

      While the cellular phenotyping is strong, the functional consequences of CDHR1 disruption is not addressed. While this is not the focus of the investigation, such analysis would raise the impact of the study overall. This is particularly important given some of the small changes observed in OS and CP structure. While statistically significant, are the subtle changes biologically significant? Examples include cone OS length (Fig 4F, 6E) as well as other morphometric data (Fig 7I in particular). Related, for quantitative data and analysis throughout the manuscript, more information regarding the number of fish/eyes analyzed as well as cells per sample would provide confidence in the rigor. The authors should also not whether analysis was done in an automated and/or masked manner.

      Comments on revisions:

      Most of my concerns were addressed in this revised version.

    3. Reviewer #3 (Public review):

      Summary:

      The manuscript by Patel et al investigates the hypothesis that CDHR1a on photoreceptor outer segments is the binding partner for PCDH15 on the calyceal processes, and the absence of either adhesion molecule results in separation between the two structures, eventually leading to degeneration. PCDH15 mutations cause Usher syndrome, a disease of combined hearing and vision loss. In the ear, PCDH15 binds CDH23 to form tip links between stereocilia. The vision loss in less understood. Previous work suggested PCDH15 is localized to the calyceal processes, but the expression of CDH23 is inconsistent between species. Patel et al suggest that CDHR1a (formerly PCDH21) fulfills the role of CDH23 in the retina.

      The experiments are mainly performed using the zebrafish model system. Expression of Pcdh15b and Cdhr1a protein is shown in the photoreceptor layer through standard confocal and structured illumination microscopy. The two proteins co-IP and can induce aggregation in vitro. Loss of either Cdhr1a or Pcdh15, or both, results in degeneration of photoreceptor outer segments over time, with cones affected primarily.

      The idea of the study is logical given the photoreceptor diseases caused by mutations in either gene, the comparisons to stereocilia tip links, and the protein localization near the outer segments. The work here demonstrates that the two proteins interact in vitro and are both required for ongoing outer segment maintenance. The major novelty for this paper would be the demonstration that Pcdh15 localized to calyceal processes interacts with Cdhr1a on the outer segment, thereby connecting the two structures. Unfortunately, the data as presented are inadequate proof of this model.

      Strengths:

      The in vitro data to support the ability of of Pcdh15b and Cdhr1a to bind is well done. The use of pcdh15b and cdhr1a single and double mutants is also a strength of the study, especially being that this would be the first characterization of a zebrafish cdhr1a mutant.

      This is a large body of data.

      Weaknesses:

      (1) I have serious concerns about the quality of the imaging here. The premise that cdhr1a/pcdh15 juxtaposition is evidence for the two proteins mediating the connection between outer segments and calyceal processes requires very careful microscopy. The SIM images have two major issues - one being that the red and green channels are misaligned and the other being evidence of bleed through between the channels. This is obvious in Fig 2A but likely true across all the panels in Fig 2, and possibly applies to confocal images in Fig 1 as well. The co-labelling with actin shows very uneven, punctate staining for actin bundles.

      (2) The newly added TEM and transverse sections include colored regions that obscure the imaging.

      (3) The quantification should be done with averages from individual fish. Counting individual measurements as single data points artificially inflates the significance. Also, the cone subtypes are still lumped together for analysis despite their variable sizes.

      (4) I highlighted previously that the measurement of calyceal processes was incorrect. The redrawn labels in Fig 7 are now more accurate, although still difficult to interpret. However, the quantification in Fig 7O is exactly the same. How can that be if the measurement region is now different?

      (5) Lower magnification views would provide context for the TEM data.

      (6) The statement describing the separation between calyceal processes and the outer segment in the mutants is still not backed up by the data.

      (7) The authors state "from the fact that rod CPs are inherently much smaller than cone CPs". This is now referenced, but incorrectly. Also, the issue of pigment interference was not addressed.

      (8) The images in panels B-F of the Supplemental Figure are uncannily similar, possibly even of the same fish at different focal planes.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript by Tsukamoto et al. describes a compelling approach to understanding whether inter-species differences in social behavior might emerge from differential expression patterns of the oxytocin receptor (Oxtr) in the brain. To this end, they genetically engineer BAC transgenic mouse lines with insertions of a large construct incorporating prairie vole Oxtr gene and surrounding regulatory elements. They name these lines Koi lines. They first evaluate if prairie vole-like Oxtr expression is reproduced in the Koi mouse lines, and they find heterogenous patterns across different lines that do not depend on the number of insertions. While they found that Koi mice can reproduce vole-like expression in PFC, NAc, and BLA, the reproduction was never complete: one Koi line had NAc and mPFC expression, another had BLA expression, etc. They confirmed major expression patterns across 3 methods: crossing with LacZ reporter line, in situ hybridization, and ligand binding (autoradiography). To determine the expression pattern of the BAC insert but not endogenous Oxtr, the authors generated new mouse lines by crossing Koi lines with Oxtr -/- line. Importantly, they found that Oxtr expression pattern in the mammary gland was similar across all lines, and wild-type mice.

      The authors used Koi:Oxtr-/- lines to test social behavior, specifically partner preference ( a behavior specific to prairie voles) and maternal behavior. They find that different Koi lines showed different changes in these behaviors compared to wild-type mice. Moreover, while some lines showed changes in partner preference, others seemed to show changes in maternal behavior. For one of the lines (Koi4), the partner preference and the maternal behavior were incongruent.

      The manuscript then hypothesizes that the Oxtr gene is positioned in different 3D chromatin structures across species and across tissues, leading to more rigid expression in the mammary glands, but more flexible expression patterns in the brain.

      Strengths:

      This study has major implications in the field of oxytocin research, and more broadly in the field of neuromodulation. It is novel, bold, and rigorous.

      Weaknesses:

      (1) The expression in the brain and mammary gland (Figure 2) was not quantified, preventing a more objective conclusion that the brain has flexible expression and mammary gland expression is rigid.

      (2) In Figure 7, a similar heatmap for the mammary gland is missing.

      (3) Partner preference in males was not tested.

      (4) It is unclear if in the behavioral testing the stimulus animals were the same genotype as the focal female or were wild-types. This could have an impact on the behavioral outcome.

    2. Reviewer #2 (Public review):

      Summary:

      This is a bold and important study and addresses an important question in the field: how species-specific variation in brain oxytocin receptor expression relates to differences in social behavior.

      Tsukamoto et al. generated eight independent transgenic mouse lines (Koi lines) carrying a bacterial artificial chromosome (BAC) encompassing the prairie vole Oxtr locus along with flanking intergenic regions, with the goal of probing the behavioral consequences of species-specific variation in brain Oxtr expression. Across these "volized" lines, the authors claim conserved Oxtr expression in the mammary gland but strikingly divergent patterns of brain expression, none of which fully recapitulate endogenous prairie vole Oxtr distribution, and instead exhibit expression patterns that diverge from both mouse and prairie vole brain Oxtr distribution. Nevertheless, some lines exhibit partial overlap with vole Oxtr expression pattern reported in the literature within specific brain regions, and one line displays partner preference behavior reminiscent of prairie voles. The authors further report line-dependent differences in maternal pup retrieval and crouching behaviors, which they interpret as evidence that variation in brain Oxtr expression can drive variation in social behaviors. Together with analyses of topologically associating domain (TAD) architecture, the authors conclude that brain, but not peripheral- Oxtr expression, is shaped by distal regulatory elements beyond the BAC insert, and propose that such regulatory flexibility underlies evolutionary diversification of social behavior.

      Strengths:

      A particular strength of the study is the generation of multiple independent transgenic lines, which provides a valuable resource for probing regulatory influences on Oxtr expression.

      Weaknesses:

      While the study addresses an important question, I have several methodological and conceptual concerns regarding the study in its current form. Some aspects of the study fall outside my primary area of expertise, and I am therefore not in a position to fully evaluate the technical difficulty or rigor of those components, or to judge whether my suggestions would be feasible to implement. I defer to reviewers with relevant expertise for a more detailed assessment of these aspects.

      (1) Each independent Koi line exhibits a distinct brain expression pattern that differs from both wild-type mouse and prairie vole Oxtr expression, complicating the interpretation of the results. The manuscript does not include a direct comparison of brain Oxtr expression patterns in these transgenic lines with those of prairie voles. Instead, expression similarity is inferred primarily from regional localization and compared indirectly with prior literature (Figures 2-5). For those lines that show partial resemblance to prairie vole Oxtr expression patterns, the authors do not assess whether Oxtr-expressing neurons share comparable anatomical projections or transcriptomic identity with prairie vole Oxtr-expressing neurons. Quantification of expression remains largely descriptive, illustrating expression patterns (Figure 2), OXTR protein distribution (Figure 3; images are difficult to evaluate due to low contrast), or Oxtr mRNA levels across selected brain regions in Koi lines, wild-type mice, and mOxtr-/- mice (Figures 4-5), without directly testing similarity to prairie vole expression. In addition, whole-brain expression data are lacking, with analyses restricted to selected sections. While such analyses may be beyond the scope of the present study, these limitations nonetheless complicate interpretation of the central question - namely, whether the observed behavioral phenotypes arise from vole-like Oxtr circuits rather than from distinct, line-specific expression configurations.

      (2) The authors state that Oxtr expression in the mammary gland is similar across all Koi lines and the mOxtr-IRES-Cre knock-in line. However, the images presented in Figure 2 appear to show differences in anatomical detail across lines, and no quantitative analysis is provided to support the claim of equivalence.

      (3) The conclusion that integration site rather than copy number determines the observed BAC transgene expression patterns (Lines 202-203) is not fully supported by the data. First, the authors did not compare multiple copy numbers at the same genomic insertion site, making it impossible to disentangle copy-number effects from position effects. Second, BAC copy number does not necessarily scale linearly with expression; higher copy numbers can have a repressive effect on gene expression (Garrick et al, Nat Genet, 1998).

      (4) While I am not an expert in TAD analysis, the observed differences in 3D architecture around Oxtr are consistent with a role for long-range regulatory interactions. However, these analyses appear largely descriptive and correlative, and establishing a causal contribution of 3D chromatin organization to Oxtr regulation by distal elements would likely require direct perturbation of TAD boundaries or looping interactions. I recognize that such experiments may be beyond the scope of the present study, but clarifying this limitation in the interpretation would be helpful.

    1. Reviewer #1 (Public review):

      Summary:

      The authors describe co-regulated gene modules underlying stage differentiation in Leishmania donovani through a system-level analysis of multiple molecular layers. Using amastigotes isolated from infected hamster spleens and corresponding culture-derived promastigotes, they analyzed genomic variation, transcript abundance, protein levels, phosphorylation states, and metabolite profiles. By combining these, the study identified potential regulatory mechanisms associated with parasite differentiation and generated hypotheses regarding how gene expression is coordinated across different levels.

      Strengths:

      A major strength of the study is the breadth of the dataset generated. The integration provides an unusually comprehensive view of molecular changes associated with Leishmania differentiation in vitro. Such multi-layer datasets involving bona fide vertebrate host stages remain relatively rare in parasitology and will likely become a valuable resource for the molecular parasitology community. In addition, the use of amastigotes isolated from infected hamsters rather than relying on axenic models provided a biologically relevant framework for the analyses.

      The revised manuscript improved several aspects of the original. The RNA-seq analysis is described with a clearer pipeline, and several claims regarding causal regulatory feedback associations have been appropriately toned down. Among the observations reported, the association between parasite differentiation and proteasome-mediated protein degradation is particularly remarkable. The combination of quantitative proteomics with pharmacological inhibition of the proteasome with lactacystin provides support for a role for protein turnover in developmental transitions and paves the way for future mechanistic studies.

      Weaknesses:

      Most regulatory interpretations remain largely inferential or indirect. The integration identifies correlations between different levels, but direct functional validation is limited/absent. Many of the descriptions should not be interpreted as validated. As highlighted by the authors in this revised version, the mechanistic studies will be part of future work and are beyond the scope of the current work. Of note, the attempt to confirm lactacystin-induced inhibition of proteasomal activity via anti-polyUb immunoblotting did not demonstrate the expected outcome of increase in overall poly-ubiquitylation.

      Comments on revised version:

      The authors have appropriately addressed my comments and questions from the initial review process. My remaining concern relates to the lack of evidence to confirm proteasomal inhibition by lactacystin in both promastigotes and amastigotes. The immunoblotting experiment newly presented does not reveal a clear increase in the levels of poly-ubiquitylated proteins in treated parasites. In fact, poly-Ub levels were lower at both the 4h and 18h timepoints of treatment. If alternative antibodies or additional immunoblots are not available, the manuscript would benefit from an expanded discussion of this observation and potential explanations. In particular, the interpretation that lactacystin stabilizes ama- and pro-specific degradation would be greatly strengthened by such validation.

    2. Reviewer #2 (Public review):

      Pescher and colleagues present a revised manuscript detailing the multi-omic characterisation of Leishmania donovani amastigote to promastigote differentiation and integration of this data. The molecular pathways that regulate Leishmania life-stage transitions are still poorly understood, with many approaches exploring single proteins/RNAs etc in a reductionist manner. This paper takes a systems-scale approach and does a good job of integrating the disparate -omics datasets to generate hypotheses about the intersections of regulatory proteins that are associated with life-cycle progression. The differentiation step studied is from amastigote to promastigote using hamster-derived amastigotes which is a major strength. The use of hamsters permits the extraction of parasites that are host adapted and represent "normal", host-adapted Leishmania ploidy; the promastigote experiments are performed at a low passage number. Therefore, this is a strength or the work as it reduces the interference from the biological plasticity of Leishmania when it is cultured outside the host for prolonged periods. The multi-omics datasets presented are robust in their acquisition and analysis and will form an excellent resource for researchers studying the molecular events (particularly proteasomal protein degradation, and phosphorylation) during life-stage progression.

      General comments on the revisions:

      My view is that the authors have made significant, satisfactory changes that address the comments and queries I made on the original manuscript (Review Commons).

      There are two areas where the authors had to make major changes/justifications where further comment is merited, these were:

      RNA-seq.<br /> The most significant issue was the originally underpowered RNA-seq which had only two replicates. This has been repeated with four replicates now. This has not led to changes in the interpretation of the data between the original study and this one. One comment that the authors make in the response to this was : "Given the robustness of the stage-specific transcriptome, and the legal constrains associated with the use of animals, we chose to limit the number of replicates to the necessary". Ensuring that animal experiments are properly powered and that maximum robustness of the data from the minimum sample size is an important part of experimental design for ethical use of animal models. Essentially the replication here could have been avoided if the original study had used 1 more animal. However, the new version of RNA-seq brings appropriate confidence to the interpretation of the data.

      Phosphoproteomics.<br /> The authors provide a robust justification of their strategy for the phosphoproteomics and highlight the inclusion criteria for phosphosites: "Phosphosites were only considered if detected with high confidence (identification FDR<1%) and high localisation confidence (localisation probability >0.75) in at least one replicate". The way missing values were dealt with is explained "For statistical analyses, missing values within a given condition were imputed with a well-established algorithm (MLE) only when at least one observed value was present in that condition." This fills in some of the gaps I was missing from the original manuscript, and I am satisfied that the data analysis is entirely appropriate for a discovery/system -based approach such as this one. The authors also edit the manuscript to reflect that "occupancy" or "stoichiometry" might not be the best description of what they were presenting and switched to the terminology of "normalised phosphorylation level" - I think this is an appropriate response.

      Overall, in the absence of follow up experiments on specific individual examples, some of the claims in the original submission were toned down and reflect a more neutral description of the data now. Significantly, the data still underpin a key role for regulation of the ribosome between the amastigote and promastigote stages (and during the differentiation process). The recursive and reciprocal links between the phosphorylation and ubiquitination systems are interesting and present many opportunities for future investigation.

    3. Reviewer #3 (Public review):

      Summary:

      The authors proposed to use 5-layer systems level analysis (genomics, transcriptomics, proteomics / protein degradation, metabolomics, phosphoproteomics) to uncover how post-transcriptional mechanisms regulate stage differentiation in Leishmania donovani.<br /> This enabled the identification of several potential regulatory networks, including the regulation of stage-specific gene clusters by RNA stabilisation or decay, proteasomal degradation and protein phosphorylation.<br /> In the new version of this manuscript, the authors have addressed all questions raised by the reviewers.

      Strengths:

      Although some observations in this study have already been described in the literature, the integrated analysis applied here provides a novel view on how different levels of post-transcriptional networks regulate Leishmania differentiation. This "5-layer system" represents the first analysis of this depth in kinetoplastid parasites.<br /> The revised version with an increased sample number for the RNA-seq now made the authors assumptions adequate to their obtained data.<br /> The use of a proteasomal inhibitor adds an interesting insight in how protein degradation is involved in the parasite differentiation, confirming previous observations in the literature, and help to explain the discrepancies between mRNA and protein expression in the different stages.

      Weaknesses:

      While this work provides an impressive and foundational dataset, it opens the door for future research to rigorously validate these initial findings and conclusions.

      Significance and Impact in the field.

      The different datasets generated in this study will be of great interest to the parasitology community, either to be used for hypothesis generation, to validate data from other sources, etc.

      The multi-layered analysis performed here identified a series of potential feedback loops and regulatory networks to be further explored in organisms that lack transcriptional control.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aim to study mutational paths connecting WW domains with different binding specificities. Their approach combines an unsupervised sequence generative model based on RBMs with a path-sampling algorithm. The key result is that most intermediate sequences along the designed transition paths retain measurable binding activity in wet-lab assays, whereas paths containing the same mutations introduced in a randomized order are largely non-functional. This difference is attributed to epistatic interactions captured by the RBM model.

      Strengths:

      Exploring mutational paths in high-dimensional protein sequence space is a challenging problem. The computational framework used here is state-of-the-art and is strengthened by systematic experimental characterization of binding activity. The study is comprehensive in scope, including multiple transition paths both within and across WW specificity classes, and the integration of modeling with high-throughput experimental validation is a clear strength.

      Weaknesses:

      A major concern is whether the stated goal of specificity switching is fully achieved. Along the sampled transition paths, most intermediate variants appear to retain specificity close to either the initial or the final class, rather than exhibiting gradually shifting specificity. For example, in Figure 4G (Class I to Class II/III), binding appears largely binary, with intermediates behaving similarly to one of the endpoints. A similar pattern is observed in Figure 3H for the Class I to Class IV transition, where binding responses are close to 0 or 1. In this sense, the specificity-switching objective is only partially realized by assigning two endpoints with different specificity. This raises a broader conceptual question: is it possible that different WW specificities evolved from a common ancestor without passing through intermediates that exhibit mixed or intermediate specificity? If so, then inferring specificity-switching pathways purely from extant natural sequences may be fundamentally challenging.

    2. Reviewer #2 (Public review):

      This is an extremely important work that shows how one can use generative models to construct specificity-switching mutational paths in complex fitness landscapes. The experimental evidence is very clear, and the theoretical tools are innovative.

      The work will likely have a deep impact on future research aimed at understanding how evolution navigates fitness landscapes as well as reconstructing ancestral sequences.

      The manuscript is extremely clear and well written, the experimental evidence is strong, and the methods are clearly described, so I do not have major issues to raise. A few minor issues are listed below.

      (1) I consider the WW domain as an 'easy' case from the point of view of generative modelling. The domain is rather short, epistatic effects are not very strong (e.g. Boltzmann learning usually converges very quickly to a very paramagnetic state), and the resulting models are well interpretable (e.g. the hidden units of the RBM correlate well with subclasses).

      This is not always (not often?) the case, however. In more complex proteins, the learning procedures can be slower and the resulting models less interpretable. Just for completeness, perhaps the authors could comment on the generality of the results and what they would expect for other systems based on their experience.

      (2) In Section 3.3, the authors say that direct paths connecting Class I and Class IV behave similarly to indirect paths, despite having lower scores according to the RBM. How generic is this? Does it also happen for other classes? This might be an important point to address, as direct paths are easier to sample.

      (3) The path shown in Figure 4 goes through a region of non-functionality around sequences 18-19. It seems that the sample path is basically exploring the functional regions for Class I and Class II/III separately, trying to approach the other class, but then it can't really make the switch.

      By contrast, the path going from Class I to Class IV seems able to perform the functional switch in a single step (20-21) without losing too much of the function.

      Perhaps the authors could better comment on this? Is this a limitation of the sampling method, or a fundamental biological fact?

      (4) On page 12, it is stated that the temperature was chosen to 1/3 to maximize the score. This is important and should be mentioned earlier (I didn't notice it until that point).

      (5) On page 13, it is stated that: "However, the scores of the ancestral sequences along the phylogenetic pathways assigned by the RBM are significantly lower than the ones of the RBM-designed sequences. This result is expected as ASR reconstruction does not take into account epistasis, differently from RBM, and we expect ASR sequences to generally be of lesser quality."

      I was very surprised by this result. My own experience with ASR shows that, on the contrary, sequences found by ASR (via maximum likelihood) tend to have high scores in the (R)BM, and tend to be more stable than extant sequences. I attribute this to the fact that ASR typically finds a "consensus" sequence that maximizes the contribution to the score coming from the fields (the profile), which is typically dominant over the epistatic signal, resulting in a bigger score. Maybe the authors did not use maximum likelihood in the ASR? Some clarification might be useful here.

    1. Reviewer #1 (Public review):

      Summary:

      In this paper, the authors investigate the effects of Miro1 on VSMC biology after injury. Using conditional knockout animals, they provide the important observation that Miro1 is required for neointima formation. They also confirm that Miro1 is expressed in human coronary arteries. Specifically, in conditions of coronary diseases, it is localized in both media and neointima and, in atherosclerotic plaque, Miro1 is expressed in proliferating cells.

      However, the role of Miro1 in VSMC in CV diseases is poorly studied and the data available are limited; therefore, the authors decided to deepen this aspect. The evidence that Miro-/- VSMCs show impaired proliferation and an arrest in S phase is solid and further sustained by restoring Miro1 to control levels, normalizing proliferation. Miro1 also affects mitochondrial distribution, which is strikingly changed after Miro1 deletion. Both effects are associated with impaired energy metabolism due to the ability of Miro1 to participate in MICOS/MIB complex assembly, influencing mitochondrial cristae folding. Interestingly, the authors also show the interaction of Miro1 with NDUFA9, globally affecting super complex 2 assembly and complex I activity.<br /> Finally, these important findings also apply to human cells and can be partially replicated using a pharmacological approach, proposing Miro1 as a target for vasoproliferative diseases.

      Comments on revisions:

      The authors have adequately addressed all the concerns raised by the reviewers, and the manuscript has been substantially improved

    2. Reviewer #2 (Public review):

      Summary:

      This study identifies the outer‑mitochondrial GTPase MIRO1 as a central regulator of vascular smooth muscle cell (VSMC) proliferation and neointima formation after carotid injury in vivo and PDGF-stimulation ex vivo. Using smooth muscle-specific knockout male mice, complementary in vitro murine and human VSMC cell models, and analyses of mitochondrial positioning, cristae architecture and respirometry, the authors provide solid evidence that MIRO1 couples mitochondrial motility with ATP production to meet the energetic demands of the G1/S cell cycle transition. However, a component of the metabolic analyses are suboptimal and would benefit from more robust methodologies. The work is valuable because it links mitochondrial dynamics to vascular remodelling and suggests MIRO1 as a therapeutic target for vasoproliferative diseases, although whether pharmacological targeting of MIRO1 in vivo can effectively reduce neointima after carotid injury has not been explored. This paper will be of interest to those working on VSMCs and mitochondrial biology.

      Strengths:

      The strength of the study lies in its comprehensive approach assessing the role of MIRO1 in VSMC proliferation in vivo, ex vivo and importantly in human cells. The subject provides mechanistic links between MIRO1-mediated regulation of mitochondrial mobility and optimal respiratory chain function to cell cycle progression and proliferation. Finally, the findings are potentially clinically relevant given the presence of MIRO1 in human atherosclerotic plaques and the available small molecule MIRO1.

      Weaknesses:

      (1) High-resolution respirometry (Oroboros) to determine mitochondrial ETC activity in permeabilized VSMCs would be informative.

      (2) Therapeutic targeting of MIRO1 failed to prevent neointima formation, however, the technical difficulties of such an experiment is appreciated.

      Comments on revisions:

      The authors have addressed the concerns I previously raised.

    3. Reviewer #3 (Public review):

      Summary:

      This study addresses the role of MIRO1 in vascular smooth muscle cell proliferation, proposing a link between MIRO1 loss and altered growth due to disrupted mitochondrial dynamics and function. While the findings are useful for understanding the importance of mitochondrial positioning and function in this specific cell type, the main bioenergetic and mechanistic claims are not strongly supported.

      Strengths:

      - This study focuses on an important regulatory protein, MIRO1, and its role in vascular smooth muscle cell (VSMC) proliferation, a relatively underexplored context.<br /> - This study explores the link between smooth muscle cell growth, mitochondrial dynamics, and bioenergetics, which is a significant area for both basic and translational biology.<br /> - The use of both in vivo and in vitro systems provides a useful experimental framework to interrogate MIRO1 function in this context.

      Weaknesses:

      - Some key bioenergetic aspects may require further investigation.

      Comments on revisions:

      The authors have adequately addressed most of the concerns I raised. I would suggest adding some of the justifications provided to the reviewers to the manuscript to further clarify and aid interpretation of the data, especially for the bioenergetic part (e.g., the proposed interaction with CI components, which might otherwise appear implausible to readers).

    1. Reviewer #1 (Public review):

      Summary:

      This article provides new insights into the organisational changes of the X4-tropic HIV-1 co-receptor CXCR4 upon binding of the viral receptor-binding protein X4-gp120, either in its soluble form or when displayed as Env on virus-like particles (VLPs). The study employs single-particle tracking total internal reflection fluorescence (SPT-TIRF) microscopy to quantify the dynamics and clustering of CXCR4 on CD4+ T cells. The data show that CXCR4 clusters in the presence of X4-gp120 and VLPs, a phenomenon that is also observed for the primary HIV-1 receptor CD4. The authors also show that a WHIM mutant of CXCR4 (CXCR4-R334X) that does not cluster in the presence of its natural ligand, CXCL12, clusters in the presence of X4-gp120 and VLPs.

      Major strengths:

      The data are well presented, discussed, and supported by solid evidence. Literature is cited appropriately.

      Major weaknesses:

      The authors have addressed my concerns in the revised manuscript.

      Significance:

      In summary, the work is presented in a clear fashion, and the main findings are properly highlighted. The paper will be of interest to the broader virology community as well as to researchers studying cell receptor clustering. The findings are not entirely surprising because it has been shown previously that the binding of Env to CD4 mediates CD4 clustering, which would also suggest clustering of the co-receptor. Nonetheless, the paper provides strong evidence that CXCR4 clusters and changes its dynamics in the presence of CD4 and X4-gp120. Moreover, the evidence that X4-gp120 clusters CXCR4-R334X is of high interest as it suggests a different binding mechanism for X4-gp120 from that of the natural ligand CXCL12, raising questions for further research.

    2. Reviewer #2 (Public review):

      Summary:

      The author investigates how the HIV-1 Env glycoprotein modulates the nanoscale organisation and dynamics of the CXCR4 co-receptor on CD4⁺ T cells. The author demonstrates that HIV-1 Env induces CXCR4 clustering distinct from that triggered by its natural ligand (CXCL12), implicating spatial receptor organization as a determinant of infection. This study investigates how HIV-1 Env (specifically X4-tropic gp120) alters the membrane organization and dynamics of the chemokine receptor CXCR4 and its WHIM-associated mutant, CXCR4R334X, in a CD4-dependent manner. Using single-particle tracking total internal reflection fluorescence microscopy (SPT-TIRF-M), the authors demonstrate that both soluble gp120 and virus-like particles (VLPs) displaying gp120 induce CXCR4 nanoclustering, reduce receptor diffusivity, and promote immobile nanoclusters of CXCR4 at the membrane of Jurkat T cells and primary CD4⁺ T cell blasts. The work offers new insights into the spatial organisation of receptors during HIV-1 entry and infection. The manuscript is well-written, and the findings are significant.

      Significance:

      Nature and significance of the advance:<br /> This work marks a conceptual and mechanistic breakthrough in understanding HIV-1 entry. It goes beyond the static view of Env-co-receptor interaction to show that nanoscale reorganization of CXCR4, distinct from chemokine-induced clustering, occurs during HIV-1 Env engagement and may be essential for infection.

      Context within existing literature. Previous studies established Env-induced CD4 clustering (Yin et al., 2020) and chemokine-induced CXCR4 nanocluster formation (Martínez-Muñoz et al., 2018), but the exact nanoscale rearrangement of CXCR4 in the context of HIV-1 Env and physiological Env densities remains unquantified. This study addresses this gap using SPT-TIRF, STED microscopy, and functional assays.

      Audience and influence: The findings will be of interest to researchers in HIV virology, membrane receptor biology, viral entry mechanisms, and therapeutic target development. The receptor-clustering aspect could also influence broader fields of study, such as GPCR organization and immune receptor signalling.

      Reviewer expertise: I can evaluate HIV-1 entry mechanisms, viral glycoprotein-host-host-host receptor interactions, single-molecule fluorescence microscopy, and membrane protein dynamics. I am less equipped to evaluate the deep structural modelling aspects, though the in silico AlphaFold results are straightforward to interpret in context.

    3. Reviewer #3 (Public review):

      Summary:

      The authors investigate how HIV-1 Env engagement affects the nanoscale organization and dynamics of the CXCR4 coreceptor on target cells. Using single-particle tracking TIRF microscopy, they analyze CXCR4 distribution following exposure to gp120 or HIV virus-like particles, including both wild-type CXCR4 and the WHIM-associated CXCR4.R334X variant. The study further examines the role of CD4-CXCR4 heterodimerization and contrasts Env-induced receptor organization with that elicited by the natural ligand CXCL12.

      Evaluation:

      A major strength of this work is the integration of high-resolution imaging with functional and comparative analyses that distinguish Env-induced CXCR4 clustering from chemokine-driven effects. The experiments are clearly described, include appropriate controls, and are supported by quantitative analyses that are consistent across experiments. The revised manuscript appears to have addressed many of the technical and interpretive issues raised during initial review, improving clarity around data analysis and strengthening confidence in the conclusions.

      I am not an expert in TIRF microscopy or single-molecule tracking and defer to other reviewers regarding limits of imaging and tracking methods. However, I did not identify major inconsistencies between the biological data presented and the conclusions drawn.

      The authors data support the conclusion that HIV-1 Env, delivered as gp120 or virus-like particles, promotes CD4-dependent nanoscale clustering of CXCR4, including the CXCR4.R334X variant associated with WHIM syndrome, in a manner distinct from CXCL12-induced receptor organization. The authors are generally careful to frame their conclusions in proportion to the evidence and avoid overinterpretation.

      Overall, this study builds on prior work on CXCR4 distribution and HIV entry by providing higher-resolution insight into receptor nanoclustering and its modulation by Env. The findings provide a mechanistic refinement rather than a conceptual paradigm shift but is a valuable dataset useful to researchers studying HIV entry, coreceptor biology, and membrane receptor organization.

      Reviewer expertise: HIV-1 Envelope glycoproteins and entry assays, HIV broadly neutralizing antibodies, HIV vaccine design

      Comments on revised version:

      This reviewer has no further recommendations and thanks the authors for clarifying that the Env content in gp120-VLPs was lower than the NL4-3deltaIN particles but that the percentage of mature particles in the gp120-VLPs was higher.

    4. Reviewer #4 (Public review):

      Summary:

      The authors investigate the impact of surface bound HIV gp120 and VLPs on CXCR4 dynamics in Jurkat T cells expressing WT or WHIM syndrome mutated CXCR4, which has a defective response to CXCL12. Jurkat cells were transfected with CXCR4-AcGFP. Images were acquired and a single particle tracking routine was applied to generate information about nanoclustering and diffusion, and FRET was used to investigate CD4-CXCR4 proximity. They compare effects of soluble gp120 to immature and mature VLPs, which include varying degrees of gp120 clustering. They find that solid phase gp120 or VLP can increase CXCR4 clustering size and decrease diffusion in Jurkat cells. Surprisingly, VLP lacking gp120 could increase CXCR4 clustering and speed, which is paradoxical as there were no known ligands on the VLPs, but they likely carry many cellular proteins with potential interactions. The impact of CXCL12 and gp120 binding to CXCR4 was different in terms of clustering and receptor down-regulation.

      Significance:

      The strengths are that it's an important question and the reagents are well prepared and characterised. They are detecting quantitative effects that will likely be reproducible. The information generated is potentially useful for those studying HIV infection processes and strategies to prevent infection.

      The major weakness is that the conditions for the SPT experiments are not ideal in that the density of particles is too high for SPT and the single molecule basis for assessing nanoclusters is not clear. This means that the data is getting at complex molecules phenomena and less likely be generating pure single molecules measurements.

      Comments on revised version:

      The authors should make the tracking data available and this will aid others in following up on it.

    1. Reviewer #1 (Public review):

      Summary:

      Yang et al. investigate the central pathways underlying nociceptive responses in Drosophila. The authors employ a behavioral platform they previously developed, which uses laser stimulation to deliver nociceptive stimuli while enabling automated tracking of fly behavior. By combining large-scale behavioral screening with circuit tracing approaches, the study identifies a set of dopaminergic neurons (DANs) and mushroom body output neurons (MBONs) that participate in the transmission of nociceptive signals. Nociceptive escape behavior has generally been regarded as largely reflexive. It is therefore intriguing that the mushroom body, a neural circuit classically associated with learning, is involved in this process. In particular, the recruitment of dopaminergic neurons typically linked to both appetitive and aversive valence is noteworthy and raises interesting questions about how nociceptive information is integrated within the circuits. Overall, the findings are conceptually interesting and may provide useful insights into dissecting the nociceptive escape behavior.

      Strengths:

      The behavioral assay used in this study is high-throughput and appears reproducible. The authors screened a large number of genetic lines, and the behavioral responses were carefully quantified. The trans-Tango tracing results are consistent with the behavioral screening results. And the observation that circuits typically associated with learned behaviors (mushroom body) contribute to a nociceptive escape response, generally considered a hard-wired reflex, is conceptually interesting.

      Weaknesses:

      The use of laser stimulation to induce nociceptive stimuli makes the paradigm difficult to combine with calcium imaging or optogenetic manipulations. As a result, the study lacks functional and temporally precise tests of the proposed circuit mechanisms.

      Several aspects of the Methods section require additional detail:

      (1) How was the behavioral potency level calculated? Since some of the split-GAL4 lines label multiple neurons, and the individual neurons may innervate multiple compartments. It is therefore unclear how a single "behavioral potency level" value was assigned to a compartment.

      (2) Additional details are needed on how velocity was calculated, particularly the time window used for the analysis. In the Kir-silenced condition, the variation in velocity appears smaller than in the control group, which would benefit from clarification.

      (3) Connectome analysis. More details are needed regarding how DAN-MBON connectivity was quantified in Figure 5. For example, were only DAN → MBON connections considered, or were bidirectional connections included?

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript aims to identify the central nervous system circuitry, specifically within the mushroom body (MB), that mediates nociception-induced escape behavior in adult Drosophila. The authors provide a detailed map of the neural pathways underlying defensive actions in flies. Overall, the study is technically solid, clearly written, and conceptually<br /> interesting.

      Strengths:

      The authors present compelling evidence by integrating multiple complementary approaches. The ALTOMS laser system enables precise, automated measurement of escape latency, allowing for high-throughput and objective behavioral quantification. Neuronal silencing experiments assess functional necessity and demonstrate that specific dopaminergic neurons (DANs) and mushroom body output neurons (MBONs) are critical for escape behavior. Trans-Tango anatomical mapping further supports the proposed circuit by identifying putative synaptic connections consistent with the authors' model.

      Weaknesses:

      A central limitation of the study is its heavy reliance on chronic Kir2.1-mediated neuronal silencing as the primary functional manipulation. This approach raises concerns about potential developmental compensation and indirect network effects. The authors could strengthen their conclusions by incorporating more temporally precise, reversible silencing strategies, such as recently developed optogenetic- or chemogenetic-based methods.

      In addition, the study relies on the trans-Tango system to identify downstream synaptic partners, which has several inherent limitations. Trans-Tango detects only chemical synapses and cannot reveal electrical coupling. The system may also yield false negatives due to reporter sensitivity, and anatomical labeling alone does not establish functional connectivity in the context of the specific behavior examined.

    3. Reviewer #3 (Public review):

      Summary:

      Yang et al sought to describe central brain circuits that underlie nociception-induced escape in Drosophila using a combination of neurogenetic tools to silence subsets of neurons and to trace their postsynaptic connections. They present interesting data that identify subsets of DANs and MBONs that are required for a jumping response to an aversive stimulus, but not for baseline locomotion, and present a model for linking peripheral nociception to MB- dependent escape behavior.

      Strengths:

      They use an innovative avoidance assay to elicit a robust behavioral response and use trans-tango to identify downstream targets of painless and TrpA1-expressing neurons.

      Weaknesses:

      This reviewer's enthusiasm for the study is lowered due to an incomplete description of methods, methods section, appropriate behavioral controls, immunohistochemistry data, and a complete behavioral screen of DANs and MBONs. Below I list my suggestions, questions, and criticisms.

      (1) Behavioral studies are interesting. The assay is simple, yet innovative. However, there is no power analysis or explanation of how sample sizes were selected. I commend the authors for including a positive control; however, although UAS-controls are present, there are no GAL4-controls included in the study. Given that many of the lines used for behavior are split-GAL4's, it's unclear if the additional transgene influenced behavior. This should be addressed.

      (2) It is also not clear from the methods how the behavior was run and how it was analyzed. Was baseline locomotion recorded before the laser was introduced? I assume this is the case; however, more importantly, how long after the flies were introduced to the arena were baseline recordings collected? How much data was used to calculate velocity? Were the experimenters blind to the conditions they were assessing? More detail in the methods is essential for understanding the data and providing an opportunity to replicate results.

      (3) At times, the authors describe "locomotion velocity" as baseline locomotion, but other times, they describe it as escape velocity (see reference to Figure 1F). The authors should clarify whether escape velocity was calculated.

      (4) Immunohistochemistry: There is a lack of detail regarding a description of the flies used for trans-tango experiments. How many brains were evaluated? Was there variability across brains? Were the flies males or females? This is an important detail as sex could impact the level of expression of the ligand and therefore the results. It is also not clear at what age these flies were dissected and at what temperature they were raised. This can also significantly affect the post-synaptic signal that is measured (see Talay et al 2017).

      (5) Figure 2 shows the overlap of trans-tango and dopamine signal, but there is no signal for the GAL4-line to evaluate the overlap between presynaptic signal and postsynaptic signal. This expression is an important consideration and should be included.

      (6) Expression of the GAL4 lines in the central brain is also important to show because the authors suggest that, because painless and TrpA1 expression does not fully overlap in peripheral tissue, it might converge in the central brain. Does that central brain expression of painless and TrpA1 overlap?

      (7) Further, although the authors clearly label the different dopamine subsets (PPL1, PAL, and PAM), some orientation with regard to where these images were taken would be helpful. I recommend a stack showing the location of the cell bodies and then a zoom in to see the overlap.

      (8) Behavioral data for DANs and MBONSs: I recommend that the authors discuss the results by the neurons that are targeted and not the driver lines. For instance, the authors suggest they get the largest effects for 433B, 434B, and 298B, but all of these lines target very similar neuronal subsets y4>y1y2. It's also not clear why different split-lines were selected. Several of the lines have overlapping expression, and other compartments were not included at all. In order to determine which MBONs and DANs are required for escape behavior, all MBONs and DANs should be included. See Aso et al for a list of recommended lines for behavior based on specificity and intensity.

      (9) Based on trans-tango data, it is not clear why the authors focus exclusively on PPL1 and PAM when PAL, PPM1, 2, 3, and PPL2 also overlap with painless and trpA1. Certainly, PPL1 and PAM DANs innervate the MB, but so do some of the other DANs identified.

      (10) For Figure 5, the titles of A and B are DANs and MBONs, but it is really showing the average jumping response when neurons that innervate MB compartments are silenced. Many DANs and MBONs innervate multiple compartments (PPL1-a`2a2, etc.); thus, if the intention is to identify neural circuits that modulate escape response, the analysis should focus on the neurons, not the MB compartments. I recommend reorganizing this data so it highlights the DANs and MBONs instead of the MB compartments. I also recommend showing error bars for averages and/or raw data and organizing the x-axes so DAN and MBON compartments can be easily compared.

      (11) Lastly, nuance is lost here in the Behavioral Potency Level, given that some of these compartments are over-represented and not adjusted for the strength of expression in different split-GAL4 lines. Aso et al. (2014) recommended specific split-GAL4 lines based on specificity and intensity. Some of the lines that are included in the average Behavioral Potency are not recommended for behavior based on the intensity of expression, which could significantly influence the potency score.

    1. Reviewer #1 (Public review):

      Summary:

      This paper presents a wireless device for closed-loop control of optogenetic stimulation based on behavioral triggers. The authors demonstrate the device through two behavioral experiments in mice, showcasing the device's capabilities and emphasizing open accessibility and using off-the-shelf components.

      Strengths:

      The paper presents a device that is open access and easily reproducible for wireless stimulation in a closed loop based on behavioral triggers. Other strengths of the device include the simultaneous use of multiple devices in parallel and the claimed ease of integration with existing frameworks. The paper shows to behavioral experiments on multiple mice along with some device validation results.

      Weaknesses:

      The main weakness of the presented device lies in the lack of flexibility in stimulation power. For a device that is intended for stimulation only, having to physically change a component on the board to adapt stimulation power is a major downside. Reprogrammable stimulation current is not complex to implement and should really have been included on this device. Another weakness lies in the limited battery life of the device. While using a battery-powered device decreases spatial constraints, allowing for the maze experiment presented in the paper, it also means the lifespan of the device is limited compared to an inductively powered device, limiting its ability for long-term experiments.

    2. Reviewer #2 (Public review):

      Summary:

      The authors have developed an elegant, lightweight, open-source system that should be able to be widely disseminated to the community. They have used this system in multiple experimental paradigms and demonstrate its functionality quite elegantly. One of these experiments involves two of three animals in the arena being stimulated, a situation that clearly requires an untethered approach. They have appropriately quantified key system parameters (latency and battery life).

      Strengths:

      The introduction places this work in a broader context. That context includes a number of previous solutions, many of which are smaller or more technically complex. However, I agree with the authors that there is a need for something that is easy for labs to acquire and deploy in terms of both what goes on the head and the broader infrastructure (i.e., not needing complex wireless power delivery approaches).

      The paper does an excellent job of describing the system architecture. And the architecture is good! Their system comprises more than just the bluetooth enabled head-mounted devices - they also have built an interface that allows for TTL triggers that link into existing workflows.

      The key metrics for a device like this are weight, battery life, and latency. The weight is 1.4g, which is appropriate for adult mice; the battery life is ~100 minutes of continuous stimulation, which should be sufficient for many experiments, and the latency is typically less than 30 ms, which is fine for all but the most demanding closed-loop experiments.

      Performance is demonstrated in two experiments, a continuous Y-maze, which elegantly demonstrates how transfected animals learn to sense optogenetic closed-loop stimulation to drive their choice behavior in a way that control-stimulated animals do not. While authors claim that the ~2m diameter apparatus is "large scale", the second behavior more convincingly demonstrates the need for wireless stimulation.

      They used closed-loop monitoring of animal pose to selectively stimulate animals for approaching the tails of a dominant conspecific (based on pre-experimental pairwise assessments). It seems that the original hope was that the increases in following that they observe would result in long-lasting changes in the hierarchy of a cage, but as they report, this was not observed. Critically, their supplementary video demonstrates that they conducted this experiment with two instrumented animals simultaneously. This is a situation where a tether would have been hopelessly tangled within a few moments!

      The online documentation seems complete, and it seems quite possible for other labs to adopt and deploy the system.

      Weaknesses:

      The battery life is highly dependent on the stimulation paradigm. It makes sense that the LED is a major component of power consumption. It would have been elegant to measure the total optical energy that can be provided by the system. In addition, Bluetooth transmission is probably a major consumer of power, and receiving may not be "free". Quantifying power as a function of Bluetooth message rates would have been useful.

      Presumably, the major constraint on latency is that the Bluetooth receiver polls at ~10 Hz, resulting in latency blocks of 20+, 30+, or 40+ ms. Why latency is never less than 10 ms is unclear. Could latency be reduced by changing a setting? Having a low-latency option would be very helpful for some experimental situations. Latency is probably the primary weakness of the system.

      The programming process sounds quite complicated. It would be nice if they had OTA updates. But described and open source. Similarly, the configuration process (Arduino IDE) seems a bit complex. It would be nice if there were a dedicated cross-platform application.

      It is unclear what the maximum number of devices that could be used without wireless interference is. The base station has two charging stations, but it would have been nice to understand the limits beyond this number.

      There is a very nice website for the system, but there is some concern that the code and design files are not archived. Could they be deposited with the paper?

    3. Reviewer #3 (Public review):

      Summary:

      This study presents a novel device for wireless control of optogenetic stimulation of the mouse brain, the Blueberry, using Bluetooth Low Energy (BLE) communication for parallel activation of up to 4 devices through an Arduino interface. The authors also present two types of brain implants for light delivery that can be connected to the Blueberry: one using uLEDs for surface cortical stimulation, and another using optical fibers for intra- or sub-cortical implants. The architecture of the system, including electronics, communication, and programming, is thoroughly described. Because the system was especially designed to be integrated with existing software used for neuroscience behavioral experiment for closed-loop experiments, validation of the system is shown on two different scenarios: a learning task in a "infinite" Y-maze, where light delivery at precise locations conditions arm choice for navigation; and a social interaction analysis where 3 animals are simultaneously stimulated in order to alter social dynamics among the group.

      Strengths:

      (1) The full system can be built by individual labs with simple PCB printing, off-the-shelf components, and readily available hardware (Arduino) for widespread dissemination.

      (2) Four headstages can be controlled in parallel for simultaneous experiments with multiple mice.

      (3) Validation across different relevant behavioral tests, demonstrating the potential of integrating Bluberry in closed-loop setups.

      Weaknesses:

      (1) Some details in the manuscript regarding system characterization (latency, battery life, etc) are included only in the supplementary materials.

      (2) The practical details of integration with other commercial and open-source software used for the closed-loop experiments, which could help third-party researchers interested in using the system, are lacking sufficient detail.

      (3) System range (3 meters reported) is limited for a BLE device.

      (4) Light output amplitude is not programmable, limiting the choice of stimulation protocols and LEDs used.

      (5) Thermal modeling of the cortical surface stimulator was not performed, and it is unclear if the brain implant for this purpose is within the safety limits.

      (6) The paper is missing a comparison with other state-of-the-art devices for wireless control of optogenetic stimulation in mice.

    1. Reviewer #1 (Public review):

      Summary:

      Mancl et al. present an integrative structural and mechanistic analysis of the human insulin-degrading enzyme (IDE), combining cryo‑EM, time‑resolved cryo‑EM, SEC‑SAXS, enzymatic assays, all-atom molecular dynamics (MD) simulations, and coarse‑grained MD simulations. Their study delineates how IDE undergoes coordinated open-close transitions and interdomain rotations, how these motions relate to its unfoldase and protease activities, and how a single residue, R668, acts as a molecular latch governing these conformational changes. Through expanded structural datasets and computational analyses, the authors propose a mechanistic model for how IDE captures, unfolds, and degrades diverse amyloidogenic substrates such as insulin and Aβ.

      Strengths:

      A major strength of this study is its integration of structural, biophysical, biochemical, and computational approaches. The authors now provide six cryo‑EM structures, including a new time‑resolved O/O state captured 123 ms after substrate mixing, which clarifies the early structural response of IDE to insulin binding. The combination of multibody analysis, 3D variability analysis, all‑atom MD, and coarse‑grained Upside simulations yields a coherent picture in which rotational interdomain motions and charge‑swapping events at the IDE‑N/C interface underpin substrate unfolding and repositioning.

      The identification of R668 as a central determinant of the open-close transition, supported by MD, HDX‑MS data from prior work, SEC‑SAXS, and functional assays on the R668A mutant, represents a significant mechanistic advance. The inclusion of Aβ degradation assays adds biological breadth and supports the conclusion that R668 modulates activity in a substrate‑dependent manner.

      The authors have also substantially improved clarity by reorganizing figures, refining section headers, and adding introductory structural schematics. Taken together, the revised manuscript now provides a rigorous and accessible framework for understanding IDE dynamics and their relevance to amyloid peptide turnover.

      Weaknesses:

      At this stage, remaining limitations are modest and inherent to the system rather than the approach. While the study convincingly demonstrates substrate‑dependent modulation of IDE dynamics, it does not experimentally assess additional endogenous substrates (e.g., amylin, glucagon), which would be needed to fully generalize the role of R668 across the substrate spectrum of IDE. Furthermore, the timescale mismatch between MD simulations and catalytic turnover, which the authors clearly acknowledge, means that correlations between simulated motions and enzymatic kinetics remain inferential. Finally, some flexible cryo‑EM states (particularly O/pO) continue to exhibit moderate local resolution, which constrains atomic interpretation of highly dynamic regions, although this is addressed transparently.

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript describes various conformational states and structural dynamics of the Insulin degrading enzyme (IDE), a zinc metalloprotease by nature. Both open and closed state structures of IDE have been previously solved using crystallography and cryo-EM which reveal a dimeric organization of IDE where each monomer is organized into N and C domains. C-domains form the interacting interface in the dimeric protein while the two N-domains are positioned on the outer sides of the core formed by C-domains. It remains elusive how the open state is converted into the closed state but it is generally accepted that it involves large-scale movement of N-domains relative to the C-domains. Authors here have used various complementary experimental techniques such as cryo-EM, SAXS, size-exclusion chromatography and enzymatic assays to characterize the structure and dynamics of IDE protein in the presence of substrate protein insulin whose density is captured in all the structures solved. The experimental structural data from cryo-EM suffered from high degree of intrinsic motion amongst the different domains and consequently, the resultant structures were moderately resolved at 3-4.1 Å resolution. Total five structures were generated in the originally submitted manuscript using cryo-EM. Another cryo-EM reconstruction (sixth) at 5.1Å resolution was mentioned after first revision which was obtained using time-resolved cryo-EM experiments. Authors have extensively used Molecular dynamics simulation to fish out important inter-subunit contacts which involves R668, E381, D309, etc residues. In summary, authors have explored the conformational dynamics of IDE protein using experimental approaches which are complemented and analyzed in atomic detail by using MD simulation studies. The studies are meticulously conducted and lay the ground for future exploration of the protease structure-function relationship.

      Strengths:

      The manuscript presents a powerful integrative structural biology study that combines high-resolution cryo-EM, particle heterogeneity analysis, time-resolved cryo-EM, multiscale molecular dynamics simulations, SAXS, and biochemical assays to dissect the conformational dynamics of human insulin-degrading enzyme. A major strength is the identification of a previously unappreciated rotational component of IDE-N relative to IDE-C and the discovery of R668 as a molecular latch governing the open-close transition, supported consistently by structural, computational, mutational, and functional data. The work provides a coherent mechanistic framework linking IDE dynamics to substrate unfolding, allostery, and substrate-dependent catalysis, with clear relevance to diabetes and Alzheimer's disease biology.

      Weaknesses:

      Despite its depth, several key mechanistic conclusions-particularly substrate unfolding and the proposed "β-grabbing" mechanism-rely heavily on coarse-grained and all-atom MD simulations rather than direct experimental observation. Cryo-EM density for insulin is limited and heterogeneous, restricting definitive structural interpretation of substrate binding modes. The time-resolved cryo-EM experiment captures only a single dominant state at modest resolution, limiting insight into transient intermediates. In addition, the study focuses primarily on insulin, leaving the generality of the proposed mechanism for other IDE substrates insufficiently tested, and the therapeutic implications remain largely speculative without direct pharmacological modulation data.

    1. Reviewer #2 (Public review):

      Summary

      This study addresses the hypothesis that the higher prevalence of autoimmune diseases in women could result from sex-dependent differences in thymic generation or selection of TCR repertoires. The biological question is important and the dataset is valuable. However, the study has major conceptual and analytical limitations.

      In particular:

      - The conclusions cannot be generalized to autoimmune diseases as a whole, as only type 1 diabetes (T1D) and celiac disease (CeD) antigens were analyzed.<br /> - The central interpretation is not supported by the data, as the observed signal is strongly influenced by TCRs associated with T1D, which shows a male-biased incidence and therefore does not align with the female bias the study aims to explain.

      Strengths

      The key strength of this work is the newly generated dataset of TCR repertoires from sorted thymocyte subsets (DP and SP populations). This approach enables the authors to distinguish between biases in TCR generation (DP) and thymic selection (SP). Bulk TCR sequencing allows deeper repertoire coverage than single-cell approaches, which is valuable here. However, the absence of TRA-TRB pairing and HLA context limits the interpretability of antigen specificity analyses.

      Weaknesses

      The authors did not adequately address the central concerns raised in the previous review. As a result, the major issues remain unresolved.

      (1) Generalization to autoimmune diseases is not justified.

      The study aims to explain the higher prevalence of autoimmune diseases in females. The main conclusion is based on enrichment in females of TCRs annotated as autoimmune-associated using database matching.<br /> However, these matches correspond exclusively to TCRs specific for T1D and CeD. This already limits the conclusions to these two diseases and does not justify generalization to autoimmune diseases as a whole.

      (2) Contradiction with epidemiology of T1D which is male-biased

      T1D and CeD have opposite sex biases in European populations. While CeD is more frequent in females (~60%; doi:10.1016/j.cgh.2018.11.013), T1D is more frequent in males (male:female = 1.11 in France; doi:10.1111/dom.70124).<br /> Importantly, T1D constitutes a substantial fraction of the autoimmune-associated dataset (42 out of 48 epitopes; 83 out of 185 TRB sequences). Therefore, the observed signal is strongly influenced by a disease that does not follow the female bias the study aims to explain.

      The authors argue that T1D sex bias varies globally, including female-biased incidence in East Asia and Africa. However, this argument does not resolve the issue, as the cohort analyzed in this study was derived from France, where T1D shows a male-biased incidence. Thus, the interpretation remains inconsistent with the population context of the dataset.

      (3) Lack of disease-level and donor-level resolution

      The authors combine T1D and CeD into a single "autoimmune" category and do not provide per-disease, per-donor or per-epitope distributions, despite explicit reviewer's requests.

      This prevents evaluation of whether the observed signal is driven by:<br /> - a specific disease (T1D or CeD), or<br /> - a small number of donors

      Without this analysis, the conclusions cannot be properly interpreted.

      (4) Use of "polyspecificity" concept is not supported by experimental evidence

      The authors extensively use the concept of "polyspecific TCRs," defined as single-chain CDR3 sequences annotated across databases as recognizing distinct and unrelated antigenic categories. This concept is not supported by experimental evidence (except for a single TCR in Quiniou et al., as acknowledged by the authors).

      In the absence of robust validation, a more parsimonious explanation for such ambiguously annotated TCR chains is the presence of false-positive annotations in public databases (see, e.g., Ton Schumacher's preprint https://www.biorxiv.org/content/10.1101/2025.04.28.651095.abstract) or alternatively, distinct TRA pairing for identical TRB sequences resulting in different specificities.

      The observation that these TCRs have high generation probability is expected, as TCRs found in independent studies are likely to have high generation probability. The interpretation of these sequences as biologically meaningful entities (e.g., a "first line of defense") is therefore speculative and not supported by the data.

      The authors also refer to in silico-generated polyspecific TCRs (ref. to Nature Machine Intelligence). However, such sequences are generated ex vivo and do not undergo thymic selection. A TCR capable of recognizing multiple unrelated foreign antigens would likely also recognize self-antigens and be eliminated during negative selection. Therefore, this argument does not support the biological relevance and in vivo existence of the proposed polyspecific TCR class.

      (5) Insufficient statistical analysis of diversity

      The absence of statistically significant differences in repertoire diversity between sexes (Figure 3), despite an apparent visual trend, may reflect limited sample size and insufficient statistical power rather than a true absence of differences. A more appropriate statistical approach, such as mixed-effects modeling, was requested in the previous review but was not performed.

    1. Reviewer #1 (Public review):

      Summary:

      Age-related synaptic dysfunction can have detrimental effects on cognitive and locomotor function. Additionally, aging makes the nervous system vulnerable to late-onset neurodegenerative diseases. This manuscript by Marques et al. seeks to profile the cell surface proteomes of glia to uncover signaling pathways that implicated in age-related neurodegeneration. They compared the glial cell-surface proteomes in the central brain of young (day 5) and old (day 50) flies and identified the most up- and down-regulated proteins during the aging process. 48 genes were selected for analysis in a lifespan screen, and interestingly, most sex-specific phenotypes. Among these, adult-specific pan-glial DIP-β overexpression (OE) significantly increased the lifespan of both males and females and improved their motor control ability. To investigate the effect of DIP-β in the aging brain, Marques et al. performed snRNA-seq on 50-day old Drosophila brains with or without DIP-β OE in glia. Cortex and ensheathing glia showed the most differentially expressed genes. Computational analysis revealed that glial DIP-β OE increased the cell-cell communication, particularly with neurons and fat cells.

      Strengths:

      (1) State-of-the-art methodology to reveal the cell surface proteomes of glia in young and old flies.

      (2) Rigorous analyses to identify differentially expressed proteins. 3

      (3) Examination of up- and down-regulated candidates and identification of glial-expressed mediators that impact fly lifespan.

      (4) Intriguing sex-specific glial genes that regulate life span.

      (5) Follow-up RNA-seq analysis to examine cellular transcriptomes upon overexpression of an identified candidate (DIP-β).

      (6) A compelling dataset for the community that should generate extensive interest and spawn many project.

      Weaknesses:

      (1) DIP-β OE using flySAM:

      a) These flies showed a larger increase in lifespan compared to using UAS-DIP-β (Figure 2 C,D). Do the authors think that flySAM is a more efficient way of OE than UAS? Also, the UAS construct would be specific to one DIP-β isoform while flySAM likely would likely express all isoforms. Could this also contribute to the phenotypes observed?

      b) The Glial-GS>DIP-β flySAM flies without RU-486 have significantly shorter lifespans (Figure 2C) than their UAS-DIP-β counterparts. flySAM is lethal when expressed under the control of tubulin-GAL4 (Jia et al. 2018) likely due to toxicity of such high levels of overexpression. Is it possible that larger increase in lifespan is due to the already reduced viability of these flies?

      c) Statistics: It is stated in the Methods that "statistical methods used are described in the figure legend of each relevant panel." However, there is no description of the statistics or sample sizes used in Figure 2.

      (2) Figure 3: The authors use a glial GeneSwitch (GS) to knock down and overexpress candidate genes. In Figure 3A, they look at glial-GS>UAS-GFP with and without RU. Without RU, there is no GFP expression, as expected. With RU, there is GFP expression. It is expected that all cell body GFP signal should colocalize with a glial nuclear marker (Repo). However, there is some signal that does not appear to be glia. Also, some many glia do not express GFP, suggesting the glial GS driver does not label all glia. This could impact which glia are being targeted in several experiments.

      (3) It is interesting that sex-specific lifespan effects were observed in the candidate screen.

      a) The authors should provide a discussion about these sex-specific differences and their thoughts about why these were observed.

      b) The authors should also provide information regarding the sex of the flies used in the glial cell surface proteome study.

      c) Also, beyond the scope of this study, examining sex-specific glial proteomes could reveal additional insights into age-related pathways affecting males and females differentially.

      (4) The behavioral assay used in this study (climbing) tests locomotion driven by motor neurons. The proteomic analysis was performed with the central adult brain, which does not include the nerve cord where motor neurons reside. While likely beyond the scope of this study, it would be informative to test other behaviors including learning, circadian rhythms, etc.

      (5) It is surprising that overexpressing a CAM in glia has such a broad impact on the transcriptomes of so many different cell types. Could this be due to DIP-β OE maintaining the brain in a "younger" state and indirectly influencing the transcriptomes? Instead of DIP-β OE in glia directly influencing cell-cell interactions? Can the authors comment on this?

      Comments on revisions:

      The authors have conducted additional experiments, updated text/figures, and included discussions to address the concerns raised by the reviewers. I commend the authors on a thorough, rigorous study that will undoubtedly impact the field and spawn many projects for years to come.

      One minor comment: In Figure S2, the figure legend states "A-C"; however, the figure itself only has an A and B.

    1. Reviewer #3 (Public review):

      Summary:

      Nigro et al examine how the locus coeruleus (LC) influences the medial prefrontal cortex (mPFC) during attentional shifts required for behavioral flexibility. Specifically, they propose that LC-mPFC inputs enable mice to shift attention effectively from texture to odor cues to optimize behavior. The LC and its noradrenergic projections to the mPFC have previously been implicated in this behavior. The authors further establish this by using chemogenetics to inhibit LC terminals in mPFC and show a selective deficit in extradimensional set shifting behavior. But the study's primary innovation is the simultaneous inhibition of LC while recording multineuron patterns of activity in mPFC. Analysis at the single neuron and population levels revealed broadened tuning properties, less distinct population dynamics, and disrupted predictive encoding when LC is inhibited. These findings add to our understanding of how neuromodulatory inputs shape attentional encoding in mPFC and are an important advance. There are some methodological limitations and/or caveats that should be considered when interpreting the findings and these are described below.

      Strengths:

      The naturalistic set-shifting task in freely-moving animals is a major strength, and the inclusion of localized suppression of LC-mPFC terminals builds confidence in the specificity of the behavioral effect. Combining chemogenetic inhibition of LC while simultaneously recording neural activity in mPFC with miniscopes is state-of-the-art. The authors apply analyses to population dynamics, in particular, that can advance our understanding of how the LC modifies patterns of mPFC neural activity. The authors show that neural encoding at both the single cell level and the population level are disrupted when LC is inhibited. They also show that activity is less able to predict key aspects of the behavior when the influence of LC is disrupted. This is quite interesting and adds to a growing understanding of how neuromodulatory systems sharpen tuning of mPFC activity.

      Weaknesses:

      Weaknesses are mostly minor, but there are some caveats that should be considered. First, the authors use a DBH-Cre mouse line and provide histological confirmation of overlap between HM4Di expression and TH immunostaining. While this strongly suggests modulation of noradrenergic circuit activity, the results should be interpreted conservatively as there is no independent confirmation that norepinephrine (NE) release is suppressed and these neurons are known to release other neurotransmitters and signaling peptides. In the absence of additional control experiments, it is important to recognize that effects on mPFC activity may or may not be directly due to LC-mPFC NE.

      Another caveat is that the imaging analyses are entirely from the extradimensional shift session. Without analyzing activity data from the intradimensional shift (IDS) session, one cannot be certain that the observed changes are to some feature of activity that is specific to extradimensional shifts. Future experiments should examine animals with LC suppression during the IDS as well, which would show whether the observed effects are specific to an extradimensional shift and might explain behavioral effects.

      Comments on revisions:

      The authors overall do a nice job of addressing reviewer comments, and I believe the manuscript is significantly improved.

    1. Reviewer #1 (Public review):

      Genetically encoded fluorescent proteins expressed in specific cell types allow recognising them in vivo and, if the protein is a functional indicator, as in the case of genetically encoded calcium indicators (GECIs), to record activity from the same cellular ensemble. Ideally, if proteins (fluorophores) have perfectly distinct spectral properties, signals can be distinguished from as many cell types as the number of employed fluorophores. In practice, fluorescent proteins have non-negligible crosstalk both in absorption and emission bands. In addition, fluorescence contribution of each fluorophore normally varies from cell to cell and therefore spectral properties of cells expressing two or more proteins are different. The work of Phillips et al. addresses this challenge. The authors present an approach defined as "Neuroplex", allowing identification of up to nine cell types from the same number of fluorophores. The fingerprint of each cell is then associated with functional fluorescence from the GECI GCaMP, allowing recording calcium activity from that specific cell. The method is implemented in vivo using head-mounted miniscopes.

      The authors used a mouse line expressing GCaMP in cortical pyramidal neurons and developed an experimental pipeline. First, they injected the nine AAV viruses, causing expression of fluorophores in a different brain area. The idea was not to image that area, but a non-infected medial prefrontal cortex (mPFC) section where neurons could be infected by their axons projecting in an injected area, in this way being identified by their targeting region(s). A GRIN lens, allowing spectral analysis, was mounted in the mPFC section, and GCaMP fluorescence was then recorded during behavioural tasks and analysed to identify regions of interest (ROIs) corresponding to neuron somata. After functional imaging, the head of the mouse was fixed, spectral analysis was performed, and after necessary correction for chromatic distortions, the fluorophore contribution was determined for each ROI (neuron) from where GCaMP signals were detected. Notably, the procedures for estimation and correction of chromatic aberration and light transmission (described in Figure 2) were a major challenge in their technical achievements. The selection of the nine fluorophores was another big effort. This was done by combining computer simulations and direct measurement of spectra from individual proteins expressed in HEK293 cells. It is important to say that the authors could simulate arbitrary combinations of two or more different fluorophores and evaluate the ability of their algorithm to detect the correct proteins against wrong estimations of false-negative (absence of an expressed protein) or false-positive (presence of a non-expressed protein). Not surprisingly, this ability decreases with the level of GCaMP expression. The authors underline that most errors were false-negatives, which have a milder impact in terms of result interpretation, but the rate of false positives was, nevertheless, relevant in detecting a second fluorophore from a cell expressing only one protein. The experimental profiles of fluorophores were dependent both on the specific fluorescent protein and on the projecting area, and the distribution of double-labelled did not match anatomical evidence. This result should be taken as the limitation of the present pioneering experiments, presented as proof-of-principle of the approach, but Neuroplex may provide far improved precision under different experimental conditions.

      In my view, the work of Phillips et al. represents a significant advance in the state-of-the-art of the field. The rigorous analysis of limitations in the use of Neuroplex must be considered an important guideline for future uses of this approach.

      Comments on revision:

      The authors have adequately addressed my comments.

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript introduces Neuroplex, a pipeline that integrates miniscope Ca²⁺ imaging in freely moving mice with multiplexed confocal and spectral imaging to infer projection identities of recorded neurons. This technical approach is promising and could broaden access to projection-resolved population imaging. However, the core quantitative analyses apply a winner-take-all single-label assignment per neuron even when multiple fluorophores exceed threshold, with additional labels treated descriptively as "secondary hits." While the authors acknowledge and simulate dual labeling, the extent to which this single-label decision rule affects subtype fractions and behavioural comparisons remains uncertain without a multi-label (or probabilistic) sensitivity analysis and propagation of classification uncertainty.

      Strengths:

      (1) Conceptual advance and practicality: Decoupling acquisition from identity readout constitutes an innovative approach that is, in principle, applicable in laboratories currently using single-color miniscopes.

      (2) Engineering thoroughness: The manuscript offers detailed consideration of GRIN optics, spectral libraries, registration procedures, and simulations that address signal-to-noise ratio, background, and class imbalances.

      (3) Immediate community value: If demonstrated to be robust, the pipeline could enable projection-resolved analyses without reliance on specialized multicolor miniscopes.

      Comments on revision:

      The authors have addressed my comments, and I have no further remarks.

    1. Reviewer #1 (Public review):

      In this study, the authors investigated a specific subtype of SST-INs (layer 5 Chrna2-expressing Martinotti cells) and examined its functional role in motor learning.

      Most of the issues remain unaddressed. The findings across experiments are inconsistent, and it is unclear how the authors performed their analyses or why specific time points and comparisons were chosen. The study will require major re-analyzing and additional experiments to substantiate its conclusions.

      After reading the reviewers' responses, my major concerns about the manuscript remain unresolved, particularly regarding the arbitrarily defined stages of learning in the motor learning task and how the calcium imaging data align with the animal's movements.

      - In line 331, the authors refer to session 5 as "training," describing it as the final spoon session, and session 6 as "re-training," because it is the first session in which the pellet is presented on the plate rather than on the spoon. However, in Fig. 1F-H, even in the Ctrl group, it is clear that the performance drops significantly in session 5, which is supposed to be the easiest session before switching to the more difficult plate condition.

      - In the classic pellet-reaching task, the spoon sessions would typically be considered "shaping", while the plate sessions would represent the actual training phase. However, in this manuscript, the authors still insist on referring to session 2 as "learning" and session 5 as "training." I don't understand the difference between session 2 and session 5, especially when session 5's performance is lower than session 2 (even in Fig 1H when you compare succ ratio).

      - Since session 6 (on the plate) is considered as "retraining," why don't the authors present the behavioral results beyond session 6? As a result, it remains unclear whether the animals improved their performance during the retraining phase.

      - Lastly, in Fig. 4B the authors present only the success ratio and claim that performance improves with CLZ application. However, when comparing sessions 8-10 between the Ctrl and Cre⁺ groups, there already appears to be a baseline difference. CLZ treatment in Cre⁺ mice seem to bring performance only to the WT level rather than producing a clear improvement beyond baseline.

      - Regarding the alignment between imaging and behavior, the authors report ~100 prehensions per minute. However, the calcium imaging traces show fewer than 20-30 spikes over 150 seconds (~2.5 min; Fig. 1E). This discrepancy raises concerns about whether the authors can truly isolate calcium signals corresponding to individual prehension events (either successful ones or multiple combined events for unsuccessful attempts). The manuscript still does not present behavioral data that directly aligns prehension events with calcium imaging activity. Although the authors performed analyses suggesting that prehension-related activity does not systematically alter non-prehension epochs, this claim is difficult to evaluate without seeing the underlying traces. It is therefore unclear how the authors selected the example calcium traces aligned to prehension onset, given that there are more than 100 prehension events per minute.

      - In Fig. 1I, the authors also did not address why neural activity during successful trials is already lower one second before movement onset. The longer traces provided do not help to explain this observation or clarify the origin of this pre-movement reduction in activity. It actually further suggests that there may be some artifacts in the imaging that could affect the analysis.

      - Overall, because it remains difficult to understand exactly what the authors are analyzing (and because the definitions of the motor learning stages appear arbitrary) it is difficult to agree with the authors' conclusion that Ma2s cells reduce PyrN cell assembly plasticity during learning, thereby possibly facilitating already acquired motor skills.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Malfatti et al. study the role of Chrna2 Martinotti cells (Mα2 cells), a subset of SST interneurons, for motor learning and motor cortex activity. The authors trained mice on a forelimb prehension task while recording neuronal activity of pyramidal cells using calcium imaging with a head mounted miniscope. While chemogenetically increasing Mα2 cell activity did not affect motor learning, it changed pyramidal cell activity such that activity peaks become sharper and differently timed than in control mice. Moreover, co-active neuronal assemblies become more stable with a smaller spatial distribution. Increasing Mα2 cell activity in previously trained mice did increase performance on the prehension task and led to increased theta and gamma band activity in the motor cortex. On the other hand, genetic ablation of Mα2 cells affected fine motor movements on a pasta handling task while not affecting the prehension task. While overall this study addresses an important and timely question, limitations in the design of the motor learning task and data analysis significantly weaken the conclusions drawn in this manuscript.

      Strengths:

      The proposed question of how Chrna2-expressing SST interneurons affect motor learning and motor cortex activity is important and timely. The study employs sophisticated approaches to record neuronal activity and manipulate the activity of a specific neuronal population in behaving mice over the course of motor learning. The authors analyze a variety of neuronal activity parameters, comparing different behavior trials, stages of learning, and the effects of Mα2 cell activation. The analysis of neuronal assembly activity and stability over the course of learning by tracking individual neurons throughout the imaging sessions is notable, since technically challenging, and yielded the interesting result that neuronal assemblies are more stable when activating Mα2 cells.

      Overall, the study provides compelling evidence that Mα2 cells regulate certain aspects of motor behaviors, likely by shaping circuit activity in the motor cortex.

      Weaknesses:

      While the authors addressed some of the concerns raised by the reviewers, several major limitations still exist in the revised manuscript.

      (1) I appreciate the authors now showing more measures of the prehension task (total reaches, success reaches/min, and success ratio) and providing more details on the task design. However, it is unclear why the authors chose a task design that is somewhat different from the commonly used approach. Here they increase the distance of the food pellet each session and are thus making the task increasingly harder, whereas commonly the target distance is kept stable (See 10.1038/nature08389 for example). The result is that important readouts of learning (e. g. success rate) thus remain stable, making it impossible to judge if learning has occurred, without a control group of non-trained mice. This makes it impossible to judge if the task is affected by increased Mα2 cell excitability, since there is no reference of how these measurements are supposed to change in a mouse that learns or doesn't learn the task.

      (2) Regarding the analysis of the calcium imaging data, it is still unclear why the authors cannot report a commonly used dF/F0 or z-score value, as recommended by both reviewers. The authors state the 1 sec time window prior to the prehension cannot be used as a baseline (F0), as there might be preparatory motor activity. In that case an even earlier window (such as -2 to -1sec) or z-scores should be used. The current version relabeling the background subtracted fluorescence signal as dF/F0 is misleading. Relatedly, it is unclear why the authors don't think the 1 sec window before prehension cannot be used as baseline, but at the same time use the difference in calcium activity before and after prehension onset as a cut-off criterion for defining cells as modulated during prehension and including in the analysis.

      (3) While the authors have improved their statistical reporting, key information is still missing in several places. For example, no N-numbers are listed in legends for figure 3, and there is no mention of the number of mice for analysis in figures 2 and 3. For clarity, the authors should also include the statistical test performed in the figure legends for any p-values shown in the figure.

    1. Reviewer #1 (Public review):

      Nio and colleagues address an important question about how the cerebellum and ventral tegmental area (VTA) contribute to extinction learning of conditioned fear associations. This work tackles a critical gap in the existing literature and provides new insights into this question in humans through the use of high-field neuroimaging with robust methodology. The presented results are novel and will broadly interest both the extinction learning and cerebellar research communities. As such, this is a very timely and important contribution.

      Strengths:

      The core finding - coupling of cerebellum and VTA as a reward-like prediction errors during fear extinction - is novel and addresses a genuine gap in the literature. Also the paradigm spanning several sessions, a well-powered sample, 7T imaging and complementary analytical approaches to target the question is commendable.

      Weaknesses:

      The authors have satisfactorily addressed the concerns raised in the previous version of the manuscript. Several results, as well as conclusions drawn from them, still rest on trend-level evidence, although the revised presentation of the results now provides a more balanced interpretation of these findings.

    1. Reviewer #1 (Public review):

      Summary:

      This study presents a map of neurons responding to aversive stimuli in zebrafish and suggests that the regions containing these neurons are homologous to mammalian brain areas involved in aversive processing. Specifically, this study found that neurons in a part of the pallium, the homolog of the amygdala, responded vigorously to strongly noxious and fully looming stimuli, but not to the milder cues. In contrast, neurons in another part of the pallium responded to all of these stimuli. The findings provide valuable insights into the neural mechanisms underlying negative-valence computation in zebrafish.

      Strengths:

      This study performed whole-brain functional imaging using two-photon light-sheet microscopy and identified the activity of individual neurons in awake zebrafish. This technique is highly valuable and will be broadly applicable to future studies aimed at elucidating the neural mechanisms underlying zebrafish behavior at single-neuron resolution.

      Weaknesses:

      Although this study reports neuronal responses to aversive stimuli, it did not directly assess how aversive these stimuli were for zebrafish. In general, studies of this kind quantify the aversiveness of test stimuli by measuring behavioral indices such as avoidance or escape responses. The present study states that "neurons responded vigorously to strongly noxious and fully looming stimuli, but not to milder cues." However, the authors did not provide behavioral evidence demonstrating that the stimuli were indeed aversive or that the so-called milder cues were perceived as less aversive by the animals. Without a behavioral measure of aversiveness, it is difficult to determine whether the reported neural responses reflect negative-valence processing, rather than general sensory salience or stimulus intensity.

    2. Reviewer #2 (Public review):

      Summary:

      The authors aim to map neurons encoding negative valence at the whole-brain scale in larval zebrafish. Using two-photon light-sheet imaging combined with various aversive stimuli, they visualize and quantify stimulus-evoked neural responses, identify the anatomical locations of responsive neurons, and explore the possibility of genetically accessing Rl neurons that respond preferentially to strongly noxious stimuli.

      Strengths:

      The major strength of this study lies in its use of two-photon light-sheet imaging, which provides a system-level characterization of neuronal response to aversive stimuli. The authors systematically compare multiple classes of aversive stimuli (heat, electric shock, looming, etc.), showing that strongly threatening stimuli converge on a compact neuronal population in the Rl, supporting the robustness of the finding. Finally, the identification of Tiam2a expression in these neurons provides a potential genetic handle for future functional studies.

      Weaknesses:

      The main weakness of the study is the lack of causal evidence supporting the functional role of the identified neurons. Without optogenetic, chemogenetic, or ablation experiments, it is difficult to determine whether these neurons are required for or sufficient to encode negative valence. In addition, the study does not include positive-valence or neutral stimuli controls, making it difficult to distinguish whether the observed neural responses reflect valence per se or more general downstream response such as motor output. Finally, the lack of behavioral readouts limits the ability to directly link the identified neural populations to defensive behaviors.

    3. Reviewer #3 (Public review):

      Overview and Strengths:

      Accurate evaluation of threat levels allows animals to determine whether to escape. The precise mechanism underlying threat evaluation remains unclear. Smith et al. identified a cluster of neurons in the zebrafish rostrolateral dorsal pallium (Rl) that respond differentially to varying levels of negative-valence stimuli.

      This work leverages the small size and optical transparency of the larval zebrafish, using two-photon selective plane illumination microscopy to assay the response of pallial neurons to various negative-valence stimuli. Interestingly, unlike the ventromedial pallium and habenula, which responded to all stimuli tested, neurons in the Rl were activated by a selection of stimuli representing relatively higher levels of threats. By leveraging a zebrafish brain atlas, the authors identified a transgenic line labeling a tiam2a+ cluster of neurons that appears to be the activated population in the Rl. Together, these results demonstrate a subpopulation of pallial neurons that likely categorizes the strength of negative valence in larval zebrafish.

      The primary conclusions of this work are well supported by the data. The identification of a neuronal cluster that may underlie the categorization of threat-associated sensory stimuli is significant. Furthermore, this study generates a high-quality functional imaging dataset using cutting-edge microscopy, setting the foundation for understanding the neuronal encoding of emotions in zebrafish.

      Results from this work set the stage to answer further exciting questions: How do tiam2a+ Rl neurons modulate the activity of the hindbrain escape circuit? What is the functional role of the Rl population inhibited by threat stimuli? Computationally, how does Rl integrate sensory signals and classify threat levels? How does the activity of Rl change in the context of habituation and conditioning? Future work may use more nuanced stimuli and combine new genetic tools, behavioral recording, and circuit-level analysis to systematically reveal how emotions modulate defensive behaviors.

      Weaknesses:

      The impact of this work could be further enhanced by incorporating more sophisticated data analysis and by more clearly anchoring the findings within the known framework of zebrafish defensive behavior.

      (1) The authors performed statistical analyses across six ROIs per experiment in Figures 1E/J, 3E/J, and 6B/D/F. This increases the probability of Type I errors. Applying multiple comparison corrections would mitigate this concern. Given that most stimuli (except for the "IR heating") are non-directional, the authors may consider first testing for the response symmetry following each stimulus and then combining ROIs from the two hemispheres to calculate a single averaged measurement per region per fish for comparisons of regional dF/F.

      (2) I found the topographical mapping of activated and inhibited ROIs very informative. There appear to be two subpopulations of Rl: a posterior-medial population often activated by negative valence stimuli, and an anterior-lateral population that is frequently inhibited. I wonder if it is possible to decode the valence or category of a stimulus based on the topography and response profiles of these neurons? These results would provide additional evidence for the Rl's roles of threat evaluation.

      (3) Findings in this paper, especially differential responses of the Rl to full and partial looming, deserve an expanded discussion. The authors should better anchor these findings to established literature to emphasize their significance in the Discussion. For example, how might this potential categorization mechanism contribute to, or differ from, the mechanisms underlying habituation (Fotowat & Engert, 2023, eLife); what are the possible connections between the pallium and the hindbrain escape circuits that could relay these Rl signals (Kunst et al., 2019, Curr Biol)?

      (4) The authors make conservative claims associating the tiam2a+ cluster with Rl neurons activated by noxious stimuli, and their data support this conclusion. However, this link could be further strengthened by testing whether the tiam2a+ cluster shows differential responses to full vs partial looming. This could be achieved by performing pERK staining following the stimulus paradigm. While future tools may allow for direct functional imaging of this population, I believe such experiments are beyond the scope of this paper.

      (5) Figure 1E/J, Figure 3E/J: Please clarify whether the dashed red vertical lines indicate the onset or the offset of the stimuli. Additionally, different time windows were used for AUC calculations across these experiments; the authors should provide a rationale for these varying windows in the Results or Methods.

    1. Reviewer #1 (Public review):

      Summary:

      The aim of this paper is to model the spontaneous emergence of sequences in networks of plastic spiking neurons. By spontaneous, they mean that the inputs have no structure, no sequences, but the network nevertheless generates sequences. To obtain this, they assume several synaptic plasticity and single neuron plasticity rules. The primary findings are that sequences can emerge, that they slowly drift over time, that weights also constantly change over time, but that very strong weights are more stable. The main driver of this result is the plasticity rules assumed.

      Strengths:

      The paper is based on simulations of a relatively large network of conductance based integrate and fire neurons. There are two different pair-based STDP rules assumed for excitatory-to-excitatory synapses and for inhibitory-to-excitatory synapses. In addition, weights are normalized, and there is an adaptation due to plasticity of the spiking threshold. The network is analyzed via simulations and data processing akin to what would be done for physiological data. The simulations are extensive, and the analysis seems rigorous.

      Weaknesses:

      There are several fundamental problems with the paper:

      (1) The plasticity mechanisms used assumed that pair-based STDP is sufficient to account for synaptic plasticity in vivo. This is unrealistic. Various different papers have shown that pair-based STDP models do not account well for experimental data. If this model is a simulation of the visual cortex (unclear), then firing rates can be sufficiently high, such that firing rates are more important than spike times. We already know that firing rates matter due to the original Markram et al paper from 1997. Even if pair-based STDP is used, we already know from Bi and Poo 1998 that there is a weight dependence of synaptic plasticity such that strong weights potentiate less and decay more. This additional assumption alone might completely change the results in this study. We don't really know how to model realistic synaptic plasticity, but we know pair-based STDP is a bad model. Would these results be robust enough for a change in the learning rule, for example, to triplet-based, calcium-based, or voltage-based? Are the results shown even robust enough to include slight modifications to the learning rule, for example, weight dependence of pair-based STDP?

      (2) The first stage of training, in which the network reaches a steady state, is unclear. What type of activity is exhibited in this network? Does most of it arise from the external inputs? What firing rates are obtained? What are the spike statistics? This is important because this activity is responsible for generating the emergent sequences, and also depends (I think) on the plasticity mechanisms. Does the 'spontaneous activity' in the network depend strongly on the external input? Figure 1E is where we see a raster plot, but we see only neurons within a sequence, and it seems neurons within the sequence fire almost only once. Before showing sequences that more general structure of the spiking activity and how it evolves should be explained and quantified.

      (3) Do these sequences really emerge without structured inputs? Is there any evidence to suggest that such sequences emerge without a structured input? If yes, please cite it. It makes sense that it would, because the time scale of these sequences is much faster than the sensory or behavioral time scale. However, experimental evidence to support this will make the paper much more interesting.

      (4) This paper is a phenomenological paper. It does not really say what these sequences might be good for, except for a cite or two, and it does not model any specific experiment. There is a medium here (a plastic spiking network) which generates a phenomenon (sequences). It also generates other measurable phenomena, such as connectivity motifs. Such motifs have been quantified in animals. It would be natural to compare the motif statistics found here to motifs characterized experimentally. This would make these results more substantial.

      (5) There are implicit predictions in the work. For example, about the stability of strong vs. weak efficacies or the stability of different motifs. Such predictions should be made more explicit.

    2. Reviewer #2 (Public review):

      Summary:

      This paper investigates how a combination of spike-timing-dependent plasticity rules in recurrent spiking networks leads to the spontaneous emergence of repeating neuronal sequences. The authors show that despite the weight distribution reaching a steady state, individual synaptic connections undergo constant turnover with timescales that depend on connection strength. The plasticity rules promote fan-in/out connectivity motifs that appear to support sequence generation.

      Strengths:

      The question addressed is important and biologically relevant. The most interesting finding of the paper is the coexistence of a stable weight distribution with constant turnover of individual synaptic connections.The simulations seem to be carefully executed.

      Weaknesses:

      The paper does not make a sufficient attempt to explain why the observed phenomena arise under the specific learning rules employed. There is no theoretical reduction, no analytical argument, and no mechanistic intuition. As it stands, this reads as a descriptive simulation study.

      It is never made clear which results reflect robust qualitative phenomena and which are specific to the particular hyperparameter choices of these simulations. Specific percentages and parameter values are reported throughout the main text without justification of their importance or generality.

      The finding that sequence composition undergoes continual turnover while the global weight distribution remains stable is interesting, but the authors should more carefully situate this result within the existing theoretical literature on synaptic drift and sequence stability under ongoing plasticity. Several modeling papers have addressed related phenomena, and the novelty of the present contribution relative to this body of work is not clearly established.

    3. Reviewer #3 (Public review):

      Summary:

      This modelling study connects synaptic plasticity, connectivity motifs, and representational drift. The authors combine excitatory and inhibitory STDP with weight normalization and intrinsic plasticity in a recurrent spiking network of AdEx neurons. This combination generates heavy-tailed synaptic weight distributions and supports repeating spike sequences under both unstructured and structured inputs. While global network statistics stabilize over time, individual synapses continue to change, creating a form of drift. Structured inputs further stabilize sequences, yet the network retains flexibility to learn new patterns.

      Strengths:

      (1) Multi-scale turnover analysis:

      The authors study the evolution of individual synapses, 3-neuron motifs, follower neurons, and entire neuronal sequences, revealing distinct turnover timescales.

      (2) Fan-in/out motif analysis:

      A specific connectivity motif (fan-in/out) is shown to be over-represented in the network and preferentially stabilised by the plasticity rules compared to other possible motifs. This generates interesting insights and testable predictions.

      (3) Connection to representational drift:

      The connection of ongoing synaptic plasticity to drift is timely and interesting, reproducing observations of macro-level stability and synapse-level turnover with a relatively simple mechanism.

      (4) Rigour and thoroughness:

      The overall quality of the numerical experiments performed in this study is high, with extensive supplementary material performing various controls to solidify the claims.

      Weaknesses:

      (1) Limited connection to network function:

      Sequence detection relies on a rather artificial protocol (forced spiking of a single neuron 1,000 times), which I suspect mostly tests whether the lognormal tail of the weight distribution can propagate activity. This risks being circular. I think performing the same sequence analysis on a random network/a network with the same weight distribution but shuffled would help understand what comes from a generic heavy-tailed weight distribution and the particular weights potentiated by the plasticity rules used here.

      The network, which would classically be evaluated as a memory network, is not assessed on this aspect. While the authors do not overclaim, this limits the impact.

      Relatedly, the relearning experiment (Figure 5G) shows catastrophic forgetting. This is acknowledged in the discussion, but the suggested solutions (alternating patterns, plastic readout) are speculative without supporting simulations. This limits the applicability of the model as a memory model or, more broadly, as a model of a brain region/function.

      Additionally, in the sequence learning experiments with structured input, the ability to learn seems tied to the very specific timescale of pattern presentation (~10 ms per pattern, comparable to the STDP kernel time constants), arguably faster than the timescale of external stimuli. The stability of sequences may also owe more to the normalization scheme than to STDP per se.

      (2) Novelty claims and positioning within the literature:

      On page 16, the authors write: "Our results demonstrate that spiking sequences can be generated in randomly connected networks trained by synaptic plasticity even under unstructured inputs, which supports STDP being the main actor, while stabilizing mechanisms such as weight normalization and intrinsic plasticity play a complementary role." (c1).

      Several aspects of this work are less novel than the presentation suggests:

      (a) The fact that STDP can create sequence-like dynamics/asymmetric connectivity matrices in recurrent networks has been studied theoretically [1,2] and in simulations [3,4,5]. While [3] is cited, the manuscript underplays the similarity. [4] (uncited) considers e+iSTDP with a different homeostatic term to represent sequential stimuli in large recurrent spiking networks. [5] (uncited) also considers a recurrent spiking network with several STDP-like rules and shows that many combinations can store and recall sequential inputs.

      (b) Lognormal weight distributions emerging from STDP-based plasticity and the autonomous emergence of connectivity structures have extensive literature. While many of these articles are already cited in the manuscript, I fail to see what this work brings to this matter compared to existing work (particularly [6]).

      (c) Several published works challenge the manuscript's implicit claim (c1) that sequences require their particular combination of rules. Many other plasticity mechanisms can create sequences [3,4,5,7,8,9]. Some interpretations may also need to be dialed down: [10] (uncited) showed that sequences can be stored and retrieved using EI and IE plasticity alone. iSTDP may be doing more computational work than acknowledged, which complicates the interpretation of which mechanisms are truly driving the phenomena.

      Overall, most of the relevant work is already cited in the manuscript, but not necessarily acknowledged adequately.

      (3) Justification of plasticity model/robustness analysis:

      The parameters in Tables 1 and 2 are quite specific without strong justification (for instance, different sparsity values for each connection type and specific normalization factors). Without parameter sweeps, it is difficult to know whether the key findings are robust or overfit to this particular network configuration. Given the number of parameters, exhaustive sweeps are out of question, and the argument made previously would still prevent the rule combination proposed from being considered as more than one possible mechanism for sequence generation among many others. However, this deserves to be acknowledged, and potentially a few sweeps to be run (e.g., over LTP/LTD ratio, normalization threshold, and network size). I don't think that Figure S12, which shows that removing any component of the model causes it to break down in some way, is enough to cover alternative plasticity rules.

      A related concern is that the network is small by current standards (1,200E + 240I neurons), especially with sparse connectivity (6-20%). Small networks with few connections are susceptible to synchronization (other studies typically consider networks of at least 10k neurons). The authors should discuss whether the phenomena they observe would persist at larger scales and under more biologically realistic connectivity. Specifically, are the intrinsic and normalization plasticity terms as crucial in this case?

      (4) Fan-in/out motif evidence is correlational:

      The evidence linking the fan-in/out motif to sequence stability appears to be correlational. Properly establishing causality would require targeted ablations or rewiring of fan-in/out connections. While designing a clean causal intervention may be difficult, the correlational nature of the evidence should be stated explicitly.

      Conclusion:

      To summarize, the manuscript would benefit from:

      (1) Reframing the contribution:

      Multi-scale turnover analysis and the discussion around representational drift as the core novelties. I would reposition sequence emergence and lognormal distributions as reproducing known results under a specific plasticity model and analysis method.

      (2) Acknowledging that many rule combinations could produce equivalent outcomes, and not suggesting that the combination chosen here is special.

      (3) Adding parameter sensitivity analysis or, at a minimum, discussing robustness.

      References:

      [1] Kempter, Gerstner and van Hemmen, Hebbian learning and spiking neurons, 1999, PRE

      [2] Ocker, Litwin-Kumar and Doiron, Self-organization of microcircuits in networks of spiking neurons with plastic synapses, 2015, plos CB<br /> (Theoretical account of STDP in spiking networks and motifs, though it only looks at 2-synapse motifs (not fan-in/fan-out)).

      [3] Fiete et al., Spike-Time-Dependent Plasticity and Heterosynaptic Competition Organize Networks to Produce Long Scale-Free Sequences of Neural Activity, 2010, Neuron

      [4] Duarte and Morrison, Dynamic stability of sequential stimulus representations in adapting neuronal networks, 2014, Frontiers in Comp Neuro

      [5] Confavreux et al., Memory by a thousand rules: Automated discovery of functional multi-type plasticity rules reveals variety and degeneracy at the heart of learning, 2025, bioRxiv

      [6] Zheng, Dimitrakakis and Triesch , Network Self-Organization Explains the Statistics and Dynamics of Synaptic Connection Strengths in Cortex, 2013, plos CB

      [7] Zheng and Triesch, Robust development of synfire chains from multiple plasticity mechanisms, 2014, Front Comp Neuro

      [8] Ravid Tannenbaum and Burak, Shaping Neural Circuits by High Order Synaptic Interactions, 2016, plos CB

      [9] Bell, Duffy, and Fairhall, Discovering plasticity rules that organize and maintain neural circuits, 2024, NeurIPS

      [10] Gong and Brunel, Inhibitory Plasticity Enhances Sequence Storage Capacity and Retrieval Robustness, 2024, bioRxiv

    1. Reviewer #1 (Public review):

      The wide-ranging serotonergic projections emerging from the Dorsal Raphe nucleus (DRN) are suggestive of a central role in regulating brain-wide activity and behavioural states. DRN activity has been associated with diverse functions, ranging from mood, motivation and pain regulation to sleep and cognitive flexibility. Its far-reaching connectivity made it challenging to assess the brain-wide effect of its activation, especially during behaviour.

      The present study by Qi et al. addresses these challenges by combining state-of-the-art tracking microscopy with the whole-brain accessibility of the larval zebrafish model. To investigate the effect of DRN activation, the authors leveraged the Tg(tph2:ChrimsonR) line to optogenetically activate tph2-positive neurons in the DRN, while monitoring changes in brain-wide activity, locomotion and auditory-stimuli evoked responses.

      Optogenetic activation had a suppressing effect on locomotion, which the authors distinguished from inducing sleep by the maintenance of posture and its sleep disturbing effect of nighttime stimulations. Further, the authors report a distinct effect of DRN activation on motor-related, but not auditory-related neuronal subspaces, identified by demixed principal component analysis.

      In addition, rather than affecting all motor-correlated neurons similarly, tph2+ DRN-mediated suppression focused on neurons encoding high-amplitude or turning motion.

      In summary, the work of Qi et al. provides solid evidence for a predominant role of the DRN in wake-state motor suppression by aptly combining the vast data-acquisition possibilities of the larval zebrafish model with computational methods to extract relevant information.

      The brain-wide scope of the analysis is a key strength, reducing bias, confirming the involvement of known motor and auditory regions, and providing a valuable dataset for future analyses.

      While the results well support the conclusion of the authors, certain biological and technical aspects demand discussion.

    2. Reviewer #2 (Public review):

      Summary:

      The authors examine the effects of activating the dorsal raphe nucleus serotonergic system using a combination of calcium imaging and optogenetics in freely moving larval zebrafish. Their findings show that optogenetic stimulation induces a state of behavioral quiescence.

      They further investigate whether this state corresponds to sleep or reduced motor activity. Analyses of posture and sleep-related paradigms indicate that serotonergic activation primarily suppresses motor output rather than promoting sleep. Notably, this suppression appears to be bout type-dependent, with stronger effects on neurons associated with larger tail amplitudes and turning angles.

      In addition, auditory stimulation experiments reveal no significant impact of serotonin on sound encoding.

      Strengths:

      The study combines advanced experimental techniques with state-of-the-art analytical methods, enabling precise and compelling insights into the role of serotonergic modulation. The experiments and analyses are well aligned with the questions being addressed, and the results appear robust and reliable.

      Moreover, the implementation of experiments that combine calcium imaging and optogenetics in freely moving animals is technically challenging and appears well justified in the context of the research questions.

      Weaknesses:

      While the analytical techniques employed are sophisticated and appear to be appropriately applied, their presentation makes the manuscript difficult to follow. Although the explanations are provided in the Methods section, including more guidance in the main text, such as how to interpret each analytical approach and what outcomes would be expected under different scenarios, would help readers who are less familiar with these techniques.

      Providing this context would better guide the reader in navigating the figures, broaden the accessibility of the work, and ultimately increase its impact.

      While the authors discuss different quiescent states mediated by serotonin reported in previous studies, their interpretation is limited to stating that "a common feature shared by these distinct behavioral states is a pronounced reduction in movement," and consequently proposing that activation of dorsal raphe nucleus is not sufficient to specify a particular behavioral state, but rather plays a primary role in driving motor suppression.

      In my view, a more thorough attempt to determine whether the observed state corresponds to any of the previously described forms of quiescence, or represents a subset or variant of them, would strengthen the manuscript. This would help better integrate the findings with the existing literature.

      For example, given that the authors have access to whole-brain activity data, it would be valuable to examine and discuss whether there are shared patterns of activation with previously reported quiescent states.

      The manuscript largely avoids discussing the mechanisms underlying the observed motor suppression. For instance, is this effect driven directly by serotonin release onto target neurons? Is it mediated by glial activity, as suggested in other studies? Are additional neuromodulatory systems being recruited?

      While addressing these questions may require substantial further work, potentially beyond the scope of the present study, the availability of whole-brain data provides an opportunity to at least explore or discuss these possibilities. In particular, it would be interesting to examine the recruitment of regions not directly stimulated but known to be associated with other neuromodulatory systems or promoting glial activation (e.g., the locus coeruleus).

    1. Reviewer #1 (Public review):

      Summary:

      This article investigates the application of commonly employed analytic methods in electrophysiological neuroscience to the speech envelope taken from 17 different languages' audio corpora. The findings indicate that features observed in speech-brain tracking responses, specifically theta and gamma oscillations, as well as their phase-amplitude coupling, are actually present within the speech envelope itself. This suggests that the neural data recorded in response to speech primarily reflects an evoked response to the temporal statistical properties of the envelope, rather than an inherent neural mechanism. Data from 18 individuals with epilepsy listening to French speech further support this interpretation: theta and gamma oscillations, along with their phase-amplitude coupling, are absent at rest and are linearly driven by the acoustic envelope during speech perception.

      Strengths:

      I find these results very interesting and convincing, with a strong take-home message: we should exercise caution when interpreting observed theta/gamma activity and the associated phase-amplitude coupling during speech comprehension tasks.

      Weaknesses:

      I mostly have comments on clarifications regarding the methods, specifically on the criteria for language exclusion, and on the statistical testing and reporting.

      (1) Clarification is needed regarding the rationale for the number of languages analysed: initially, 17 languages were considered, six were excluded due to the absence of PAC in the high gamma range, yet the analysis was ultimately conducted on only nine languages, not eleven. Could you please explain this discrepancy?

      (2) Considering the six languages that did not exhibit any statistically significant high-frequency PAC, do you have potential reasons for this result? Might it be related to the fundamental frequency (F0) of the speakers' voices? If six languages out of seventeen do not show PAC, can we argue that this feature is universal across languages?

      (3) How is inter-subject variability addressed within the SEEG analysis? The authors report the percentage of SEEG independent components showing significant effects in power spectral changes, PAC, and other measures, but it is unclear whether these components are consistent across participants or whether only a few participants drive the effect. It would be helpful to report how many participants are retained for each selection of SEEG-ICs in the article. Currently, the statistical testing of the SEEG-ICs also appears to assume independent samples. It would be helpful to include group-level statistical tests across subjects, for instance by performing mixed-effects models and including participant as a random factor.

    2. Reviewer #2 (Public review):

      Summary:

      This paper nicely demonstrates that "speech tracking" in the auditory cortex extends all the way up to 100Hz-150Hz. Specifically, the study asks whether the fluctuations in sound amplitude found in speech at various time scales relate to fluctuations found in similar time scales in intracranial recordings in auditory brain areas. First, it analyzes amplitude fluctuations in speech of 17 different languages, and characterizes fluctuations due to syllabic rate (2-6Hz), vocalic features (30-50 Hz), and fundamental frequency (100-150 Hz, in male speakers). It then analyzes whether neural activity occurs while listening to male and female speakers in French. By measuring changes in power spectrum relative to rest, it links the sound amplitude fluctuations to fluctuations in neural activity in the same frequency bands, referring to them as "theta", "low-gamma", and "high-gamma". Using Grange "causality," it clearly shows that the neural fluctuations can be predicted linearly from the sound fluctuations. Using a cross-frequency coupling measure, they further show that, in the neural dynamic, high-gamma fluctuations precede theta fluctuations.

      Strengths:

      (1) Analysis of neural activity (Figure 2 is a very compelling account of how theta, low, and high gamma observed in neural recordings closely follow the properties of the acoustic speech signal itself.

      (2) This includes phase amplitude coupling, a property that I had not previously seen described for the speech signal itself, and is here nicely demonstrated in Figure 1.

      (3) The Grange "causality" analysis makes a compelling case that neural fluctuations in these frequency bands are driven by the stimulus itself.

      (4) The finding in Figure 4 that female fundamental emerges at half the frequency in the neural activity is, to my knowledge, an entirely novel observation, not just in speech but in amplitude modulated sounds in general. This non-linear phenomenon is very interesting and prompts a host of interesting questions for future research: Does this happen only for voiced speech, does it depend on the harmonic stack of speech, or can it be produced with a single AM frequency? Are there preferred frequencies for this phenomenon?

      (5) The cross-frequency coupling measure shows a number of directed effects in the neural signal which seem to counter the predominant view in neuroscience, namely, that the phase of the slower fluctuations "organize" or "drive" the faster fluctuations seen in power, e.g. theta→gamma coupling, which here is seen to be reversed as gamma→ theta coupling, and this is not a property of sound itself. This, too, should lead to a number of follow-up studies (although there are some potential confounds here).

      Weaknesses:

      (1) The claim that different frequency bands are processed in different locations, referred to in the abstract as "multiplexing" is less well supported. The neural analysis is performed on independent components that are spatially distributed, making this claim less transparent than it could be, with other, more direct ways of treating electrode location, such as bipolar referencing.

      (2) The writing in the Introduction and Results section obscures the source of sound amplitude fluctuations at different timescales. Instead, it treats these fluctuations as some sort of discovery. This is strange because the abstract and discussions are fairly accurate on this point - namely, they are all due to well-known properties of speech. The descriptions are accurate, although I would put it slightly differently: fluctuations below 6Hz are due to varying length of sentences and words, 25Hz-50Hz are well-established stationary times of the vocal tract, and 100-150Hz are the vibration of the vocal cords in male speakers.

      (3) The problem of guiding the analysis of sound by notions from neural signals is most glaring when they restrict their analysis to less than 150Hz, which leaves out female-voiced speech.

      (4) Along with this, there is a heavy emphasis on notions of "rhythms" and "oscillations" when clearly, aside from the vocal cords, there is no evidence for rhythmic fluctuations. Any reasonable definition of a rhythm would need at least 2 or 3 cycles of a repeated pattern. A spectral "peak" for the sound envelope is shown at 5Hz. But this is not indicative of a regular rhythm. Instead, the peak appears to be an artifact of displaying power per octave rather than power spectral density. A peak in a power per octave is not a reliable indicator of a coherent oscillation, and the speech envelope does not exhibit a clear 5Hz rhythm. Unfortunately, prior literature has not been clear on this. It would be more accurate if the word "rhythm" were replaced with "fluctuation" and/or "activity" for the case of speech envelope and neural activity, respectively.

      (5) The Introduction also omits the literature on neural responses to amplitude-modulated sounds that go up at least to 200Hz and more. So the findings here on "high-gamma" are well in line with prior literature.

      (6) The fact that neural analysis was cut off at 150Hz to me is a missed opportunity to test if neural speech tracking goes all the way up to 200Hz of the typical female fundamental.

      (7) The gamma→theta effects reported here could be confounded by a simple longer delay in the analysis of theta. In fact, Figure S5 confirms that delay. It is unclear whether the CFD metric captures anything more than a temporal delay between the two signals. The term "functionally interconnected" in the abstract is a bit of a stretch; it may be essentially delayed correlation.

      (8) There is a minor concern with the claim that low-gamma drives theta amplitude. While statistics on this are reported, the corresponding figure may be suggesting an alpha-harmonic instead of theta (Figure 5c).

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript investigates whether the theta-gamma phase-amplitude coupling in the human auditory cortex serves as an intrinsically generated neural mechanism for hierarchically parsing speech or not. By analyzing speech corpora across 17 languages alongside human intracranial EEG recordings, the authors demonstrate that these nested oscillatory dynamics are actually inherent, robust acoustic properties embedded within the speech envelope itself. Consequently, they claim that rather than generating parsing windows internally, the early auditory cortex acts as a temporal demultiplexer that segregates syllabic, vocalic, and pitch features into distinct, stimulus-driven neural channels. Furthermore, the study presents evidence for a reversed functional directionality wherein fast-varying gamma activity drives the phase alignment of slower theta rhythms, fundamentally reframing auditory PAC as a stimulus-evoked alignment to a highly structured external signal rather than an endogenous cognitive parsing tool.

      Strengths:

      (1) The authors demonstrated robust theta-gamma acoustic structure across languages. They analyzed the acoustic speech envelope across 17 typologically distinct languages. This establishes that the nested theta-gamma acoustic structure is a universal feature of human speech, rather than an artifact of one language's specific phonology.

      (2) The use of time-resolved, high-SNR intracranial recordings is a critical strength of this study. This approach provides the precise spatiotemporal fidelity required to confidently separate and delineate multiplexed high-frequency dynamics, particularly the low- and high-gamma bands, that are essential for accurate speech decoding but are typically attenuated or lost in non-invasive scalp recordings.

      (3) The authors move beyond standard correlational PAC metrics by employing a suite of converging analyses, including the isolation of true oscillations from aperiodic noise and the directional index. Together, these metrics demonstrate that auditory PAC is a stimulus-evoked alignment to a highly structured external speech signal, rather than an intrinsically generated top-down parsing mechanism.

      Weaknesses:

      (1) A major methodological concern is the use of ICA across SEEG electrode shafts to define distinct neural sources (SEEG-ICs). SEEG electrodes traverse complex macroanatomy, including multiple cortical layers, sulcal banks, and white matter. By constructing components derived from weights across the entire electrode, and subsequently localizing each component solely to the contact with the maximal contribution, the authors risk generating biologically implausible signals. Such an approach potentially mixes true localized cortical gray matter activity with deep structure or white matter signals. Given that a central claim of this manuscript is the spatial and functional segregation of theta and gamma neural populations, the authors could consider further validating these core findings (such as the gamma-to-theta directionality) using single-channel or bipolar-referenced data.

      (2) Another methodological concern is the use of GC to evaluate the directional causality between speech and neural signal. As noted in Bastos & Schoffelen (2015) and indeed acknowledged by the authors' own citation of Nolte et al. (2010), Granger Causality is highly sensitive to SNR imbalances and filtering artifacts. Given the inherent SNR disparity between a cleanly extracted acoustic envelope and noisy SEEG data, coupled with the known distortions introduced by distinct filtering pipelines (Barnett & Seth, 2011), the GC results may reflect methodological artifacts rather than true physiological driving.

      (3) The third concern is the study's exclusive reliance on linear metrics applied to the envelopes of band-filtered speech and neural signals, e.g., linear Granger Causality and cross-correlations. The human auditory system is an inherently non-linear dynamical system. Complex acoustic features, such as rapid spectrotemporal transitions or dynamic pitch trajectories, often drive non-linear neural responses and complex phase-locking behaviors. While the linear models provide strong interpretable results, by restricting their connectivity and directionality metrics to linear autoregressive models, the authors may be missing substantial non-linear interactions, or conversely, forcing a linear fit onto non-linear data, which can distort estimations of causality and temporal lags. The authors should consider explicitly addressing this limitation in their discussion. Ideally, they should validate their core directional claims on a subset of the data using an information-theoretic, non-linear metric (e.g., Transfer Entropy or Mutual Information), or apply linear methods to nonlinearly abstracted features (e.g., phonemic, syllabic, intonational-level features), to ensure their linear assumptions are not masking or misrepresenting the true underlying dynamics.

    1. Reviewer #1 (Public review):

      Summary:

      The authors report results from an EEG study investigating neural oscillations in 8-month-old infants, as well as an adult control group. Participants were presented with cartoon figures flickering at different frequencies, as well as a broadband condition. While adults showed the well-known dominant response at 10 Hz, infants showed dominance resonance at 4 Hz, irrespective of stimulation frequency. The authors interpret this finding as evidence for the fundamental role of 4 Hz oscillations in early development and discuss two conflicting theories regarding the underlying functionality.

      Strengths:

      Overall, this is a very well-designed and rigorous study, and the results significantly add to our understanding of a very fundamental aspect of early brain activity. The study is embedded in a coherent theoretical framework, and the authors discuss possible implications and next steps with great clarity.

      Weaknesses:

      I see relatively few weaknesses in this paper. It does not statistically compare infant and adult responses, which would add to the argument that infant responses actually differ from adult ones, but I don't think this is necessary at this point for the authors' argument.

      In contrast, I actually like about the paper that the authors had a very clear vision of what they wanted to look at - 4 Hz oscillation responses in 8-month-olds - and this is exactly what they did. Yes, this does not answer all questions one might have, especially about the function of 4-Hz-oscillations in infants, but it goes a long way in characterising properties in 4 Hz oscillations, which provides the starting point for several potential future lines of research.

    2. Reviewer #2 (Public review):

      Summary:

      This study combines EEG with frequency-tagging and broadband stimulation paradigms to investigate the developmental precursors of brain rhythms in 8-month-old human infants. The manuscript employs state-of-the-art methods, focusing on theta and alpha rhythms to assess their functional significance in visual information processing.

      By evaluating responses to visual stimulation at different frequencies and broadband stimulation presented simultaneously with sounds, the authors report a stimulation frequency-independent response at ~4 Hz. They interpret this as the precursor of the adult alpha rhythm involved in perceptual echo mechanisms. However, I have a number of questions regarding the hypotheses, experimental framework, and analytical approach that need to be addressed before confirming the conclusions.

      Strengths:

      (1) The analyses are innovative, and the frequency-tagging paradigm is particularly well-suited for studying challenging populations with short protocols.

      (2) The sample size is adequate.

      Weaknesses:

      There is a gap between the hypotheses and the experimental paradigm, as well as between the hypotheses and the analytical choices. These gaps could alter the interpretation of the findings and thus require clarification (or perhaps a reformulation of the theoretical framework).

      I am not convinced that the conclusion - that the theta rhythm is the functional precursor of the alpha rhythm in the infant visual system - holds without addressing the following questions.

      In brief, my specific concerns are the following:

      (1) Gap Between Hypotheses and Experimental Paradigm:

      The experimental paradigm involves the simultaneous presentation of sound and image, i.e., cross-modal sensory information, which contrasts with the manuscript's theoretical framework and conclusions, all of which are grounded in visual information processing. Previous work has shown that preverbal infants spontaneously engage in cross-modal associative learning in such audiovisual paradigms (e.g., Kabdebon et al., 2019). This raises the question of whether the paradigm taps into different mechanisms - such as associative learning - rather than those hypothesized, and whether these mechanisms might better explain the observed 4 Hz response. Associative learning mechanisms are particularly relevant to theta rhythm, involving hippocampal learning and the engagement of wider networks, including frontal areas.

      Given this cross-modal design, I question whether it might alter the interpretation of the paradigm and the conclusions drawn. The current framing of the manuscript suggests that theta/4 Hz is the functional equivalent of the alpha rhythm for visual processing in the 8-month-old brain. However, the use of multisensory input complicates this conclusion for the visual domain and the parallel to adult mechanisms.

      Kabdebon, C., & Dehaene-Lambertz, G. (2019). Symbolic labeling in 5-month-old human infants. Proceedings of the National Academy of Sciences, 116(12), 5805-5810.

      (2) Analytical Focus - Gap Between Hypothesis and Analysis Choices:

      The link between the literature described in the introduction and the hypothesis of a 4 Hz inherent rhythm in the visual system remains unclear. This puzzles me as to why the analyses focused on 4 Hz and a control band that is not adapted to the infant population. The focus of the analyses on 4 Hz (and the control band analyses) overlooks the critical frequency range (~6-8 Hz), which other studies have suggested may serve as proxies for the adult alpha rhythm. This omission does not align with the hypotheses regarding the role of the alpha rhythm in visual information processing.

      The introduction discusses both alpha rhythm and its significance in perceptual echo phenomena, and theta rhythm and its role in mnemonic function, but these remain as separate phenomena. While the paradigm aims to assess perceptual echo phenomena in infants, one would expect the hypothesis to relate to precursors of the alpha rhythm in infancy (slower frequencies, yet related to alpha, ~6 Hz; Stroganova et al., 1999). However, the authors hypothesize that theta rhythm (4 Hz) is a precursor of the alpha rhythm in infancy: "Given the prominence of the theta rhythm in infancy, we expected the presence of a 4 Hz theta response and resonant activity in the infant visual system upon periodic stimulation and broadband visual input, respectively."

      Why did the authors not study the 6-9 Hz frequency range, which previous work suggests may serve as a proxy for alpha in infants? Currently, the analyses are restricted to the theta range (i.e., 4 Hz) and a control band (adult-classical alpha range [8-14 Hz]), but [8-14 Hz] is not adapted to the infant population. At this age, prior work has reported ~6 Hz as the age-adapted range corresponding to alpha. It would be more appropriate to investigate this range. I can see some trace of this in Figure 2a, but perhaps this is weaker compared to the 4 Hz stimulation due to the cross-modal nature of the paradigm.

      Stroganova, T. A., Orekhova, E. V., & Posikera, I. N. (1999). EEG alpha rhythm in infants. Clinical Neurophysiology, 110(6), 997-1012.

      In the adult results, we also see similar ("two types of") responses: the main response at 8 Hz, which to me is the upper band of the theta rhythm (related to cross-modal learning), and traces around 10 Hz, which are more in line with perceptual echo mechanisms. The cited literature in adults (VanRullen & Macdonald, 2012), on which the authors base their framework and analysis, indicates a response at 10 Hz (not 8 Hz). This supports the idea that the 8 Hz response observed in this work might be related to the cross-modal presentation of stimuli. The authors could evaluate this more easily through a control group of adults with an unimodal (visual-only) presentation of stimuli.

      (3) Methodological Approach and Clarity:

      The methodological approach is not sufficiently detailed, which is crucial for reproducibility and wider contribution, especially given the difficulties in studying infants. Key points requiring clarification include preprocessing, choice of electrode clusters, and statistical details.

    3. Reviewer #3 (Public review):

      Summary:

      The authors aim to characterize the intrinsic temporal dynamics of the infant visual system by examining how it responds to rhythmic visual stimulation. Using EEG in 8-month-old infants, they present visual stimuli that flicker at different periodic frequencies as well as broadband (aperiodic) luminance sequences to probe resonance properties of the visual system. The central goal is to determine whether the infant brain exhibits a characteristic oscillatory response independent of the external stimulation frequency, analogous to the well-known alpha (~10 Hz) resonance of the adult visual system. The results are then compared with data from a small adult sample to assess whether the dominant processing rhythm of the visual system shifts across development.

      Strengths:

      This manuscript presents a compelling and carefully executed study with intriguing findings, and I greatly enjoyed reading it. Several strengths deserve particular mention:

      (1) Clear and focused research approach. The study addresses a well-defined question regarding the intrinsic rhythmic dynamics of the infant visual system and applies an elegant experimental paradigm to probe these dynamics directly.

      (2) Well-designed parametric stimulation paradigm. The use of rhythmic visual stimulation across multiple frequencies (2-30 Hz), combined with broadband stimulation, provides a systematic way to characterize resonance properties of the visual system. This parametric approach allows the authors to clearly visualize the relationship between stimulation frequency and neural response, making the key effects easy to grasp.

      (3) Strong statistical power in the infant sample. The relatively large infant sample (N = 42) is a major strength, particularly given the challenges of infant EEG research. This sample size provides sufficient power to support the conclusions about the robustness of the ~4 Hz response in infants.

      (4) Converging analytical approaches. The authors combine periodic stimulation analysis with impulse-response-function (IRF) analyses of broadband stimulation, which provides complementary evidence for the presence of a ~4 Hz resonance in the infant visual system. This convergence strengthens the interpretation of the results.

      (5) Direct developmental comparison. Although the adult sample is small, including adults in the same paradigm provides a useful benchmark showing the expected alpha-band response (~8-9 Hz), thereby contextualizing the infant findings within a developmental framework.

      Weaknesses:

      (1) Potential oculomotor contribution to the frontal 4 Hz effect. My main concern relates to the interpretation of the prominent ~4 Hz response in infants, particularly at frontal electrodes. The frequency range is close to what might be expected for oculomotor activity such as microsaccades, and the scalp distribution appears suggestive of such a contribution. Notably, the topography of the 4 Hz response differs substantially from the topography of the harmonic responses (Figure 2B), which show the expected occipital dominance. The latter is more clearly visual, whereas the former is more complex, definitely going beyond visual responses. This should be considered more in the discussion.

      (2) Differences in topography between periodic and IRF effects. The spatial distribution of the 4 Hz response during periodic stimulation also appears to differ from the topography of the 4 Hz impulse response function (IRF; Figure 2B vs 3D). The IRF response appears not really "visual" in its spatial distribution, as compared to, e.g. the harmonic responses in 2B. This difference could indicate distinct underlying generators, but the implications of this discrepancy are not discussed in detail.

      (3) Strength of the interpretation of neural resonance. Taken together, these observations make it difficult to determine conclusively whether the observed 4 Hz activity reflects genuine neural resonance of the visual system or potentially other processes (e.g., oculomotor dynamics). While the current findings remain interesting under either interpretation, the manuscript tends to favor the neural resonance account quite strongly without fully addressing alternative explanations.

      (4) Relation to known developmental shifts in resting-state oscillations. The dominance of lower-frequency rhythms (theta range) in infancy is well documented in the resting-state EEG literature. Although this point is briefly mentioned in the discussion, it would be interesting to relate the current findings more directly to this literature. For example, it would be informative to know whether peak frequencies observed here align with resting-state theta peaks in infants and whether similar spatial distributions are observed.

      (5) Limited follow-up of the proposed theoretical accounts. The discussion introduces both mnemonic and inhibition accounts for infant theta activity. However, these frameworks are not fully developed in relation to the present data. In particular, the mnemonic account might generate testable predictions within the current dataset, for example, whether theta responses change over time with repeated stimulus exposure or learning.

      (6) Characterization of the adult alpha response. A minor point concerns the characterization of the adult resonance frequency. The manuscript often refers to a 10 Hz alpha resonance, whereas the data presented here show a peak around ~8 Hz (Figure 5A). In that frequency range, that is a lot. Also, there seems to be some variability, such that for the topography, the authors use the "individual alpha frequency". It would be interesting to see the distribution of peak frequencies across participants to appreciate the actual range. Interestingly, the spatial distribution of the alpha response also appears quite similar to the infant 4 Hz effect (Figure 5B) and differs from the harmonic responses, which may deserve further discussion. A comparison with resting-state alpha characteristics could also be informative here (e.g., does the peak IAF during visual stimulation relate to IAF recorded at "rest").

    1. Reviewer #1 (Public review):

      Summary and Strengths:

      Shin et al deepen our understanding of high-frequency oscillations in the frontal cortex during REM in a manner that sheds important light on the roles of these events. In particular, they reveal that cortical HFOs are modulated by theta oscillations, occur in chains and recruit cortical neuronal activation patterns in a manner that is distinct from other high-frequency events during non-REM or in the hippocampus. They also show that these events occur during increased oscillatory cross-talk between hippocampus and cortex and may protect cortical neurons from downregulation of firing during sleep. Overall, this is important work with several novel observations pointing towards an important role for these events that will become increasingly understood over time.

      I also wanted to comment that 2D is a beautiful illustration of separate and essentially exclusive communication channels used during HF events in NREM vs REM. They almost perfectly complement each other's frequencies.

      Weaknesses:

      I have only one major scientific critique: I believe we need to see quantification of how phasic REM theta waves with versus without HFOs differ. What do REM HFOs add to the "normal" theta oscillation? Without this comparison, it is more difficult to interpret the meaning of these events. Given that HFO chains have IEIs around the time of a theta cycle duration, are the repeating spiking activities stronger during HFO repeats than during adjacent theta waves without HFOs? What percentage of theta waves contain HFOs, and what is the firing rate during those theta waves with vs without HFOs? Is there differential firing rate modulation? The authors may even consider that all REM-HFO-specific quantifications should be shown as differential from phasic theta cycles without HFOs.

      As a non-scientific comment on the manuscript itself: unfortunately, the paper is difficult to read and understand at times, requiring great effort by the reader. This is to an extent that communication is hindered. The paper is dense with changing methods, often from panel to panel. Unfortunately, the panel quantifications are not explained in the results section in a manner that readers can understand without going to read the methods, often for each individual panel. These measures should be explained in a way that lets readers understand the conclusions of each panel and what gross calculations were used to reach those. Instead, too much jargon is used rather than clear descriptions of the overall calculations being done for each panel. 


      The authors mention in the discussion section that they see increased functional connectivity between mPFC and CA1, but most data suggesting this seems to be based on LFP rather than spiking. Functional connectivity is best defined by spiking-spiking relationships. And these authors have spiking data. So I believe either the descriptive language should be pulled back to something like "oscillatory coupling" or more analyses should be dedicated to showing spike-spike coordination across regions.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, the authors investigate high-frequency oscillations (HFOs) in the prefrontal cortex during REM sleep. They identify a specific pattern where these HFOs occur in "chains" that are phase-locked to theta oscillations, primarily during the "phasic" periods of REM. The study contrasts these events with isolated HFOs and NREM ripples, suggesting a unique role for these chains in coordinating activity between the prefrontal cortex and the hippocampus. Most notably, the authors report that a specific subset of hippocampal cells-those that co-fire with the prefrontal cortex during these HFOs-increase their firing rates over the course of sleep, suggesting a potential mechanism for selective memory consolidation.

      Strengths:

      The study addresses an under-explored area of sleep physiology: the fine-grained temporal coordination between the cortex and hippocampus during REM sleep. The identification of HFO "chains" and their association with higher theta power provides an interesting framework for understanding how the brain might organize information transfer outside of NREM sleep. The observation that specific hippocampal populations show differential firing rate changes based on their participation in these HFO events is a striking finding that warrants further investigation.

      Weaknesses:

      The primary weakness of the study lies in the lack of a clear distinction between global brain states and the specific events being analyzed. Because the authors compare HFOs across different sleep stages (NREM, tonic REM, and phasic REM) without sufficient controls, it is difficult to determine if the observed differences are intrinsic to the HFOs themselves or simply a reflection of the different physiological states in which they occur.

      Furthermore, the evidence for "structured reactivation" is not yet convincing. The temporal alignment of these reactivation events appears inconsistent, with peaks occurring well before the HFO itself, and the analysis does not sufficiently control for pre-existing cellular assembly strengths. Additionally, some of the sleep architecture presented appears atypical, such as very short REM bouts and direct NREM-to-REM transitions that bypass standard progression, raising questions about the consistency of the sleep detection across animals. Finally, the study does not account for potential confounds like baseline firing rates when interpreting the behavior of "high-cofiring" neurons, which may simply be the most active cells in the population.

    3. Reviewer #3 (Public review):

      Summary:

      Shin et al. examine hippocampal-prefrontal interactions during sleep using simultaneous CA1 and prefrontal cortex recordings in rats performing a spatial memory task. They identify high-frequency oscillation (HFO) events in PFC during REM sleep that occur in theta-modulated chains and are associated with increased CA1-PFC coherence and sequential, sparse reactivation of cortical ensembles. This pattern contrasts with the synchronous reactivation observed during NREM cortical ripples. Together with a simple cholinergic network model, the authors propose that REM HFO chains represent a distinct mechanism for hippocampal-cortical coordination that complements NREM ripple-mediated processing during sleep.

      Strengths:

      A major strength of the work is the extensive electrophysiological dataset, which includes simultaneous recordings of large neuronal populations in both hippocampus and prefrontal cortex across behaviour and subsequent sleep. The analyses linking high-frequency events to population dynamics, interregional coherence, and ensemble reactivation are technically sophisticated and provide an incredibly detailed description of REM-associated cortical activity patterns. In particular, the demonstration that REM HFOs occur in chains aligned to theta phase and organise sequential activation of cortical assemblies represents a potentially important advance in understanding the neural structure of REM sleep activity. The integration of experimental data with a computational model further provides a useful framework for interpreting the observed differences between REM and NREM network states in terms of neuromodulatory influences.

      Weaknesses:

      While overall this study provides a highly valuable body of work, there are two primary limitations, which, if overcome, would provide substantially more significance to the overall characterisation of REM HFOs. Specifically:

      (1) Distinction from wake HFOs

      The results largely support the authors' claim that REM HFO chains represent a distinct pattern of neural coordination compared to NREM cortical ripples. The analyses consistently show differences between REM and NREM events in terms of neuronal modulation, ensemble structure, and interregional coupling. However, similar high-frequency events during wake are not examined. Since REM sleep shares several network features with wakefulness, including strong theta oscillations, evaluating whether comparable PFC HFOs occur during wake would provide clarity on whether these events are specific to REM sleep (and its associated functions) or represent a more general theta-associated phenomenon.

      (2) Link to memory consolidation

      The manuscript proposes throughout that REM HFO chains may contribute to memory consolidation by coordinating hippocampal-cortical reactivation, but the evidence for this functional role remains indirect. The authors do highlight this as a limitation of the study - the inability to link their findings to learning - but it is not clear why. Further details of the behaviour results should be included. If no learning occurred across the eight behavioural sessions, this should be reported. If learning did occur, but could not be linked to HFO events, this should also be reported.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript presents a three-dimensional and molecular atlas of the adult hagfish brain to investigate the evolutionary origin and early diversification of vertebrate brain organization. Using whole-brain tissue clearing, light-sheet microscopy, and computational reconstruction, the authors generate a high-resolution 3D anatomical model of the hagfish brain. They complement this structural analysis with gene-expression profiling of neurotransmitter systems and receptors, including glutamatergic, GABAergic, cholinergic, serotonergic, and dopaminergic markers.

      Strengths:

      Together, the work aims to establish a modern neuroanatomical reference for the hagfish. Given the phylogenetic importance of hagfish as one of two extant species of cyclostomes (the other being lamprey), and the fact that the hagfish brain has barely been studied in contrast to the lamprey, the atlas provides a foundational resource and should be of interest to evolutionary and comparative neurobiology.

      Weaknesses:

      However, there are several places where both data presentation and the narrative can be improved and clarified, and particularly some of the homology and evolutionary claims seem to be superlative and need to be toned down. I present more detailed comments below:

      (1) The authors spend too much effort trying to convince readers of the monophyly of hagfish and lamprey to stress its importance for evolutionary comparisons. This is now well accepted; instead, there could be more details on some of the specific, unique features of the hagfish brain relevant to a comparative atlas. For instance, the unusual fusion of the telencephalon anteriorly with the olfactory bulb and posteriorly with the diencephalon (Wicht and Northcutt, 1992), the degenerate visual system, the absence of the pineal gland, and the oculomotor system can be discussed in reference to the generated atlas and examined marker expression in related structures and their possible identity.

      (2) The assertion that the MGE is absent in the lamprey is incorrect based on Sugahara et al. (2016; 2017), who identified lamprey paralogues of Nkx2.1/2.4 that are expressed in the ventral subpallium. This should be corrected.

      (3) The major contribution of this study, in my mind, is the "three-dimensional atlas" of the hagfish brain. However, the atlas itself is not presented; A video of the 3D reconstructed Nissl-stained hagfish brain would be an important data resource and should be added. Annotations of forebrain, midbrain and hindbrain regions and constituent major structures can also be illustrated, which will be a useful resource.

      (4) In the pallium, there seems to be an inner GABAergic cell layer and inner and outer glutamatergic cell layers, as noticed in lampreys (Suryanarayana et al., 2017). What are the overall proportions of glutamatergic and GABA neurons? In the images, it does seem that vGlut neurons are present in both P2 and P4, while there appear to be more GAD neurons in P4.

      (5) As a general comment, homology claims should be toned down throughout the manuscript. This would at least require some connectivity data or transcriptomic analysis for any possible suggestions; the current data, with few markers, are insufficient for any reasonable comparisons.

      (6) Expression of Pax6 and AChE is not sufficient to suggest a cerebellum-like structure. While it is true that embryonic Pax6 expression in the rhombic lip of the hagfish embryo is more comparable to other vertebrates than lamprey, and the presence of a rudimentary cerebellum-like structure would be of great interest, the evidence is too limited for such claims and should be toned down.

      (7) Again, expression of Tbr1 and GAD1 in NCvl neurons does not suggest that these could be hippocampal neurons. One would at least need to rule out expression of prethalamic markers and demonstrate the presence of pallial markers through transcriptomic data (as in Lamanna et al., 2023).

      (8) Presence of GABAergic neurons in the striatum - is there any data on expression of dopamine receptors, particularly given the seeming loss of the D2 receptor subtype in the hagfish?

    2. Reviewer #2 (Public review):

      Summary:

      The work of Harada and collaborators fills an important gap in our knowledge of neuronal identities in the adult hagfish brain. There is essentially no modern, cell-type-level characterisation of neuronal identity in the hagfish brain yet. Existing data are limited to classical neuroanatomy (e.g. Nieuwenhuys) and sparse transmitter/gene-expression studies, mostly in embryos (e.g. work from the Kuratani lab). This study reveals a very broad peculiar pattern of dopaminergic identities and a strikingly unusual pattern of serotonergic transmission, with serotonergic cell bodies present in the telencephalon, which is uncommon for vertebrates and contrasts with previous reports (e.g., Kadota, 1991).

      Strengths:

      The three-dimensional reconstruction of the brain, including the ventricular system, is novel and very useful. Most of the neurotransmitter identity patterns presented here have not been previously described, and those that were published earlier, such as the serotonergic system (e.g. Kadota, Nieuwenhuys, Wicht), are old and would clearly benefit from re-evaluation using more modern approaches.

      Weaknesses:

      Neurotransmitter identities are highly relevant for interpreting the possible presence of LGE/MGE territories in hagfish (e.g. GABAergic patterns), for characterising the raphe nuclei (e.g. serotonergic system), and for refining our understanding of the central prosencephalic complex in relation to other vertebrate brain architectures. However, the authors do not address these points and overlook recent evidence from the amphioxus brain that could help interpret their results in an evolutionary context. Overall, the results are insufficiently discussed in relation to the current state of the art.

      The study would clearly benefit from complementary gene expression profiling to place these neurotransmitter patterns within a broader framework of brain partitions, to enable more direct comparisons with other vertebrates, and, importantly, to interpret them in relation to the prosomeric model. Furthermore, the work lacks appropriate controls for the in situ hybridization experiments; Datx2 does not show any expression, so there is currently no evidence that this probe is functional. Including such controls would also strengthen the overall description of the dopaminergic system, especially given that the expression patterns of the different genes analysed appear very diffuse and somewhat random.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript provides a well‑argued discussion of the misalignment between common predictive performance evaluations reported in the literature and actually measuring clinical utility in the context of predictive psychiatry. Specifically, the authors discuss measurement reliability and prevalence as two neglected factors which can substantially inflate the assessment of model performance for clinical practice. To mitigate this, the authors offer a concrete framework and an accompanying web tool, with which to adjust performance metrics and additional predictive‑value and decision‑analytic measures.

      Strengths:

      The manuscript speaks convincingly about the risk of face validity and the practical irrelevance of seemingly promising predictive models in psychiatry. The authors outline how predictive performance estimations often fail to generalize to clinical contexts and thereby potentially mislead scientific efforts. In the face of ubiquitous biomarker models and incremental improvements in the literature, the reader is reminded that, irrespective of the glory of the proposed model, low reliability of clinical measurements fundamentally affects (and limits) both effect sizes and predictive performance ("garbage in, garbage out"), and that neglecting this can ultimately lead to misinformed decisions in the treatment of individual patients. The provision of an online tool with a user‑friendly interface and clearly worked examples is a major practical asset that will facilitate the adoption of the proposed framework beyond quantitative methodologists.

      Weaknesses:

      While the outlined issues highlight important aspects in the translational gap, the suggested solutions remain somewhat theoretical. For example, the use of prevalence might not reflect what a model would see in practice, assuming that population prevalence and the composition of actual clinical cohorts are aligned. Accounting for who presents to care, and under which referral or triage patterns, is a crucial determinant of effective base rates. While the authors do acknowledge the importance of using base rates from the target population, these nuances could be emphasized more prominently at the points where practical recommendations are made. Relatedly, the analytical context and the methodological assumptions are not clearly specified. Many arguments and demonstrations are derived in univariate, group‑comparison settings and then discussed in a way that can be read as broadly applicable.

    2. Reviewer #2 (Public review):

      Summary and strengths:

      The authors present a description of their online tool to estimate real-world performance of predictive models. The authors bring together different calculations to make better-informed implementation choices. It is a very nice tool to go from effect sizes to base rates to decision curve analysis. The paper describes the background and use of the tool with examples and seems like an extended version of their online how-to. The methods themselves are not new, but I think the tool will be valuable for researchers from different fields. Tools already exist for the conversion of effect sizes (my current favorite is https://www.escal.site/), but I haven't seen measurement noise being incorporated previously. The main benefit is the evaluation of performance under different real-world scenarios. Code is available on GitHub, and the manuscript is well-written.

      Weaknesses:

      While comprehensive explanation and examples are important for correct use of the tool, I don't really see the added value above their online how-to guide, as the software itself has already been published (Karvelis, P. and Diaconescu, A. O. (2025b). E2p simulator: An interactive tool for estimating real world predictive utility of research findings. Journal of Open Source Software, 10(114):8334.)

    3. Reviewer #3 (Public review):

      Summary:

      This important work provides a web-based tool to contextualize effect sizes in psychiatry with respect to reliability and base rates (collectively referred to as predictive utility analysis). The methods for the tool incorporate established psychometric principles that I think are of use for multiple fields in this seemingly easy-to-use tool. I agree with the critical importance of this tool and the methodological points made in this manuscript. Enthusiasm for the manuscript is weakened by a lack of clarity on the formulation of the paper and stated goals of the examples used, with the inferences and impact on clinical decision making from various parameterizations via this tool left open-ended.

      Strengths:

      This paper presents a well-considered and, what I think will be highly useful, web-based tool to contextualize effect sizes with respect to reliability and base rates. As the authors rightly point out, such a tool could be used in conjunction with widespread analytic power analysis tools in study planning. The paper also well contexualizes the need for such a tool in the relatively recent history of concerns of power, reliability, and inference in psychiatry specifically, and more general meta-scientific debates in psychology and neuroscience.

      Weaknesses:

      My primary feedback on this manuscript is the lack of clarity in what the paper itself, specifically, separate from the tool, is hoping to achieve. There is a central, but unresolved, tension in whether the reader is supposed to:

      (1) focus on the specifics of the examples used and whether to reevaluate the substantive claims from the studies, (2) buy in to how various reliability and base rate parameters impact modeling outcomes, (3) receive an introduction to the tool itself.

      In my estimation, the largest contribution to the field here is in (2) and (3), but currently much of the real estate of the paper is dedicated to several examples of (1). While these specific examples may be illustrative to some degree, I think given the number and brevity of such, they are unlikely to incidentally achieve points (2) and (3) above. Specific examples include the assertion of kappas for DSM diagnoses, without much nuance (e.g., see https://psycnet.apa.org/buy/2015-27500-001). Given the relatively limited space given to this example, however, it's hard to be entirely certain what the reviewer should take away.

      A second point of concern is where this tool would be situated in the research pipeline. I agree with the authors that this tool could be used in ways that parallel power analysis. With that in mind, it seems the most common use of this tool for an individual investigator is likely to be in a priori study planning. In contrast, and with my point above in mind, the use of the tool for existing results is likely best done with multiple estimates of effect sizes, reliability, and base rates, as is common in meta-analysis or consensus reviews. Nevertheless, there is no real example or guidance around how this influences new study planning.

      A third point is that more nuance would be useful in the introduction about the current state of psychiatry research. For example, I share many of the authors' concerns about reliability, power, reproducibility, and barriers to translation. That said, it is the case that while effect sizes should be considered considerably more, they are widely considered in psychiatry research via the common place of meta-analysis and other data pooling approaches. Another such example that the authors state in the context of reliability: "However, this [reliability] attenuation is rarely accounted for in routine analyses in psychiatry". This is true in practice, but somewhat misleading insofar as the method by which to do this remains unclear. For example, should we all report disattenuated associations, assuming there is no error and everything is perfectly reliable? This, of course, would be unrealistic to expect zero error. That we can achieve this with the new tool is clear, but the nuance of how and under what circumstances it should be done is not clear, and such nuance should be better reflected in the framing of the problem. That is, there is also a lack of clarity on what ought to be best practices and field-wide goals, rather than simply the lack of an ability to model these factors.

      Minor point

      For conceptual clarity, it would benefit the manuscript to at least briefly mention the role of validity in translational importance. Of course, the current psychometric issues of reliability, base rate, power, etc are critical, but it should at least be mentioned, given the potential wide audience of this manuscript, validity is important as well. For example, highly reliable measures may not be valid indicators of underlying disease etiology (e.g., fMRI head motion is a highly reliable trait-level feature, but typically not considered an important predictor or consequence of mental health worth investing translational resources in). Relatedly, confounding as a general topic would be useful to mention just briefly, to help with the spirit of considering underlying issues in translation.

    1. Reviewer #1 (Public review):

      Summary:

      Here the authors attempted to test whether the function of Mettl5 in sleep regulation was conserved in Drosophila, and if so, by which molecular mechanisms. To do so they performed sleep analysis, as well as RNA-seq and ribo-seq in order to identify the downstream targets. They found that the loss of one copy of Mettl5 affects sleep, and that its catalytic activity is important for this function. Transcriptional and proteomic analyses show that multiple pathways were altered, including the clock signaling pathway and the proteasome. Based on these changes the authors propose that Mettl5 modulate sleep through regulation of the clock genes, both at the level of their production and degradation, possibly by altering the usage of Aspartate codon.

      Comments on revisions:

      The authors addressed all my comments satisfactorily.

    2. Reviewer #3 (Public review):

      Xiaoyu Wu and colleagues examined a potential role in sleep of a Drosophila ribosomal RNA methyltransferase, mettl5. Based on sleep defects reported in CRISPR generated mutants, the authors performed both RNA-seq and Ribo-seq analyses of head tissue from mutants and compared to control animals collected at the same time point. A major conclusion was that the mutant showed altered expression of circadian clock genes, and that the altered expression of the period gene in particular accounted for the sleep defect reported in the mettl5 mutant. In this revision, the authors have added a more thorough analysis of clock gene expression and show that PER protein levels are increased relative to wild type animals a specific times of day, indicating increased stability of the protein. Given that PER inhibits its own transcription, the per RNA is low in the mutants. The revised manuscript included efforts toward a more detailed understanding of how clock gene expression was altered in the mutants, as well as other clarification of sleep phenotypes.

      Comments on revisions:

      All critiques have been addressed by the authors; the manuscript is much improved from its original submission. Thank you.

    1. Reviewer #1 (Public review):

      Summary:

      This study aims to clarify MATR3's function and molecular mechanism in oocyte growth and maturation, explore its association with OMA, and its potential as a diagnostic and therapeutic target using specific knockout mouse models, human OMA samples, and multi-omics technologies. And it has fully achieved preset objectives with results strongly supporting conclusions. Specifically, it addresses the gap in the synergistic mechanism of epigenetic and secretory signals regulated by RNA-binding proteins (RBPs) in oocyte growth and enriches the molecular etiological spectrum of oocyte maturation disorders. It is the first time the conservative function of MATR3 has been revealed in multiple species, providing a paradigm for cross-species research on RBPs in the field of reproductive biology. It also provides a new candidate target for OMA, a clinically refractory infertility disease, and is expected to promote the optimization of assisted reproductive technology and the development of precision medicine.

      Strengths:

      The strengths of this study are significant and prominent. First, the research system is comprehensive, integrating knockout mouse models, in vitro knockdown models, multi-species (mouse, porcine, and human) verification, combined with scRNA-seq, LACE-seq, CO-IP, and other multi-omics and molecular biology technologies, forming a complete and progressive evidence chain. Second, the mechanism analysis is in-depth, clarifying the dual molecular mechanisms of MATR3 regulating the transcriptional synthesis and secretion of GDF9 through "recruiting KDM3B to regulate H3K9me2 demethylation" and "directly binding to Rdx mRNA", with a clear logical closed loop. Third, the clinical correlation is close. It is the first time to find abnormal nuclear localization of MATR3 in oocytes of OMA patients, providing new clues for clinical disease mechanism research, and verifying the downstream function of GDF9 through rescue experiments, effectively enhancing the translational value of the results.

      Weaknesses:

      This study included only one OMA patient's oocyte sample. Without clinical screening for MATR3 mutations or abnormal expression, establishing a causal relationship between MATR3 and OMA remains difficult.

    2. Reviewer #2 (Public review):

      Summary:

      This study investigates the role of MATR3 in oocyte development and folliculogenesis using conditional knockout mouse models together with in vitro follicle culture and molecular analyses. The authors aim to determine whether MATR3 regulates oocyte maturation and follicle development and to explore potential mechanisms linking MATR3 function to transcriptional and epigenetic regulation in growing oocytes.

      Strengths:

      A major strength of the work is the use of a conditional knockout mouse model combined with complementary in vitro follicle culture approaches, which together provide a useful framework for examining gene function during oocyte development. The study also attempts to integrate cellular phenotypes with molecular analyses of transcriptional activity and epigenetic markers.

      Weaknesses:

      Several weaknesses limit the strength of the conclusions. These include insufficient validation of key experimental manipulations (such as the efficiency of MATR3 knockdown in siRNA experiments), limited quantification or statistical analysis for some datasets, inconsistencies between the text and presented data in certain figures, and incomplete methodological descriptions that make it difficult to fully evaluate reproducibility.

    3. Reviewer #3 (Public review):

      Summary:

      The study aims to elucidate the dual molecular mechanisms of the RNA-binding protein MATR3 in oocyte growth and maturation. The authors propose that MATR3, highly expressed in growing oocytes (GOs), regulates oocyte quality through two pathways: epigenetically, by recruiting KDM3B to remove the repressive H3K9me2 mark at the Gdf9 locus to activate transcription; and post-transcriptionally, by binding Rdx mRNA to maintain microvillus structure for GDF9 secretion. This mechanism ensures oocyte-granulosa cell communication and female fertility. The study also explores the link between MATR3 and human oocyte maturation arrest (OMA).

      Strengths:

      The study proposes an innovative dual-mechanism model encompassing "epigenetic transcriptional activation and cytoskeletal regulation," which not only expands the functional understanding of RNA-binding proteins in chromatin regulation but also reveals the coordination between nuclear transcription and organelle structure. By integrating scRNA-seq and LACE-seq, the authors constructed a comprehensive regulatory network for MATR3, identifying both key targets and numerous potential molecules, thereby providing rich resources for future mechanistic studies. Furthermore, the inclusion of oocyte samples from human OMA patients directly links the basic findings to clinical reproductive disorders. Despite the limited sample size, this approach demonstrates strong translational potential.

      Weaknesses:

      The partial phenotypic improvement achieved by exogenous GDF9 supplementation suggests that the downstream effector pathways may involve a more complex network regulation, implying that the current interpretation of GDF9's central role could be further explored. Regarding the developmental abnormalities of granulosa cells in the conditional knockout model, their pathological origins require in-depth analysis to determine whether they represent primary alterations or secondary adaptive responses resulting from the loss of oocyte signaling.

    1. Reviewer #1 (Public review):

      The manuscript by Fisher et al describes the molecular mechanism underlying how G beta gamma subunits engage with the beta 3 isoform of PLC. The paper used a combination of cryo EM, BRET assays, and biochemical assays of PLC beta activity. A key discovery is that G beta gamma is not sufficient to drive membrane binding by itself, and instead promotes G alpha activation. The work is important, but suffers slightly from some ambiguity in the actual interface that is present in their cryo EM model, as crosslinkers could stabilise a transient and non-native complex. This is somewhat abrogated by the careful mutational analysis, which shows that mutation of any of these three sites does somewhat block PLC beta G beta gamma activation. However, there could be some improvement in the presentation of this data, as well as possible mutant selection. Overall, this paper is a nice complement to the Falzone et al paper, showing the membrane-bound complex of PLCB3 on membranes, with this work building on this work, highlighting the importance this will have in our full understanding of PLC beta activation.

      Major concerns:

      My biggest concern is the potential that this interface is artefactual based on the crosslinking strategy utilised. Here are thoughts on how this could be better validated, presented in a more convincing way.

      (1) The authors' main claim is that there is a degree of plasticity of G beta gamma binding to the PLC beta 3 isoform, with three possible binding sites. The main complication of this is, of course, the possibility that the crosslinking stabilises a non-native complex, driven by a mutated cysteine.

      Because of this, any other additional details about this interface are going to be critical for the scientific audience to judge if this is accurate.

      What would greatly help Figure 1 is an evolutionary conservation analysis of the novel Gbg interface in PLC, to see how well this is conserved, and compare this to the conservation of the previously annotated sites. Conservation of these sites on both the G beta gamma and PLC side would help justify this as a native complex.

      This will also help orient the reader to the identity of the mutated residues assayed in Figure 3.

      (2) The g beta gamma orientation is also different than what I have observed in previous g beta gamma effector structures. Is there any precedent for this as an effector interface? A supplemental figure comparing this structure to other g beta gamma interfaces from other enzymes, for example recent Tesmer structure with PI3K.

      (3) The mutational analysis in Figure 2D-G seems to give some strange results, and I have some question why certain residues were chosen rather than others. Mutation of the Gbg side will be more complicated, as of course that can affect any of the three surfaces. My main question is that, from the way Figure 2A is oriented, the main salt bridge in their novel interface to me looks like R199-D228, with K183 being in the wrong orientation to E226, and D167 being far from any charged residues. Why did the authors not make the corresponding R199 to D or E mutation?

      (4) To help the reader's interpretation of Figure 2A, I would recommend a supplemental figure showing the density for interfacial residues, as that also would increase confidence in the interface.

    2. Reviewer #2 (Public review):

      In this manuscript, the authors dissect how Gβγ potentiates PLCβ3 signaling in cells. Using engineered crosslinking to stabilize a Gβγ-PLCβ3 complex, single particle cryo-EM, and cell-based functional assays, they identify and map multiple putative Gβγ interaction surfaces on PLCβ3, including a previously unrecognized binding mode. Structure-guided mutagenesis supports the functional relevance of these interactions and suggests that Gβγ potentiation is not primarily mediated by PLCβ3 membrane recruitment, but instead enhances PLCβ3 activity after the lipase is already at the membrane.

      Previous reconstitution work on the membrane surface (Falzone & MacKinnon, 2023) proposed a recruitment/partitioning-centric model in which Gβγ increases PLCβ3 output largely by elevating its membrane surface concentration, whereas Gαq primarily increases catalytic turnover; under those reconstitution conditions, the two inputs can combine approximately multiplicatively. In receptor-driven cellular signaling, however, PLCβ3 is robustly recruited to the plasma membrane upon Gαq activation, which raises the question of whether Gβγ contributes mainly through additional recruitment or through a post-recruitment mechanism once PLCβ3 is already at the membrane.

      This manuscript helps address that gap by using membrane-anchored PLCβ3 and complementary cellular readouts to separate "getting PLCβ3 to the membrane" from "boosting activity once PLCβ3 is already there." Their results argue that, in cells, membrane recruitment is largely dominated by Gαq·GTP, while Gβγ can further potentiate PIP2 hydrolysis after membrane association, consistent with a modulatory role at the membrane rather than primary recruitment.

      Overall, the work provides a structural and mechanistic framework for Gβγ-PLCβ3 cooperation and helps clarify the basis of Gq pathway amplification. The manuscript is generally strong, but some issues need to be addressed.

      Major comments:

      (1) BMOE/BM(PEG)2 crosslinking may enforce a non-native docking geometry, potentially compromising the physiological relevance and precision of the Gβγ-PLCβ3 interface as described. Although a >50% 1:1 crosslinked complex is formed and remains active, the solution maps show lower local resolution for Gβγ, consistent with a dynamic, potentially heterogeneous, interface. One interface is captured via a single engineered cysteine pair (PLCβ3 E60C-Gβ C271), which could potentially bias the pose. It would be helpful if the authors could provide additional orthogonal support (e.g., alternative crosslinked sites) and bolster the clarification of its uniqueness and relevance.

      (2) In the crosslinked structure, the authors report that GβD228 interacts with PLCβ3 R199 and K183. In Figure 2A, R199 appears closer to Gβ D228 than K183, yet only K183 is functionally tested. Testing R199 (e.g., R199E/R199A) would strengthen the structure-guided validation of this interface.

      (3) The mutagenesis strategy appears inconsistent across figures/assays, which makes it difficult to interpret phenotypes and directly link the functional data to the proposed interfaces. For example, in Figure 2E, we see R185L but R215E, while residue L40 is mutated to Gly in the IP accumulation assays but to Glu/Lys (L40E/K) in the BRET assays (Figures 3B/3D/3F). The authors should (i) clearly justify the rationale for each substitution (conservative vs charge-reversal, interface disruption, etc.) and (ii), where possible, test the same mutants across assays (or provide evidence that alternative substitutions yield consistent conclusions).

    3. Reviewer #3 (Public review):

      Summary:

      PLCβ3 is activated by both Gαq and Gβγ subunits. This paper follows previous solutions and cryoEM studies of PLCβ3 / Gβγ, trying to understand the molecular details of activation using cellular BRET assays and cryoEM.

      Strengths:

      The authors find evidence for multiple binding sites on PLCβ3 for Gβγ and suggest that Gβγ is not bone fide activator per se but enhances Gαq activation by positioning the catalytic site towards substrate, although this is not completely convincing. Although these sites may not naturally be operative, the authors might want to develop the potential role of these sites.

      The authors also find that this activation is not through recruitment of the enzyme to the membrane by Gβγ released upon G protein activation, in accord with other PLCβ enzymes, but not for PLCβ3, and again, the authors might want to develop this point further.

      Weaknesses:

      (1) I'm confused as to why the authors feel that their mechanism is distinct from the two-state enzyme, the synergistic activation proposed by Ross in 2011, using a primarily thermodynamic argument. As written, the authors appear to be very reliant on structural and BRET studies that do not give the details that would disprove this interpretation. The main issue is that the author's mechanism does not fully explain how Gβγ activation occurs for PLCβ2 in reconstituted systems in the absence of Gαq subunits.

      (2) In a recent study, McKinnon presents a model showing that Gαq and Gβγ activate PLCβ3 by two distinct pathways and that activation by Gβγ occurs through membrane recruitment. It is not surprising that the authors find that this is not true since the pelleting method used by McKinnon is subject to error. The authors should directly address the limitations of this previous work and the changes in proteoliposomes with sedimentation that alter partition coefficients. Although the inability of Gβγ to drive membrane binding is in accord with the quantitative studies of Scarlata, showing that the affinity of PLCβ3 to Gβγ is fairly weak as compared to the intrinsic membrane partition coefficient.

      (3) It was proposed many years ago that in signaling complexes Gαq - Gβγ may not have to fully dissociate when binding PLCβ, but rather shift their relative orientation when binding to PLCβ to allow activation. Is their model consistent with this? Is it possible that PLCβ3 keeps Gβγ from diffusing to enhance the rate of Gq / Gβγ re-association?

      (4) The authors find that Gβγ binds multiple sites, and it is clear that the PH domain site is the primary one in accord with previous work. Could these weaker sites be an artifact of the elevated concentrations used in cryoEM and BRET assays?

      (5) Although their assays infer differences in binding affinities, it would strengthen the paper if the authors could estimate the association energies of these different binding sites. This estimation would also address the concern stated above.

    1. Reviewer #1 (Public review):

      Summary:

      The authors set out to understand the complex regulation of the assembly of the Type 3 Secretion System of S. typhimurium. They found that the gene synteny as well as specific mRNA stem loops were important for the translational coupling of sctS and sctT. Without this regulation, SctT self-oligomerizes, which disrupts the export of effector proteins and leads to a decreased fitness of the pathogen. The work was done using a variety of convincing methods and leads to an updated picture of how T3SS assembly occurs. Since the same genetic synteny is found in a large majority of T3SS in different bacteria, it is likely that this is a general mechanism, but one that needs to be further experimentally validated.

      Strengths:

      The paper uses an impressive amount of experiments, with different techniques, to describe how they identified the genetic regulation of SctT production.

      Weaknesses:

      Only minor weaknesses are found.

      (1) Regarding the use of the complex being unique. It is not well explained what makes this a unique complex.

      (2) The paper would benefit from a discussion regarding how regulation might work in the minority of bacterial strains where the T3SS gene synteny is largely different. One would expect that those bacteria would have a different way of regulating T3SS assembly, but that is not discussed at all by the authors.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Samuel Wagner and colleagues describe an elegant mechanism to prevent promiscuous assembly of a core virulence type III secretion system protein, SctS. Starting from a bioinformatic standpoint, they demonstrate that synteny is highly conserved, and sctT occurs immediately downstream of sctS. Secretion is greatly reduced when sctT is removed or scrambled from its genomic context, and sctT expression is accordingly reduced (sctS synteny is also important, though less so). The distance between sctS and sctT is crucial. An elegant series of genetic experiments leads the authors to pinpoint a stem loop structure that occludes the Shine-Dalgarno sequence of sctT. This property is independent of the actual gene preceding sctT. In sum, this means that SctS is already expressed before SctT is expressed, preventing SctT from forming cytotoxic homooligomers.

      Strengths:

      The manuscript is very well-written, easy to follow, and describes a substantial amount of genetic detective work to identify the underlying mechanism. I have only a number of textual suggestions, mainly for the Introduction text, which I believe could be revised for a flagellar and broader audience.

      Weaknesses:

      Major concern:

      While the work is rigorous and substantial, I am unsure as to whether its findings will appeal beyond a niche audience.

      Minor points:

      (1) Line 117: The number here seems to be very small. RefSeq has ~200,000 genomes. My guess is that at least 100,000 of these will be bacterial. Many (most?) bacteria have flagella, and some unflagellated strains have injectisomes, meaning I would have guessed that the authors would have ~50,000 genomes with SctRSTU. This estimate is error-prone, but not by too much. Can the authors explain the discrepancy between my estimate and their figure of almost two orders of magnitude? (SctRSTU/FliPQGFlhB should also be easy to pick up by sequence searches, so I don't think this is due to false negatives).

      (2) Discussion: I would appreciate some discussion of how species that do not conserve the synteny of sctS and sctT prevent problems of sctT oligomerisation? It doesn't need to be evidence-based at this stage, but I'm sure the authors have thought about this, and the Discussion is an appropriate place to share their speculations.

    3. Reviewer #3 (Public review):

      At the core of the bacterial type III secretion system (T3SS), a nanomachine used to inject effector proteins into eukaryotic cells, five highly conserved proteins, SctRSTUV, form the export apparatus, which is the actual gate for effector proteins. Not only are these proteins the most strongly conserved parts of the system, but also their gene order is conserved, which is not the case for most other components of the T3SS. Interestingly, this order does not completely recapitulate the assembly order, which is SctR5-T4-S-U-V. Looking into the reasons for the conserved synteny, the authors noted a stem-loop in the mRNA of the Salmonella SPI-1 sctS gene, which is present in many other T3SS as well (and in fact had been found in Yersinia before). They then use an array of clever gene permutations and modifications to discern the benefit of this order for the bacteria. The combination of thorough sequence analysis with different, partly quantitative, protein expression and secretion assays and growth curves, both in the native Salmonella background and in heterologous systems, provides strong evidence for the interpretation of the authors: The stem-loop in sctS prevents the premature expression of SctT, which can otherwise assemble into "futile multimers" that can lead to ion loss. The presence of stem-loops in many other sctS/T genes gives weight to this finding.

      This is a very nice and thorough study addressing an important point in the assembly of type III secretion systems. I only have a few suggestions.

      (1) Conserved gene orders have been shown for many complexes, and the findings presented in this manuscript might be applicable to other membrane complexes.

      The conservation of gene order and the presence of the stem loop give weight to the authors' findings. However, it is only mentioned quite late in the discussion that a similar stem loop was found in Yersinia upstream sctT earlier, and was interpreted differently. The authors' current discussion is somewhat evasive on this point. Why would these similar structures be used differently? Why would temperature not play a role in Salmonella SPI-1? And wouldn't the stem-loop also couple sctS and sctT expression in Yersinia? This should be addressed, if possible, by experiments (at least, the influence of temperature on the SPI-1 mRNA structure should be testable for the authors) and by a more detailed discussion (given the redundancy of RNA thermometers in the Yersinia T3SS, the interpretation in the current paper might well be the more compelling one).

      (2) A point that deserves more attention is that a similar finding in Yersinia has been interpreted differently before (as a temperature sensor rather than translational coupling) - are these systems really different? Testing the different interpretations in the respective other system (at least the influence of temperature in the Salmonella SPI-1 system used in this manuscript) would have made the interpretation even more compelling.

      (3) Another point that should be discussed in more detail is why this mechanism is present when replacement of the sctT ATG by weaker start codons and the simple omission of a separate SD sequence upstream sctT would achieve the same outcome. This could be tested in one of the nice heterologous systems, as used in Figure 4.

    1. Reviewer #1 (Public review):

      Summary:

      This is an interesting and well-written manuscript in which the authors set out to answer a simple, old question with a modern toolkit: where in crab evolution did sideways walking arise, how often has it been lost or regained, and is it plausibly linked to the ecological and taxonomic success of true crabs. To do this, they record locomotion from 50 live species, convert each species' movements into a quantitative index that compares forward versus sideways bouts, and then map the resulting states onto a recent crab phylogeny to infer the most likely evolutionary history of locomotor direction.

      Strengths:

      The strongest part of the study is the dataset itself. Comparable behavioral measurements across dozens of crab species are rare. The authors have done the field and husbandry work needed to make this possible. The overall pattern they recover, that most true crabs are strongly biased toward sideways movement (while a smaller set of lineages move predominantly forward), is interesting and likely to be useful to others. The phylogenetic mapping is also a reasonable way to address the "how many times" question (although this is peripheral to my expertise). The manuscript makes a convincing case that sideways locomotion is not simply a trivial byproduct of a crab-like body plan.

      Weaknesses:

      Where I am less convinced is in how strongly the authors describe the discreteness of the behavioral categories and the absence of intermediates. The manuscript states that the Forward-Sideways Index shows a clear separation between two locomotor types with little evidence for intermediates, and it cites a statistical test rejecting a single peak in the distribution. However, the histogram in Figure 3 appears structured within each labeled category, with subclusters inside both the forward and sideways groups rather than a single tight peak per group. This matters because the index is built by first placing each movement bout into "forward" versus "sideways" bins using a fixed angle boundary and then collapsing the result into a single ratio. That approach is simple and transparent enough, but it can also hide mixed strategies. For example, a species that produces substantial amounts of both forward and sideways walking can still end up with a strongly positive or negative index, and therefore be classified as a pure "type," even though the underlying behavior is mixed. In that context, rejecting a single peak in the across-species distribution does not, by itself, justify the stronger claim that intermediates are rare or absent.

      Related to this, a key methodological choice is the use of 60 degrees as the cutoff between forward and sideways bouts. This boundary may be reasonable as a convention, but the paper does not explain why it is the right place to draw the line, and there is a plausible biological concern that a fixed angular cutoff does not mean the same thing across taxa.

      Crabs vary in body shape and in how the legs are arranged around the body. In my own comparative work, for example, some species show an elliptical stance pattern elongated along the preferred direction of travel, while others show a more circular leg arrangement, and the latter can express more mixed forward and sideways behavior. When limb arrangement and body geometry differ across species, the same measured angle can correspond to different underlying mechanics and different functional "degree of sidewaysness." The practical implication is that the reported binary separation may partly reflect the imposed classification rule, rather than a sharp biological divide.

      Another limitation that affects interpretation is the decision to use one individual per species. I understand the logistics, and for some questions, a single representative individual can be a reasonable first pass. But it is not strong support for negative claims about intermediates, especially in a group where individuals can change substantially with growth and allometry. Crabs can grow dramatically, often with pronounced allometric shifts in limb proportions that can alter the center of mass location. Size alone can alter the kinematics and choice of locomotor behaviors in crustaceans. In species where appendage proportions change with size, or where certain legs become disproportionately large (or calcified), it is plausible that locomotor direction and the distribution of movement angles shift across ontogeny. That makes it hard to treat a single individual as a complete description of a species-level strategy, particularly for species that fall closer to the boundary between categories.

      In sum, this is a valuable and useful behavioral comparative study with a dataset that many in the field will appreciate. The main conclusions about the likely evolutionary placement of sideways walking are plausible, but several of the stronger claims about discrete locomotor types, the absence of intermediates, and the relationship to diversification would be more convincing if the analysis were less dependent on a fixed angular cutoff and on single individuals per species, or if the manuscript framed those points more cautiously so the conclusions track the strength of the evidence.

    2. Reviewer #2 (Public review):

      Summary:

      The current work investigates the evolution of sideward locomotion in Brachyura in light of a single evolutionary origin. To this end, the authors first analysed the mode of locomotion in 50 crab species and observed mutually exclusive presence of sideways vs. forward movement. The phylogenetic analysis confirmed that there is indeed a single evolutionary origin for sideways movement, which was sometimes followed by several reversions to forward locomotion. This way, authors demonstrate how locomotor movement modes shape evolutionary diversification in animals by showing that species richness is much higher in side-ways-moving crabs than in the nearest groups. This is an interesting work that integrates behavioural analysis and phylogenetic relations, capitalising largely on crabs. I have a few suggestions and questions.

      Firstly, I think the paper spends too much time on a straightforward analysis of the mode of locomotion. I was also wondering whether the phylogenetic analysis could be simply achieved by maximising an objective function in which the modes of movement are inversely coded for two putative groups, with all values calculated at all possible nodes.

      Unfortunately, I find that the authors did not sufficiently discuss differences in the ecological niches of species with forward vs. sideways locomotion modes (including challenges of locomotion and substrate).

      Likewise, what are the anatomic correlates of forward vs. sideways locomotion? For instance, how are the advantages assumed for sideways movement associated with a flattened body? Is it possible that the mode of motion is secondary to flattened/narrow body structure, which basically limits the distance between legs and thus makes the forward movement difficult - under this logic, the mode of movement would be a secondary phenomenon to body shape traits. How can one differentiate between this alternative and the one that puts the mode of movement in the centre of the story? On a related note, how do different modes of movement relate to the ability to fit into tight spaces - how does it relate to differences in leg joints?

      Is it possible that the sideways movement maximises the scanned visual field per unit time/displacement, which may be beneficial for mostly forward-moving predators?

      It is really difficult to decipher the information contained in the nodes (circles) in the printed black-and-white version of the manuscript.

      Briefly, although I find the study interesting, the presented complexity may not be necessary given the endpoints; it can be achieved much more simply. Furthermore, the degree to which the conceptual analysis of different modes of locomotion was exercised was limited. The general approach may serve as a good model for the evolutionary analysis of other traits. The demonstration of traceability of the relations in question is a major contribution of the work.

      Strengths:

      The research question and the novel combination of different data types.

      Weaknesses:

      The complexity of the methods used, along with a limited discussion of the potential dynamics that may underlie the evolution of the sideways movement mode.

    1. external evaluations of the passing paper also uncovered hallucinations, faked results, and overestimated novelty

      通过了同行评审,但独立评估发现了幻觉、伪造结果和夸大新颖性——这个细节极为重要,却经常被忽视。它揭示了一个深刻的系统性漏洞:AI 已经学会了「通过评审」,但没有学会「诚实做科学」。这两件事在人类评审员看来是同一件事,但在 AI 系统的优化目标中可能是分离的。这是 AI 安全在科学领域的具体表现。

    2. one manuscript achieved high enough scores to exceed the average human acceptance threshold, marking the first instance of a fully AI-generated paper successfully navigating a peer review.

      史上第一篇完全由 AI 自主生成并通过同行评审的论文——这个里程碑的重要性不亚于 AlphaFold 折叠蛋白质。令人惊讶的是,这篇论文得分超越了 55% 的人类作者投稿(平均分 6.33,高于人类投稿平均录取线)。学术界存在了数百年的「同行评审」制度,第一次被一个 AI 系统悄悄穿越了。

    1. an agent does not care about the structure, unless you specifically ask it to. But even in this case you have to review the changes.

      【启发】「AI 天然不在意结构,除非你明确要求」——这个发现定义了人类工程师在 AI 时代最不可替代的职责:做代码结构的「守门人」。这与 Every 文章里「每个人都是管理者」的洞见形成呼应:人类的工作从「执行代码」转变为「审查代码质量并为 AI 设定标准」。对工程团队文化的启发:代码 Review 的重要性不是在下降,而是在上升——因为现在需要 Review 的代码量是以前的 10 倍。

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have submitted a second revision, largely to address a comment from Reviewer 2, which was "The failure to model the neural data with an explicit model is a missed opportunity." The authors have now included a computational model.]

      This study makes a fundamental contribution to our understanding of interocular suppression, particularly continuous flash suppression (CFS). Using neuroimaging data from two macaque monkeys, the study provides compelling evidence that CFS suppresses orientation responses in neurons within V1. These findings enrich the CFS literature by demonstrating that neural activity under CFS may prevent high-level visual and cognitive processing.

      Comments on previous revisions:

      The authors have addressed all my previous comments.

    2. Reviewer #2 (Public review):

      Summary:

      The goal of this study was to investigate the degree to which low-level stimulus features (i.e., grating orientation) are processed in V1 when stimuli are not consciously perceived under conditions of continuous flash suppression (CFS). The authors measured the activity of a population of V1 neurons at single neuron resolution in awake fixating monkeys while they viewed dichoptic stimuli that consisted of an oriented grating presented to one eye and a noise stimulus to the other eye. Under such conditions, the mask stimulus can prevent conscious perception of the grating stimulus. By measuring the activity of neurons (with Ca2+ imaging) that preferred one or the other eye, the authors tested the degree of orientation processing that occurs during CFS.

      Strengths:

      The greatest strength of this study is the spatial resolution of the measurement and the ability to quantify stimulus representations during CSF in populations of neurons preferring the eye stimulated by either the grating or the mask. There have been a number of prominent fMRI studies of CFS, but all of them have had the limitation of pooling responses across neurons preferring either eye, effectively measuring the summed response across ocular dominance columns. The ability to isolate separate populations offers an exciting opportunity to study the precise neural mechanisms that give rise to CFS, and potentially provide insights into nonconscious stimulus processing.

      Weaknesses:

      (The authors have now included a computational model in the second revision.)

    3. Reviewer #3 (Public review):

      Summary:

      In this study, Tang, Yu & colleagues investigate the impact of continuous flash suppression (CFS) on the responses of V1 neurons using 2-photon calcium imaging. The report that CFS substantially suppressed V1 orientation responses. This suppression happens in a graded fashion depending on the binocular preference of the neuron: neurons preferring the eye that was presented with the marker stimuli were most suppressed, while the neurons preferring the eye to which the grating stimuli were presented were least suppressed. Binocular neuron exhibited an intermediate level of suppression.

      Strengths:

      The imaging techniques are cutting-edge.

      Weaknesses:

      The strength of CFS suppression varies across animals, but the authors attribute this to comparable heterogeneity in the human psychophysics literature.

      Comments on previous revisions:

      The authors have addressed my comments from the previous round of review, and I have no further comments.

    1. Reviewer #1 (Public review):

      In this paper, Stanojcic and colleagues attempt to map sites of DNA replication initiation in the genome of the African trypanosome, Trypanosoma brucei. Their approach to this mapping is to isolate 'short-nascent strands' (SNSs), a strategy adopted previously in other eukaryotes (including in the related parasite Leishmania major), which involves isolation of DNA molecules whose termini contain replication-priming RNA. By mapping the isolated and sequenced SNSs to the genome (SNS-seq), the authors suggest that they have identified origins, which they localise to intergenic (strictly, inter-CDS) regions within polycistronic transcription units and suggest display very extensive overlap with previously mapped R-loops in the same loci. Finally, having defined locations of SNS-seq mapping, they suggest they have identified G4 and nucleosome features of origins, again using previously generated data. Though there is merit in applying a new approach to understand DNA replication initiation in T. brucei, where previous work has used MFA-seq and ChIP of a subunit of the Origin Replication Complex (ORC), there are two significant deficiencies in the study that must be addressed to ensure rigour and accuracy.

      (i) The suggestion that the SNS-seq data is mapping DNA replication origins that are present in inter-CDS regions of the polycistronic transcription units of T. brucei is novel and does not agree with existing data on the localisation of ORC1/CDC6, and it is very unclear if it agrees with previous mapping of DNA replication by MFA-seq due to the way the authors have presented this correlation. For these reasons, the findings essentially rely on a single experimental approach, which must be further tested to ensure SNS-seq is truly detecting origins. Indeed, in this regard, the very extensive overlap of SNS-seq signal with RNA-DNA hybrids should be tested further to rule out the possibility that the approach is mapping these structures and not origins.

      (ii) The authors' presentation of their SNS-seq data is too limited and therefore potentially provides a misleading view of DNA replication in the genome of T. brucei. The work is presented through a narrow focus on SNS-seq signal in the inter-CDS regions within polycistronic transcription units, which constitute only part of the genome, ignoring both the transcription start and stop sites at the ends of the units and the large subtelomeres, which are mainly transcriptionally silent. The authors must present a fuller and more balanced view of SNS-seq mapping, across the whole genome, to ensure full understanding and clarity.

      In the revised manuscript, the authors have improved the presentation and analysis of their data, expanding the description of SNS-seq mapping across the genome, and more clearly assessing to what extent there is correlation between SNS-seq signal and previous mapping approaches to predict origins (by MFA-seq and ChiP-chip of ORC1/CDC6). With regard the correlation between SNS-seq and ORC/1CDC6 ChIP-chip, it should be noted that two datasets were generated in distinct strains of T. brucei (Lister 427 and TREU927, respectively), and it is unclear if the latter dataset can be accurately mapped to the strain used here. Notwithstanding this concern, these improvements clarify a number of aspects of the SNS-seq mapping: (1) the signal is more prevalent in the transcribed core of the genome than in the largely transcriptionally silent subtelomeres; and (2) whereas previous work revealed strong correlation between ORC1/CDC6 localisation and MFA-seq peaks at the ends of multigene transcription units, neither of these data show significant overlap with SNS-seq signal, which is not seen at transcription start or stop sites ('SSRs'; supplementary Fig.8D) and shows marked depletion at predicted ORC1/CDC6 sites (supplementary Fig.8C). To the authors' credit, they acknowledge this lack of correlation in the discussion.

      The authors have not provided any new data to substantiate their assertion that SNS-seq accurately detects origins in T. brucei, and therefore the work rests on a single experimental approach, without validation. As a result, the suggestion of abundant, previously undetected origins in the intergenic regions of multigene transcription remains a prediction. One key untested limitation of the work lies in the observation that the very large majority of SNS-seq signal overlaps with previously RNA-DNA hybrids; without an experimental test, the suggestion that the authors have 'disclosed for the first time a strong link between RNA:DNA hybrid formation and DNA replication initiation' remains conjecture.

    2. Reviewer #2 (Public review):

      Summary:

      Stanojcic et al. investigate the origins of DNA replication in the unicellular parasite Trypanosoma brucei. They perform two experiments, stranded SNS-seq and DNA molecular combing. Further, they integrate various publicly available datasets, such as G4-seq and DRIP-seq, into their extensive analysis. Using this data, they elucidate the structure of origins of replications. In particular, they find various properties located at or around origins, such as polynucleotide stretches, G-quadruplex structures, regions of low and high nucleosome occupancy, R-loops, and that origins are mostly present in intergenic regions. Combining their population-level SNS-seq and their single-molecule DNA molecular combing data, they elucidate the total number of origins as well as the number of origins active in a single cell.

      Between the initial submission and this revision, the raised major concerns have not been resolved, and no additional validation has been provided.

      Strengths:

      (1) A very strong part of this manuscript is that the authors integrate several other datasets and investigate a large number of properties around origins of replication. Data analysis clearly shows the enrichment of various properties at the origins, and the manuscript is concluded with a very well-presented model that clearly explains the authors' understanding and interpretation of the data.

      (2) The DNA combing experiment is an excellent orthogonal approach to the SNS-seq data. The authors used the different properties of the two experiments (one giving location information, one giving single-molecule information) well to extract information and contrast the experiments.

      (3) The discussion is exemplary, as the authors openly discuss the strengths and weaknesses of the approaches used. Further, the discussion serves its purpose of putting the results in both an evolutionary and a trypanosome-focused context.

      Weaknesses:

      I have major concerns about the origin of replication sites determined from the SNS-seq data. As a caveat, I want to state that, before reading this manuscript, SNS-seq was unknown to me; hence, some of my concerns might be misplaced.

      (1) There are substantial discrepancies between the origins identified here and those reported in previous studies. Given that the other studies precede this manuscript, it is the authors' duty to investigate these differences. A conclusion should be reached on why the results are different, e.g., by orthogonally validating origins absent in the previous studies.

      (2) I am concerned that up to 96% percent of all SNS-seq peaks are filtered away. If there is so much noise in the data, how can one be sure that the peaks that remain are real? Upon request, the authors have performed a control, where randomly placed peaks were run through the same filtering process. Only approximately twice as many experimental peaks passed filtering compared to random peaks. While the authors emphasize reproducibility between replicates, technical artifacts from the protocol would also be reproducible. Moreover, in other SNS-seq studies, for example, Pratto et al. Cell 2021, Fig. 1B, + and − strand peaks always appear closely paired. This pattern contrasts strongly with Fig. 2A in this manuscript.

      Further, I have some minor concerns that do not affect the main conclusions of the manuscript:

      - Fig 2C: The regions shown in the heatmap have different sizes, and I presume that the regions are ordered by size on the y-axis? If so, does the cone-shaped pattern, which is origin-less for genic regions and origin-enriched for intergenic regions, arise from the size of the regions? (I.e., for each genic region, the region itself is origin-less and the flanking intergenic regions contain origins.) If this is the case, then the peaks/valleys, centered exactly on the center of the regions on the mean frequency plots, arise from the different sizes of the analyzed regions, not from the fact that origins are mostly found at the center of intergenic regions. This data would be better presented with all regions stretched to the same size. This has not been addressed in the revision.

      - Line 123, "and the average length of origins was found to be approximately 150 bp.": To determine origins, the authors filter away overlapping peaks and peaks that are too far from each other. Both restrict the minimal and maximal length of origins that can be observed, and this, in turn, affects the average length. This has not been addressed in the revision.

      Are claims well substantiated?:<br /> The identification of origins via SNS-seq appears to be incompletely supported to me.<br /> All downstream analyses depend on the reliability of origin identification.

      Impact:<br /> This study has the potential to be valuable for two fields: In research focused on T. brucei as a disease agent, where essential processes that function differently than in mammals are excellent drug targets. Further, this study would impact basic research analyzing DNA replication over the evolutionary tree, where T. brucei can be used as an early-divergent eucaryotic model organism.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript "Adapting Clinical Chemistry Plasma as a Source for Liquid Biopsies" addresses a timely and practical question: whether residual plasma from heparin separator tubes can serve as a source of cfDNA for molecular profiling. This idea is attractive, since such samples are routinely generated in clinical chemistry labs and would represent a vast and accessible resource for liquid biopsy applications. The preliminary results are encouraging, and likely to benefit the research community.

      Comments on revisions:

      The concerns raised have been addressed. The heparin separator-based cfDNA method described in this study is likely to benefit the research community. I have no further scientific concerns.

    2. Reviewer #2 (Public review):

      Summary:

      The authors propose that leftover heparin plasma can serve as a source for cfDNA extraction, which could then be used for downstream genomic analyses such as methylation profiling, CNV detection, metagenomics, and fragmentomics. While the study is potentially of interest, several major limitations reduce its impact; for example, the study does not adequately address key methodological concerns, particularly cfDNA degradation, sequencing depth limitations, statistical rigor, and the breadth of relevant applications.

      Strengths:

      The paper provides a cheap method to extract cfDNA, which has broad application if the method is solid.

      Weaknesses:

      (1) The introduction lacks a sufficient review of prior work. The authors do not adequately summarize existing studies on cfDNA extraction, particularly those comparing heparin plasma and EDTA plasma. This omission weakens the rationale for their study and overlooks important context.

      (2) The evaluation of cfDNA degradation from heparin plasma is incomplete. The authors did not compare cfDNA integrity with that extracted from EDTA plasma under realistic sample handling conditions. Their analysis (lines 90-93) focuses only on immediate extraction, which is not representative of clinical workflows where delays are common. This is in direct conflict with findings from Barra et al. (2025, LabMed), who showed that cfDNA from heparin plasma is substantially more degraded than that from EDTA plasma. A systematic comparison of cfDNA yields and fragment sizes under delayed extraction conditions would be necessary to validate the feasibility of their proposed approach.

      (3) The comparison of methylation profiles suffers from the same limitation. The authors do not account for cfDNA degradation and the resulting reduced input material, which in turn affects sequencing depth and data quality. As shown by Barra et al., quantifying cfDNA yield and displaying these data in a figure would strengthen the analysis. Moreover, the statistical method applied is inappropriate: the authors use Pearson correlation when Spearman correlation would be more robust to outliers and thus more suitable for methylation and other genomic comparisons.

      (4) The CNV analysis also raises concerns. With low-coverage WGS (~5X) from heparin-derived cfDNA, only large CNVs (>100 kb) are reliably detectable. The authors used a 500 kb bin size for CNV calling, but they did not acknowledge this as a limitation. Evaluating CNV detection at multiple bin sizes (e.g., 1 kb, 10 kb, 50 kb, 100 kb, 250 kb) would provide a more complete picture. In addition, Figure 3 presents CNV results from only one sample, which risks bias. Similar bias would exist for illustrations of CNVs from other samples in the supplementary figures provided by the authors. Again, Spearman correlation should be applied in Figure 3c, where clear outliers are visible.

      (5) It is important to point out that depth-based CNV calling is just one of the CNV calling methods. Other CNV calling software using SNVs, pair-reads, split-reads, and coverage depth for calling CNV, such as the software Conserting, would be severely affected by the low-quality WGS data. The authors need to evaluate at least two different software with specific algorithms for CNV calling based on current WGS data.

      (6) The authors omit an important application of cfDNA: somatic mutation detection. Degraded cfDNA and reduced sequencing depth could substantially impact SNV calling accuracy in terms of both recall and precision. Assessing this aspect with their current dataset would provide a more comprehensive evaluation of heparin plasma-derived cfDNA for genomic analyses.

      Comments on revisions:

      As suggested previously, the Pearson correlation analysis tends to be overstated; please replace it with Spearman correlation in the whole manuscript. Currently, the authors include both of them in the abstract, method, results, and graphics, all of which are required to be updated to only use Spearman correlation results.

      I don't have other concerns about the manuscript.

    1. AIサイエンティストは、アイデアの創出から実験、分析、論文執筆、そして査読に至るまでの科学的研究サイクル全体をAIが自律的に遂行する仕組みです。この仕組みの定量的評価も含めた結果を、共同研究者とともにNature誌の論文として公開しています。

      AI Scientist 研究——一个让 AI 自动化完整科研周期的系统——被 Nature 正式发表了。令人震惊的是:一篇关于「AI 能否替代科学家」的论文,本身就是通过「AI 辅助科研」的过程产生的,并通过了人类同行评审。这个自指性质让 Nature 的认可变成了一个双重背书:既是对内容的认可,也是对方法论的认可。Sakana 将这个成果作为 Marlin 的技术背书,是极为聪明的品牌叙事策略。

    1. Imagine every report has the following: Agent's best-guess about what comments you'd get from Beth, Hjalmar, Ajeya. Agent's best-guess about survey results. Agent's best-guess about benchmark results. Agent's best-guess about how this will be received on Twitter.

      「预测反馈」的概念令人惊讶:AI 在报告发出前,预测各位审阅者会说什么、Twitter 会怎么反应、调查结果会是什么——研究者先在「预测反馈」中迭代,只有当预期信息增量足够高时,才真正发出去等待真实反馈。这是一种「反馈的预计算」——把等待时间转化为优化时间,本质上是把「串行等待」变成了「并行模拟」。

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript investigates the biological mechanism underlying the assembly and transport of the AcrAB-TolC efflux pump complex. By combining endogenous protein purification with cryo-EM analysis, the authors show that the AcrB trimer adopts three distinct conformations simultaneously and identify a previously uncharacterized lipoprotein, YbjP, as a potential additional component of the complex. The work aims to advance our understanding of the AcrAB-TolC efflux system in near-native conditions and may have broader implications for elucidating its physiological mechanism.

      Strengths:

      Overall, the manuscript is clearly presented, and several of the datasets are of high quality. The use of natively isolated complex is a major strength, as it minimizes artifacts associated with reconstituted systems and enables the discovery of a novel subunit. The authors also distinguish two major assemblies-the TolC-YbjP sub-complex and the complete pump-which appear to correspond to the closed and open channel states, respectively. The conceptual advance is potentially meaningful, and the findings could be of broad interest to the field.

      Weaknesses:

      (1) As the identification of YbjP is a key contribution of this work, a deeper comparison with functional "anchor" proteins in other efflux pumps is needed. Including an additional supplementary figure illustrating these structural comparisons would be valuable.

      (2) The observation of the LTO states in the presence of TolC represents an important extension of previous findings. A more detailed discussion comparing these LTO states to those reported in earlier structural and biochemical studies would improve the clarity and significance of this point.

      Comments on revisions:

      In the revision, the authors have addressed the above concerns to improve this study.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript reports the high-resolution cryo-EM structures of the endogenous TolC-YbjP-AcrABZ complex and a TolC-YbjP subcomplex from E. coli, identifying a novel accessory subunit. This work is an impressive effort that provides valuable structural insights into this native complex.

      Strengths:

      (1) The study successfully determines the structure of the complete, endogenously purified complex, marking a significant achievement.<br /> (2) The identification of a previously unknown accessory subunit is an important finding.<br /> (3) The use of cryo-EM to resolve the complex, including potential post-translational modifications such as N-palmitoyl and S-diacylglycerol, is a notable highlight.

      Weaknesses:

      (1) Clarity and Interpretation: Several points need clarification. Additionally, the description of the sample preparation method, which is a key strength, is currently misplaced and should be introduced earlier.<br /> (2) Data Presentation: The manuscript would benefit significantly from improved figures.<br /> (3) Supporting Evidence: The inclusion of the protein purification profile as a supplementary figure is essential. Furthermore, a discussion comparing the endogenous AcrB structure to those obtained in other systems (e.g., liposomes) and commenting on observed lipid densities would strengthen the overall analysis.

      Comments on revisions:

      In the revision, all my concerns have been addressed.

    1. Reviewer #1 (Public review):

      Summary:

      Witte et al. examined whether canonical behavioral functions attributed to the cerebellum decline with age. To test this, they recruited younger, old, and older-old adults in a comprehensive battery of tasks previously identified as cerebellar-dependent in the literature. Remarkably, they found that cerebellar function is largely preserved across the lifespan-and in some cases even enhanced. Structural imaging confirmed that their older adult cohort was representative in terms of both cerebellar gray- and white-matter volume. Overall, this is an important study with strong theoretical implications and compelling evidence supporting the motor reserve hypothesis, demonstrating that cerebellar-dependent measures remain largely intact with aging.

      Strengths:

      (1) Relatively large sample size.

      (2) Most comprehensive behavioral battery to date assessing cerebellar-dependent behavior.

      (3) Structural MRI confirmation of age-related decline in cerebellar gray and white matter, ensuring representativeness of the sample.

      Weaknesses:

      The absence of a voxel-based morphometry (VBM) analysis limits the anatomical and functional specificity of the conclusions. Such an analysis would help identify which functions are truly cerebellar-dependent, rather than relying primarily on inferences drawn from prior neuropsychological literature. Notably, the authors have undertaken this analysis in a separate manuscript.

      As acknowledged in the Discussion, the classification of tasks as "cerebellar-dependent" versus "general" remains somewhat ambiguous. Some measures labeled as "general" may still engage cerebellar processes. Moreover, analyses in the authors' forthcoming manuscript show weak structure-behavior correlations, casting further doubt on how clearly cerebellar-specific functions can be distinguished from more general processes.

    2. Reviewer #2 (Public review):

      Summary:

      The authors are investigating cerebellar-mediated motor behaviors in a large sample of adults, including 30 individuals over the age of 80 (a great strength of this work). They employed a large battery of motor tasks that are tied to cerebellar function, in addition to a cognitive task and motor tasks that are more general. They also evaluated cerebellar structure. Across their behavioral metrics, they found that even with cerebellar degeneration, cerebellar-mediated motor behavior remained intact relative to young adults. However, this was not the case for measures not directly tied to cerebellar function. The authors suggest that these functions are preserved and speak to the resiliency and redundancy of function in the cerebellum. They also speculate that cerebellar circuits may be especially good for preserving function in the face of structural change. The tasks are described very well, and their implementation is also well-done with consideration for rigor in the data collection and processing. The inclusion of Bayesian estimates is also particularly useful, given the theoretically important lack of age differences reported. This work is methodologically rigorous with respect to the behavior, and certainly thought-provoking.

      Strengths:

      The methodological rigor, inclusion of Bayesian statistics, and the larger sample of individuals over the age of 80 in particular are all great strengths of this work. Further, as noted in the text, the fact that all participants completed the full testing battery is of great benefit. Please note, upon my second review the strengths remain. This is a really wonderful investigation and amazingly comprehensive from a behavioral perspective given the numerous tasks and domains that were considered.

      Weaknesses:

      The suggestion of cerebellar reserve, given that at the group level there is a lack of difference for cerebellar specific behavioral component,s could be more robustly tested. That is, the authors suggest that this is a reserve given that volume of cerebellar gray matter is smaller in the two older groups, though behavior is preserved. This implies volume and behavior are seemingly dissociated. However, there is seemingly a great deal of behavioral variability within each group and likewise with respect to cerebellar volume. Is poorer behavior associated with smaller volume? If so, this would suggest still that volume and behavior are linked; but, rather than being age that is critical it is volume. On the flip side, a lack of associations between behavior and volume would be quite compelling with respect to reserve. More generally, as explicated in the recommendations, there are analyses that could be conducted that, in my opinio,n would more robustly support their arguments given the data that they have available.

      The authors have done wonderful work to address the comments from the initial feedback/reviews. While I may ultimately disagree with the approach of including the imaging data in another manuscript, that is at the same time, a reasonable decision. This, however, does not change the impression that the paper would be stronger with the inclusion of the volumetric imaging data. I can understand why it may be published separately - it would be a very long paper to include both. At the same time the assertions made here, which are largely nicely supported by the preprint, would ultimately strengthen this work. The behavior certainly stands on its own as an excellent and needed investigation; together, both pieces make for a truly excellent contribution to the literature.

    1. Reviewer #1 (Public review):

      This is an important article, which represents the culmination of 25 years of research on the spore coat protein, SafA. Reading this paper is not necessarily easy because it requires time, patience, and attention to detail, but it is truly rewarding. The attentive reader will certainly appreciate the description of a biochemical tour de force, providing convincing experimental evidence for every aspect of a step-by-step inner coat assembly model. It was previously known that SafA was a coat morphogenetic protein responsible for the assembly of the inner layer of the spore coat in Bacillus subtilis, and SafA was already viewed as a hub that directly or indirectly recruited several dozens of coat proteins to the spore envelope. It was also known that there were isoforms of SafA (the most important being the C30 form), and SafA was a substrate of Tgl, a transglutaminase involved in crosslinking some of the coat proteins, especially those found in the inner coat. Several studies have combined genetics and various types of microscopy approaches, including fluorescence microscopy, to decipher the mechanism of coat assembly, but the current study brings top-notch biochemistry into the picture and, therefore, is able to go much further into the molecular characterization of this important mechanism. It should be noted that spore coat assembly is a notoriously difficult process to study biochemically. It was also suspected to be a complex mechanism, because coat assembly is a protracted process involving at least 80 different proteins, whose production is controlled both temporally and spatially, but the current paper manages to connect specific chemical reactions to well-known stages of spore formation. The authors did so by generating several constructs with specific substitutions of Cys and Lys residues, interfering with the completion of disulfide bond formation and crosslinking events, thus determining the order of events and the structural consequences when one of these steps is impaired. Importantly, their conclusions are consistent with previous work. In the updated model, self-assembly of SafA is the first step, promoted by disulfide bond formation between C30 complexes. This is followed by recruitment of inner coat proteins and, finally, transglutamination to stabilize the scaffold structure (referred to as a "spotwelding activity".

      The work is extremely thorough. I did not identify any weaknesses and could not think of any experiment that would have been omitted.

    2. Reviewer #2 (Public review):

      Summary:

      The authors assemble a variety of information from biochemical experiments on oligomeric and higher-order assembly of the spore coat protein SafA, which functions as a hub in spore coat development. Together, the data indicate a robust process of assembly, guided initially by an organized process of disulfide bond formation and ultimately leading to cross-linking by the enzyme Tgl. Interestingly, neither process is strictly necessary for the formation of highly assembled oligomeric forms of SafA, but instead, these processes are mutually supportive in creating a strong, intercrosslinked assembly. Given this lead-up, it is somewhat disappointing to find that the cross-linking defective SafA mutants do not exhibit any obvious defects in sporulation in vivo, and one is left with the conclusion that this stage of spore coat assembly is accomplished by multiple independent co-occurring activities. The information is sufficient to support a detailed model for SafA assembly, which is significant in that it helps to explain the process of building a critically important hub-scaffold for spore coat development.

      Strengths:

      The main body of experiments supports a detailed model for the assembly of SafA monomers into spore coat superstructures. This is interesting because it shows how a protein can be used as both a scaffold and a hub in contributing to the assembly of a super-resilient biological material.

      Weaknesses:

      (1) The weak sporulation phenotype of the crosslinking mutants diminishes the significance of the mechanism that is described.

      (2) The narrative flow of the originally submitted manuscript could be improved by removing some unnecessary and confusing figures on peripheral subjects and rearranging some of the latter figures to arrive at a conclusion that focuses more on SafA assembly.

      (3) The original manuscript appears to have a labeling error in the supplementary figures, but a correctly labeled version of the figures would not support one of the manuscript's claims.

    3. Reviewer #3 (Public review):

      The manuscript by Amara et al. provides novel mechanistic insight into how SafA, a spore coat morphogenetic protein, self-assembles and is later crosslinked by the Tgl transglutaminase during spore coat assembly. Through rigorous, carefully executed biochemical analyses of SafA's oligomerization and crosslinking states, the authors demonstrate that SafA forms dimers that promote disulfide bond formation between two cysteine pairs found in its C30 region; this disulfide bond-mediated crosslinking promotes, but is not essential for, Tgl-mediated crosslinking of lysine residues within SafA. Specifically, one pair in its N-terminal C30 region promotes the formation of higher-order oligomers, while the second pair in its C-terminus C30 region promotes its ability to form a tetramer. Mutation of both cysteine pairs prevents higher-order SafA structures and reduces the efficiency of Tgl-mediated crosslinking via lysines in close proximity to the cysteines. They further show that disulfide bond formation promotes, but is not essential for, SafA to self-assemble into structures ~1200 kDa via SAXS analyses and kinetic analyses of Tgl-mediated crosslinking of purified SafA in vitro.

      Major Comments:

      (1) While the authors' detailed and thorough biochemical analyses advance our understanding of how SafA forms higher-order structures in the presence and absence of Tgl, they could broaden the significance of their findings with additional functional analyses of their mutants in B. subtilis. Figure 8 shows that loss of Tgl and SafA disulfide bond formation renders SafA more extractable (presumably leading to a less resilient spore coat), and FRAP analyses indicate that SafA in ∆tgl sporulating cells is more mobile than in its lysine crosslinked form. Some ideas that the authors could test to try and identify additional functions for the Cys and Lys residues in SafA:<br /> - Analyze the Cys mutants in the FRAP assay?<br /> - Does loss of SafA-mediated crosslinking via the Cys and/or Lys mutations affect its localization to the forespore or the recruitment of its client proteins like GerQ?<br /> - Have the authors tested higher concentrations of lysozyme? Or chloroform?

      (2) While the authors show in supplementary data that the safA point mutants they generated do not affect spore germination in the single condition tested, the Rudner group previously showed that SafA plays a role in spore germination by affecting CwlJ localization to the forespore. Perhaps the authors might see a more significant phenotype on spore germination with their Cys and Lys mutants if they tried to complement a ∆safA∆sleB double mutant with mutant safA constructs? For the germination assays, it was unclear to me whether the authors used heat activation prior to inducing spore germination.

      (3) Have the authors looked at whether the Cys or Lys mutations affect the sensitivity of spores to oxidative insults, especially since the Cys residues might temper the effects of oxidizing agents?

      (4) Did the authors test the effect of single Cys mutations on disulfide bond formation, since intermolecular disulfide bond formation might still be possible even if one of the Cys residues has been changed?

      (5) Finally, I was unsure how many times each experiment was replicated and how many experiments had been conducted in total.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Frangos at al. used a transcriptomic and proteomic approach to characterise changes in HER2-driven mammary tumours compared to healthy mammary tissue in mice. They observed that mitochondrial genes, including OXPHOS regulators, were among the most down-regulated genes and proteins in their datasets. Surprisingly, these were associated with higher mitochondrial respiration, in response to a variety of carbon sources. In addition, there seems to be a reduction in mitochondrial fusion and an increase in fission in tumour tissues compared to healthy tissues.

      Strengths:

      The data are clearly presented and described.

      The author reported very similar trends in proteomic and transcriptomic data. Such approaches are essential to have a better understanding of the changes in cancer cell metabolism associated with tumorigenesis.

      The authors provided a direct link between HER2 inhibition and OXPHOS, strengthening the mechanistic aspect of the work.

      Weaknesses:

      The manuscript would have benefited from more ex-vivo approaches to further dissect mechanistic links and resolve the contradiction of elevated respiration with reduced expression of most associated proteins (but these points are clearly articulated in the discussion).

      The results presented support the authors' conclusions, and limitations are addressed in the discussion. This work will likely impact the progression of the field, and the provided data will benefit the scientific community.

      Comments on revisions:

      The authors addressed all my concerns.

    2. Reviewer #2 (Public review):

      Frangos et al present a set of studies aiming to determine mechanisms underlying initiation and tumour progression. Overall, this work provides some useful datasets, further establishing mitochondrial dysfunction during the cellular transformation process.

      A key strength is the coordinated analysis of transcriptomics and proteomics from tumour samples derived from a Neu-dependent mouse model for breast cancer. This analysis provides rigorous datasets that show robust patterns, including down-regulation across many components of mitochondrial OXPHOS that were generally consistent at both the mRNA and protein level. Parallel analysis of corresponding tumour samples thereby clearly shows the opposite trend of increased mitochondrial function, which is unexpected. As such, this work further establishes altered mitochondrial phenotypes in tumour contexts and further illustrates that mitochondrial function is not necessarily always tightly correlated with mitochondrial gene expression patterns.

      Several key weaknesses remain. It remains unclear how increased mitochondrial function is being sustained despite wide decreases in mRNA and protein levels of OXPHOS components. In terms of mechanism, the study confirmed that pharmacologic EGFR inhibition decreases OXPHOS in a EGFR-dependent breast cancer line. However, it remains unclear if the cell culture system recapitulates other key observations of the tumour model (namely decreased expression with increased function).

      Therefore, the mechanistic basis of increased mitochondrial function in light of decreased mitochondrial content remains speculative, as does the role of these changes for tumour initiation or progression.

      Comments on revisions:

      We agree with the overall findings of the study and appreciate that the claims in text and title have been appropriately toned down.

      As additional suggestions eg for presentation, many of the graphics/labels are still too small to be useful. It would be interesting to see if this cell line is similar to the tumours in terms of all the phenotypes. The lapatinib experiment was good. I wonder how quick this drug affects the mitochondria. Also it would be interesting to see if these cells have higher OXPHOS than other non-transformed breast epithelial cells.

      The WB on oxphos components is good with ab110413 but this looks like many subunits are detected so this should be made clear.

    1. Reviewer #1 (Public review):

      Summary:

      The study investigates the role of vascular mural cells, specifically pericytes and vascular smooth muscle cells (vSMCs), in maintaining blood-brain barrier (BBB) integrity and regulating vascular patterning. Analyzing zebrafish pdgfrb mutants that lack brain pericytes and vSMCs, the show that mural cell deficiency does not impair BBB establishment or maintenance during larval and early juvenile stages. However mural cells seem to be crucial for preventing vascular aneurysms and hemorrhage in adulthood as focal leakage, basement membrane disruption and increased caveolae formation are observed in adult zebrafish at aneurysm hotspots. The authors challenge the paradigm that mural cells are essential for BBB regulation in early development while highlighting their importance for long-term vascular stability.

      Strengths:

      Previous studies have established that the zebrafish BBB shares molecular and morphological homology with e.g. the mammalian BBB and therefore represents a suitable model. By examining mural cell roles across different life stages-from larval to adult zebrafish-the study provides an unprecedented comprehensive developmental analysis of brain vascular development and of how mural cells influence BBB integrity and vascular stability over time. The use of live imaging, whole-brain clearing, and electron microscopy offers high-resolution insights into cerebrovascular patterning, aneurysm development, and structural changes in endothelial cells and basement membranes. By analyzing "leakage hotspots" and their association with structural endothelial defects in adults the presented findings add novel insights into how mural cell loss may lead to vascular instability.

    2. Reviewer #2 (Public review):

      Summary:

      The authors generated a zebrafish mutant of the pdgfrb gene. The presented analyses and data confirm previous studies demonstrating that Pdgfrb signaling is necessary for mural cell development in zebrafish. In addition, the data support previously published studies in zebrafish showing that mural cell deficiency leads to hemorrhages later in life. The authors presented quantified data on vessel density and branching, assessed tracer extravasation, and investigated the vasculature of adult mice using electron microscopy.

      Strengths:

      The strength of this article is that it provides independent confirmation of the important role of Pdgfrb signaling for the development of mural cells in the zebrafish brain. In addition, it confirms previous literature on zebrafish that provides evidence that, in the absence of pericytes/VSMC, hemorrhages appear (Wang et al, 2014, PMID: 24306108 and Ando et al 2021, PMID: 3431092)".

      The Reviewing Editor has carefully reviewed the revised manuscript and is fully satisfied with the authors' revisions.

    1. Reviewer #3 (Public review):

      This manuscript provides novel insights into altered glucose metabolism and KC status during early MASLD. The authors propose that hyperactivated glycolysis drives a spatially patterned KC depletion that is more pronounced than the loss of hepatocytes or hepatic stellate cells. This concept significantly enhances our understanding of early MASLD progression and KC metabolic phenotype.

      Through a combination of TUNEL staining and MS-based metabolomic analyses of KCs from HFHC-fed mice, the authors show increased KC apoptosis alongside dysregulation of glycolysis and the pentose phosphate pathway. Using in vitro culture systems and KC-specific ablation of Chil1, a regulator of glycolytic flux, they further show that elevated glycolysis can promote KC apoptosis.

      However, it remains unclear whether the observed metabolic dysregulation directly causes KC death or whether secondary factors, such as low-grade inflammation or macrophage activation, also contribute significantly. Nonetheless, the results, particularly those derived from the Chil1-ablated model, point to a new potential target for the early prevention of KC death during MASLD progression.

      The manuscript is clearly written and thoughtfully addresses key limitations in the field, especially the focus on glycolytic intermediates rather than fatty acid oxidation. The authors acknowledge the missing mechanistic link between increased glycolysis and KC death. A few things require clarification.

      Strengths:

      • The study presents the novel observation of profound metabolic dysregulation in KCs during early MASLD and identifies these cells as undergoing apoptosis. The finding that Chil1 ablation aggravates this phenotype opens new avenues for exploring therapeutic strategies to mitigate or reverse MASLD progression.

      • The authors provide a comprehensive metabolic profile of KCs following HFHC diet exposure, including quantification of individual metabolites. They further delineate alterations in glycolysis and the pentose phosphate pathway in Chil1-deficient cells, substantiating enhanced glycolytic flux through 13C-glucose tracing experiments.

      • The data underscore the critical importance of maintaining balanced glucose metabolism in both in vitro and in vivo contexts to prevent KC apoptosis, emphasizing the high metabolic specialization of these cells.

      • The observed increase in KC death in Chil1-deficient KCs demonstrates their dependence on tightly regulated glycolysis, particularly under pathological conditions such as early MASLD.

      Weaknesses:

      • The TUNEL staining in the overview in Figure 2 is not convincing. Typically the signal overlaps with DAPI, which is mostly not the case in the figures shown.

      • The mechanistic link between elevated glycolytic flux and KC death remains unclear.

      • Figure S5: shows deltadelta CT values, not relative values. What are the housekeeping genes? There should be at least 2, and they should not have metabolically related functions such as Gapdh.

      • Figure 1C: shows WT and KO gating side by side

      • The following point has not been answered: "While BMDMs from Chil1 knockout mice are used to demonstrate enhanced glycolytic flux, it remains unclear whether Chil1 deficiency affects macrophage differentiation itself." Expression of certain genes that indicate function does not show whether BMDMs isolated from these KO mice are fully differentiated. Here, counting BM input/ BMDM output, flow cytometry on BMDMs, morphology etc. should be tested.

    2. Reviewer #4 (Public review):

      Summary:

      In this study, He et al. investigate the mechanisms underlying Kupffer cell (KC) loss during metabolic stress. It has long been observed that embryonically derived KCs decline in obesity and liver disease, a loss that is compensated by monocyte recruitment, although the underlying mechanisms remain unclear. The authors propose that metabolic reprogramming, particularly excessive glycolysis, drives KC death. Using an original murine genetic model to modulate glycolysis, they further demonstrate that enhanced glycolytic activity exacerbates KC damage.

      Strengths:

      Overall, the study is extremely clearly presented, with a convincing and simple message destined to a vast audience.

      Weaknesses:

      This manuscript has already undergone one round of revisions in which I was not involved. The authors have tried to address several points raised by the previous reviewers, notably regarding the unexpectedly high level of TUNEL staining observed in KCs. However, I share these concerns expressed by the three reviewers that the reported levels remain difficult to reconcile with the biology. A TUNEL positivity rate of ~60% at week 16 of the HFHC diet would imply massive KC death, which should have led to a near-complete depletion of the KC population, something that is not observed. While I agree that the KC compartment is clearly affected under this dietary challenge, I would strongly encourage the authors to carefully rule out potential technical biases that could account for this implausibly high rate of cell death.

      Considering the new in-vivo experiment with 2-DG, it is definitely convincing and clearly adds some value to the full study.

      So the full story deserves publication.

    1. Reviewer #1 (Public review):

      In this manuscript, the authors aimed to identify the molecular target and mechanism by which α-Mangostin, a xanthone from Garcinia mangostana, produces vasorelaxation that could explain the antihypertensive effects. Building on on prior reports of vascular relaxation and ion channel modulation, the authors convincingly show that large-conductance potassium BK channels are the primary site of action. Using electrophysiological, pharmacological, and computational evidence, the authors achieved their aims and showed that BK channels are the critical molecular determinant of mangostin's vasodiltory effects, even though the vascular studies are quite preliminary in nature.

      Strengths:

      (1) The broad pharmacological profiling of mangostin across potassium channel families, revealing BK channels - and the vascular BK-alpha/beta1 complex - as the potently activated target in a concentration-dependent manner.

      (2) Detailed gating analyses showing large negative shifts in voltage-dependence of activation and altered activation and deactivation kinetics.

      (3) High-quality single-channel recordings for open probability and dwell times.

      (4) Convincing activation in reconstituted BKα/β1-Caᵥ nanodomains mimicking physiological condition and functional proof-of-concept validation in mouse aortic rings.

      Weaknesses are minor:

      (1) Some mutagenesis data (e.g., partial loss at L312A) could benefit from complementary structural validation.

      The author's rebuttal provides alphafold3 models for mutants. While there are interesting preliminary observations, the authors decided not to include these in the main manuscript, awaiting further structual validation. I concur.

      (2) While Cav-BK nanodomains were reconstituted, direct measurement of calcium signals after mangostin application onto native smooth muscle could be valuable.

      In their response, the authors acknowledge the importance of measuring Ca2+ sparks in smooth muscle cells to further validate their findings. However, this is not provided in the manuscript. Part of my earlier comment alludes to the possibility of α-Mangostin directly affecting Cav1.2 or ryanodine receptor activity, and therefore BK activity would go up. With the current provided evidence, these possibilities cannot be excluded and need to be acknowledged.

      (3) The work has impact for ion channel physiology and pharmacology, providing a mechanistic link between a natural product and vasodilation. Datasets include electrophysiology traces, mutagenesis scans, docking analyses, and aortic tension recordings. The latter however are preliminary in nature.

      The authors acknowledge that additional vascular physiology experiments would strengthen the argument they make. They are however unable to provide such evidence in the present manuscript. Therefore, I strongly suggest that the authors tune down the physiological implications of α-Mangostin that they include in the manuscript. I'd also suggest that "vasorelaxation" is removed from the manuscript title, given the preliminary nature of the findings.

    2. Reviewer #2 (Public review):

      Summary:

      In the present manuscript, Cordeiro et al. show that α-mangostin, a xanthone obtained from the fruit of the Garcinia mangostana tree, behaves as an agonist of the BK channels. The authors arrive at this conclusion by examining the effects of mangostin on macroscopic and single-channel currents elicited by BK channels formed by the α subunit and α + β1 subunits, as well as αβ1 channels coexpressed with voltage-dependent Ca2+ (CaV1,2) channels. The single-channel experiments show that α-mangostin produces a robust increase in the probability of opening without affecting the single-channel conductance. The authors contend that α-mangostin activation of the BK channel is state-independent, and molecular docking and mutagenesis suggest that α-mangostin binds to a site in the internal cavity. Importantly, α-mangostin (10 μM) alleviates noradrenaline-induced contracture. Mangostin is ineffective if the contracted muscles are pretreated with the BK toxin iberiotoxin.

      In this revised version of the manuscript by Cordeiro et al., the authors have adequately answered my previous concerns. However, as I stated in my comments, without determining the probability of opening across a wide range of voltages, any conclusion about the drug's mechanism of action can be questioned. For example, the statement in Discussion line 481: "The higher shift observed in 1 μM Cai 2+ may reflect the steep Cai2+-dependence of the closed-open equilibrium (Cui, Cox and Aldrich, 1997) and the allosteric coupling of voltage and Cai2+ signals (Horrigan and Aldrich, 2002; Magleby, 2003; Clay, 2017), which are effective in this concentration range, which may lead to a higher apparent activation when voltage activation is facilitated by Cai 2+ (Sun and Horrigan, 2022)." has no support in the data and is not predicted by the allosteric model. In order to have a larger shift induced by the drug in the presence of Ca2+, you need either to alter the Ca2+ binding or the allosteric coupling factor C.<br /> Please note that in the manuscript, there are several problems with the English in this sentence.

      Minor

      In Figure 1E, BKa should read BKalpha.

    3. Reviewer #3 (Public review):

      Summary:

      This research shows that a-mangostin, a proposed nutraceutical, with cardiovascular protecting properties, could act through the activation of large conductance potassium permeable channels (BK). The authors provide convincing electrophysiological evidence that the compound binds to BK channels and induces a potent activation, increasing the magnitude of potassium currents. Since these channels are important modulators of the membrane potential of smooth muscle in vascular tissue, this activation leads to muscle relaxation, possibly explaining cardiovascular protecting effects.

      Strengths:

      The authors have satisfactorily answered my previous comments and present evidence based on several lines of experiments that a-mangostin is a potent activator of BK channels. The quality of the experiments and the analysis is high and represents an appropriate level of analysis. This research is timely and provides a basis to understand the physiological effects of natural compounds with proposed cardio protective effects.

      Weaknesses:

      The identification of the binding site continues to be the least developed point of the manuscript. The authors show that the binding site is probably located in the hydrophobic cavity of the pore and show that point mutations reduce the magnitude of the negative voltage shift of activation produces by a-mangostin. This binding site should be demonstrated in the future using structural techniques such as cryo-EM.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers.]

      Summary:

      This study resolves a cryo-EM structure of the GPCR, GPR30, in the presence of bicarbonate, which the author's lab recently identified as the physiological ligand. Understanding the ligand and the mechanism of activation is of fundamental importance to the field of receptor signaling. This solid study provides important insight into the overall structure and suggests a possible bicarbonate binding site.

      Strengths:

      The overall structure, and proposed mechanism of G-protein coupling are solid. Based on the structure, the authors identify a binding pocket that might accommodate bicarbonate. Although assignment of the binding pocket is speculative, extensive mutagenesis of residues in this pocket identifies several that are important to G-protein signaling. The structure shows some conformational differences with a previous structure of this protein determined in the absence of bicarbonate (PMC11217264). To my knowledge, bicarbonate is the only physiological ligand that has been identified for GPR30, making this study an important contribution to the field. However, the current study provides novel and important circumstantial evidence for the bicarbonate binding site based on mutagenesis and functional assays.

      Weaknesses:

      Bicarbonate is a challenging ligand for structural and biochemical studies, and because of experimental limitations, this study does not elucidate the exact binding site. Higher resolution structures would be required for structural identification of bicarbonate. The functional assay monitors activation of GPR30, and thus reports on not only bicarbonate binding, but also the integrity of the allosteric network that transduces the binding signal across the membrane. However, biochemical binding assays are challenging because the binding constant is weak, in the mM range.

      The authors appropriately acknowledge the limitations of these experimental approaches, and they build a solid circumstantial case for the bicarbonate binding pocket based on extensive mutagenesis and functional analysis. However, the study does fall short of establishing the bicarbonate binding site.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, "Cryo-EM structure of the bicarbonate receptor GPR30," the authors aimed to enrich our understanding of the role of GPR30 in pH homeostasis by combining structural analysis with a receptor function assay. This work is a natural development and extension of their previous work on Nature Communications (PMID: 38413581). In the current body of work, they solved the cryo-EM structure of the human GPR30-G-protein (mini-Gsqi) complex in the presence of bicarbonate ions at 3.15 Å resolution. From the atomic model built based on this map, they observed the overall canonical architecture of class A GPCR and also identified 3 extracellular pockets created by ECLs (Pockets A-C). Based on the polarity, location, size, and charge of each pocket, the authors hypothesized that pocket A is a good candidate for the bicarbonate binding site. To identify the bicarbonate binding site, the authors performed an exhaustive mutant analysis of the hydrophilic residues in Pocket A and analyzed receptor reactivity via calcium assay. In addition, the human GPR30-G-protein complex model also enabled the authors to elucidate the G-protein coupling mechanism of this special class A GPCR, which plays a crucial role in pH homeostasis.

      Strengths:

      As a continuation of their recent Nature Communications publication, the authors used cryo-EM coupled with mutagenesis and functional studies to elucidate bicarbonate-GPR30 interaction. This work provided atomic-resolution structural observations for the receptor in complex with G-protein, allowing us to explore its mechanism of action, and will further facilitate drug development targeting GPR30. There were 3 extracellular pockets created by ECLs (Pockets A-C). The authors were able to filter out 2 of them and hypothesized that pocket A was a good candidate for the bicarbonate binding site based on the polarity, location, and charge of each pocket. From there, the authors identified the key residues on GPR30 for its interaction with the substrate, bicarbonate. Together with their previous work, they mapped out amino acids that are critical for receptor reactivity.

      Weaknesses:

      When we see a reduction of a GPCR-mediated downstream signaling, several factors could potentially contribute to this observation: 1) a reduced total expression of this receptor due to the mutation (transcription and translation issue); 2) a reduced surface expression of this receptor due to the mutation (trafficking issue); and 3) a dysfunctional receptor that doesn't signal due to the mutation.

      Altogether, the wide range of surface expression across the different cell lines, combined with the different receptor function readouts, makes the cell functional data only partially support their structural observations.

    3. Reviewer #3 (Public review):

      Summary

      GPR30 responds to bicarbonate and plays a role in regulating cellular pH and ion homeostasis. However, the molecular basis of bicarbonate recognition by GPR30 remains unresolved. This study reports the cryo-EM structure of GPR30 bound to a chimeric mini-Gq in the presence of bicarbonate, revealing mechanistic insights into its G-protein coupling. Nonetheless, the study does not identify the bicarbonate-binding site within GPR30.

      Strengths

      The work provides strong structural evidence clarifying how GPR30 engages and couples with Gq.

      Weaknesses

      Several GPR30 mutants exhibited diminished responses to bicarbonate, but their expression levels were also reduced. As a result, the mechanism by which GPR30 recognizes bicarbonate remains uncertain.

    1. Reviewer #1 (Public review):

      Summary:

      The authors introduce ImPaqT, a modular toolkit for zebrafish transgenesis, utilizing the Golden Gate cloning approach with the rare-cutting enzyme PaqCI. The toolkit is designed to streamline the construction of transgenes with broad applications, particularly for immunological studies. By providing a versatile platform, the study aims to address limitations in generating plasmids for zebrafish transgenesis.

      Strengths:

      The ImPaqT toolkit offers a modular method for constructing transgenes tailored to specific research needs. By employing Golden Gate cloning, the system simplifies the assembly process, allowing seamless integration of multiple genetic elements while maintaining scalability for complex designs. The toolkit's utility is evident from its inclusion of a diverse range of promoters, genetic tools, and fluorescent markers, which cater to both immunological and general zebrafish research needs. Even small DNA fragements, such as the viral 2a sequence, can be cloned into a multi-component plasmid in one step. The components can be assembled from PCR fragments or synthesized DNA fragments, forgoing the need for "entry" vectors. Further, the authors show that the exisiting PaqCI sites can be domesticated to improve the versatility of the system. The validation provided in the manuscript is Convincing, demonstrating the successful generation of several functional transgenic lines. These examples highlight the toolkit's efficacy, particularly for immune-focused applications.

      Comments on revisions:

      The authors have addressed all the concerns raised in the first review. Congratulations to the authors for their effort.

    2. Reviewer #2 (Public review):

      Summary:

      Hurst et al. developed a new Tol2-based transgenesis system, ImPaqT, an Immunological toolkit for PaqCl-based Golden Gate Assembly of Tol2 Transgenes, to facilitate the production of transgenic zebrafish lines. This Golden Gate assembly-based approach relies on only a short 4-base-pair overhang sequence in the final construct, and the insertion construct and backbone vector can be assembled in a single-tube reaction using PaqCl and a ligase. This approach can also be expandable by introducing new overhang sequences while maintaining compatibility with existing ImPaqT constructs, allowing users to add fragments as needed.

      The generation of several transgenic zebrafish lines for immunological studies demonstrates the feasibility of the ImPaqT in vivo. Lineage tracing of macrophages via LPS injection demonstrates the approach's functionality and validates its use in vivo.

      Comments on revisions:

      The authors have addressed all my concerns.

    1. Reviewer #1 (Public review):

      Summary:

      For each of three key transcription factor (TF) proteins in E. coli, the authors generate a large library of TF binding site (TFBS) sequences on plasmids, such that each TFBS is coupled to the expression of a fluorescence reporter. By sorting the fluorescence of individual cells and sequencing their plasmids to identify each cell's TFBS sequence (sort-seq), they are able to map the landscape of these TFBSs to the gene expression level they regulate. The authors then study the topographical features of these landscapes, especially the number and distribution of local maxima, as well as the statistical properties of evolutionary paths on these landscapes. They find the landscapes to be highly rugged, with about as many local peaks as a random landscape would have, and with those peaks distributed approximately randomly in sequence space. This is quite different from previous work on landscapes for eukaryotic TFBSs, which tend to be rather smooth. The authors find that there are a number of peaks that produce regulation stronger than that of the wild-type sequence for each TF, and that it is not too unlikely to reach one of those "high peaks" from a random starting sequence. Nevertheless, the basins of attraction for different peaks have significant overlap, which means that chance plays a major role in determining which peak a population will evolve to.

      Strengths:

      (1) The apparent differences in landscape topography between prokaryotic TFBSs and other molecular landscapes is a fascinating discovery to add to the field of genotype-phenotype maps. I am really excited to learn the molecular mechanisms of this in the future.

      (2) The experiments and analysis of this paper are very well-executed and, by and large, very thorough. I appreciated the systematic nature of the project, both the large-scale experiments done on three TFs with replicates, and the systematic analysis of the resulting landscapes. This not only makes the paper easy to follow, but also inspires confidence in their results since there is so much data and so many different ways of analyzing it. It's a great recipe for other studies of genotype-phenotype landscapes to follow.

      (3) Considering how technical the project was, I am really impressed at how easy to read I found the paper, and the authors deserve a lot of credit for making it so. They do a great job of building up the experiments and analyses step-by-step, and explaining enough of the basics of the experimental design and essence of each analysis in the main text without getting too complicated with details that can be left to the Methods or SI.

      Weaknesses:

      (1) Regarding the effect of measurement uncertainties, one way in which they attempt to test their effect is to simulate dynamics on noisy and noise-free versions of the landscape and measure visitation frequencies. While they show that visitation frequencies are highly correlated between these cases, I'd prefer a more direct test of epistasis or navigability (e..g, number of local peaks), since that's how they are characterizing the landscapes, and the connection between that and visitation frequency of individual states is unclear.

      (2) I am still a little concerned about the fraction of sequences missing from the data due to filtering, although I appreciate the difficulties in testing the importance of this (requiring additional assumptions) and the authors' good-faith efforts to do their best with the data they have.

    2. Reviewer #2 (Public review):

      The authors aim to investigate the ability of evolution to create strong transcription factor binding sites (TFBSs) de novo in E. coli. They focus on three global transcriptional regulators: CRP, Fis, and IHF, using a massively parallel reporter assay to evaluate the regulatory effects of over 30,000 TFBS variants. By analyzing the resulting genotype-phenotype landscapes, they explore the ruggedness, accessibility, and evolutionary dynamics of regulatory landscapes, providing insights into the evolutionary feasibility of strong gene regulation. Their experiments show that de novo adaptive evolution of new gene regulation is feasible. It is also subject to a blend of chance, historical contingency, and evolutionary biases that favor some peaks and evolutionary paths.

      (1) Strengths of the methods and results:

      The authors successfully employed a well-designed sort-seq assay combined with high-throughput sequencing to map regulatory landscapes. The experimental design ensures reliable measurement of regulation strengths. Their system accounts for gene expression noise and normalizes measurements using appropriate controls.

      Comprehensive Landscape Mapping:<br /> The study examines ~30,000 TFBS variants per transcription factor, providing statistically robust and thorough maps of the regulatory landscapes for CRP, Fis, and IHF. The landscapes are rigorously analyzed for ruggedness (e.g., number of peaks) and epistasis, revealing parallels with theoretical uncorrelated random landscapes.

      Evolutionary Dynamics Simulations:<br /> Through simulations of adaptive walks under varying population dynamics, the authors demonstrate that high peaks in regulatory landscapes are accessible despite ruggedness. They identify key evolutionary phenomena, such as contingency (multiple paths to peaks) and biases toward specific evolutionary outcomes.

      Biological Relevance and Novelty:<br /> The author's work is novel in focusing on global regulators, which differ from previously studied local regulators (e.g., TetR). They provide compelling evidence that rugged landscapes are navigable, facilitating de novo evolution of regulatory interactions. The comparison of landscapes for CRP, Fis, and IHF underscores shared topographical features, suggesting general principles of global transcriptional regulation in bacteria.

      (2) Weaknesses of the methods and results:

      Undersampling of Genotype Space:<br /> Approximately 40% of the theoretical TFBS genotype space remains uncharacterized after quality filtering. The authors now discuss this limitation more explicitly and provide analyses suggesting that undersampling does not strongly bias their conclusions at the landscape level. Nevertheless, predictive modeling approaches could further extend these landscapes in future work.

      Simplified Regulatory Architecture:<br /> The study considers a minimal system consisting of a single TFBS upstream of a reporter gene. While this simplification allows clean interpretation and high-throughput measurement, natural promoters often involve combinatorial regulation and chromosomal context effects that may alter landscape topography.

      Lack of Experimental Evolution Validation:<br /> The evolutionary conclusions are based on simulations rather than direct experimental evolution. The authors provide a reasonable justification for this choice and frame their conclusions at the statistical level rather than for specific trajectories, but experimental validation would be a valuable future extension.

      Impact on the Field:<br /> This study advances our understanding of adaptive landscapes in gene regulation and offers a critical step toward deciphering how global regulators evolve de novo binding sites. The findings provide foundational insights for synthetic biology, evolutionary genetics, and systems biology by highlighting the evolutionary accessibility of strong regulation in bacteria.

      Utility of Methods and Data:<br /> The sort-seq approach, combined with landscape analysis, provides a robust framework that can be extended to other transcription factors and systems. If made publicly available, the study's data and code would be valuable for researchers modeling transcriptional regulation or studying evolutionary dynamics.

      Additional Context:<br /> The study builds on a growing body of work exploring regulatory evolution. For instance, recent studies on local regulators like TetR and AraC have revealed high ruggedness and epistasis in TFBS landscapes. This study distinguishes itself by focusing on global regulators, which are more complex biologically and more influential in bacterial gene networks. The observed evolutionary contingency aligns with findings in other biological systems, such as protein evolution and RNA folding landscapes, underscoring the generality of these evolutionary principles.

      Conclusion:<br /> The authors successfully mapped the genotype-phenotype landscapes for three global regulators and simulated evolutionary dynamics to assess the feasibility of strong TFBS evolution. They convincingly demonstrate that ruggedness and epistasis, while prominent, do not preclude the evolution of strong regulation. Their results support the notion that gene regulation evolves through a blend of chance, contingency, and evolutionary biases.

      This paper makes a significant contribution to the understanding of regulatory evolution in bacteria. While minor limitations exist, the authors' methods are robust, and their findings are well-supported. The work will likely be of broad interest to researchers in molecular evolution, synthetic biology, and gene regulation.

    1. Reviewer #1 (Public review):

      The work presented by Cheung et al. used a quantitative proteomics method to capture molecular changes in B cells exposed to LPS and IL-4, a combination of stimuli activating naive B cells. Amino acid transporters, cholesterol biosynthetic enzymes, ribosomal components, and other proteins involved in cell proliferation were found to increase in stimulated B cells. Experiments involving genetic loss-of-function (SLC7A5), pharmacological inhibition (HMGCR, SQLE, prenylation), and functional rescue by metabolites (mevalonate, GGPP) validated the proteomics data and revealed that amino acid uptake, cholesterol/mevalonate biosynthesis, and cholesterol uptake played a crucial role in B cell proliferation, survival, biogenesis, and immunoglobulin class switching. Experiments involving cholesterol-free medium showed that both biosynthesis and LDLR-mediated uptake catered to the cholesterol demand of LPS/IL-4-stimulated B cells. A role for protein prenylation in LDLR-mediated cholesterol uptake was postulated and backed by divergent effects of GGPP rescue in the presence and absence of cholesterol in culture medium.

      Strengths:

      The discovery was made by proteome-wide profiling and unbiased computational analysis. The discovered proteins were functionally validated using appropriate tools and approaches. The metabolic processes identified and prioritized from this comprehensive survey and systematic validation highly likely represent mechanisms of high importance and influence. Analysis of immune cell metabolism at the protein level is relatively compared to transcriptomic and metabolomic analysis.

      The conclusions from functional validation experiments were supported by clear data and based on rational interpretations. This was enabled by well-established readouts/analytical methods used to determine cell proliferation, viability, size, cholesterol content, and transporter/enzyme function. The data generated from these experiments strongly support the conclusions.

      This work reveals a complex, yet intriguing, relationship between cholesterol metabolism and protein prenylation as they serve to promote B cell activation. The effects of pharmacological inhibition and metabolite replenishment on the cholesterol content and activation of B cells were determined and logically interpreted.

      Weaknesses:

      The findings of this study were obtained almost exclusively from ex vivo B cell stimulation experiments. Their contribution to B cell state and B cell-mediated immune responses in vivo was not explored. Without in vivo data, the study still provides valuable mechanistic information and insights, but it remains unknown, and there is no discussion about, how the identified mechanisms may play out in B cell immunity.

      The role of HMGCR, SQLE, and prenylation in B cell activation was assessed using pharmacological inhibitors. Evidence from other loss-of-function approaches, which could strengthen the conclusions, does not exist. This is a moderate weakness and somewhat offset by other data, including those obtained from the tests involving multiple distinct pharmacological inhibitors and the metabolite replenishment experiments.

    2. Reviewer #2 (Public review):

      This study uses mass spectrometry to quantify how LPS + IL-4 modify the mouse B cell proteome as naïve cells undergo blastogenesis and enter the cell cycle. This analysis revealed changes in key proteins involved in amino acid transport and cholesterol biosynthesis. Genetic and pharmacological experiments indicated important roles for these metabolic processes in B cell proliferation.

      This work provides new information about the regulation of TI B cell responses by changes in cell metabolism and also a comprehensive mass spectrometry dataset which will be an important general resource for future studies. The experiments are thorough and carefully carried out. The majority of conclusions are backed up by data that is shown to be highly significant statistically. The comprehensive mass spectrometry dataset will be an important general resource for future studies.

      After revision, the study now includes new data showing that the up regulation of amino acid uptake and cholesterol metabolism is not restricted to LPS + IL-4 (TLR4 + IL4R) stimulation but is also observed after stimulation of TLR7, TLR9, CD40 and the BCR. This increases the impact of this work and shows that this metabolic rewiring is a common feature of B cell activation. The inclusion of inhibitor data showing important roles for MTOR and ERK/p38a MAP kinases in the metabolic changes identified and provides preliminary insights into the mechanisms involved.

    1. Reviewer #1 (Public review):

      Summary:

      Adult laboratory mice produce ultrasonic vocalizations during free social interactions, as well as lower-frequency, voiced calls (squeaks) during aversive contexts. The question of whether mice possess a more complex repertoire of vocalizations has been of great interest to scientists studying rodent vocal behavior. In the current study, the authors analyze the rates and acoustic features of vocalizations produced by pairs of mice that are allowed to interact across a barrier, which prevents direct physical interaction. In this context, they find that same-sex (but not opposite-sex) pairs of mice produce vocalizations that are lower in frequency than the typical 70 kHz ultrasonic vocalizations produced during free interactions and that are also distinct from squeaks. These lower frequency vocalizations were observed in both male-male and female-female pairs, as well as in same-sex pairs from multiple mouse strains. The authors also report that call rates and acoustic features are not affected in male-male pairs that have been treated with the anxiolytic drug buspirone, suggesting that anxiety is not a major driver of vocalization in this behavioral context.

      Strengths:

      (1) The observation that same-sex pairs of mice produce lower frequency (<70 kHz) vocalizations in this behavioral context is novel.

      (2) The consideration of multiple types of pairs (female-female, male-male, and female-male), as well as the inclusion of multiple strains of mice and barriers with different hole diameters, are all strengths of the study.

      (3) The authors include detailed analyses of vocalization acoustic features, as well as detailed tracking of mouse positions relative to the barrier.

      Weaknesses:

      The categorization applied to vocalizations based on their mean frequencies is poorly supported and ignores the distinction in laryngeal production mechanism between voiced and ultrasonic vocalizations. Specifically, the authors are likely lumping together voiced and ultrasonic vocalizations into their "low frequency" (< 30 kHz) category, while they reserve the term "ultrasonic" exclusively for the subset of ultrasonic vocalizations with the highest mean frequencies (> 50 kHz). This categorization scheme also does not align well with past work on lower frequency rodent vocalizations, which complicates the comparison of the present findings to that past work.

      In some analyses, the authors report that different groups of mice produce different relative proportions of vocalization types (as defined by mean frequency) but then compare acoustic features of vocalizations between groups after pooling all vocalizations together. The analyses of acoustic features conducted in this way may be confounded by the different proportions of vocalization types across groups.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors examine vocal communication during same-sex dyadic interactions in mice, comparing periods of physical separation (with limited sensory access) to direct social contact. They report that separation dramatically alters the vocal repertoire, shifting it away from canonical ultrasonic vocalizations (USVs) toward low-frequency vocalizations (LFVs) and broadband "noisy" calls. While LFVs and noisy calls have been described previously, largely in aversive contexts, this study provides a detailed, systematic characterization of these vocalizations during social interactions, thereby extending prior work.

      The authors explore several experimental manipulations and analyses, including divider hole size, strain and sex differences, anxiolytic drug treatment, and correlations with spatial proximity, to infer potential functions of these call types. Although the dataset is rich, the results are largely descriptive, and many conclusions remain tentative. Several experimental variables are not fully controlled, and in some cases, the interpretation exceeds what the data can clearly support. Nonetheless, with improved experimental framing, additional analyses of existing data, and a clearer discussion of limitations, this work has the potential to make a valuable contribution by broadening the field's focus beyond USVs to understand a wider vocal repertoire relevant to social context.

      Strengths:

      Much work on mouse vocal communication focuses almost exclusively on USVs. This manuscript convincingly demonstrates that non-USV vocalizations (LFVs and noisy calls) are prominent and systematically modulated by social context, highlighting an underappreciated dimension of mouse communication. Furthermore, the authors employ several experimental manipulations, including sensory access, strain, sex, and pharmacological treatment, to assess changes in vocalization repertoire. This provides a valuable resource for the field and reveals robust context dependence of vocalization. The discussion is thoughtful and integrative, particularly in its consideration of potential communicative roles of LFVs and noisy calls and their relationship to sensory constraints and signal propagation, although these ideas will require further experimental validation.

      Weaknesses:

      There are several concerns regarding experimental design and data interpretation that could be addressed to strengthen the manuscript.

      (1) The terminology used for vocalization types is confusing and needs better clarification. The authors refer to Grimsley et al. (2016) multiple times, yet they use the same names for their vocalizations while applying different definitions. This makes it very difficult to compare the two papers. Since this study and Grimsley et al. use different mouse strains (FVB vs CBA), a direct comparison of absolute frequencies may also not be appropriate. Please explicitly clarify the definitions of the call types (e.g., frequency range, voiced vs. USV) and explain how they relate to those in the previous study earlier in the manuscript.

      (2) In the initial experiment, mice always experience separation first (15 minutes), followed by unification (5 minutes), using novel same-sex dyads. Multiple factors besides physical contact could influence vocalization across this sequence, including habituation to the arena, reduced anxiety over time, or increasing familiarity with the partner despite physical separation. It is unclear whether the authors have tested the reverse order (unification first, followed by separation). If not, this limitation should be explicitly acknowledged. In addition, examining whether vocalizations or behaviors change over the course of the 15-minute separation period, for example, by comparing early vs late phases, could help disentangle effects of habituation from those of physical separation per se.

      (3) The conclusion that separation-induced LFVs are unlikely to be anxiety-driven may overinterpret the buspirone experiment (Figure 8). Vehicle injections themselves produced large changes in call rate and call-type distribution, raising concerns about stress or arousal induced by the injection procedure. Comparisons between buspirone-treated animals and untreated animals are therefore problematic, as these groups differ in their experimental histories, including the number of exposures. The manuscript would benefit from independent measures confirming the anxiolytic efficacy of buspirone compared to vehicle injection in this paradigm, such as behavioral readouts of anxiety. In addition, the experimental design requires a clearer description. It is not always clear whether the same dyads were tested twice, or how social familiarity, contextual familiarity, and habituation to injections were handled. Male data comparing first and second exposures should also be included as supplementary figures to allow direct comparison with the excluded female dataset.

      (4) The idea that noisy calls function to attract conspecific attention is intriguing. However, in Figure 5, all call types, including LFVs and USVs, are most likely to occur when mice are already in close proximity during separation, which seems inconsistent with a long-distance signaling role. Analyses of the temporal relationship between vocalizations and behavior would strengthen this claim. For example, it would be informative to test whether bouts of noisy calls precede approach behavior or a reduction in inter-animal distance. Examining whether calls occur before, during, or after orientation toward the partner could further clarify whether these vocalizations actively modulate social behavior.

      (5) The effects of divider hole size on vocal repertoire are striking but difficult to interpret. Unexpectedly, small holes and no holes yield similar call distributions, whereas large holes produce a markedly different profile dominated by LFVs, which also differs from free interactions. If large holes allow greater tactile or close-range interaction, the reduction in USVs and MFV is counterintuitive. Incorporating behavioral metrics such as distance, orientation, or specific interaction types alongside call classification would greatly aid interpretation and help link vocal output to interaction quality rather than divider type alone.

      (6) Throughout the study, vocalizations are pooled across both animals in the dyad. Because the arena is neutral rather than a home cage, either animal could be initiating vocalization. Assigning calls to individuals, where possible, using spatial or acoustic cues, would substantially strengthen functional interpretations. Even limited analyses, e.g., identifying which animal vocalizes first or whether calls precede approach by the partner, could provide important insight into the communicative role of different call types.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aimed to uncover novel therapeutic vulnerabilities in APC-mutant colorectal cancer (CRC), which constitutes the majority of CRC cases. They hypothesized that modulating oxygen-sensing pathways (via PHD inhibition) could disrupt adaptive stress responses in these tumours.

      Strengths:

      The study employs a powerful, two-pronged approach to identify Molidustat's targets. By using both Thermal Proteome Profiling (TPP) and an orthogonal chemical proteomic competition assay, the authors provide compelling evidence that GSTP1 is a genuine, direct off-target, effectively addressing the common limitation of indirect effects in proteomic screens.

      Weaknesses:

      (1) In Figure 1, the current data rely on a single guide RNA (sgRNA). To make the data solid, at least two independent sgRNAs targeting different regions of PHD2 should be used.

      (2) Figure 3E: Asn205 site should be mutated to prove that whether Molidustat inhibits GSTP1 activity via Asn205 or not.

      (3) Figure 5B and 5C: The metabolic imbalance phenotype observed upon dual knockout of PHD2 and GSTP1 requires rescue experiments to confirm on-target specificity.

    2. Reviewer #2 (Public review):

      Summary:

      The authors aimed to determine Molidustat targets and the potential utility of these findings. They clearly demonstrate that Molidustat interferes with GSTP1 and some other proteins on top of PHD2. They also demonstrate that PHD2 deletion is not sufficient to recapitulate Molidustat effects in cells and proteomes. Finally, they demonstrate synthetic lethality in organoids for Molidustat and APC deletion.

      Strengths:

      The data on Molidustat proteomes, GSTP1 binding, inhibition and metabolic health of organoids is really clear. All biochemical, docking and omic data are really strong. The potential impact of these findings could be the use of Molidustat in APC null tumours and awareness of potential off-target effects.

      Weaknesses:

      A main but minor weakness is that Molidustat also inhibits other PHDs, although these are less expressed. PHD1 has been shown to control the cell cycle and be expressed in the colon, where it is needed for viability. Although this does not explain the lack of effect of other PHD inhibitors, it does warrant some discussion. The use of MTT is not very good to detect viability when it measures metabolism; this also needs to be discussed and perhaps supplemented with colony or cell number measurements.

      Reviewer #3 (Public review):

      In this paper, the authors revealed that Molidustat can induce a dose-dependent increase in Caspase-3/7 activity in the HT29 cell line, which is an APC-mutant colorectal cancer cell line. More importantly, they found that targeting PHD2 alone cannot cause cell death. By using thermal proteome profiling (TPP) and orthogonal chemical proteomic competition assays, they determined GTSP1 as a previously undiscovered off-target of Molidustat. They also revealed that combined PHD2 and GSTP1 loss leads to an increase in intracellular ROS and apoptosis. Moreover, they evaluated the effects of Molidustat in colonic organoids and showed that Molidustat has a high selectivity for colonic organoids with activated WNT signaling and/or KRAS pathway alterations, and this effect is not reproduced by hydroxylase inhibition alone, providing a new potential approach to targeting both PHD2 and GTSP1 for the treatment of APC-mutant CRC.

      Specific comments:

      (1) What is the possible molecular mechanism of dual GSTP1/PHD2 loss, inducing cell death?

      (2) Can the authors mutate the binding site of Molidustat on GTSP1 to verify the in silico docking results?

      (3) Evidence for Molidustat inhibiting PHD2 activity or stabilising HIF-1α should be provided.

  2. Apr 2026
    1. Reviewer #1 (Public review):

      Summary:

      The study examines human biases in a regime-change task, in which participants have to report the probability of a regime change in the face of noisy data. The behavioral results indicate that humans display systematic biases, in particular, overreaction in stable but noisy environments and underreaction in volatile settings with more certain signals. fMRI results suggest that a frontoparietal brain network is selectively involved in representing subjective sensitivity to noise, while the vmPFC selectively represents sensitivity to the rate of change.

      Strengths:

      - The study relies on a task that measures regime-change detection primarily based on descriptive information about the noisiness and rate of change. This distinguishes the study from prior work using reversal-learning or change-point tasks in which participants are required to learn these parameters from experiences. The authors discuss these differences comprehensively.

      - The study uses a simple Bayes-optimal model combined with model fitting, which seems to describe the data well. The model is comprehensively validated.

      - The authors apply model-based fMRI analyses that provide a close link to behavioral results, offering an elegant way to examine individual biases.

      Weaknesses:

      The authors have adequately addressed my prior concerns.

    2. Reviewer #3 (Public review):

      This study concerns how observers (human participants) detect changes in the statistics of their environment, termed regime shifts. To make this concrete, a series of 10 balls are drawn from an urn that contains mainly red or mainly blue balls. If there is a regime shift, the urn is changed over (from mainly red to mainly blue) at some point in the 10 trials. Participants report their belief that there has been a regime shift as a % probability. Their judgement should (mathematically) depend on the prior probability of a regime shift (which is set at one of three levels) and the strength of evidence (also one of three levels, operationalized as the proportion of red balls in the mostly-blue urn and vice versa). Participants are directly instructed of the prior probability of regime shift and proportion of red balls, which are presented on-screen as numerical probabilities. The task therefore differs from most previous work on this question in that probabilities are instructed rather than learned by observation, and beliefs are reported as numerical probabilities rather than being inferred from participants' choice behaviour (as in many bandit tasks, such as Behrens 2007 Nature Neurosci).

      The key behavioural finding is that participants over-estimate the prior probability of regime change when it is low, and under estimate it when it is high; and participants over-estimate the strength of evidence when it is low and under-estimate it when it is high. In other words participants make much less distinction between the different generative environments than an optimal observer would. This is termed 'system neglect'. A neuroeconomic-style mathematical model is presented and fit to data.

      Functional MRI results how that strength of evidence for a regime shift (roughly, the surprise associated with a blue ball from an apparently red urn) is associated with activity in the frontal-parietal orienting network. Meanwhile at time-points where the probability of a regime shift is high, there is activity in another network including vmPFC. Both networks show individual differences effects, such that people who were more sensitive to strength of evidence and prior probability show more activity in the frontal-parietal and vmPFC-linked networks respectively.

      Strengths

      (1) The study provides a different task for looking at change-detection and how this depends on estimates of environmental volatility and sensory evidence strength, in which participants are directly and precisely informed of the environmental volatility and sensory evidence strength rather than inferring them through observation as in most previous studies

      (2) Participants directly provide belief estimates as probabilities rather than experimenters inferring them from choice behaviour as in most previous studies

      (3) The results are consistent with well-established findings that surprising sensory events activate the frontal-parietal orienting network whilst updating of beliefs about the word ('regime shift') activates vmPFC.

      Weaknesses

      (1) The use of numerical probabilities (both to describe the environments to participants, and for participants to report their beliefs) may be problematic because people are notoriously bad at interpreting probabilities presented in this way, and show poor ability to reason with this information (see Kahneman's classic work on probabilistic reasoning, and how it can be improved by using natural frequencies). Therefore the fact that, in the present study, people do not fully use this information, or use it inaccurately, may reflect the mode of information delivery.

      In the response to this comment the authors have pointed out their own previous work showing that system neglect can occur even when numerical probabilities are not used. This is reassuring but there remains a large body of classic work showing that observers do struggle with conditional probabilities of the type presented in the task,

      (2) Although a very precise model of 'system neglect' is presented, many other models could fit the data.

      For example, you would get similar effects due to attraction of parameter estimates towards a global mean - essentially application of a hyper-prior in which the parameters applied by each participant in each block are attracted towards the experiment-wise mean values of these parameters. For example, the prior probability of regime shift ground-truth values [0.01, 0.05, 0.10] are mapped to subjective values of [0.037, 0.052, 0.069]; this would occur if observers apply a hyper-prior that the probability of regime shift is about 0.05 (the average value over all blocks). This 'attraction to the mean' is a well-established phenomenon and cannot be ruled out with the current data (I suppose you could rule it out by comparing to another dataset in which the mean ground-truth value was different).

      More generally, any model in which participants don't fully use the numerical information they were given would produce apparent 'system neglect'. Four qualitatively different example reasons are: 1. Some individual participants completely ignored the probability values given. 2. Participants did not ignore the probability values given, but combined them with a hyperprior as above. 3. Participants had a reporting bias where their reported beliefs that a regime-change had occurred tend to be shifted towards 50% (rather than reporting 'confident' values such 5% or 95%). 4. Participants underweighted probability outliers resulting in underweighting of evidence in the 'high signal diagnosticity' environment (10.1016/j.neuron.2014.01.020 )

      In summary I agree that any model that fits the data would have to capture the idea that participants don't differentiate between the different environments as much as they should, but I think there are a number of qualitatively different reasons why they might do this - of which the above are only examples.

    1. Reviewer #3 (Public review):

      Summary

      In this manuscript, Zhang et al. investigate the conduction and inhibition mechanisms of the Kv2.1 channel, with a particular focus on the distinct effects of TEA and RY785 on Kv2 potassium channels. Using microsecond-scale molecular dynamics simulations, the authors characterize K⁺ ion permeation and RY785-mediated inhibition within the central pore. Their results reveal an inhibition mechanism that differs from those described for other Kv channel inhibitors.

      Strengths

      The study identifies a distinctive inhibitory mode for RY785, which binds along the channel walls in the open-state structure while still permitting a reduced level of K⁺ conduction. In addition, the authors propose a long-range allosteric coupling between RY785 binding in the central pore and changes in the structural dynamics of Kv2.1. Overall, this is a well-organized and carefully executed study, employing robust simulation and analysis methodologies. The work provides novel mechanistic insights into voltage-gated potassium channel inhibition and may offer useful guidance for future structure-based drug design efforts.

      Weaknesses:

      As noted in the Discussion, this study focuses primarily on the major binding site within the central pore and was not designed to systematically assess other potential allosteric binding sites for RY785. A more comprehensive structural and biophysical evaluation of possible additional binding sites would be a valuable direction for future investigations.

      Comments on revisions:

      The authors have addressed my comments.

    1. Reviewer #1 (Public review):

      Summary:

      In many vertebrates, the neural tube closes by folding, elevation, and fusion of bilateral neural folds. Loss of the actin-binding protein Vinculin causes failed cranial neural tube closure in mice and is associated with neural tube defects in human patients, but it was not known how Vinculin contributes to neural tube closure. Here, Prudhomme and colleagues find that neural fold elevation and the apical constriction that drives it initiate normally in Vinculin-deficient mouse embryos, but both arrest before the neural folds fuse. The time of failure coincides with increased mechanical tension within the cranial neural plate. They find that Vinculin localizes to areas of high mechanical stress in the WT neural plate, including multi-cellular junctions and dividing cells, and in the absence of Vinculin, recruitment of Myosin and Apical junction proteins is reduced at these sites. These data support a model in which Vinculin recruits junctional proteins to high-stress areas to maintain junctional integrity during neural tube closure.

      Strengths:

      The data presented are thorough, rigorous, and convincing. The combination of live imaging and transgenic fluorescent reporters enables direct observation of junctional behaviors within the mouse cranial neural plate and detailed analysis of how these behaviors are disrupted upon loss of Vinculin. The authors make good use of an ESC transplant approach to efficiently generate mutant and transgenic embryos for analysis.

      Weaknesses:

      Although the loss of junctional integrity, especially at multi-cellular junctions, is clearly and convincingly demonstrated in Vinculin-deficient embryos, it is not clear precisely how this disrupts the elevation of the neural folds to cause exencephaly.

    2. Reviewer #2 (Public review):

      Summary

      Using mouse embryos early in development, this excellent paper from Prudhomme et al. shows that Vinculin's recruitment to adherens junctions during mammalian cranial neural tube closure is essential for maintaining junctional integrity in response to increased tension during this process. Previous work had shown that during neural tube elevation, planar polarity of Myosin II and mechanical forces in the tissue are increased. Additionally, mouse embryos lacking Vinculin were known to display neural tube closure failure, and mutations in human Vinculin had been associated with increased risk of neural tube defects, but the mechanism remained unclear. Here, the authors utilize a high-throughput embryonic stem cell (ESC)-based pipeline to generate Vinculin-depleted embryos, complemented by a conditional mutant lacking Vinculin in the embryonic lineages, to investigate this question. The authors show that Vinculin is not required for force generation, but Vinculin is recruited to cell-cell junctions in a tension-dependent manner and is needed to transmit actomyosin-mediated tension to junctions - particularly tricellular and higher-order multicellular junctions - so that apical constriction can happen during neural fold elevation. Furthermore, they find that Vinculin is required to maintain adhesion during high force events (e.g., rosette resolution and cell division) during neural tube closure. The research builds on previous studies about Vinculin's role in mechanotransduction at cell-cell junctions carried out in cultured epithelial cells, zebrafish cardiomyocytes, or early Xenopus embryos, and investigates how physiological forces required for mouse neural tube closure challenge junction integrity and the important role that Vinculin plays in maintenance of junction integrity and translation of mechanical forces into changes in tissue structure during this process.

      Strengths:

      This study stands out for its sophisticated use of laser ablation and live imaging in neurulating mouse embryos, enabling quantification of junctional tension, Vinculin recruitment to multicellular junctions, and assessment of junction integrity during neural tube elevation. The authors' use of both ESC-derived Vinculin mutant embryos complemented by a second conditional mutant of Vinculin convincingly demonstrates that their findings are specific to the loss of Vinculin. Additionally, the authors demonstrated proof-of-principle for their ESC-based pipeline with a Shroom3 mutant known to be important for neural tube closure. The Zallen lab's application of the genetically engineered ESC-derived mouse embryo pipeline to efficiently generate larger numbers of mutant mouse embryos exhibiting neural tube closure defects (compared with traditional genetic crossing strategies) that can be utilized for live imaging and mechanical perturbations like laser ablation will be valuable for future work in the field. The authors show that Vinculin depletion disrupts tricellular and multicellular junctions. Notably, over 75% of higher-order (5+) vertices in Vinculin mutant embryos display gaps, but interestingly, about one third of 5+ cell junctions in Control embryos also display gaps, indicating that transient vertex remodeling events are needed for normal neural tube closure. Overall, this is a well-written paper that places the authors' findings within the context of prior literature; their beautiful data that is robustly analyzed and clear figure presentation will make the authors' exciting findings accessible to readers.

      Weaknesses:

      The criteria for selection of junctions targeted by laser ablation, including specifics of location, Myosin II intensity, and initial junction length, should be more clearly described in the Methods, especially given the use of different reporter strains (MyoIIB-GFP vs. GFP-Plekha7) across figures, which may influence junction selection for laser ablation. Analysis of Myosin II in Vinculin mutant embryos would benefit from staining for active Myosin II (pMRLC), and further examination of actomyosin organization at different stages of neural fold elevation in controls vs. Vinculin mutants would be informative. Although the authors note that ZO-1 gaps are limited to a subset of vertices where adherens junction gaps are detected, the increased frequency of tight junction gaps in Vinculin mutants could have functional significance that should be noted. Finally, inclusion of schematics to detail how the adherens and tight junction gaps were defined and measured at cell vertices, as well as how cell division completion was defined, would improve transparency and strengthen readers' understanding of how the data were quantified.

    3. Reviewer #3 (Public review):

      Summary:

      Prudhomme et al report a detailed analysis of the role of vinculin in maintaining neuroepithelial integrity during cranial neurulation.

      Strengths:

      The authors use complementary experiments involving super-resolution microscopy, laser ablation, and live imaging of conditional knockout and ESC-derived embryos to demonstrate that loss of vinculin produces wide gaps between the adherens junctions of neuroepithelial cells at later stages of cranial neural fold elevation. The data presented are of extremely high quality, logically presented in a compelling story, and represent a very substantial contribution.

      Weaknesses:

      The authors are invited to consider the largely minor questions recommended below.

      (1) The laser ablations reported are a correlate of cell border, or 'junctional' tension. Please avoid broad statements such as 'mechanical forces are upregulated' (abstract), which invoke gene-like regulation of tissue-level forces (in Newtons). Changes in junctional tension are likely to relate to changes in force generated, but their relationship is not simple: higher tensile stress withstood by the shorter length of junctions in cells with smaller apical surfaces does not necessarily translate into greater force being produced by that cell. The junctional tension readout measured is perfectly relevant to the paper, more so than tissue-level forces would have been.

      (2) What is the mechanical mechanism by which loss of vinculin prevents neural fold elevation? The authors present exciting findings about the cellular consequences of losing Vcl at the late elevation stages when the tissue is quantifiably dysmorphic. A clear argument of how Vcl loss could lead to this dysmorphology would strengthen the paper, particularly given that junctional tension defects are excluded and apical non-constriction at the late stage is only mild.

      (3) Can the authors comment on the likely impacts of Vcl deletion on the basal domain of the cell? For example, they could cite live-imaging of distinct behaviours in Williams et al Dev Cell 2014, and the NTD phenotypes of some integrin/focal adhesion mutant mice.

      (4) The apparent uncoupling of apical area (larger in Vcl KO) from junctional tension (equivalent) in this model is noteworthy. Can the authors speculate on its potential basis?

      (5) Live imaging in Figure 7C appears to show a marked reduction in apical area before cleavage furrow formation (T0-18min), suggesting a large apical constriction event (post-mitotic?), as previously reported (e.g., Ampartzidis et al Dev Biol 2023). Do junctional gaps appear during these constrictions?

      (6) The live imaging setup used is clearly sufficient to identify differences between genotypes, so this is only a minor point. The gassing conditions listed in the methods specify 5% CO2, but E8.5 embryos also need low O2 to complete cranial closure. Was the O2 level controlled? Was tissue-level shape change observed to be consistent with ongoing neurulation during live-imaging?

      (7) Neither the multi-cell laser ablations in the pre-print by De La O cited here, nor the narrower junctional ablations in Bocanegra-Moreno et al., Nat Phys, (2023), identified differences in recoil between developmental stages. Why might those results be different from the findings reported here (e.g., analysis region - not specified in the latter paper)? Limitations to interpreting junctional ablations between cells with different junction lengths include more of the recoil being dissipated by retraction of the longer ablated border.

      (8) Is a truncated Vcl expressed in the ESC model, which could bind catenin without an F-actin anchor? The very high-contrast western shown is cropped so it is not clear whether the catenin-binding N-terminus is present. Does the antibody used recognise the head domain (this reviewer could not readily find the information)?

    1. Reviewer #1 (Public review):

      Summary:

      Using electron microscopy, the authors report discontinuities in the plasma membrane of C. elegans embryos. They associate these discontinuities with cell division and speculate that membrane rupture and subsequent resealing contribute to cytokinesis. They further discuss the proximity of these sites to vesicles and propose a role for vesicle-mediated membrane extension.

      Weaknesses:

      (1) The possibility that the membrane discontinuity is an artifact

      Although the authors focus on discontinuities in the plasma membrane, similar discontinuities are also observed in mitochondria, the nuclear envelope, and yolk granules. This raises concerns about whether the electron micrographs presented are suitable for assessing membrane continuity.

      Electron micrographs result from a lengthy sample preparation process, including high-pressure freezing, freeze substitution in acetone containing OsO4, gradual warming, uranyl acetate staining, resin embedding, and ultrathin sectioning. In general, lipids are soluble in acetone at temperatures above −30 {degree sign}C, and preservation of membrane structures relies heavily on efficient OsO4 fixation. Insufficient OsO4 treatment would be expected to reduce membrane contrast.

      C. elegans embryos are encapsulated by an eggshell that forms at fertilization and gradually develops during the first few cell divisions. It is unclear how efficiently OsO4 in acetone penetrates the eggshell during freeze substitution, raising further concern about plasma membrane preservation under the conditions used.

      (2) Lack of evidence linking membrane discontinuity to cell division

      The reported plasma membrane discontinuities are not specific to mitotic cells. If this were a physiological process playing an important role in cytokinesis, it should occur in a temporally and spatially coordinated manner with nuclear division. However, it remains unclear at what stage of the cell cycle the membrane rupture occurs and where it is located relative to chromosomes and the mitotic spindle.

      (3) Lack of evidence for extension of the separated membrane

      Although the authors speculate that resealing of the ruptured membrane occurs via extension of the separated membrane, no direct evidence supporting this mechanism is presented. Proximity to vesicles alone does not demonstrate that membrane extension occurs through vesicle fusion. More direct evidence is required to support this claim.

      (4) Inconsistency with published work

      Numerous studies have examined cell division in developing C. elegans embryos using the GFP::PH(PLC1δ1) marker expressed from the ltIs38 transgene [pAA1; pie-1::GFP::PH(PLC1δ1) + unc-119(+)], generated by the Oegema lab (https://wormbase.org/species/c_elegans/transgene/WBTransgene00000911#01--10 ). To date, no study has reported membrane ruptures of the magnitude described here. The complexity of cell surface morphology from the 8- to 12-cell stages onward has been well documented, for example, by Fu et al. (2016) using light-sheet microscopy and 3D reconstruction (doi:10.1038/ncomms11088).

      Supplementary Movies 5, 6, and 10 of this paper illustrate how single-plane images can easily produce apparent membrane discontinuities, for example, due to membrane orientations nearly parallel to the imaging plane.

      The three single-plane images from only three embryos presented in Figure 6 are insufficient to support the authors' strong conclusions. Raw 3D data should be provided.

    2. Reviewer #2 (Public review):

      Summary:

      Liang et al. explore an unusual observation of membrane discontinuities in dividing C. elegans embryonic cells. This report is the first to demonstrate that, instead of the classical invagination of membranes during cytokinesis, cells in the early embryos of C. elegans exhibit separation of sister membranes that extend independently. TEM images of high-pressure-frozen samples provide strong evidence for the presence of Membrane Openings (MOs) in cells at various stages of the cell cycle, predominantly during mitosis. High-resolution images (x 30,000) clearly show the wrinkled plasma membrane and smooth MOs.<br /> The electron microscopy data are supported by the live cell imaging of strains with fluorescently tagged membrane markers. This study opens up the possibility of tracking MOs at other stages of C. elegans development, and also asks if it might be a common phenomenon in other species that exhibit rapid embryonic growth and divisions.

      Strengths:

      (1) Thorough verification of Membrane Openings (MO) by several methods:

      (a) 4 independent sample batches.

      (b) Examined historical collections.

      (c) Analysed embryos at different stages of development. The absence of MOs in later stages (comma) serves as a negative control and gives confidence that MOs are genuine and not technical artifacts.

      (2) Live cell imaging of strain with fluorescently labelled membranes provides real-time dynamics of membrane rupture.

      (3) After observing the membrane rupture, the next obvious question is - what prevents the cytosol from leaking out? The EM images showing PBL and PEL - extracellular matrix serving as barriers for the cytosol are convincing.

      Weakness:

      (1) The association of membrane discontinuities with cell division is not convincing, as there are 159 cells out of 425 showing MOs, but it is not mentioned clearly how many of these are undergoing cell division. Also, it's not clear whether the 20 dividing cells analysed for MOs are a part of the 159 cells or a separate dataset. A graphical representation of the number of samples and observed frequencies would be helpful to understand the data collection workflow.

      (2) In Figures 3A and 3B, the resolution of the images is not enough to verify 3A as classical membrane invagination and 3B as detached sister membranes.

      (3) Figure 6 lacks controls. How does the classical invagination look in this strain? Also, adding nuclear dye would be informative, in order to correlate the nuclear division with membrane rupture, as claimed.

    3. Reviewer #3 (Public review):

      Summary:

      In this manuscript, the authors challenge a dogma in cell biology, namely that cells are at any time point engulfed by a continuous plasma membrane. Liang et al. find that during C elegans embryogenesis, a high number of cells are not entirely surrounded by a plasma membrane but show membrane openings (MOs). These openings are enriched at the embryo's periphery, towards the eggshell. The authors propose that plasma membrane discontinuities emerge during metaphase of mitosis and that independent extension of "sister membranes" engulfs the daughter cells.

      Strengths:

      On the positive side, the authors find plasma membrane discontinuities not only by electron microscopy but also by fluorescence microscopy and provide information about the dynamics of membrane openings and their emergence. While this is assuring, the authors conclude that MOs emerge during metaphase. From what the authors show, this particular information cannot be deduced, as there is no dynamic capture of a membrane scission event together with a chromatin marker that would indicate mitosis. The authors could, however, attempt to find such events in live movies, given the high incidence of MOs reported from their EM data.

      Weaknesses:

      In order to convincingly demonstrate the absence of any plasma membrane in the respective regions of the embryonic periphery or between cells of the embryo, the authors would have to show consecutive serial TEM sections where MOs are detected over more z-planes, beyond the mere 3D reconstructions. Although the authors state in the methods section that continuous ultrathin sections were cut for the metaphase sample (page 21, line 472), consecutive sections are never shown in TEM. While we do see the 3D reconstructions, better documentation of the underlying TEM data is missing. It would be necessary to show a membrane opening in consecutive z sections. Alternatively, the authors could seek the possibility to convincingly back up their claims with volume imaging by focused ion beam scanning EM (FIBSEM), where cellular volumes can be sectioned in almost isotropic resolution.

      Another critical issue concerns the detection of the membrane discontinuities in electron micrographs, which, in my opinion, is ambiguous. How do the authors reliably discriminate in their TEM images whether there is a plasma membrane or not? The absence - or weak appearance - of the stain of the electron dense material at membranes, which seems to be their criterion for MOs, is also apparent at other, intracellular membranes, like at the NE or at the ER (for example, see Figure 1C). Also, the plasma membrane itself appears unevenly stained in regions that the authors delineate as intact (for example, Figure 1C, 2B/1).

    1. Reviewer #1 (Public review):

      [Editors' note: This version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      In their paper entitled "Alpha-Band Phase Modulates Perceptual Sensitivity by Changing Internal Noise and Sensory Tuning," Pilipenko et al. investigate how pre-stimulus alpha phase influences near-threshold visual perception. The authors aim to clarify whether alpha phase primarily shifts the criterion, multiplicatively amplifies signals, or changes the effective variance and tuning of sensory evidence. Six observers completed many thousands of trials in a double-pass Gabor-in-noise detection task while an EEG was recorded. The authors combine signal detection theory, phase-resolved analyses, and reverse correlation to test mechanistic predictions. The experimental design and analysis pipeline provide a clear conceptual scaffold, with SDT-based schematic models that make the empirical results accessible even for readers who are not specialists in classification-image methods.

      Strengths:

      The study presents a coherent and well-executed investigation with several notable strengths. First, the main behavioral and EEG results in Figure 2 demonstrate robust pre-stimulus coupling between alpha phase and d′ across a substantial portion of the pre-stimulus interval, with little evidence that the criterion is modulated to a comparable extent. The inverse phasic relationship between hit and false-alarm rates maps clearly onto the variance-reduction account, and the response-consistency analysis offers an intuitive behavioral complement: when two identical stimuli are both presented at the participant's optimal phase, responses are more consistent than when one or both occur at suboptimal phases. The frontal-occipital phase-difference result suggests a coordinated rather than purely local phase mechanism, supporting the central claim that alpha phase is linked to changes in sensitivity that behave like changes in internal variability rather than simple gain or criterion shifts. Supplementary analyses showing that alpha power has only a limited relationship with d′ and confidence reassure readers that the main effects are genuinely phase-linked rather than a recasting of amplitude differences.

      Second, the reverse-correlation results in Figure 3 extend this story in a satisfying way. The classification images and their Gaussian fits show that at the optimal phase, the weighting of stimulus energy is more sharply concentrated around target-relevant spatial frequencies and orientations, and the bootstrapped parameter distributions indicate that the suboptimal phase is best described by broader tuning and a modest change in gain rather than a pure criterion account. The authors' interpretation that optimal-phase perception reflects both reduced effective internal noise and sharpened sensory tuning is reasonable and well-supported. Overall, the data and figures largely achieve the stated aims, and the work is likely to have an impact both by clarifying the interpretation of alpha-phase effects and by illustrating a useful analytic framework that other groups can adopt.

    2. Reviewer #2 (Public review):

      Summary:

      The study of Pilipenko et al evaluated the role of alpha phase in a visual perception paradigm using the framework of signal detection theory and reverse correlation. Their findings suggest that phase-related modulations in perception are mediated by a reduction in internal noise and a moderate increase in tuning to relevant features of the stimuli in specific phases of the alpha cycle. Interestingly, the alpha phase did not affect the criterion. Criterion was related to modulations in alpha power, in agreement with previous research.

      Strengths:

      The experiment was carefully designed, and the analytical pipeline is original and suited to answer the research question. The authors frame the research question very well and propose several models that account for the possible mechanisms by which the alpha phase can modulate perception. This study can be very valuable for the ongoing discussion about the role of alpha activity in perception.

      Conclusion:

      This study addresses an important and timely question and proposes an original and well-thought-out analytical framework to investigate the role of alpha phase in visual perception. While the experimental design and theoretical motivation are strong, the very limited sample size substantially constrains the strength of the conclusions that can be drawn at the group level.

      Bibliography:

      Button, K., Ioannidis, J., Mokrysz, C. et al. Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci 14, 365-376 (2013). https://doi.org/10.1038/nrn3475

      Tamar R Makin, Jean-Jacques Orban de Xivry (2019) Science Forum: Ten common statistical mistakes to watch out for when writing or reviewing a manuscript eLife 8:e48175 https://doi.org/10.7554/eLife.48175

    1. Reviewer #1 (Public review):

      Summary:

      In the manuscript "Heat Shock Factor Regulation of Antimicrobial Peptides Expression Suggests a Conserved Defense Mechanism Induced by Febrile Temperature in Arthropods," Xiao and colleagues examine the role of the shrimp Litopenaeus vannamei HSF1 ortholog (LvHSF1) in the response to viral infection. The authors provide compelling support for their conclusions that the activation of LvHSF1 limits viral load at high temperatures. Specifically, the authors convincingly show that (i) LvHSF1 mRNA and protein are induced in response to viral infection at high temperatures, (ii) increased LvHSF1 levels can directly induce the expression of the nSWD (and directly or indirectly other antibacterial peptides, AMPs), (ii) nSWD's antimicrobial activities can limit viral load, and, (iv) LvHSF1 protects survival at high temperatures following virus infection. These data thus provide a model by which an increase in HSF1 levels limits viral load through the transcription of antimicrobial peptides, and provide a rationale for the febrile response as a conserved response to viral infection.

      Strengths:

      The large body of careful time series experiments, tissue profiling, and validation of RNA-seq data is convincing. Several experimental methodologies are used to support the author's conclusions that nSWD is an LvHSf1 target and increased LvHSF1 alone can explain increased levels of nSWD. Similar carefully conducted experiments also conclusively implicate nSWD protein in limiting WSSV viral loads.

      Weaknesses:

      As with any complex biological phenomenon, several aspects remain incompletely explained. Nevertheless, in their revision, the authors provide additional analyses supporting the authors model that losing LvHSF1 is not detrimental to survival, by more directly altering viral loads. In addition, their revised manuscript clarifies the complex interactions between infection, the role of HSF1, and hormesis. These revisions increase the impact of their findings.

      Comments on revisions:

      The authors have addressed all comments, and the manuscript is very much improved.

    2. Reviewer #3 (Public review):

      In the manuscript titled "Heat Shock Factor Regulation of Antimicrobial Peptides Expression Suggests a Conserved Defense Mechanism Induced by Febrile Temperature in Arthropods", the authors investigate the role of heat shock factor 1 (HSF1) in regulating antimicrobial peptides (AMPs) in response to viral infections, particularly focusing on febrile temperatures. Using shrimp (Litopenaeus vannamei) and Drosophila S2 cells as models, this study shows that HSF1 induces the expression of AMPs, which in turn inhibit viral replication, offering insights into how febrile temperatures enhance immune responses. The study demonstrates that HSF1 binds to heat shock elements (HSE) in AMPs, suggesting a conserved antiviral defense mechanism in arthropods. The findings are informative for understanding innate immunity against viral infections, particularly in aquaculture. However the logical flow of the paper can be improved.

      Comments on revisions:

      Some aspects of the initial study design, regarding the selection of representative candidate genes and the logical flow, raised concerns. However, these issues have been addressed in the revised manuscript through additional validations and clarifications. Most of my comments and concerns were sufficiently addressed in the revised manuscript. The results support the authors' conclusion that HSF1-dependent regulation of AMP expression contributes to antiviral defense under febrile conditions.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the comments raised in the previous round of review.]

      Summary:

      This study provided key experimental evidence for the "Solstice-as-Phenology-Switch Hypothesis" through two temperature manipulation experiments.

      Strengths:

      The research is data-rich, particularly in exploring the effects of pre- and post-solstice cooling, as well as daytime versus nighttime cooling, on bud set timing, showcasing significant innovation. The article is well-written, logically clear, and is likely to attract a wide readership.

    2. Reviewer #2 (Public review):

      In 'Developmental constraints mediate the reversal of temperature effects on the autumn phenology of European beech after the summer solstice', Rebindaine and co-authors report on two experiments on Fagus sylvatica where they manipulated temperatures of saplings between day and night and at different times of year. I think the experiments are interesting, but note that the treatments are extreme compared to natural conditions. Further, given that much of the experiment happened outside, I am not sure how much we can generalize from one year for each experiment, especially when conducted on one population of one species.

    1. Reviewer #1 (Public review):

      Summary:

      The authors use a gambling task with momentary mood ratings from Rutledge et al. and compare computational models of choice and mood to identify markers of decisional and affective impairments underlying risk-prone behavior in adolescents with suicidal thoughts and behaviors (STB). The results show that adolescents with STB show enhanced gambling behavior (choosing the gamble rather than the sure amount), and this is driven by a bias towards the largest possible win rather than insensitivity to possible losses. Moreover, this group shows a diminished effect of receiving a certain reward (in the non-gambling trials) on mood. The results were replicated in a general online sample where participants were divided into groups with or without STB based on their self-report of suicidal ideation on one question in the Beck Depression Inventory self-report instrument. The authors suggest, therefore, that adolescents diagnosed with depression or anxiety with decreased sensitivity to certain rewards may need to be monitored more closely for STB due to their increased propensity to take risky decisions aimed at (expected) gains (such as relief from an unbearable situation through suicide) regardless of the potential losses. However, such a result was only found in the clinical sample and cannot be generalized more broadly based on the current findings.

      Strengths:

      (1) The study uses a previously validated task design and replicates previously found results through well-explained model-free and model-based analyses.

      (2) Sampling of adolescents at high risk can help target early preventative diagnoses and treatments for suicide.

      (3) Replication of the results in an online cohort increases confidence in the findings.

      (4) The models considered for comparison are thorough and well-motivated. The chosen models allow for teasing apart which decision and mood sensitivity parameters relate to risky decision-making across groups based on their hypotheses.

      (5) Novel finding of mood (in)sensitivity to non-risky rewards and its relationship with risk behavior in STB.

      Weaknesses:

      (1) Sample size of 25 for S- group is low-powered, which is explicitly mentioned as a study limitation.

      (2) Modeling in the mediation analysis focused on predicting risk behavior in this task from the model-derived bias for gains and suicidal symptom scores. Thus, the implications of this work are more relevant to a basic-science understanding of the etiology of suicidal behavior than they are useful as a predictor of suicidal behavior, and it is not clear that a psychiatrist or psychologist could use this task to potentially determine who is at higher risk of attempting suicide and must be more closely monitored. Indeed, relationships between task parameters and behavior and suicidal behavior was limited to the clinical sample with a diagnosis of depression or anxiety disorder, and did not extend to the online sample. Therefore, the claim that these findings provide "computational markers for general suicidal tendency among adolescents" is unwarranted.

    2. Reviewer #2 (Public review):

      Summary:

      This article addresses a very pertinent question - what are the computational mechanisms underlying risky behaviour in patients having attempted suicide. In particular, it is impressive how the authors find a broad behavioral effect whose mechanisms they can then explain and refine through computational modeling. This work is important because currently, beyond previous suicide attempts, there has been a lack of predictive measures. This study is the first step towards that: understanding the cognition on a group level. Before then being able to include it in future predictive studies (based on the cross-sectional data, this study by itself cannot assess the predictive validity of the measure).

      Strengths:

      - Large sample size<br /> - Replication of their own findings<br /> - Well-controlled task with measures of behaviour and mood + precise and well-validated computational modeling

      Questions, based on revised manuscript and replies to other reviewers:

      (1) Replies to reviewers in general: Bayes Factors have been added, it would be good to also use common verbal terms to describe them (e.g. 'anecdotal', 'moderate' etc). For example, my reading of table S8 would be that for gambling rate there is only anecdotal evidence that it does not relate to PSWQ, BDI, and moderate evidence it does not relate to TAI.

      (2) Reply to reviewer 1 Q2 (Predicting STB):<br /> For the regression predicting suicidal ideation, it seems to me that what you did was a regression STB ~ gambling behaviour + approach + mood? Could you clarify? I had expected as a test of whether the task can predict STB risk something slightly different - a cross-validation (LOO or maybe 5-fold in the large sample): STB ~ gambling behaviour + approach [parameter from model] + mood [parameter from model]; and then computing in the left out participants: predicted STB. Then checking correlation between STB and predicted STB. This would allow testing whether the diverse task measures together predict STB (with the caveat, that it's cross-validated, rather than hold-out sample, unless you could train on one sample (in lab) and test on the other (online).

      (3) Reply to reviewer 2 Q1 (parameter recovery): I'm looking at S3, it seems to still show only the scatter plots and not the correlation matrices, which are now added as text notes. Can you actually show these matrices? An off-diagonal correlation of 0.63 appears quite high. I think it needs to be discussed exactly which parameters those are, and whether that impacts the interpretation of the results.

      (4) Reply to reviewer 3 Q3 (mood model): I would have imagined that the response would involve changing the mood equations (equation 8 main text) to include a term for whether the participant gambled or not, independent of the gamble value.

    3. Reviewer #3 (Public review):

      This manuscript investigates computational mechanisms underlying increased risk-taking behavior in adolescent patients with suicidal thoughts and behaviors. Using a well-established gambling task that incorporates momentary mood ratings and previously established computational modeling approaches, the authors identify particular aspects of choice behavior (which they term approach bias) and mood responsivity (to certain rewards) that differ as a function of suicidality. The authors replicate their findings on both clinical and large-scale non-clinical samples.

      The main problem, however, is that the results do not seem to support a specific conclusion with regard to suicidality. The S+ and S- groups differ substantially in the severity of symptoms, as can be seen by all symptom questionnaires and the baseline and mean mood, where S- is closer to HC than it is to S+. The main analyses control for illness duration and medication but not for symptom severity. The supplementary analysis in Figure S11 is insufficient as it mistakes the absence of evidence (i.e., p > 0.05) for evidence of absence. Therefore, the results do not adequately deconfound suicidality from general symptom severity.

      The second main issue is that the relationship between an increased approach bias and decreased mood response to CR is conceptually unclear. In this respect, it would be natural to test whether mood responses influence subsequent gambling choices. This could be done either within the model by having mood moderate the approach bias or outside the model using model-agnostic analyses.

      Additionally, there is a conceptual inconsistency between the choice and mood findings that partly results from the analytic strategy. The approach bias is implemented in choice as a categorical value-independent effect, whereas the mood responses always scale linearly with the magnitude of outcomes. One way to make the models more conceptually related would be to include a categorical value-independent mood response to choosing to gamble/not to gamble.

      The manuscript requires editing to improve clarity and precision. The use of terms such as "mood" and "approach motivation" is often inaccurate or not sufficiently specific. There are also many grammatical errors throughout the text.

      Claims of clinical relevance should be toned down, given that the findings are based on noisy parameter estimates whose clinical utility for the treatment of an individual patient is doubtful at best.

      Comments on revisions:'

      The authors adequately addressed my comments and I find the manuscript substantially strengthened.

    1. Reviewer #1 (Public review):

      Summary

      From transcriptomic comparisons of adult mouse cochlear and vestibular hair cells, Xu et al. provide a broad and well-organized overview of differences across 4 established hair cell types (2 cochlear and 2 vestibular). They go on to demonstrate the power of such analyses to provide functional insights by focusing on the differentiated expression of ciliary genes, building to the hypothesis that kinociliary motility occurs in adult vestibular hair cells.

      Background

      Cilia are prominent in sensory receptors, including vertebrate photoreceptors, olfactory neurons and mechanosensitive hair cells of the inner ear and lateral line. Cilia can be motile or nonmotile depending on their axonemal structure: motile cilia require dynein and the inner 2 singlet microtubules of the 9+2 array. Primary cilia, present early in development, are considered to have sensory functions and to be nonmotile (Mill et al., Nature Rev Gen 2023).

      In hair cells, the kinocilium anchors and polarizes the mechanosensitive hair bundle of specialized microvilli. The kinocilium matures from the primary cilium of a newborn hair cell; behind it the bundle of mechanosensory microvilli rises in a descending staircase of rows. During maturation of the mammalian cochlea, all hair cells lose the kinocilium, though not the associated basal body. The consensus for many years has been that most vertebrate kinocilia, and especially mammalian kinocilia, are nonmotile, based largely on the lack of spontaneous motility in excised mammalian vestibular organs, but also on the impression that the rare examples of spontaneous beating motility even in non-mammalian hair cells are associated with deterioration of the preparation (Rüsch & Thurm 1990).

      Strengths

      In comparing RNA expression across the 4 major types of mouse hair cells - 2 cochlear and 2 vestibular - Xu et al. provide rich data sets for exploration of structure-function differences between these highly specialized cell types. The revised paper significantly improves the organization, interpretation and readability of the presentation of overall findings. smFISH and immuno-staining back up key RNA data, and comparisons are made with published data.

      The ciliary motility focus of the rest of the paper is creative and highly interesting. The authors curated the ciliary genes into types associated with different aspects of beating motility, and also investigated the expression of genes typical of primary cilia, which are considered to have sensory and cell signaling functions and to be nonmotile. Their data justify suggesting a role for kinociliary motility (or force generation) in adult mammalian vestibular hair cells, in opposition to a long-held assumption. The results should stimulate investigation of the implications for mechanosensitivity.

      Weaknesses

      Data

      Functional data on kinocilia motility: The technical difficulty in making such measurements in small mouse hair bundles led the authors to work with bullfrog crista bundles. Though not extensively studied here, the ciliary motility shown is convincing. Mouse hair bundle motions are also shown but the evidence connecting the data to kinociliary motion are more suggestive than convincing. But the authors are not dogmatic about these data, and it is reasonable to show them.

      Interpretation

      The authors take the view that kinociliary motility is likely to be normally present but is rare in their observations because conditions are not right. But while others have described some (rare) kinociliary motility in fish organs (Rusch & Thurm 1990), they interpreted its occurrence as a sign of pathology. Indeed, in this paper, it is not clear what role kinociliary motility would play in mature hair bundles. The authors have added a discussion of this question in the revision.

      An underlying rationale for the hypothesis that ciliary motility manifests in mammalian vestibular hair cells seems to rest on the presence of the necessary mRNA and its contrasting absence in cochlear hair cells. Another way to look at this difference could be that evolution acted on cochlear hair cells to shed kinocilia as one of many changes to improve mechanosensitivity at much higher sound frequencies. In vestibular hair cells, kinociliary motion might be useful to enhance mechanostimulation in the developing vestibule (as suggested in this revision) and not so active in maturity. Nevertheless, with their scholarly analysis of the expression of ciliary genes, the authors make a significant argument for further investigation of when and why hair cell kinocilia show active motility.

    2. Reviewer #2 (Public review):

      Summary:

      In this study the authors compared the transcriptomes of the various different types of hair cells contained in the sensory epithelia of the cochlea and vestibular organs of the mouse inner ear. The analysis of their transcriptomic data lead to novel insights into the potential function of the kinocilium.

      Strengths:

      The novel findings for the kinocilium gene expression along with the demonstration that some kinocilia demonstrate rhythmic beating as would be seen for known motile cilia is fascinating. It is possible that perhaps the kinocilium known to play a very important role in the orientation of the stereocilia, may have a gene expression pattern that is more like a primary cilium early in development and later in mature hair cells more like a motile cilium. Since the kinocilium is retained in vestibular hair cells it makes sense that it is playing a different role in these mature cells than its role in the cochlea.

      Another major strength of this study which cannot be overstated is that for the transcriptome analysis they are using mature mice. To date there is a lot of data from many labs for embryonic and neonatal hair cells but very little transcriptomic data on the mature hair cells. They do a nice job in presenting the differences in marker gene expression between the 4 hair cell types. This information is very useful to those labs studying regeneration or generation of hair cells from ES cell cultures. One of the biggest questions these labs confront is what type of hair cell develop in these systems. The more markers available the better. These data will also allow researchers in the field to compare developing hair cells with mature hair cell to see what genes are only required during development and not in later functioning hair cells.

      Comments on revision:

      I am satisfied with the revision, the authors made an effort to incorporate the changes requested.

    1. Reviewer #1 (Public review):

      This revised manuscript by Qin and colleagues delineates an important neural mechanism that suppresses the intake of sugar solution in response to internal glucose level (the "brake" mechanism for sugar consumption). They identified a three-step neuropeptidergic system that downregulates the sensitivity of sweet-sensing gustatory sensory neurons, primarily in response to elevated level of circulating glucose. First, neurons that release a neuropeptide Hugin (which is an insect homolog of vertebrate Neuromedin U (NMU)) are activated by a high concentration of hemolymph glucose, which is directly sensed by Hugin-releasing neurons in a cell-autonomous mechanism. Next, Hugin neuropeptides activate Allatostatin A (AstA)-releasing neurons via one of Hugin receptors, PK2-R1. Finally, the released AstA neuropeptide suppresses sugar response in sweet-sensing Gr5a-expressing gustatory sensory neurons through the AstA-R1 receptor. Suppression of sugar response in Gr5a-expressing neurons reduces fly's sugar intake motivation. They also found that NMU-expressing neurons in the ventromedial hypothalamus (VMH) of mice (which project to the rostal nucleus of the solitary tract (rNST)) are also activated by high concentration of circulating glucose, independent of synaptic transmission, and that injection of NMU reduces the glucose-induced activity in the downstream of NMU-expressing neurons in rNST. These data suggest that the function of Hugin neuropeptides in the fly is analogous to the function of NMU in the mouse.

      The authors have provided multiple lines of compelling evidence generated through rigorous and comprehensive experiments, which spans genetic abrogation, neuronal manipulation, pharmacology, and functional imaging. The authors are also receptive to the critiques and reframed the central message, such that their conclusions are soundly supported by the presented data. Importantly, the parallel study in mice adds a unique comparative perspective that makes the paper of interest to a wide range of readers.

    2. Reviewer #2 (Public review):

      Summary:

      The question of how caloric and taste information interact and consolidate remains both active and highly relevant to human health and cognition. The authors of this work sought to understand how nutrient sensing of glucose modulates sweet sensation. They found that glucose intake activates hugin signaling to AstA neurons to suppress feeding, which contributes to our mechanistic understanding of nutrient sensation. They did this by leveraging the genetic tools of Drosophila to carry out nuanced experimental manipulations, and confirmed the conservation of their main mechanism in a mammalian model. This work builds on previous studies examining sugar taste and caloric sensing, enhancing the resolution of our understanding.

      Strengths:

      Fully discovering neural circuits that connect body state with perception remains central to understanding homeostasis and behavior. This study expands our understanding of sugar sensing, providing mechanistic evidence for a hugin/AstA circuit that is responsive to sugar intake and suppresses feeding. In addition to effectively leveraging the genetic tools of Drosophila, this study further extends their findings into a mammalian model with the discovery that NMU neural signaling is also responsive to sugar intake.

      Weaknesses:

      The effect of Glut1 knockdown on PER in hugin neurons is modest in both fed and starved flies, suggesting that glucose intake through Glut1 may only be part of the mechanism. The authors address this in their discussion.

    1. Reviewer #2 (Public review):

      Summary:

      The authors goals is to be develop a more accurate system that reports TDP-43 activity as a splicing regulator. Prior to this, most methods employed western blotting or QPCR based assays to determine whether targets of TDP-43 were up or down regulated. The problem with that is the sensitivity. This approach uses an ectopic delivered construct containing splicing elements from CFTR and UNC13A (two known splicing targets) fused to a GFP reporter. Not only does it report TDP-43 function well, but it operates at extremely sensitive TDP-43 levels, requiring only picomolar TDP-43 knockdown for detection. This reporter should supersede the use of current TDP-43 activity assays, its cost-effective, its rapid and reliable.

      Strengths:

      In general, the experiments are convincing and well designed. The rigor, number of samples and statistics, and gradient of TDP-43 knockdown were all viewed as strengths. In addition, the use of multiple assays to confirm the splicing changes were viewed as complimentary (ie PCR and GFP-fluorescence) adding additional rigor. The final major strength i'll add is the very clever approach to tether TDP-43 to the loss of function cassette such that when TDP-43 is inactive it would autoregulate and induce wild-type TDP-43. This has many implications for the use of other genes, not just TDP-43, but also other protective factors that may need to be re-established upon TDP-43 loss of function.

      Weaknesses:

      Admittedly, one needs to initially characterize the sensor and the use of cell lines is an obvious advantage, but it begs the question of whether this will work in neurons. Additional future experiments in primary neurons will be needed. The bulk analysis of GFP-positive cells is a bit crude. As mentioned in the manuscript, flow sorting would be an easy and obvious approach to get more accurate homogenous data. This is especially relevant since the GFP signal is quite heterogenous in the image panels, for example Figure 1C, meaning the siRNA is not fully penetrant. Therefore, stating that 1% TDP-43 knockdown achieves the desired sensor regulation might be misleading. Flow sorting would provide a much more accurate quantification of how subtle changes in TDP-43 protein levels track with GFP fluorescence.

      Some panels in the manuscript would benefit from additional clarity to make the data easier to visualize. For example, Figure 2D and 2G could be presented in a more clear manner, possibly split into additional graphs since there are too many outputs. Sup Figure 2A image panels would benefit from being labeled, its difficult to tell what antibodies or fluorophores were used. Same with Figure 4B.

      Figure 3 is an important addition to this manuscript and in general is convincing showing that TDP-43 loss of function mutants can alter the sensor. However, there is still wild-type endogenous TDP-43 in these cells, and its unclear whether the 5FL mutant is acting as a dominant negative to deplete the total TDP-43 pool, which is what the data would suggest. This could have been clarified. Additional treatment with stressors that inactivate TDP-43 could be tested in future studies.

      Overall, the authors definitely achieved their goals by developing a very sensitive readout for TDP-43 function. The results are convincing, rigorous, and support their main conclusions. There are some minor weaknesses listed above, chief of which is the use of flow sorting to improve the data analysis. But regardless, this study will have an immediate impact for those who need a rapid, reliable, and sensitive assessment of TDP-43 activity, and it will be particularly impactful once this reporter can be used in isolated primary cells (ie neurons) and in vivo in animal models. Since TDP-43 loss of function is thought to be a dominant pathological mechanism in ALS/FTD and likely many others disorders, having these type of sensors is a major boost to field and will change our ability to see sub-threshold changes in TDP-43 function that might otherwise not be possible with current approaches.

      Comments on revisions:

      In the revised version, most of the reviewer's comments have been appropriately addressed with the exception of 1) the use of flow sorting to improve the data analysis and 2) testing this sensor in primary neurons. The latter is the focus of an ongoing separate study. Though flow sorting would significantly strengthen this study and help others in the field to use this sensor, it is still an impactful and innovative study without it.

    2. Reviewer #3 (Public review):

      The DNA and RNA binding protein TDP-43 has been pathologically implicated in a number of neurodegenerative diseases including ALS, FTD, and AD. Normally residing in the nucleus, in TDP-43 proteinopathies, TDP-43 mislocalizes to the cytoplasm where it is found in cytoplasmic aggregates. It is thought that both loss of nuclear function and cytoplasmic gain of toxic function are contributors to disease pathogenesis in TDP-43 proteinopathies. Recent studies have demonstrated that depletion of nuclear TDP-43 leads to loss of its nuclear function characterized by changes in gene expression and splicing of target mRNAs. However, to date, most readouts of TDP-43 loss of function events are dependent upon PCR based assays for single mRNA targets. Thus, reliable and robust assays for detection of global changes in TDP-43 splicing events are lacking. In this manuscript, Xie, Merjane, Bergmann and colleagues describe a biosensor that reports on TDP-43 splicing function in real time. Overall, this is a well-described unique resource that would be of high interest and utility to a number of researchers validated in multiple cell types as a sensitive readout of TDP-43 loss of function. Future studies validating the utility of this biosensor in models of TDP-43 loss of function (e.g. disease iPSNs) that do not rely on TDP-43 knockdown will be of further interest.

    1. Reviewer #1 (Public review):

      Summary:

      Planar cell polarity core proteins Frizzled (Fz)/Dishevelled (Dvl) and Van Gogh-like (Vangl)/Prickle (Pk) are localized on opposite sides of the cell and engage in reciprocal repression to modulate cellular polarity within the plane of static epithelium. In this interesting manuscript, the authors explore how the anterior core proteins (Vangl/Pk) inhibit the posterior core protein (Dvl). The authors propose that Pk assists Vangl2 in sequestering both Dvl2 and Ror2, while Ror2 is essential for Dvl to transition from Vangl to Fz in response to non-canonical Wnt signaling.

      Strengths:

      The strengths of the manuscript are found in the very interesting and new concept along with supportive data for a model of how non-canonical Wnt induces Dvl to transition from Vangl to Fz with an opposing role for PK and Vangl2 to suppress Dvl during convergent extension movements. Ror is key player required for the transition and antagonizes Vangl.

      Weaknesses:

      In addition to general whole embryo morphology that is used as evidence for CE defects, two forms of data are presented: co-expression and IP, as well as IF of exogenously expressed proteins. The microscopy would benefit from super-resolution microscopy since in many cases the differences in protein localization are not very pronounced, and Western analysis data often show relatively subtle differences. Thus, future work will determine the strength of the interactions of the model.

      Major points.

      Overexpression conditions

      A possible concern is that most analyses were performed with overexpression conditions. PCP core proteins (Vangl2, Pk, Dvl, and Fz receptors) are known to display polarized subcellular localization in both the neural epithelium and DMZ explants (Ref: PCP and Septins govern the polarized organization of the actin cytoskeleton during convergent extension, Current Biology, 2024). However, in this study, overexpressed PCP core proteins failed to show polarized localization. Thus, one must be careful in interpreting data.

      Subtle effects

      Several of the reported results show quite modest changes in imaging and immunoprecipitation analyses, which are supportive of the proposed molecular model, but future experiments will be needed to robustly test the model.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript presents an end-to-end pipeline, intended to accelerate EM-based connectomics by combining low-resolution imaging for large volumes with synapse-level imaging only in selected regions of interest. In principle, this strategy can substantially reduce imaging time, computational demands, analysis time, and overall cost.

      General note:

      Overall, I found the manuscript interesting and valuable, particularly as a description of how one laboratory has assembled and applied a practical workflow to reconstruct and analyze the central complex across multiple insect species. In that sense, the work is compelling as an account of a real, functioning strategy for comparative connectomics, and I appreciated reading it. My main reservation is not about the relevance of the biological problem or the utility of the pipeline in the authors' own hands, but about whether the manuscript, in its current form, fully meets the expectations of a paper that is focused on tools and resources. The expectation would be that this paper would be a venue for sharing new techniques, software tools, datasets, and other resources intended to be usable by the community. Here, because much of the pipeline appears to build on existing methods and software, the key value added should be a particularly clear demonstration of how these components were adapted, integrated, validated, and documented for this specific use case in a way that others could realistically reproduce and adopt. At present, that translational and reproducibility-oriented component does not yet seem sufficiently developed, despite the clear promise of the overall approach.

      Major comments:

      (1) The work is valuable as a practical integration and application of multiple existing tools into a coherent pipeline, together with a new multi-resolution imaging strategy. However, the manuscript at times reads as though it introduces an entirely novel workflow. I would encourage the authors to clarify the contribution more explicitly: which components are genuinely new (for example, the acquisition strategy and the end-to-end integration/validation), and which are adaptations of already established methods or software. This would make the scope and novelty of the paper easier to assess.

      (2) The most distinctive element is the multi-resolution acquisition strategy. However, as described, the selection of high-resolution regions seems to be decided a priori based on anatomy (guided by xCT localization of the CX), rather than being determined automatically from the data (i.e., ROI placement is anatomy-driven rather than data-driven). A more data-driven or machine learning-guided ROI strategy would strengthen the methodological contribution and the adaptability to new scenarios, along the lines of approaches such as SmartEM [1].

      (3) The manuscript emphasizes open-source availability and reduced barriers to entry, but the current software release, as referenced, does not yet appear to support straightforward external reuse. Since much of the pipeline builds on existing methods, the main added value lies in how these technologies were adapted, combined, and validated for the present problem. A clear and complete explanation of this adaptation is therefore essential, but is currently missing. I would suggest the following concrete improvements:<br /> a) Provide a single landing page or umbrella repository that links each pipeline step in the paper to the corresponding codebase, including version tags/commits and expected inputs/outputs for each step.<br /> b) Include step-by-step tutorials for each component.<br /> c) Provide an example dataset together with a full reproduction walkthrough in a controlled environment.<br /> d) Clearly explain the required parameters and configuration for each step, including how they should be adjusted for other datasets or scenarios.<br /> e) Follow packaging and distribution best practices (for example, PyPI/conda releases, Docker containers, and version pinning).

      (4) In my own attempt to set up and run parts of the released code, I encountered issues that currently limit reproducibility. For example, when creating an environment for EMalign (https://github.com/Heinze-lab/EMalign), the required Python version is not specified, and installation did not succeed under Python 3.12 due to dependency constraints. Additionally, synful_312 (https://github.com/Heinze-lab/synful_312) and SegToPCG (https://github.com/Heinze-lab/SegToPCG) appear to be empty despite being referenced in the manuscript. These are fixable issues, but addressing them is important if the paper is to deliver on its "low entry cost" claim.

      (5) Table 1 reports acquisition times, which is helpful. However, the multi-resolution approach adds essential processing steps that appear due to the strategy followed (e.g., "XY alignment high-res" and "high-res to low-res alignment"). Please include registration/alignment (and other major post-processing) runtimes and resource requirements, such as storage, in a comparable table so readers can assess true end-to-end cost.

      References:

      [1] Meirovitch, Y., et al. "SmartEM: machine learning-guided electron microscopy." Nature Methods (2025).

    2. Reviewer #2 (Public review):

      Summary:

      The paper proposes a workflow to accelerate EM connectomics by combining multi-scale imaging with image processing and analysis (image alignment, registration, neuron tracing, automated segmentation and synapse prediction, proof-reading) to derive a brain region connectome. The paper argues and (partially) demonstrates that this approach facilitates comparative connectomics.

      The data acquisition pipeline uses a well-established sample preparation protocol, uCT guided acquisition, and SBEM imaging at cellular and synaptic resolution.

      Data processing and analysis combine existing state-of-the-art components and focus on the alignment and complementary analysis of the two SBEM resolution levels. The paper applies the workflow to the central complex of six different insects and performs some preliminary analysis based on this (which is acceptable for a resource/tool).

      Disclaimer for the rest of the review: I am an expert in image analysis and segmentation, so I have mainly focused on these aspects as I am not qualified to analyze the details of image acquisition.

      Strengths:

      The paper addresses an important problem and promises an acceleration and democratization of comparable connectomics. The time savings of the imaging approach are well-motivated and derived. The methods used for image alignment, segmentation, synapse detection, and proofreading are state-of-the-art.

      Weaknesses:

      I see two major weaknesses in the paper:

      (1) The paper introduces the (approximate) equivalence of the projectome and connectome in the insect brain very prominently in the introduction and uses this as a central motivation for the multi-resolution image acquisition protocol. But - to me - it is unclear how this principle is really used in the analysis presented in the last results and if this assumption is evaluated at all. Specifically, Figure 4 a shows the anatomical neuron reconstructions (from cellular resolution SBEM), d-g show connectome-level analysis from the synaptic resolution data. The only link I can see between the two is that the neural processes in the synapse-resolution data can be mapped to the neurons from the cellular resolution data, thanks to the image alignment. This is certainly important, BUT it is only tangentially related to the projectome vs. connectome claim from the introduction. This claim implies that a tentative connectome is derived from projectome-level data (e.g. by assuming a uniform probability of synapse-formation given surface or distance between projections) that is then validated by the "true" connectome data from synaptic resolution. Instead, what is actually solved - to my understanding - is mapping the local connectome to the projectome. While related, these are different things and the current framing of the paper and the quite brief description of the section on comparative connectomics (also no corresponding Methods section) make this claim inadequately supported.

      (2) Reporting on segmentation and proofreading is purely qualitative. Given that this is claimed as a core contribution of the paper (e.g. statement in line 497 and following), I would expect substantially more reporting and evaluation of this claim:<br /> a) Report the actual time needed for proofreading the segmentations in CAVE. I could not find any numbers on this.<br /> b) Report the initial segmentation quality of the model: How many errors does it make? Note: There is a brief mention of VoI-based quantification in Methods (around line 1060), but the results are not reported.

      What should be done: Report the error rates (with an accurate measure such as skeleton VoI) independently for all 6 volumes. Given that the authors have the proofread versions, this is feasible. Only then can the claims be made here be evaluated. Note that the F1-score of synapse prediction is quantified. This is a good starting point, but could also be extended to further species in order to assess the actual transferability. Furthermore, none of the data from the study seems to be available. The training data of the network has to be made available. If possible, high-resolution data should be proofread too.

      Further points:

      (1) Why isn't reconstruction at the cellular level addressed with ML? This is surely possible and should be easier than the full connectome analysis. Similar to before, the actual times needed for tracing with CATMAID are not reported; the manuscript only states that this can be done in minutes for a neuron, but it's unclear if this is the best or average case. It would help to have quantitative numbers to assess whether automation would bring any benefits.

      (2) Finally, regarding the underlying software. I did not try this myself due to time constraints, but did check the repositories. They seem to be in an ok state with some documentation in a README. However, given the central role of the software contribution, I would expect a centralized doc page that explains how to use the different parts of the software, including a full example with sample data. Without this, application by other labs - a central claim - will be difficult.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aimed to determine the neural networks involved in updating behaviour by training mice on a 'go / no go' odour discrimination task, and measuring their brain activity using functional MRI.

      Strengths:

      The use of the translationally relevant 'go / no go' task is a major strength, as this is a task that can be used as readily in humans as in animals such as mice. The use of fMRI in awake, behaving mice is also a major strength, as this allows the activation of multiple brain regions to be measured while behaviour is ongoing, and also facilitates comparison to human studies. The computational modelling approaches further support these translational aims, again being as readily applied to human data as to animal data.

      Weaknesses:

      The major weakness of the paper - and one that is potentially addressable - is that the key analysis of the paper, showing that the periaqueductal gray (PAG) is recruited for reversal learning, is only partially supported by the data presented in the paper as it stands. The authors have used a sophisticated way of analysing the behavioural data using 'signal detection theory', in which they collected behavioural data showing correct 'go' responses ('hits'), correct 'no go' responses ('correct rejections'), missed 'go' responses ('misses') and go responses when mice should have withheld a response ('false alarms'). The data presented showing a double dissociation in the activation of the nucleus accumbens for 'hits' but not 'correct rejections' and the PAG for 'correct rejections' but not 'hits' is very interesting; however, it is confounded by the fact that the nucleus accumbens may activate when the animal makes a response, and the PAG when the animal withholds a response. If the authors also included the analysis of nucleus accumbens and PAG activation for 'misses' and 'false alarms', this would allow them to determine whether the activation of these regions reflects the behavioural response or the expectation of reinforcement from the response.

      Thus, the paper includes very interesting data and is impressive in its approach to analysing behaviour in a manner that is highly translatable between species. The additional analyses would markedly strengthen the paper and would add depth to the finding that the PAG appears to be involved in behavioural flexibility.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors test the hypothesis that whole-brain functional magnetic resonance imaging in behaving mice, coupled with reinforcement-learning modeling, can dissociate neural substrates of initial cue-reward acquisition versus contingency reversal, and potentially reveal underappreciated contributors to cognitive flexibility. Using a head-fixed go/no-go odor discrimination task with subsequent rule reversal in a subset of mice, they model trial-by-trial state-action values with a model-free Q-learning algorithm (hierarchical Bayesian fit) and use the model-derived decision variable as a parametric regressor in whole-brain analyses. They report that acquisition-related signals prominently involve ventral and dorsal striatal regions, whereas reversal learning additionally recruits the periaqueductal gray (negative correlation with the decision variable) and shows an apparent double dissociation between nucleus accumbens and periaqueductal gray responses for hit versus correct-rejection outcomes during reversal.

      Strengths:

      (1) The reversal manipulation is implemented without explicit punishment, targeting suppression of previously rewarded actions under reward omission - an underexplored regime for midbrain contributions beyond canonical threat/pain framing.

      (2) The manuscript provides a credible MR-compatible olfactory/licking platform with synchronized sniff/lick/valve/reward timing and high-field imaging, supporting feasibility and broader utility for mesoscale systems neuroscience in rodents.

      (3) Trial-by-trial value estimates from a Q-learning variant are fit via hierarchical Bayesian inference and explicitly integrated into subject-level general linear models with a mouse hemodynamic response function, which is appropriate for leveraging within-subject dynamics in small-N rodent fMRI.

      (4) The decision-variable maps during acquisition recover expected basal ganglia involvement (including nucleus accumbens and dorsal striatum), providing face validity; the reversal-stage map yields an interpretable set of cortical/striatal/pallidal regions plus periaqueductal gray/hippocampus.

      (5) The finite impulse response analysis stratified by behavioral outcomes (hit, false alarm, correct rejection, miss) adds interpretability beyond the model regressor alone, and the reported crossover interaction between nucleus accumbens and periaqueductal gray is potentially impactful if robust.

      Weaknesses:

      (1) The core claim regarding selective periaqueductal gray engagement rests on a subset of n = 6 mice for reversal. With permutation-based whole-brain inference and very small cluster sizes, the robustness of the periaqueductal gray effect to reasonable analytic perturbations is not yet convincing. I would suggest providing leave-one-animal-out analyses for the periaqueductal gray cluster/ROI effects and reporting how often the key findings survive.

      (2) The authors note that due to temporal resolution and hemodynamics, they cannot separate stimulus, choice, and feedback and therefore model "whole trials." This limitation creates ambiguity about whether periaqueductal gray signals reflect value updating, action inhibition (no-lick), reward omission, autonomic arousal, or motor preparation/withholding, especially given the strong hit versus correct-rejection opponency. I would suggest adding targeted analyses that disambiguate "withholding" from "reversal-related updating".

      (3) ROIs are defined from the whole-brain decision-variable maps and then interrogated by outcome types; the manuscript acknowledges non-independence. This can inflate apparent dissociations. It would be better if the authors define ROIs independently (anatomical periaqueductal gray/nucleus accumbens masks, or split-half ROI definition with held-out data) and repeat the key ROI conclusions.

      (4) The reversal group is a subset of the acquisition cohort and also experiences a different task phase structure and additional sessions; the paper attempts to address exposure differences descriptively. I would suggest that the authors formally test whether periaqueductal gray effects are explained by session count, time-in-scanner, or learning rate differences (e.g., include these as covariates, or match sessions more strictly).

      (5) The platform records sniffing and licking, but the imaging models described include motion, global, and ventricle regressors and do not clearly include trialwise lick/sniff covariates. Given the periaqueductal gray's known autonomic and defensive coordination roles, physiological state confounding is a major concern. Could the authors incorporate sniff and lick metrics (and their derivatives) as nuisance regressors and show whether the periaqueductal gray effects persist?

    1. Reviewer #1 (Public review):

      Summary:

      In their manuscript, Zhou and colleagues present a detailed look at how the JSP functions differently in the various cells of a breast tumor. The authors have effectively shown that the JSP acts as a double-edged sword, as it helps T cells fight cancer but also allows tumor cells to grow and avoid ferroptosis. These findings are important because they identify a useful biomarker to predict how TNBC patients might respond to PD-1 inhibitors.

      Strengths:

      This work is important because it provides a clear explanation for the conflicting roles of the JSP in the tumor environment. The evidence is solid, as it combines data from thousands of patients with single-cell analysis and lab experiments to confirm the role of STAT4 in cancer progression and immunity.

      Weaknesses:

      However, there are areas for improvement in the scope of the review, the depth of analysis, and the potential for broader clinical implications. The authors are encouraged to address these issues to enhance the scientific and clinical impact of the study.

      Major Issues:

      (1) The authors demonstrate that STAT4 upregulates SLC47A1, but this is currently supported only by expression correlation and western blot data. To confirm a direct link, the authors are encouraged to perform ChIP-qPCR or luciferase reporter assays to show that STAT4 binds directly to the SLC47A1 promoter.

      (2) The conclusion that the MIF-CD74 axis drives immunosuppression is based on computational inference. To support this, the authors could consider mining publicly available breast cancer spatial transcriptomics data to show the co-localization of MIF and CD74. Alternatively, performing simple dual-color immunofluorescence staining on a few clinical sections would effectively demonstrate the physical proximity of these cells.

      (3) TNBC is highly heterogeneous and includes subtypes like mesenchymal and immunomodulatory groups. The authors should analyze whether the JSP score or STAT4 levels vary significantly between these subtypes, as this could further refine the selection of patients for JAK1 inhibitors.

      (4) While the JSP score works well in the current datasets, the authors should consider validating its predictive accuracy in additional independent immunotherapy cohorts, such as the TONIC trial, to ensure the biomarker is robust across different treatment settings.

      Minor Issue:

      The manuscript mentions a U-shaped trajectory of JSP activity during tumor transition. A more detailed biological explanation of why the pathway activity initially drops and then rises would add depth to the discussion.

    2. Reviewer #2 (Public review):

      Summary:

      The JAK-STAT pathway (JSP) exhibits cell-type-specific functional heterogeneity in breast cancer. This study investigates the JSP in breast cancer and its response to anti-PD‑1 immunotherapy. JSP displays distinct cell‑type heterogeneity: it promotes malignant phenotypes and immunosuppression in tumor cells, while enhancing cytotoxicity and reducing exhaustion in T cells. Elevated JSP expression correlates with improved immunotherapy responses, especially in triple‑negative breast cancer. These findings highlight the paradoxical roles of JSP, indicating that broad inhibition may compromise anti‑tumor immunity.

      Strengths:

      The major strengths of this study include the comprehensive characterization of JSP heterogeneity across epithelial, tumor, and T cells in breast cancer. The identification of JSP and STAT4 as predictive biomarkers for immunotherapy response, particularly in triple‑negative breast cancer, provides clinically relevant insights for patient stratification.

      Weaknesses:

      The findings rely heavily on public dataset analyses.

    3. Reviewer #3 (Public review):

      Summary:

      This multi-omics study by Zhou et al elucidates the context-dependent roles of the Janus kinase-signal transducer and activator of transcription (JAK-STAT) pathway (JSP) across different cellular compartments in the breast cancer tumor microenvironment. While bulk JSP activity is associated with a favorable prognosis, single-cell analysis reveals a paradoxical landscape: high JSP in T cells drives anti-tumor cytotoxicity and reduces exhaustion, whereas high activity in tumor epithelial cells promotes malignancy and immunosuppression via the MIF-CD74 signaling axis. The JSP score (immune-related) serves as a robust predictive biomarker for response to anti-PD-1 immunotherapy, particularly in triple-negative breast cancer (TNBC). Furthermore, the study identifies the STAT4/SLC47A1 axis as a critical mechanism through which tumor cells resist ferroptosis, facilitating disease progression. These findings suggest that broad JAK-STAT inhibition may be counterproductive in cancer therapeutics; instead, therapeutic success depends on precise modulation and carefully timed interventions to preserve its T-cell-associated functions. This study may inspire future studies to explore specific factors that selectively modulate JAK-STAT activity in immune cells to achieve favorable therapeutic outcomes.

      Strengths:

      Significant therapeutic implications.

      Weaknesses:

      Limited molecular mechanisms.

    1. Reviewer #1 (Public review):

      Summary:

      Fields et al. investigated the heterogeneity and kinetics of human antibody secreting cell (ASC) differentiation by analyzing ex vivo tonsil samples and using in vitro differentiation modeling. They discovered that a CD30+ intermediate subset emerges in transition from B cell to ASC in both contexts, but not from germinal centers, and they identified cytokines that promote this state. They also identified an isoform of CD44, CD44v9, that is expressed on some ASCs.

      Strengths:

      The strengths are the novelty of the findings and the identification of two new markers that may be useful for tracking ASC heterogeneity.

      Weaknesses:

      However, some of this work seems preliminary and would need to be further validated. Some of the data presented was only representative, with limited controls and biological repeats, limiting the interpretation. For example, the role of Mef2c for CD30 expression was not robustly demonstrated. It was not clear if Figure 1 scRNAseq/ATACseq was from multiple donors or just one. Future studies may extend these novel findings and determine the functional relevance of these factors, CD30, and CD44v9 for ASC differentiation and physiology.

    2. Reviewer #2 (Public review):

      Summary:

      Bhattacharya and colleagues here use cell culture, single-cell RNA and ATACseq sequencing of such in vitro cultures and of ex vivo isolated B-lineage cells to infer an ontogeny for extra-germinal centre B cell differentiation. The manuscript presents a useful potential ontogeny for plasma cells, wherein in vitro cultured naïve human B cells enter a CD30+ intermediate state before moving in subsequent days through a CD44v9+ state before ultimately obtaining a 'mature' antibody-secreting plasma cell phenotype. Ex vivo isolated germinal centre B cells obtain the plasma cell state without expressing CD30 in their development. Phenotype analysis of tonsillar B-lineage cells supports the same phenotype conversion in vivo, although the intermediate cell population was smaller in vivo. The link to CD44v9 expression on developing plasma cells is inferred to be for extra-GC (T-independent) responses, but the data presented leave this equivocal, and the functional importance of developing via a CD30+CD44v9+ intermediate is not investigated.

      Strengths:

      The article presents a solid potential ontogeny for PC development, wherein some differentiating B cells acquire a CD30+ state, transition through a CD44v9+CD30+ state, then downmodulate CD30 before obtaining canonical CD38+ 'PC' status. A strength is the integration of in vitro cultured B cell results with tonsillar B-lineage cell data sets, and careful flow cytometry of the in vitro cultures over several days to infer lineage. The data provide reasonable support for the concept. CD30+ cells are shown to develop readily from naïve B cells in culture, but uncommonly from GC B cell cultures. A nice piece of data is Figure 6B, which shows reasonably strong correlative changes in phenotype through the assumed ontogeny, and this fits with the expected trajectory of maturation.

      Weaknesses:

      The most important weakness throughout is the non-absolute nature of the relationship. An example is seen in that the sorted ex vivo GC B cells also give rise to the 'extra-GC' phenotype of plasma cell, suggesting that while the profile is enriched, it is not absolute. There is a further weakness, as while cultures are run for several days, division-associated shifts in PC phenotype are not mapped; such would greatly strengthen the weight of the argument, and show conditional shifts in phenotype associated with division, an uncontrolled parameter in the mix. For example, for the MEF2C A388 inhibition experiments, it would be strong evidence of the pathway/process contributing if a by-division peak increase in CD30+ population was demonstrated in the early days of culture.

      There are some basic sort experiments performed (e.g. 3C-3F), which show that the CD30+ cells do give rise to PC preferentially, but what is missing is the step-wise phenotype shifts in these sorted populations, which should support the trajectory shown in Figure 3B and (the in vitro equivalent of) 6B. It would emphatically support the trajectory to show the cellular phenotypes on the PC with sorting based on CD30, CD44v9, CD27, and CD20 expression, and following outcome phenotypes 24-48 hours later, if the inferred maturation trajectory is true.

      There are also specific weaknesses with the bioinformatics, in that, while the analyses are likely appropriate, unpresented data is necessarily used to shape the argument. For example, Figure 1C shows bubble plots for two plasma cell sets, yet, of archetypal PC-expressed genes, only IRF4 is demonstrated to confirm they are true PC, and the gene is not universally expressed in cells in the clusters. For this figure, it would help to expand the bubble plot to show J-CHAIN, XBP-1, CIITA and PRDM1 or other appropriate PC demarcating molecules. Similarly, in Fig 2B, more evidence of a bifurcation in state is needed than that CD44v9 distinguishes PC1 from PC2 clusters-this is the stated conclusion, but 2A depicts that 50% of PC1 relatively weakly express CD44, while <25% of PC2 express it. Demonstrating additional molecules or genes distinguishing the clusters would improve veracity. Figure 2F shows clonal lineages, but it would be helpful to see somatic hypermutation burdens and learn if they differ between the demarcated subsets. I also find the pseudotime analyses of limited value, as some of the branches follow trajectories that are unrealistic biologically, so less weight should be placed on the pathways to which they do or do not point (i.e., the notion that GC B cells do or do not give rise to particular PC subsets).

      Statistically, some of the experiments are single wells from single donors, so there is a low level of confidence and no reproducibility demonstrated for some aspects of the study, which is a weakness.

      Paradoxical to the argument that it is the TI response process being modelled, it is presented that CpG stimulation, plus proxy T cell help (CD40L), drives the CD30+ phenotype best with the addition of the GC-associated cytokine IL-21. This should be carefully considered and discussed.

      Overall, in addition to presenting more contextual information from the bioinformatics, the best way to solidify the data set, in my vie,w would be to revisit the hypothesis with two additional experimental approaches: (1) to incorporate division tracing into the ontogeny studies and (2) to perform lineage tracing on sort-purified populations at different stages of the maturation process.

    1. Reviewer #1 (Public review):

      Summary:

      Spinal projection neurons in the anterolateral tract transmit diverse somatosensory signals to the brain, including touch, temperature, itch, and pain. This group of spinal projection neurons is heterogeneous in their molecular identities, projection targets in the brain, and response properties. While most anterolateral tract projection neurons are multimodal (responding to more than one somatosensory modality), it has been shown that cold-selective projection neurons exist in lamina I of the spinal cord dorsal horn. Using a combination of anatomical and physiological approaches, the authors discovered that the cold-selective lamina I projection neurons are heavily innervated by Trpm8+ sensory neuron axons, with calb1+ spinal projection neurons primarily capturing these cold-selective lamina I projection neurons. These neurons project to specific brain targets, including the PBNrel and cPAG. This study adds to the ongoing effort in the field to identify and characterize spinal projection neuron subtypes, their physiology, and functions.

      Strengths:

      (1) The combination of anatomical and physiological analyses is powerful and offers a comprehensive understanding of the cold-selective lamina I projection neurons in the spinal cord dorsal horn. For example, the authors used detailed anatomical methods, including EM imaging of Trpm8+ axon terminals contacting the Phox2a+ lamina I projection neurons. Additionally, they recorded stimulus-evoked activity in Trpm8-recipient neurons, carefully selected by visual confirmation of tdTomato and GFP juxtaposition, which is technically challenging.

      (2) This study identifies, for the first time, a molecular marker (calb1) that labels cold-selective lamina I projection neurons. Although calb1+ projection neurons are not entirely specific to cold-selective neurons, using an intersectional strategy combined with other genes enriched in this ALS group or cold-induced FosTRAP may further enhance specificity in the future.

      (3) This study shows that cold-selective lamina I projection neurons specifically innervate certain brain targets of the anterolateral tract, including the NTS, PBNrel, and cPAG. This connectivity provides insights into the role of these neurons in cold sensation, which will be an exciting area for future research.

      Weaknesses:

      (1) The sample size for the ex vivo electrophysiology conducted on the calb1+ lamina I projection neurons (Figure 5) is limited to a total of six recorded neurons. Given the difficulty and complexity of the preparation, this is understandable. Notably, since approximately 87% of lamina I projection neurons heavily innervated by Trpm8+ terminals are calb1+, these six recordings of such neurons in Figure 4E could also be calb1+.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, the authors took advantage of a semi-intact ex vivo somatosensory preparation that includes hindlimb skin to characterize the response of projection neurons in the dorsal horn of the spinal cord to peripheral stimulation, including cold thermal stimuli. The main aim was to characterize the connectivity between peripheral afferents expressing the cold sensing receptor TRPM8 and a set of genetically tagged neurons of the anterolateral system (ALS). These ALS neurons expressed high levels of the calcium binding protein calbindin 1.

      In addition, combining different viral tracing methods, the authors could identify the anatomical targets of this specific subset of projection neurons within the brainstem and diencephalon.

      Strengths:

      The use of a relatively new (seldom used previously) transgenic line to label TRPM8-expressing afferents, combined with the genetic characterization of a previously identified subset of projections neurons add specificity to the characterization. The transgenic line appears to capture well the subpopulation of Trpm8-expressing neurons.

      In addition, the use of electron microscopy techniques makes the interpretation of the structural contacts more compelling

      The writing is clear and the presentation of findings follows a logical flow.

      Overall, this study provides solid, novel information about the brain circuits involved in cold thermosensation.

      Weaknesses:

      In the characterization of recorded neurons in close contact or in the absence of this contact with TRPM8 afferents, the number of recordedd neurons is relatively low. In addition, the strength of thermal stimuli is not very well controlled, preventing a more precise characterization of the connectivity.

      The authors acknowledge that, technically, this is a very difficult preparation with very low yield as far as obtaining successful recordings. Moreover, the tissue needs to be maintained at room temperature which is obviously not ideal when characterizing cold thermoreceptors due to the unavoidable effects of low temperature on cold-activated receptors.

    3. Reviewer #3 (Public review):

      Summary:

      Razlan and colleagues provide a detailed anatomical characterization of lamina I projection neurons in the mouse spinal cord that are densely innervated by primary afferents activated by cooling of the skin. The authors validate a Trpm8-Flp mouse line, show synaptic contacts between Trpm8⁺ boutons and projection neurons at the ultrastructural level, and demonstrate at the physiological level that these neurons specifically respond to cooling stimuli. Next, by taking advantage of previous transcriptomic analysis of ALS neurons, the authors identify calbindin as a marker for cold activatetd lamina I projection neurons and map their ascending projections to the rostral lateral parabrachial area, caudal periaqueductal gray, and ventral posterolateral thalamus, well-known thermosensory and thermoregulatory centers. Altogether, these findings provide strong anatomical and functional evidence for a direct line of transmission from Trpm8⁺ sensory afferents through Calb1⁺ lamina I neurons to key supraspinal centers controlling perception of cold and thermoregulatory responses.

      Strengths:

      The combination of mouse genetics, electron microscopy, ex-vivo physiology, optogenetics and viral tracing provides convincing evidence for a direct cold pathway. The work validates the Trpm8-Flp line by extensive anatomical and molecular characterization. Integration with previous transcriptomic and anatomical data, neatly links the cold-selective lamina I neurons to a molecularly defined cluster of ALS neurons, strengthening the bridge between molecular identity, anatomy, and physiological function.

      Weaknesses:

      The main limitation remains the relatively small number of neurons that could be recorded electrophysiologically. While understandable given the complexity of the preparation, this necessarily limits generalization.

    1. Reviewer #1 (Public review):

      Summary:

      Spinal projection neurons in the anterolateral tract transmit diverse somatosensory signals to the brain, including touch, temperature, itch, and pain. This group of spinal projection neurons is heterogeneous in their molecular identities, projection targets in the brain, and response properties. While most anterolateral tract projection neurons are multimodal (responding to more than one somatosensory modality), it has been shown that cold-selective projection neurons exist in lamina I of the spinal cord dorsal horn. Using a combination of anatomical and physiological approaches, the authors discovered that the cold-selective lamina I projection neurons are heavily innervated by Trpm8+ sensory neuron axons, with calb1+ spinal projection neurons primarily capturing these cold-selective lamina I projection neurons. These neurons project to specific brain targets, including the PBNrel and cPAG. This study adds to the ongoing effort in the field to identify and characterize spinal projection neuron subtypes, their physiology, and functions.

      Strengths:

      (1) The combination of anatomical and physiological analyses is powerful and offers a comprehensive understanding of the cold-selective lamina I projection neurons in the spinal cord dorsal horn. For example, the authors used detailed anatomical methods, including EM imaging of Trpm8+ axon terminals contacting the Phox2a+ lamina I projection neurons. Additionally, they recorded stimulus-evoked activity in Trpm8-recipient neurons, carefully selected by visual confirmation of tdTomato and GFP juxtaposition, which is technically challenging.

      (2) This study identifies, for the first time, a molecular marker (calb1) that labels cold-selective lamina I projection neurons. Although calb1+ projection neurons are not entirely specific to cold-selective neurons, using an intersectional strategy combined with other genes enriched in this ALS group or cold-induced FosTRAP may further enhance specificity in the future.

      (3) This study shows that cold-selective lamina I projection neurons specifically innervate certain brain targets of the anterolateral tract, including the NTS, PBNrel, and cPAG. This connectivity provides insights into the role of these neurons in cold sensation, which will be an exciting area for future research.

      Weaknesses:

      (1) The sample size for the ex vivo electrophysiology is small. Given the difficulty and complexity of the preparation, this is understandable. However, a larger sample size would have strengthened the authors' conclusions.

      (2) The authors used tdTomato expression to identify brain targets innervated by these cold-selective lamina I projection neurons. Since tdTomato is a soluble fluorescent protein that fills the entire cell, using synaptophysin reporters (e.g., synaptophysin-GFP) would have been more convincing in revealing the synaptic targets of these projection neurons.

      (3) The summary cartoon shown in Figure 7 can be misleading because this study did not determine whether these cold-selective lamina I projection neurons have collateral branches to multiple brain targets or if there are anatomical subtypes that may project exclusively to specific targets. For example, a recent study (Ding et al., Neuron, 2025) demonstrated that there are PBN-projecting spinal neurons that do not project to other rostral brain areas. Furthermore, based on the authors' bulk labeling experiments, the three main brain targets are NTS, PBNrel, and cPAG. The VPL projection is very sparse and almost negligible.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, the authors took advantage of a semi-intact ex vivo somatosensory preparation that includes hindlimb skin to characterize the response of projection neurons in the dorsal horn of the spinal cord to peripheral stimulation, including cold thermal stimuli. The main aim was to characterize the connectivity between peripheral afferents expressing the cold-sensing receptor TRPM8 and a set of genetically tagged neurons of the anterolateral system (ALS). These ALS neurons expressed high levels of the calcium-binding protein calbindin 1.

      In addition, combining different viral tracing methods, the authors could identify the anatomical targets of this specific subset of projection neurons within the brainstem and diencephalon.

      Strengths:

      The use of a relatively new (seldom used previously) transgenic line to label TRPM8-expressing afferents, combined with the genetic characterization of a previously identified subset of projection neurons, adds a specificity to the characterization. The transgenic line appears to capture well the subpopulation of Trpm8-expressing neurons

      In addition, the use of electron microscopy techniques makes the interpretation of the structural contacts more compelling.

      The writing is clear, and the presentation of findings follows a logical flow.

      Overall, this study provides solid, novel information about the brain circuits involved in cold thermosensation.

      Weaknesses:

      In the characterization of recorded neurons in close contact or in the absence of this contact with TRPM8 afferents, the number of recorded neurons is relatively low. In addition, the strength of thermal stimuli is not very well controlled, preventing a more precise characterization of the connectivity.

      The authors could provide some sense of the effort needed to record from the 6 cold-activated neurons described. How many preparations were needed, etc?

    3. Reviewer #3 (Public review):

      Summary:

      Razlan and colleagues provide a detailed anatomical characterization of lamina I projection neurons in the mouse spinal cord that are densely innervated by primary afferents activated by cooling of the skin. The authors, building on their previous anatomical work, validate a Trpm8-Flp mouse line, show synaptic contacts between Trpm8⁺ boutons and projection neurons at the ultrastructural level, and demonstrate at the physiological level that these neurons specifically respond to cooling stimuli. Next, by taking advantage of their previous transcriptomic analysis of ALS neurons, they identify calbindin as a marker for cold-activated lamina I projection neurons and map their ascending projections to the rostral lateral parabrachial area, caudal periaqueductal gray, and ventral posterolateral thalamus, well-known thermosensory and thermoregulatory centers. Altogether, these findings provide strong anatomical and functional evidence for a direct line of transmission from Trpm8⁺ sensory afferents through Calb1⁺ lamina I neurons to key supraspinal centers controlling perception of cold and thermoregulatory responses.

      Strengths:

      The combination of mouse genetics, electron microscopy, ex vivo physiology, and viral tracing provides convincing evidence for a direct cold pathway. The work validates the Trpm8-Flp line by extensive anatomical and molecular characterization. Integration with previous transcriptomic and anatomical data neatly links the cold-selective lamina I neurons to a molecularly defined cluster of ALS neurons, strengthening the bridge between molecular identity, anatomy, and physiological function.

      Weaknesses:

      While anatomical evidence for direct synaptic connectivity between Trpm8+ afferents and lamina I projection neurons is compelling, a physiological demonstration of strict monosynaptic transmission is not shown. The conclusion that these inputs are exclusively monosynaptic should be toned down. Similarly, the statement that "Lamina I ALS neurons that are surrounded by Trpm8 afferents are cold-selective" should also be toned down as only a few neurons have been tested and it cannot be excluded that other neurons with similar characteristics may be polymodal.

    1. Reviewer #1 (Public review):

      Summary:

      CCK is the most abundant neuropeptide in the brain, and many studies have investigated the role of CCK and inhibitory CCK interneurons in modulating neural circuits, especially in the hippocampus. The manuscript presents interesting questions regarding the role of excitatory CCK+ neurons in the hippocampus, which has been much less studied compared to the well-known roles of inhibitory CCK neurons in regulating network function. The authors adopt several methods including transgenic mice and viruses, optogenetics, chemogenetics, RNAi, and behavioral tasks to explore these less-studied roles of excitatory CCK neurons in CA3. They find that the excitatory CCK neurons are involved in hippocampal-dependent tasks such as spatial learning and memory formation, and that CCK-knockdown impairs these tasks.

      However, these questions are very dependent on ensuring that the study is properly targeting excitatory CCK neurons (and thus their specific contributions to behavior).

      There needs to be much more characterization of the CCK transgenic mice and viruses to confirm the targeting. Without this, it is unclear whether the study is looking at excitatory CCK neurons or a more general heterogeneous CCK neuron population.

      Strengths:

      This field has focused mainly on inhibitory CCK+ interneurons and their role in network function and activity, and thus this manuscript raises interesting questions regarding the role of excitatory CCK+ neurons, which have been much less studied.

      Weaknesses:

      (1a) This manuscript is dependent on ensuring that the study is indeed investigating the role of excitatory CCK-expressing neurons themselves and their specific contribution to behavior. There needs to be much more characterization of the CCK-expressing mice (crossed with Ai14 or transduced with various viruses) to confirm the excitatory-cell targeting. Without this, it is unclear whether the study is looking at excitatory CCK neurons or a more general heterogeneous CCK neuron population.

      (2) The methods and figure legends are still extremely sparse, still leading to many questions regarding methodology and accuracy. More details would be useful in evaluating the tools and data, and the lack of proper quantification is still prevalent throughout the paper. In many places, only % values are noted, or only images are presented, and the number of cells counted is almost never reported.

    2. Reviewer #2 (Public review):

      Summary:

      In this study, the authors have demonstrated, through a comprehensive approach combining electrophysiology, chemogenetics, fiber photometry, RNA interference, and multiple behavioral tasks, the necessity of projections from CCK+ CAMKIIergic neurons in the hippocampal CA3 region to the CA1 region for regulating spatial memory in mice. Specifically, authors have shown that CA3-CCK CAMKIIergic neurons are selectively activated by novel locations during a spatial memory task. Furthermore, authors have identified the CA3-CA1 pathway as crucial for this spatial working memory function, thereby suggesting a pivotal role for CA3 excitatory CCK neurons in influencing CA1 LTP. The data presented appear to be well-organized and comprehensive.

      Strengths:

      (1) This work combined various methods to validate the excitatory CCK neurons in the CA3 area; these data are convincing and solid.

      (2) This study demonstrated that the CA3-CCK CAMKIIergic neurons are involved in the spatial memory tasks; these are interesting findings, which suggest that these neurons are important targets for manipulating the memory-related diseases.

      (3) This manuscript also measured the endogenous CCK from the CA3-CCK CAMKIIergic neurons; this means that CCK can be released under certain conditions.

      Weaknesses:

      In summary, this work can be formally accepted after the revision. For the limitations of the revision, the distinct neural effects of cholecystokinin (CCK) receptors (CCK-1R, CCK-2R, and CCK-3R) on hippocampal function have not been fully elucidated. Recent studies indicate that CCK-2R can modulate hippocampal activity at CA3-Schaffer collateral synapses; however, the roles of CCK-1R and CCK-3R in hippocampal function remain poorly characterized, with limited experimental evidence supporting their involvement. Overall, this study provides an interesting and novel perspective on the role of excitatory CCK signaling in hippocampus-dependent navigation learning.

    3. Reviewer #3 (Public review):

      Summary:

      Fengwen Huang et al. used multiple neuroscience techniques (transgenetic mouse, immunochemistry, bulk calcium recording, neural sensor, hippocampal-dependent task, optogenetics, chemogenetics, and interfer RNA technique) to elucidate the role of the excitatory cholecystokinin-positive pyramidal neurons in the hippocampus in regulating the hippocampal functions, including navigation and neuroplasticity.

      Strengths:

      (i) The authors provided the distribution profiles of excitatory cholecystokinin in the dorsal hippocampus via the transgenetic mice (Ai14::CCK Cre mice), immunochemistry, and retrograde AAV.

      (ii) The authors used the neural sensor and light stimulation to monitor the CCK release from the CA3 area, indicating that CCK can be secreted by activation of the excitatory CCK neurons.

      (iii) The authors showed that the activity of the excitatory CCK neurons in CA3 is necessary for navigation learning

      (iv) The authors demonstrated that inhibition of the excitatory CCK neurons and knockdown of the CCK gene expression in CA3 impaired the navigation learning and the neuroplasticity of CA3-CA1 projections.

      Weaknesses:

      (i) The causal relationship between navigation learning and CCK secretion remains nebulous; answering this question will require a more sensitive CCK-BR sensor in future work.

    1. Reviewer #1 (Public review):

      Summary:

      The authors test whether the frog buccal ventilatory rhythm generator behaves as a discrete, anatomically localized oscillator or as a distributed, state-dependent network. They combine reduced preparations (segment/subsegment work), systematic extracellular unit surveys over a defined grid, and local AMPA/GABA microinjections in a hemisected brainstem preparation. Based on these approaches, the authors conclude that mild global excitation (bath AMPA) broadens the distribution of rhythmically active units and renders a previously defined "buccal area" functionally non-identifiable as a unique necessary/sufficient locus.

      The central idea is plausible, and the overall experimental strategy is appropriate for the question being asked. However, in its current form, the manuscript overstates the strength of inference supporting the "expansion" and "loss of necessity/sufficiency" conclusions. This is primarily due to (a) statistical treatment of unit-mapping data that does not respect clustering by preparation/animal, (b) inconsistent statistical reporting across sections, and (c) limited interpretability of focal inhibitory perturbations under a globally excited state.

      Strengths:

      (1) The manuscript addresses a clear mechanistic question with broader relevance: whether rhythm generation is best conceptualized as a localized kernel or as an emergent distributed property that changes with excitatory state.

      (2) The authors use convergent approaches (reduced preparations, mapping, and necessity/sufficiency-style pharmacological perturbations), which is appropriate for circuit-level inference.

      (3) A strong element is the within-unit analysis supporting state-dependent changes in phase coupling for a subset of units ("lung" units adopting a buccal-like pattern). The authors' offline PCA-based spike sorting (with cluster-quality selection via silhouette score) provides some reassurance that the reported pre/post injection changes are not simply driven by unit misidentification.

      Weaknesses:

      (1) Pseudoreplication in unit-survey statistics undermines the main mapping inference. The Methods state that "Units were pooled from multiple preparations" and that chi-squared tests were used to compare proportions across conditions (baseline vs 60 nM AMPA). The Results similarly report proportion changes (e.g., 110 units pooled from three preparations vs 137 units pooled from three additional animals) analyzed with chi-squared tests. Because many units come from the same preparation/animal, independence is unlikely to hold; therefore, inference about state-dependent reorganization at the systems level should be made at the preparation/animal level or via hierarchical models that explicitly account for clustering.

      (2) Statistical methods are inconsistently described and need harmonization. In the segment dose-response "Analysis," values are described as compared to zero using a "One-sample t-test." Yet Table 1 is titled as using a "Wilcoxon One-sample Test." These discrepancies must be resolved throughout (Methods, Results, figure legends, and tables), including clear reporting of the unit of n and exact test statistics.

      (3) Unit classification and operational definitions raise interpretational concerns. The unit classification scheme defines "buccal units" as those firing during buccal bursts as well as lung bursts, and explicitly notes that "no units were found which fired only during buccal bursts." This is a consequential result, and it currently reads more like a limitation of detection/classification (or state-space sampled) than a robust biological conclusion. Without additional evidence, it weakens claims about a distinct buccal rhythmogenic module and complicates the interpretation of "buccal identity" changes under excitation.

      (4) Microinjection mapping: high exclusion rate and alternative explanations for 'loss of necessity' under excitation. The manuscript reports that 15 experiments were conducted, but 9 were excluded because the buccal area was not found or the preparation was "overdriven." This exclusion rate is too high to leave implicit; it raises concerns about selection bias and demands transparent accounting. Moreover, under baseline conditions, GABA (or AMPA-GABA) microinjections reliably reduce/abolish buccal bursts, but under bath 60 nM AMPA, the same injections produce no significant change in instantaneous frequency. This pattern can be interpreted as network redistribution, but it can also reflect state-dependent changes in gain, dynamic range, or local pharmacological impact (e.g., inhibition being comparatively underpowered in the globally excited state). Additional controls/analyses are required to distinguish these explanations.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors investigate the response of the amphibian respiratory rhythm generator under varying excitability conditions. They use pharmacological agents to increase and/ or decrease synaptic excitability and demonstrate the resilience of buccal rhythms under different conditions. They employ these results to formulate their primary thesis, that there is no obligatory locus of the buccal respiratory rhythm in the frog, and that their respiratory rhythmogenic mechanisms should be considered diffuse and anatomically distributed across a larger brainstem region.

      Strengths:

      This manuscript is well written, with a sufficiently large number of experiments, for which the authors should be congratulated.

      Weaknesses:

      The presented results don't support the authors' main conclusions, and the interpretation of the data is heavily biased toward their hypothesis. This impregnates an unsubstantiated narrative in the Abstract, Introduction, and Discussion of this manuscript, which must be reexamined with the following points in consideration:

      (1) The authors seem to confuse degeneracy with redundancy. For instance, at line 54, they state, "These findings support the broader hypothesis that respiratory rhythm-generating circuits can switch to being diffuse and redundant, with discrete oscillators quickly drowning in a sea of excitations."

      Redundancy means having the same component repeated multiple times to buffer the failure of any single component, whereas degeneracy means different functional components that compensate for one another under perturbations (Goaillard and Marder, ARN 2021)

      Since the premotor-lung units get converted to buccal units under high excitability, this suggests a degenerate mechanism for respiratory rhythm generation- rather than a redundant mechanism, where there should be multiple buccal units that get recruited under different excitability conditions.

      (2) Line 83, "but the essential requirement for a discrete, rudimentary buccal oscillator is also lost".

      This statement is not supported by the data presented in this study. How does the expansion of the buccal unit imply that the essential requirement for discreteness is lost? Under increased excitability, does the burst/rhythm initiation zone also expand? Or does it still remain centered around the location of buccal units under physiological conditions? Increased excitability can lead to recruitment of a larger area, without a change in the location of the rhythmogenic kernel.

      (3) Line 86, "... oscillators should be viewed as promiscuous flexible functional entities that expand or contract...".

      Oscillators can be regarded as promiscuous only if, under physiological conditions, they switch positions. Under high excitability, only the flexibility argument holds, which has been established in mammals before (e.g., CA Del Negro, K Kam, JA Hayes, JL Feldman, The Journal of physiology 587 (6), 1217-1231; CA Del Negro, C Morgado-Valle, JL Feldman,Neuron 34 (5), 821-830; NA Baertsch, LJ Severs, TM Anderson, JM Ramirez, Proceedings of the National Academy of Sciences 116 (15), 7493-7502; NA Baertsch, HC Baertsch, JM Ramirez Nature communications 9 (1), 843).

      Results:

      (4) Interpretation of data in Figure 6.

      How does the Buccal activity and L2 Power stroke change with 60nm AMPA (in CN5)? Does the increase in the Buccal neurons and decrease in power stroke neurons also reflect in the CN5 activity? Also see comments on Figure 9 data below.

      (5) Interpretation of data in Figure 7.

      Here, classifying buccal neurons solely by spiking may obscure the fact that the 'silent' neurons under baseline conditions were part of the rhythmic network but could not spike due to subthreshold inputs. 60 nM AMPA increased their firing in response to previously subthreshold synchronous inputs during the buccal burst. Intracellular recordings are required to negate this possibility and establish that the neuronal classification is robust.

      (6) Interpretation of data in Figure 8.

      "Lung units can transform into buccal units under excitation".<br /> CN5 buccal and lung bursts need to be compared before and after AMPA injection. From Figure 8 A-D, it is apparent that the example Unit2's activity increases during the buccal bursts, after AMPA injection. However, they are also present in buccal burst pre-AMPA, albeit with less frequency.

      It is striking that the pre-AMPA epoch (panel A) is less than half of the post-AMPA epoch. This would, in itself, lead to a biased estimate of lung units that are active under the baseline condition during the buccal bursts.

      Figure 8G, meta-analysis of lung units spiking during the baseline buccal bursts is warranted to interpret the main claim of this figure. Similarly, analysis of spiking per lung burst for the post-AMPA condition is essential for comparing the lung unit's contribution under high excitability.

      (7) Interpretation of data in Figure 9

      "Buccal area loses importance under increased excitation."

      This interpretation is not fully supported by the data presented in this manuscript. Under 60 nm AMPA, does the ratio of lung burst to buccal burst change in CN5? This analysis is crucial for determining whether the lung units are indeed converted into buccal bursts at the expense of lung activity or whether their appearance during buccal bursts is incidental due to increased excitability. In the baseline, there are 4-5 buccal bursts per lung burst, whereas under high excitability, there are 2-3 buccal bursts per lung burst (Figure 9 A-B). This seems inconsistent with the conclusion that increased excitability converts lung units into buccal units (Figures 6 &7).

      Could the authors comment on the connectivity between the lung and the buccal units? Results in Figure 9A-B indicate that lung units may receive an efference copy of buccal units, and under high excitability, their spikes may generate negative feedback onto the buccal units, terminating their bursts. This could explain the decrease in the buccal-to-lung burst in high-AMPA conditions. This type of circuit interaction resembles the mammalian breathing CPG, in which the parafacial/RTN (which controls the abdominal muscles) and preBötC (which controls the diaphragm) interact and cross-inhibit each other.

      (8) Line 382.

      "Buccal-like bursting produced from two independent slices".

      The two "independent" slices have portions of the same anatomical kernel, the buccal rhythm generator. This experiment is like the sandwich slice preparation of preBötC (Del Negro Lab), in which two thinner slices exhibit rhythmic activity. Thus, the two slices are not independent; they are anatomically adjacent and functionally overlapping.

    3. Reviewer #3 (Public review):

      Summary:

      This study uses isolated frog brainstem preparations to test whether inspiratory rhythm generation is confined to a narrowly defined neural center or instead reflects the activity of a distributed and adaptable network. Building on prior rodent work, the authors examine structural and functional parallels between the frog Buccal Area and the mammalian preBötzinger complex. By increasing excitatory drive, they assess whether a localized rhythmogenic region can expand into a broader network that participates in buccal rhythm generation, providing insight into how respiratory circuits are dynamically reconfigured across physiological states.

      Strengths:

      The work presents compelling evidence that ventilatory rhythm generation is supported by a flexible, state-dependent network rather than a fixed anatomical locus. The experimental preparation is well-suited to address these questions, and the data are generally of high quality. The demonstration that increased excitation recruits a more distributed network parallels observations in mammalian systems and strengthens the translational relevance of the findings. Overall, the analyses are thoughtful, and the interpretations are largely well supported by the results.

      Weaknesses:

      Some issues limit the strength of the conclusions. First, the study does not address the transition from eupnea to gasping in mammals, which could provide important physiological context for the observed AMPA-induced network reorganization. Second, the reported transformation of lung-active neurons into buccal-active neurons would benefit from additional analyses to clarify whether neurons switch identities or acquire dual activity. Finally, the necessity and sufficiency experiments in Figure 9 require further support, particularly through AMPA dose-response analyses and more comprehensive GABA manipulations, to confirm that network expansion does not obscure the continued functional importance of the core buccal region.

    1. Reviewer #1 (Public review):

      Summary:

      Choucri and Treiber have reassessed their previous study on TE-gene chimeric transcripts in neural genes in response to Azad et al (2024). Azad and colleagues argued that, contrary to Choucri and Treiber's findings, chimeric TE-mRNAs are relatively infrequent, and they cautioned that further optimization of bioinformatics pipelines is needed to detect TE insertions from RNAseq accurately. In this short response, Choucri and Treiber clearly demonstrate that differences in the tools used between their study and that of Azad et al. likely account for the contrasting results, along with RT-PCR failure in designing primers that would match the chimeric transcript, and the use of different Drosophila lines. The authors emphasize the need for uniform, standardized criteria in such analysis, which would ultimately strengthen and advance the field.

      Strengths:

      The addition of a ratio to compute the number of splice reads specific to the chimeric transcript and compare to the exon-exon splice reads is really interesting because it opens the door to finally quantify the contribution of chimeric TEs to the overall gene expression, although this is not the scope of the present article. The clear dissection of chimeric transcripts, along with the results from Azad et al, allows us to understand the differences between the two studies confidently. Finally, the discussion on Drosophila lines is indeed essential, given that the lines and even individuals have high TE polymorphism.

      Weaknesses:

      I think it is necessary to add more detail to this article, for instance, the differences between TEchim and Tidal could be laid out more precisely. Regarding the roo example, one of the caveats of this family, along with others, is the presence of simple repeats. It would be important to show that the simple repeats are not interfering with the read mapping. Regarding the experiments, if we are looking for a standardized protocol, then we should have a detailed material and methods section, with every experiment, replicate, and PCR temperature clearly defined. Finally, and in my opinion, more importantly, the use of RT negative controls on the RT PCRs, along with DNA PCRs to show insertion presence, is mandatory for testing the presence of chimeric genes. Of course, water negative PCR controls are also needed, and unfortunately, absent from Figure 3.

    2. Reviewer #2 (Public review):

      Summary:

      This study by Choucri and Treiber aims to directly address a recent critique regarding the role of transposable elements (TEs) in diversifying the neural transcriptome of Drosophila. The authors seek to demonstrate that TEs are not merely genomic "noise" but are frequently and reliably "exonized" into brain-specific mRNA. By introducing an upgraded computational pipeline, TEChim, and conducting precise experimental validations, the authors set out to show that TE-mediated splicing represents a genuine biological phenomenon that expands the molecular repertoire of the nervous system.

      Strengths:

      The study's primary strength lies in its rigorous technical "forensic" analysis of previous failed replication attempts. The authors convincingly demonstrate that the lack of signal in the opposing study stemmed from a fundamental methodological mismatch: the software used by the critics (TIDAL) is logically incapable of detecting splice sites located within TE sequences. Importantly, the authors complement this computational clarification with definitive experimental evidence through an effective "experimental rescue." By employing correctly designed primers and matching the genetic backgrounds of the fly strains, thereby accounting for genomic polymorphisms, they successfully validated all seven loci that were previously reported as undetectable. This dual-pronged strategy, addressing both algorithmic bias and experimental design, establishes a more robust technical benchmark for the detection and validation of TE-derived exons in neural tissues.

      Weaknesses:

      While the technical rebuttal is highly convincing, the scope of the study remains primarily defensive. As a response to a prior critique, the work focuses on establishing the existence and detectability of chimeric TE-derived transcripts rather than exploring their broader functional consequences. As a result, there is limited new insight into how these TE-modified isoforms influence neural circuit function or organismal behavior. In addition, the detection and validation of these events remain technically demanding, requiring deep sequencing and specialized bioinformatic expertise, which may limit broader adoption by laboratories without dedicated computational resources.

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript by Choucri and Treiber responds to a recent paper by Azad et al., which responds to a paper by Treiber and Wadell (Genome Research, 2020). The controversy relates to the detection of transcripts with transposable elements (TEs) spliced into them in the Drosophila brain.

      Strengths:

      The authors now argue convincingly that these transcripts exist using an improved, updated version of their pipeline. They also validate some of their findings using RT-PCR and explain why Azad et al. failed to detect these transcripts due to methodological errors. Overall, I am convinced that these transcripts exist and that the TE-derived transcripts described by Choucri and Treiber are real.

      Weaknesses:

      The authors should mention that combining PCR-amplified cDNA generation with short-read sequencing is suboptimal for detecting TE-fusion transcripts. Recently, direct long-read ONT RNA sequencing, which does not require amplification and spans the entire transcript, has been used to detect similar transcripts in human stem cells and the human brain (PMID: 40848716 & Garza et al, BioRxiv). Had the authors used this technology to validate their findings, there would be no question about these transcripts. If not doing such experiments, then they should at least discuss the possibility and the advantage of the approach.

    1. Reviewer #1 (Public review):

      [Editors' note: this version has been assessed by the Reviewing Editor without further input from the original reviewers. The authors have addressed the minor comments raised in the previous round of review.]

      Summary:

      In this manuscript, Chengjian Zhao et al. focused on the interactions between vascular, biliary, and neural networks in the liver microenvironment, addressing the critical bottleneck that the lack of high-resolution 3D visualization has hindered understanding of these interactions in liver disease.

      Strengths:

      This study developed a high-resolution multiplex 3D imaging method that integrates multicolor metallic compound nanoparticle (MCNP) perfusion with optimized CUBIC tissue clearing. This method enables the simultaneous 3D visualization of spatial networks of the portal vein, hepatic artery, bile ducts, and central vein in the mouse liver. The authors reported a perivascular structure termed the Periportal Lamellar Complex (PLC), which is identified along the portal vein axis. This study clarifies that the PLC comprises CD34⁺Sca-1⁺ dual-positive endothelial cells with a distinct gene expression profile, and reveals its colocalization with terminal bile duct branches and sympathetic nerve fibers under physiological conditions.

      Comments on revisions:

      The authors very nicely addressed all concerns from this reviewer. There are no further concerns and comments.

    2. Reviewer #3 (Public review):

      Xu, Cao and colleagues aimed to overcome the obstacles of high-resolution imaging of intact liver tissue. They report successful modification of the existing CUBIC protocol into Liver-CUBIC, a high-resolution multiplex 3D imaging method that integrates multicolor metallic compound nanoparticle (MCNP) perfusion with optimized liver tissue clearing, significantly reducing clearing time and enabling simultaneous 3D visualization of the portal vein, hepatic artery, bile ducts, and central vein spatial networks in the mouse liver. Using this novel platform, the researchers describe a previously unrecognized perivascular structure they termed Periportal Lamellar Complex (PLC), regularly distributed along the adult liver portal veins.<br /> Using available scRNAseq data, the authors assessed the CD34⁺Sca-1⁺ cells' expression profile, highlighting mRNA presence of genes linked to neurodevelopment, bile acid transport, and hematopoietic niche potential. Different aspects of this analysis were then addressed by protein staining of selected marker proteins in the mouse liver tissue. Next, the authors addressed how the PLC and biliary system react to CCL4-induced liver fibrosis, implying PLC dynamically extends, acting as a scaffold that guides the migration and expansion of terminal bile ducts and sympathetic nerve fibers into the hepatic parenchyma upon injury.

      The work clearly demonstrates the usefulness of the Liver-CUBIC technique and the improvement of both resolution and complexity of the information, gained by simultaneous visualization of multiple vascular and biliary systems of the liver. The identification of PLC and the interpretation of its function represent an intriguing set of observations that will surely attract the attention of liver biologists as well as hepatologists. The importance of the CD34+/Sca1+ endothelial cell population and claims based on transcriptomic re-analysis require future assessment by functional experimental approaches to decipher the functional molecules involved in PLC formation, maintenance, and the involvement in injury response before establishing their role in biliary, arterial, and neural liver systems.

      Strengths:

      The authors clearly demonstrate an improved technique tailored to the visualization of the liver vasulo-biliary architecture in unprecedented resolution.<br /> This work proposes a new morphological feature of adult liver facilitating interaction between the portal vein, hepatic arteries, biliary tree, and intrahepatic innervation, centered at previously underappreciated protrusions of the portal veins - PLCs.

      Weaknesses:

      The importance of CD34+Sca1+ endothelial cell sub-population for PLC formation and function was not tested and warrants further validation.

    1. Reviewer #1 (Public review):

      This manuscript investigates how dentate gyrus (DG) granule cell subregions, specifically suprapyramidal (SB) and infrapyramidal (IB) blades, are differentially recruited during a high cognitive demand pattern separation task. The authors combine TRAP2 activity labeling, touchscreen-based TUNL behavior, and chemogenetic inhibition of adult-born dentate granule cells (abDGCs) or mature granule cells (mGCs) to dissect circuit contributions.

      This manuscript presents an interesting and well-designed investigation into DG activity patterns under varying cognitive demands and the role of abDGCs in shaping mGC activity. The integration of TRAP2-based activity labeling, chemogenetic manipulation, and behavioral assays provides valuable insight into DG subregional organization and functional recruitment. However, several methodological and quantitative issues limit the interpretability of the findings. Addressing the concerns below will greatly strengthen the rigor and clarity of the study.

      Major points:

      (1) Quantification methods for TRAP+ cells are not applied consistently across panels in Figure 1, making interpretation difficult. Specifically, Figure 1F reports TRAP+ mGCs as density, whereas Figure 1G reports TRAP+ abDGCs as a percentage, hindering direct comparison. Additionally, Figure 1H presents reactivation analysis only for mGCs; a parallel analysis for abDGCs is needed for comparison across cell types.

      (2) The anatomical distribution of TRAP+ cells is different between low- and high-cognitive demand conditions (Figure 2). Are these sections from dorsal or ventral DG? Is this specific to dorsal DG, as itis preferentially involved in cognitive function? What happens in ventral DG?

      (3) The activity manipulation using chemogenetic inhibition of abDGCs in AsclCreER; hM4 mice was performed; however, because tamoxifen chow was administered for 4 or 7 weeks, the labeled abDGC population was not properly birth-dated. Instead, it consisted of a heterogeneous cohort of cells ranging from 0 to 5-7 weeks old. Thus, caution should be taken when interpreting these results, and the limitations of this approach should be acknowledged.

      (4) There is a major issue related to the quantification of the DREADD experiments in Figure 4, Figure 5, Figure 6, and Figure 7. The hM4 mouse line used in this study should be quantified using HA, rather than mCitrine, to reliably identify cells derived from the Ascl lineage. mCitrine expression in this mouse line is not specific to adult-born neurons (off-targets), and its expression does not accurately reflect hM4 expression.

      (5) Key markers needed to assess the maturation state of abDGCs are missing from the quantification. Incorporating DCX and NeuN into the analysis would provide essential information about the developmental stage of these cells.

      Minor points:

      (1) The labeling (Distance from the hilus) in Figure 2B is misleading. Is that the same location as the subgranular zone (SGZ)? If so, it's better to use the term SGZ to avoid confusion.

      (2) Cell number information is missing from Figures 2B and 2C; please include this data.

      (3) Sample DG images should clearly delineate the borders between the dentate gyrus and the hilus. In several images, this boundary is difficult to discern.

      (4) In Figure 6, it is not clear how tamoxifen was administered to selectively inhibit the more mature 6-7-week-old abDGC population, nor how this paradigm differs from the chow-based approach. Please clarify the tamoxifen administration protocol and the rationale for its specificity.

      Comments on revisions:

      I appreciate the authors' careful and thorough revisions. They have addressed all of my previous concerns satisfactorily, and the manuscript is now significantly strengthened. I have no further concerns.

    2. Reviewer #2 (Public review):

      In this study, the authors investigate how increasing cognitive demand shapes activity patterns in the dorsal dentate gyrus (DG). Using a touchscreen-based TUNL task combined with TRAP/c-Fos tagging, birth-dating of adult-born granule cells (abDGCs), and chemogenetic inhibition, they show that higher task demand increases mature granule cell (mGC) recruitment and enhances suprapyramidal (SB) versus infrapyramidal (IB) blade bias. Functionally, mGC inhibition reduces overall activity and impairs performance without disrupting blade bias, whereas inhibition of {less than or equal to}7-week-old abDGCs increases mGC activity, abolishes blade bias, and impairs discrimination under high-demand conditions. These findings suggest that effective pattern separation depends not only on overall DG activity levels but also on the spatial organization of recruited ensembles.

      The integration of touchscreen TUNL with temporally controlled activity tagging and birth-dated cohorts is technically strong. Quantification of SB-IB bias and radial/apical distributions adds anatomical precision beyond bulk activity measures. The comparison between mGC and abDGC inhibition is conceptually compelling and supports dissociable functional roles. Overall, the data convincingly demonstrate that increasing cognitive demand amplifies blade-biased DG recruitment and that mGCs and abDGCs differentially contribute to both behavioral performance and network organization.

      However, how abDGCs are integrated into the mGC network under high cognitive demand remains unresolved. Additional experiments are needed to clarify how abDGCs shape spatial recruitment patterns and whether they directly inhibit or indirectly regulate mGC activity to maintain high performance.

      Furthermore, the authors frame "high cognitive demand" as a multidimensional construct encompassing broad behavioral challenge. It would strengthen the work to delineate how local abDGC-mGC circuit interactions regulate specific task components in real time. This will require higher temporal resolution approaches, as TRAP and c-Fos labeling integrate activity over prolonged windows and primarily reflect sustained engagement rather than moment-to-moment computations.<br /> The central conclusion that dentate function depends on coordinated spatial recruitment rather than total activity magnitude is supported by the data, although mechanistic interpretations should be tempered given methodological limitations.<br /> Overall, this work advances models of adult neurogenesis by emphasizing a critical-period modulatory role of abDGCs in organizing DG network activity during high-demand discrimination. The combined behavioral and circuit-level framework is likely to be influential in the field.

    3. Reviewer #3 (Public review):

      This study examines the role of dentate gyrus neuronal populations, reflecting neurogenesis and anatomical location (suprapyramidal vs infrapyramidal blade), in a mnemonic discrimination task that taxes the pattern separation functions of the dentate. The authors measure dentate gyrus activity resulting from cognitive training and test whether adult neurogenesis is required for both the anatomical patterns of activity and performance in the cognitive task. The authors find that more cognitively challenging variants of the task evoked more dentate activity, but also distinct patterns of activity (more activity in the suprapyramidal blade, less in the infdrapyramidal blade). Using chemogenetic approaches they silence mature vs immature dentate gyrus neurons and find that only mature neurons (either the general population or specifically mature adult-born neurons), and not immature adult-born neurons, are required for the difficult version of the task. Inhibition of mature adult-born neurons furthermore increased overall activity in the dentate and reduced the biased pattern of activity across the blades, consistent with evidence that adult-born neurons broadly regulate dentate gyrus activity.

      Comments on revisions:

      I appreciate the efforts the authors have taken to revise this manuscript. I have only minor concerns with this revised version of the manuscript:

      Methods state that significance is defined as P<0.05 but some results are interpreted as significant when P=0.05. Either the alpha value needs to change or the interpretation needs to change.

      I believe the statistical results for group and blade effects for the ANOVAs, in Figs 2,3 & 4, appear to be switched (blade should be significant, not group).

      I appreciate that sometimes there is not a perfect overlap between immunohistochemical signals, but I continue to believe that the spatially-non-overlapping TRAP and EDU signals in Fig 3 is caused by these 2 markers being in different cells. A Z-stack or orthogonal projection could verify/disprove this concern.

    1. Reviewer #1 (Public review):

      Kong et al.'s work describes a new approach that does exactly what the title states, "Correction of local beam-induced sample motion in cryo-EM images using a 3D spline model." It is, therefore, a more elaborate approach than current methods in the field for the "movie alignment" stage. Additionally, the work uses 2DTM (2D Template Matching)-related measurements to quantify the improvement of the new method compared to other methods in the field. I find both parts very compelling (the new method and the testing approach)

      On a "focused" view, the strengths of the work rest on presenting a better approach for motion correction and on measuring their performance very well at the 2D level in a compelling manner

      On a more "general" view, the authors introduce the important notion that even one of the most worked-out steps in the processing workflow can still be done better in a measurable way, and that this could lead to better results beyond the 2DTM metrics used for testing, reflecting in better results along the processing pipeline (although the manuscript does not explore further this notion)

      On the "usability" side, the method is still CPU-based and is slower than standards in the field. This may pose significant limitations in practical work, although the authors are aware of this issue and are working on it.

    2. Reviewer #2 (Public review):

      Summary:

      The authors present a new method, Unbend, for measuring motion in cryo-EM images, with a particular emphasis on more challenging in situ samples such as lamella and whole cells (that can be more prone to overall motion and/or variability in motion across a field of view). Building on their previous approach of full-frame alignment (Unblur), they now perform full-frame alignment followed by patch alignment, and then use these outputs to generate a 3D model of the motion. This model allows them to estimate a continuous, per-pixel shift field for each movie frame that aims to better describe complex motions and so ultimately generate improved motion-corrected micrographs. Performance of Unbend is evaluated using the 2D template matching (2DTM) method developed previously by the lab, and results are compared to using full-frame correction alone and to the leading local motion correction methods. Several different in situ samples are used for evaluation covering a broad range that will be of interest to the rapidly growing in situ cryo-EM community.

      Strengths:

      The method appears an elegant way of describing complex motions in cryo-EM samples and the authors present sound data that Unbend generally improves SNR of aligned micrographs as well as increases detection of particles matching the 60S ribosome template when compared to using full-frame correction alone and since review to the leading local motion correction methods. The authors also give interesting insights into how different areas of a lamella behave with respect to motion by using Unbend on a montage dataset collected previously by the group. There is growing interest in imaging larger areas of in situ samples at high resolution and these insights contribute valuable knowledge. Additionally, the availability of data collected in this study through the EMPIAR repository will be much appreciated by the field.

      Weaknesses:

      A major weakness was comparing this method to full-frame approaches only but this has since been addressed by the authors during review and Unbend is compared to MotionCor2, 3, CryoSPARC and Warp. The improvements here are smaller, generally it seems to perform on par with the above methods, but there are significant gains for certain samples (e.g. the M. pneumoniae sample). A comment from this reviewer about using an adaptive approach to decide if/when to proceed to the full Unbend pipeline, over full-frame alone, has been addressed by the authors.

    3. Reviewer #3 (Public review):

      Summary

      Kong and coauthors describe and implement a method to correct local deformations due to beam induced motion in cryo-EM movie frames. This is done by fitting a 3D spline model to a stack of micrograph frames using cross-correlation-based local patch alignment to describe the deformations across the micrograph in each frame, and then computing the value of the deformed micrograph at each pixel by interpolating the undeformed micrograph at the displacement positions given by the spline model. A graphical interface in cisTEM allows the user to visualise the deformations in the sample, and the method is proved to be successful by showing improvements in 2D template matching (2DTM) results on the corrected micrographs using five in situ samples.

      Impact

      This method has great potential to further streamline the cryo-EM single particle analysis pipeline by shortening the required processing time as a result of obtaining higher quality particles early in the pipeline, and is applicable to both old and new datasets, therefore being relevant to all cryo-EM users.

      Strengths

      (1) The key idea of the paper is that local beam induced motion affects frames continuously in space (in the image plane) as well as in time (along the frame stack), so one can obtain improvements in the image quality by correcting such deformations in a continuous way (deformations vary continuously from pixel to pixel and from frame to frame) rather than based on local discrete patches only. 3D splines are used to model the deformations: they are initialised using local patch alignments and further refined using cross-correlation between individual patch frames and the average of the other frames in the same patch stack.

      (2) Another strength of the paper is using 2DTM to show that correcting such deformations continuously using the proposed method does indeed lead to improvements, as evidenced by the number of particles found and the quality of the detections (measured using 2DTM SNR). This is shown using five in situ datasets, where local motion is quantified using statistics based on the estimated motions of ribosomes. The same analysis is performed using other deformation correction tools, with Unbend showing superior performance in terms of particle detected or 2DTM SNR of the detections.

    1. Reviewer #2 (Public review):

      Summary:

      In this study, Bansal et al examine and characterize feeding behaviour in Anopheles stephensi mosquitoes. While sharing some similarities to the well-studied Aedes aegypti mosquito, the authors demonstrate that mated-females, but not unmated (virgin) females, exhibit suppression in their blood-feeding behaviour after imbibing an initial bloodmeal. Using brain transcriptomic analysis comparing sugar fed, blood fed and starved mosquitoes, several candidate genes potentially responsible for influencing blood-feeding behaviour were identified, including two neuropeptides (short NPF and RYamide) that are known to modulate feeding behaviour in other mosquito species. Using molecular tools including in situ hybridization, the authors map the distribution of cells producing these neuropeptides in the nervous system and in the gut. Further, by implementing systemic RNA interference (RNAi), the study suggests that both neuropeptides (particularly in the brain, but not in the abdomen since knockdown outside the brain did not affect feeding behaviour) appear to promote blood-feeding while having no impact on sugar feeding. Interestingly, when either of these two neuropeptide gene transcripts were reduced independently by RNAi, the proportion of females acquiring a blood meal was not affected, whereas simultaneous knockdown of both sNPF and RYa led to a reduction in blood feeding behaviour but did not impact sugar feeding.

      Given that the expression of both neuropeptide genes was found in mostly in non-overlapping brain neurons, this suggests that these two neuropeptides may elicit at least partially complementary actions promoting blood feeding in A. stephensi. Indeed, their putative receptors appear to be colocalized within several neurons within the brain, which could explain why knockdown of both sNPF and RYa transcripts was required to affect blood feeding behaviour (although authors could not confirm if either of these neuropeptides act independently as only partial knockdown was achieved in the brain). Finally, while sNPF was mapped to brain neurons and midgut enteroendocrine cells, the authors mapped RYa only in the brain while reporting expression in the abdomen by qPCR, but that was not localized to the midgut EECs (like sNPF). Therefore, the source of RYamide in the abdomen remains unknown in this mosquito species, but could involve the abdominal ganglia where this neuropeptide has been localized in Ae. aegypti.

      Strengths and/or weaknesses:

      Overall, the manuscript was effectively communicated. Previous concerns and requested clarifications have been addressed in the revised manuscript. While advanced cell-specific tools are lacking in this mosquito species, one weakness here is that peptides could have been applied ectopically in attempts to rescue the deficit in blood feeding behaviour following knockdown by RNAi. Further insight in this regard may be provided in future studies by this and other research groups.

      Reviewing editor comment:

      Inclusion of a schematic in Supplementary Figure S9B addresses the point raised by reviewer 1 in the previous round.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aimed to investigate how short-term visual deprivation influences tactile processing in the primary somatosensory cortex (S1) of sighted rats. They justify the study based on previous studies that have shown that long-term blindness can enhance tactile perception, and aim to investigate the change in neural representations underlying rapid, short-term cross-modal effects. The authors recorded local field potentials from S1 as rats encountered different tactile textures (smooth and rough sandpaper) under light and dark conditions. They used deep learning techniques to decode the neural signals and assess how tactile representations changed across the four different conditions. Their goal was to uncover whether the absence of visual cues leads to a rapid reorganization of tactile encoding in the brain.

      Strengths:

      The study effectively integrates high-density local field potential (LFP) recordings with convolutional neural network (CNN) analysis. This combination allows for decoding high-dimensional population-level signals, revealing changes in neural representations that traditional analyses (e.g., amplitude measures) failed to detect. The custom treadmill paradigm permits independent manipulation of visual and tactile inputs under stable locomotion conditions. Gait analysis confirms that motor behavior was consistent across conditions, strengthening the conclusion that neural changes are due to sensory input rather than movement artifacts.

      Weaknesses:

      (1) While the study interprets the emergence of more distinct texture representations in the dark as evidence of rapid cross-modal plasticity, the claim rests on correlational data from a short-term manipulation and decoding analysis. The authors show that CNN-derived feature embeddings cluster more clearly by texture in the dark, but this does not directly demonstrate plasticity in the classical sense (e.g., synaptic or circuit-level reorganization). The authors have noted this as a limitation and have clarified that the observed changes reflect functional reorganization rather than structural plasticity.

      (2) Although gait was controlled, changes in arousal or exploratory behavior in light versus dark conditions might play a role in the observed neural differences. The authors have controlled for various factors in relation to locomotion, but future studies would benefit from more direct behavioural readouts of arousal states (e.g., via pupillometry or cortical state indicators).

      (3) It should be noted that the time course of the observed changes (within 10 minutes) is quite rapid, and while intriguing, the study does not include direct evidence that the underlying circuits were reorganized-only that population-level signals become more discriminable. The authors have adequately discussed this as an avenue for more mechanistic future research.

      (4) The authors have adequately discussed that, while these findings are consistent with somatotopy and context-dependent dynamics, they do not provide strong independent evidence for novel spatial or temporal organization.

      (5) The authors have also discussed that, while the neural data suggest enhanced tactile representations, the study does not assess whether rats' actual tactile perception improved. Future studies including an assessment of a behavioral readout (e.g., discrimination accuracy), would be insightful.

      (6) The authors' discussion about the implications for sensory rehabilitation, including Braille training and haptic feedback enhancement was a bit premature, but they have amended this, and it remains an interesting translational potential to be explored in future studies.

      (7) While the CNN showed good performance, more transparent models (e.g., linear classifiers or dimensionality reduction) appear to not exceed chance level. The implications of this are that there is an underlying complex structure in the LFPs that has yet to be fully uncovered, on the mechanistic level. This would be important to push the findings forward in future studies.

      Therefore, while the authors raise interesting hypotheses around rapid plasticity, somatotopic dynamics, and rehabilitation, the evidence for each is indirect. Stronger claims will require future causal experiments, behavioral readouts, and mechanistic specificity beyond what the current data provides. However, the work represents an interesting starting point to a more mechanistic understanding in the future.

    2. Reviewer #2 (Public review):

      Summary:

      Yamashiro et al. investigated how transient absence of visual input (i.e. darkness) impacts tactile neural encoding in the rat primary somatosensory cortex (S1). They recorded local field potentials (LFPs) using a 32-channel array implanted in forelimb and hindlimb primary somatosensory cortex while rats walked on smooth or rough textures under illuminated and dark conditions. Employing a convolutional neural network (CNN), they successfully decoded both texture and lighting conditions from the LFPs. The authors conclude that the subtle differences in LFP patterns underlie tactile representation surface roughness and become more distinct in darkness, suggesting a rapid cross-modal reorganization of the neural code for this sensory feature.

      Strengths:

      • The manuscript addresses a valuable question regarding how sensory cortices dynamically adapt to changes in sensory context.<br /> • The use of machine learning (CNNs) enables the analysis to go beyond conventional amplitude-based metrics, potentially uncovering subtle but meaningful effects.<br /> • The authors have substantially improved the manuscript with clearer figures, additional statistical analyses (including permutation tests and cross-validation), and greater methodological transparency.

      Weaknesses:

      • The new analyses (grand-average LFPs, correlation maps, wavelet decompositions, attribution-score correlations) improve transparency but do not yet clarify which specific neural features the CNN exploits, leaving the central interpretability question unresolved.<br /> • A plausible alternative explanation for the increased discriminability in darkness remains insufficiently ruled out: visually driven activity in the light condition (e.g., ambient illumination changes or self-motion-induced visual input) could contaminate S1 LFPs and account for the effect without reflecting a true neural representational change.<br /> • Behavioural and order controls have been improved but remain somewhat limited in sample size.

      Overall assessment:

      The revised manuscript is clearer, more transparent, and technically strengthened. However, the true nature of the signal changes underlying the observed differences in discriminability remains unclear, limiting the scientific strength of the conclusions. The possibility that visual interference contributes to the observed effects remains a plausible and untested alternative interpretation. Additional experiments or analyses quantifying visually evoked activity in S1 would be required to confirm the claim of genuine reorganization of neural representation depending on the illumination condition.

    1. Reviewer #2 (Public review):

      Summary:

      An abundant literature documents molecular changes in the rodent hypothalamus that occur during the transition from prepubertal to mature reproductive physiology. Equally well documented is the role of sex steroids and their receptors during this important period of reproductive development, as well as the importance of GABAergic and glutamatergic neurons. The medial preoptic area (MPOA) is known to play a central role in expression of sexually dimorphic reproductive function and previously reported sexually dimorphic patterns of gene expression are consistent with this role. The present manuscript extends this knowledge base and reports the results of a detailed evaluation of transcriptional dynamics in the MPOA during the adolescent transition to maturity with a particular focus on the role of the estrogen receptor gene (Esr1). Both single cell RNA sequencing (scRNseq) and multiplex in situ hybridization methods were employed and the results subjected to detailed computational analyses to demonstrate that the transcriptomic structure of MPOA neurons displays both sex and cell type specific expression profiles. In addition, both hormonal and genetic manipulations of Esr1 signaling during puberty altered the transcriptional profiles of MPOA neurons, and these changes aligned with maturation of hormone-dependent reproductive function. The authors provide this evidence to illustrate Esr1-dependent control of gene regulatory networks required for normal expression of reproductive behaviors expressed during the transition from adolescence to adulthood. The results presented in this manuscript are extensive and represent the most comprehensive evaluation of transcriptomic changes during reproductive maturation to date. The methods appear strong and the results provide a rich data set that will support a good deal of future analysis.

      Strengths:

      (1) The major strength of this manuscript is the extensive set of images and graphs that illustrate molecular changes that occur in MPOA neurons during adolescence, although additional spatial detail as to locations of the source neurons would be welcome in order to place the changes in the proper circuitry context.

      (2) Targeting Esr1 deletion to MPOA GABA neurons is a good choice, given how these cells have been implicated in sexual differentiation of reproductive behavior previously, and the lack of comparable responses in glutamatergic neurons is convincing. The AAV-frtFlex-Cre virus created by the investigators is a most useful tool for such studies. Profiling distinct transcriptomic trajectories in GABA and glutamatergic neurons during reproductive maturation is impressive and leads to some of the best supported conclusions in this paper.

      (3) Cellular and molecular resolution of the transcriptomics data appears excellent, however, because the source tissue for the scRNAseq analysis was obtained by bulk dissection of the MPOA anatomical resolution is limited. This problem is addressed to some extent by careful comparison of scRNAseq results with previously published spatial transcriptomics data. The HM-HCR-FISH analysis clearly documents spatially restricted changes in gene expression, but it is hard to discern where these changes occur based on the images presented or the descriptions included in the Results. The anatomical schematic included in Figure 4 suggests that investigators are not familiar with components of the MPOA (see Allen Mouse Brain Atlas).

      Weaknesses:

      (1) A major conceptual flaw is that the authors do not distinguish between genetically determined sex differences in patterns of gene expression and differences caused by the fact that MPOA neurons are exposed to different endocrine environments in adolescent males and females, which can cause different transcriptional trajectories independent of genetic sex. This issue does not render their results invalid, but their terminology should address the issue in the discussion and "limitations" section. At the very least the endocrine status of "intact females" should be included.

      (2) A major technical flaw is that the MPOA is treated as a functionally distinct brain region (block dissections) with uniform distribution of cell types (FISH data are not illustrated or reported with sufficient spatial detail). Thus, an enormous amount of molecular data is provided that cannot be mapped to distinct neural circuits, thereby limiting the neurobiological impact. This is also a weakness of the FISH data, which is presented with only small regions illustrated without anatomical detail. In fact, some images are compared that appear to illustrate different MPOA structures, although it is impossible to be certain of this due to the lack of morphological landmarks. The analysis of how Esr1 orchestrates regulatory gene networks is impressive and interesting, but the fact that many of the observed transcriptional events occur in neural circuits that do not overlap confounds interpretation.

      (3) The locations of the AAV injections should be characterized because deleting Esr1 in multiple distinct parts of the MPOA will likely confound interpretation. This is especially problematic given the limited number of mice used for parts of the RNAscope analysis.

      (4) Although the focus of these experiments on adolescence is welcome, neither the Introduction nor the Discussion do a good job of placing these studies in the context of what is already known about brain maturation during puberty. It is true that this is very much a results-focused manuscript, but the scholarship can be improved. Simply stating that your results are consistent with previous reports places an undue burden on the reader to go figure out what is new.

      (5) Throughout the manuscript, the authors utilize obscure abbreviations, which often makes reading their text overly cumbersome. This is certainly justified in certain instances where complex names of analytical methods are used repeatedly, but the authors are encouraged to try and simply their use of non-standard abbreviations.

      Comments on revisions:

      The authors have considered issues raised during the initial review. Although there do not appear to be significant changes to analyses, figures or conclusions, the authors have added important revisions listing limitations in study design and methodology that impact interpretation.

    2. Reviewer #3 (Public review):

      The paper identifies effects of gonadal hormones within hormone-responsive GABAergic neurons in the MPOA. Although it is not surprising that hormones have effects on neurons that express hormone receptors, the current paper adds insights with higher cellular and spatial resolution than previous work and focuses on adolescence period. The paper also identifies a major role for Esr1-dependent mechanisms on behavior using an intersectional genetic strategy to ablate Esr1 in GABAergic or glutamatergic neurons in the MPOA.

      The authors have thoughtfully addressed the reviews, in particular by focusing quantitative analyses on Vgat+Esr1+ clusters and adding important technical and conceptual considerations in the limitations section.

      I have one remaining minor concern. I appreciate that the text now defines "transcriptional maturation". However, the term seems inappropriate when describing the "minimal transcriptional changes" in Vgat+hormone RLow clusters, which implies that they are transcriptionally immature. Do the authors mean to imply that transcriptional maturation is observed in Vgat+Esr1+ clusters but not Vgat+hormone RLow clusters? The authors also use the term "hormone-dependent transcriptional dynamics", which I think is more appropriate. For example, hormone-dependent transcriptional dynamics are observed in Vgat+Esr1+ clusters but not Vgat+hormone RLow clusters.

    1. Reviewer #1 (Public review):

      Summary:

      Overall, this is an interesting and well-written manuscript on a fascinating question in a "charismatic" model system.

      Strengths:

      1) The Introduction is concise, though it might be helpful to the non-specialist reader to learn a bit more about what is known about the social control of somatic growth across diverse species (including humans), which would help to make this work more generally interesting.

      (2) The experiment is well-designed.

      (3) The data collected are comprehensive.

      (4) The complementary analysis of both feeding and aggression/submission data with and without known social roles is a neat idea and compelling!

      Weaknesses:

      (1) I was surprised that the HPA/stress axis was not considered here at all. Wouldn't we expect that subordinates have increased stress axis activation, which in turn could inhibit their growth and aggressive behavior?

      (2) To what extent are growth, food intake, agonistic behavior, and/or gene expression patterns coordinated across P1 vs P2 pairs? The lack of such an analysis seems like a missed opportunity.

      (3) What was the rationale for using whole bodies for the transcriptome analysis? Given the hypotheses, the forebrain or hypothalamus and certain other organ systems (e.g., liver, gonads, skin, etc.) would have been obvious candidate tissues here. I realize that cost is always a consideration, but maybe a focus on the fore-/midbrain could have been prioritized.

      (4) Given the preceding point, why was a fold-change threshold used for assessing DEGs (supplementary Figure 3)? There is no biological justification to ever use a fold-change threshold, especially in bulk RNA-seq analysis. This is particularly true here, where whole bodies were used for RNA-seq analysis, which is a bit unusual. Relatively small cell populations (such as hypothalamic neurons that regulate growth or food intake) may show substantial gene expression variation across social types, yet will be masked by the masses of other cells in the whole body sample. However, gene expression may still vary significantly, albeit the fold-difference may be small. I therefore suggest a reanalysis that omits any fold-change threshold.

      (5) Why is the analysis of color (hue, saturation) buried in the supplementary materials? Based on the hypotheses that motivated the study, color seems just as relevant as food intake, growth, and agonistic behavior, so even if the results are negative, they should be presented in the main paper.

      (6) The Discussion is sometimes difficult to follow. The authors may want to consider including a conceptual graphic that integrates the different aspects of growth and satiety regulation, etc., into a work-in-progress model of sorts, which would also facilitate clearer hypotheses for future research.

    2. Reviewer #2 (Public review):

      In this manuscript, the authors test growth, behavior, and gene expression in pairs of clownfish as they establish social dominance hierarchies, examining patterns of gene expression in these pairs after dominance has been established. The authors show solid evidence that emerging dominant clownfish show increased growth, aggression, and food consumption compared to their submissive or solitary counterparts, eventually adopting distinct gene expression profiles.

      Major Comments:

      (1) The Introduction is comprehensive, but it could be condensed. Likewise, the discussion could be condensed. There is considerable redundancy between the methods, the results, and the legend in Figure 1. The authors should consolidate and remove the redundancy.

      (2) For Figure 3, the authors are showing PC2 and PC3; why is PC1 not shown? There is so much overlap between the three groups in PC2 vs PC3; it seems unlikely that researchers could conclusively identify any individual as belonging to a group based on the expression profile. The ovals shown do not capture all the points within each of the groups, and particularly the grey S oval seems misaligned with the datapoints shown.

      (3) The authors indicate that the 15 replicates exhibiting the greatest size difference between P1 and P2 were selected for gene profiling. Does this mean that each of the P1 and P2 were pairs with each other? Have the authors tried examining the gene expression patterns in a paired manner? E.g., for the pairs that showed the greatest size differences, do they also show the greatest differences in gene expression? Do the P1s show the most extreme differences from P2s that also show the most extreme P2 differences? Perhaps lines on Figure 3A connecting datapoints from the P1 and P2 pairs would be informative.

      (4) For the specific target pathways that are up- and downregulated in the different backgrounds, I recommend that the authors include boxplots (or heatmaps) showing the actual expression values for these targets. Figure 6 shows a heatmap for appetite-related genes, and it would be great to see a similar graph for the metabolism and glycolysis genes; it would also be informative to see similar graphs for hormonal and sexual maturation pathways as well.

      (5) Particularly given that there is a relatively small number of genes enriched in the different rank conditions, I did not understand the need to do the WGCNA module analysis. I thought that an analysis of GO terms across the dataset would have been more meaningful than the GO term analysis shown in Figure 4, which considers only genes assigned to the "brown WGCNA module". This should be simplified or clarified.

      (6) The authors say that they have identified coordinated changes in behaviors and the "underlying gene expression, leading to the emergence" of social roles. This is a little bit misleading, since the gene expression analysis occurred well after the behavioral and phenotypic differences emerged. Presumably, the hormonal and genetic shifts that actually caused the behavioral and phenotypic difference occurred during the weeks during which the experiment was underway, and earlier capture of the transcriptome would presumably reveal different patterns, and ones that would be considered more causative. The authors acknowledge this in 434-435, but it could be emphasized further.

      (7) The authors have measured a number of differences between the different dominance classes of fish. All these differences were measured relative to the other classes, but in my view, the Solitary group was the closest to a baseline control. So I'm not sure that it is fair to say that "P2 and S individuals showed consistent downregulation of these genes and pathways" (line 401). I encourage the authors to emphasize the differences in gene expression from the "perspective" of the P1 individuals compared to the baseline of P2 and S individuals. Line 474 says that "P2 fish showed significant upregulation" of a number of pathways. It should be very clear what that is compared to (compared to P1, presumably?)

      (8) Along the same lines, the authors say in line 514 that subordinates and solitaries strategically downregulate their growth. I'm not convinced that this is the case: I would consider this growth trajectory to be the default and the baseline. I would interpret that under certain social conditions, a P1 dominant pattern of growth, behavior, and gene expression is allowed to emerge.

    3. Reviewer #3 (Public review):

      Summary:

      The authors tested the hypothesis that interactions among size- and age-matched rivals will lead to the emergence of social roles, accompanied by divergence in four aspects of individual phenotypes: growth, feeding behavior, fighting behaviors, and gene expression in clownfish.

      Strengths:

      The data on growth, feeding rate, and fighting behaviors support the authors' claims.

      Weaknesses:

      Gene analysis conducted in this study is not sufficient to clarify how the relevant genes actually regulate growth and behavior.

      The information obtained from whole-body gene expression analysis is very limited. Various gene expression is associated with the regulation of fighting behaviors, food intake, growth, and metabolism, and these genes are regulated differently across tissues, even within a single individual. Gene expression analysis should be performed separately for each tissue.

      Clownfish undergo sex change depending on social status and body size, as the authors mention in the manuscript. Numerous gene expressions are affected by sex change. It is unclear how this issue was addressed.

    1. Reviewer #1 (Public review):

      Summary:

      In this study, the authors examine how a developmentally regulated cis-regulatory element controls SOX2 expression during neural differentiation of human stem cells. The results suggest that this highly conserved long-range enhancer mediates neural-specific SOX2 regulation and offer insight into the role of promoter-enhancer contacts in this process. Although the findings are interesting, several limitations need to be addressed.

      Strengths:

      A central question in developmental biology is how genes are regulated in a context-dependent manner. SOX2, a major pluripotency factor, is expressed in diverse tissues during development, and therefore understanding the mechanisms that control its spatiotemporal expression is critical. This study addresses this important question by examining the functional relevance of a neural-specific, developmentally regulated SOX2 enhancer and its associated promoter-enhancer contacts in driving gene expression during human neural development. Using multiple model systems and techniques, the authors test the requirement of this enhancer by analyzing SOX2 expression in mutant lines, providing evidence for its role in this process.

      Weaknesses:

      A key limitation of the study is the absence of data from homozygous SOX2 enhancer deletion, which leaves the analysis incomplete and tempers the conclusions that can be drawn. Furthermore, the suitability of teratomas as a model system is questionable, given their limited capacity to recapitulate the spatial patterning, regional specification, and organized developmental processes characteristic of the human forebrain. Finally, the manuscript remains largely descriptive with little mechanistic insight.

    2. Reviewer #2 (Public review):

      Summary:

      The authors use a combination of genomics, genome conformation assays, and CRISPR-mediated deletion to study the transcriptional regulation of the SOX2 gene in human neural stem cells (hNSCs).

      Strengths:

      The authors show that two distal elements, located ~550kb downstream of the SOX2 gene, are important for SOX2 transcription in hNSC. They investigate both the deletion of these elements in established hNSCs and in hNSCs generated by differentiation of human pluripotent stem cells, suggesting these elements are important in both the establishment and maintenance of SOX2 expression in hNSCs.

      Weaknesses:

      Homologous elements have been studied in the mouse genome and have conserved function in mouse NSCs, yet these findings are not mentioned. Inclusion of biological replicates for the scRNA-seq and replicate CRISPR-deleted clones would strengthen the study.

    1. Reviewer #1 (Public review):

      Summary:

      In this work, the authors investigate the mechanisms of low-frequency synaptic depression at cerebellar parallel fiber to interneuron synapses using unitary recordings that allow direct quantification of synaptic vesicle release. They show that sparse stimulation can induce robust synaptic depression even in the absence of substantial vesicle consumption, and that this depressed state is rapidly reversed when stimulation frequency is increased. To account for these observations, the authors propose a model in which low-frequency depression reflects a redistribution of vesicles within the readily releasable pool, in particular, a reduction in docking site occupancy due to vesicle undocking.

      Strengths:

      I found the experimental work to be of high quality throughout. The use of simple synapse recordings to count individual vesicle release events is particularly powerful in this context and allows questions to be addressed that are difficult to approach with more conventional approaches. The demonstration that low-frequency depression can occur independently of prior vesicle release, together with the rapid recovery observed during high-frequency stimulation, places strong constraints on possible underlying mechanisms and represents a clear strength of the study.

      The modeling framework is clearly laid out and helps organize a broad set of observations across stimulation frequencies. Several of the experimental tests appear well-motivated by the model, including the recovery train experiments, the analysis of failures, and the use of doublet stimulation. Taken together, the data provide a coherent phenomenological description of low-frequency depression and its relationship to vesicle availability within the readily releasable pool.

      Weaknesses:

      While the experimental results are strong, the manuscript would benefit from rebalancing the strength of the mechanistic conclusions drawn from the modeling in light of its limitations. The framework is clearly useful and provides a coherent interpretation of the data, but it is not uniquely constrained by the experimental observations, and alternative models or interpretations could plausibly account for the findings. The use of different model regimes concatenated across time, with substantially different parameter values, highlights the abstract nature of the approach. For these reasons, the model seems best presented as one plausible explanatory framework rather than a definitive biological mechanism. Clarifying the distinction between data-driven observations and model-based inferences would help readers assess which conclusions are strongly supported and which remain more speculative.

      The interpretation of the Ca2+-related experiments would benefit from more cautious wording. The absence of detectable changes in presynaptic Ca2+ signals does not exclude more localized or subtle Ca2+-dependent mechanisms, and conclusions regarding Ca2+ independence should therefore be framed accordingly. In addition, while low-frequency depression is still observed at reduced extracellular Ca2+, these experiments appear less diagnostic of the specific model-derived mechanism emphasized elsewhere in the manuscript - namely, a selective reduction in docking-site occupancy - and should be discussed with appropriate qualification in the text.

      Major points:

      (1) Clarify and qualify mechanistic claims derived from the model.

      Throughout the manuscript, changes in model parameters are at times described as if they directly reflected underlying physiological mechanisms. As a result, the conceptual distinction between experimentally observed phenomena, model-derived variables, and biological interpretation is not always clear. Several conclusions in the Results and Discussion are phrased as mechanistic statements, although they rest on assumptions intrinsic to the modeling framework. The authors should systematically review the text and explicitly distinguish between (i) experimentally observed changes in synaptic responses and (ii) inferences about vesicle docking states or transitions within the model.

      In particular, statements implying that vesicle undocking is the mechanism underlying low-frequency depression should be rephrased to reflect that this is an interpretation within the proposed framework rather than a uniquely demonstrated biological process. For example, statements such as "Low-frequency depression is caused by synaptic vesicle undocking" should be replaced with formulations such as "Within the framework of our model, low-frequency depression is accounted for by a redistribution of synaptic vesicles away from docking sites" or "Our results are consistent with a model in which changes in vesicle docking-state occupancy contribute to low-frequency depression."

      A particularly problematic example is the statement that "these experiments further confirm that LFD only involves a decrease in δ, without accompanying changes in ρ or IP size." Here, an experimentally defined phenomenon (LFD) is directly equated with changes in model-derived variables. Such statements should be revised to make clear that δ, ρ, and IP size are inferred quantities within the model, and that the experimental data are interpreted through this framework rather than directly confirming changes in these parameters. Similarly, over-generalizing statements such as "Undocking therefore represents the key mechanism controlling short-term depression across stimulation frequencies" should be softened to reflect that this conclusion emerges from the model rather than from direct experimental evidence.

      (2) Address the biological interpretation of time-dependent model regimes.

      The model relies on distinct parameter regimes applied at different time points, with some transitions effectively suppressed in certain regimes. While this approach captures the data well, its biological interpretation remains unclear. The authors should either (i) expand the discussion to outline plausible biological processes that could give rise to such regime changes (for example, calcium-dependent modulation of transition rates or activity-dependent changes in vesicle state stability), or (ii) more explicitly frame this aspect of the model as a descriptive abstraction rather than a mechanistic proposal. This further underscores the need to clearly separate the descriptive role of the model from claims about underlying biological mechanisms.

      (3) Reframe conclusions drawn from calcium-related experiments.

      The calcium imaging data demonstrate no detectable changes in the measured presynaptic calcium signals under the tested conditions, but they do not rule out that calcium signals contribute in ways undetectable by the assay. Conclusions should therefore be revised to reflect this limitation, avoiding statements that exclude a role for calcium-dependent mechanisms. Wording such as "we did not detect evidence for..." would be more appropriate than conclusions implying the absence of an effect.

      Similarly, while low-frequency depression is still observed at reduced extracellular calcium (1.5 mM Ca²⁺), the specific mechanistic signature emphasized elsewhere in the manuscript - namely a selectively reduced first response during a high-frequency recovery train - is no longer apparent. These experiments should therefore be discussed as consistent with the proposed framework, but not as providing independent support for a selective reduction in docking-site occupancy. Explicitly acknowledging this limitation would improve clarity and avoid over-interpreting these data.

      (4) Soften interpretations based on non-significant comparisons.

      In several places, comparisons that do not reach statistical significance are used to argue for equivalence between conditions (for example, comparisons involving failure versus non-failure trials or different LFD conditions). These conclusions should be revised to emphasize the limits of statistical power and framed as a lack of evidence for a difference rather than evidence of independence.

    2. Reviewer #2 (Public review):

      Summary:

      Silva and co-workers exploit their previously established methods of analyzing release events at single parallel fiber to molecular layer interneuron synapses. They observed synaptic depression at low transmission frequencies (< 5 Hz), which rapidly recovers during high-frequency transmission. Analysis of the time course of low-frequency depression revealed an initial rapid and a slow linearly increasing time course. Strikingly, the initial depression occurred even in the absence of preceding release, arguing against vesicle depletion as the underlying mechanism.

      Strengths:

      The main strength of the study is the careful demonstration of an interesting synaptic phenomenon challenging the classical vesicle-centered interpretation of synaptic depression.

      Weaknesses:

      No major weaknesses were identified by this reviewer.

      The finding of release-independent synaptic depression is important and would have widespread implications. Therefore, some more analyses to increase the confidence in these findings could be performed.

      My concern is whether rundown could explain the findings. If the rate of failures in s1 increases and at the same time the amplitude decreases during the experiments, an apparent depression in s2 could arise. The Supplementary Figure 5A addresses run-down, but the figure is not easy to understand, and, as far as I understood, it does not address the question of whether the release-independent depression could be caused by a rundown. To address this, the analysis of Figure 5 could be repeated by investigating the failure rate and amplitude separately or by analyzing the 1st and 2nd half of the recordings separately.

    3. Reviewer #3 (Public review):

      Summary:

      The manuscript builds on the observation that, at some synapses, low-frequency stimulation causes synaptic depression, which can be reversed by subsequent high-frequency stimulation. Such low-frequency depression (LFD) cannot be easily explained by the depletion of a single vesicle pool. Here, Silva and colleagues propose a model of activity-dependent vesicle trafficking to explain LFD at synapses between cerebellar granule cells and molecular layer interneurons.

      Strengths:

      Overall, LFD is interesting and worthy of examination, and the authors provide new experimental results that are of the high quality expected from this group.

      Weaknesses:

      The study proposes a novel model of vesicle trafficking that is not explained by known biological mechanisms, and the manuscript does not adequately compare or discuss alternative models.

      I have several concerns about how the authors interpret the data. First, the manuscript's primary conceptual advance is the idea that LFD involves vesicle undocking, rather than depletion. However, most experiments were performed under conditions that promote vesicle depletion (3 mM extracellular Ca2+). When experiments were repeated in physiological Ca2+, there appeared to be little or no LFD (stats are not provided). Second, the RS/DS/DU/undocking model, though not outside the realm of possibility, is not readily explained by known mechanisms and is only loosely supported by experimental findings. Third, when simulating LFD, the authors do not compare alternative models and use inappropriate language to imply that a model fit represents the truth (e.g., "the finding of identical experimental and simulated values confirms that the undocking mechanism accounts for LFD"). Finally, the model is presented in an overly complicated manner. The sheer amount of terms and nomenclature makes the manuscript confusing and difficult to read. Overall, the manuscript would benefit from added experiments and more statistics, a better justification and evaluation of the model, and more nuanced language.

      Major concerns:

      (1) Most experiments were performed under conditions that exacerbate depletion

      In order to attribute LFD to vesicle undocking rather than depletion, it is important to show LFD under conditions where depletion is minimal. As mentioned above, the authors only report significant LFD in elevated extracellular Ca2+. In a small number of experiments performed in more physiological Ca2+ (1.5 mM), there is no depression after a single stimulus, and it is not clear that there was statistically significant depression during a low-frequency train. Several studies cited in support of LFD share this problem:

      • Abrahamsson et al., (2007) recorded from Schaffer collaterals in 4 mM Ca, 3-4X physiological Ca2+.

      • Doussau et al., (2010) recorded from aplysia synapses in 3X Ca compared to seawater.

      • Rudolph et al., (2011) is cited as an example of LFD. However, this study performed experiments at high release probability cerebellar climbing fibers, and reported depression that increased monotonically with

      stimulation frequency, so it does not resemble the phenomenon studied in this paper. Lin et al., (2022) also largely describe monotonic depression at the calyx.

      The authors note that their results differ from those of Atluri and Regehr, but do not mention that a possible reason for the difference is the increased release probability in their experiments.

      The authors should provide statistics for the data obtained in 1.5 mM Ca, and discuss why LFD is increased in conditions that also elevate vesicle release probability.

      (2) Lack of biological mechanisms supporting the model

      The model is presented without compelling biological support. The evidence in support of vesicle undocking comes from experiments by the Watanabe lab, which showed fewer-than-expected docked vesicles under EM when cultured synapses were stimulated immediately prior to high-pressure freezing. Kusick et al were careful to note that these vesicles may have been lost to fusion.

      The putative undocking Kusick describes is immediate (< 5 ms after stimulation), and was not shown to be Ca2+ sensitive. This manuscript describes "calcium-dependent undocking" that proceeds from 10 ms - 200 ms. Multiple studies from the Watanabe lab show that a single stimulus lowers the number of docked vesicles, and subsequently, there is a transient redocking of vesicles that can be blocked by EGTA or Syt7 knockout.

      I also question the rationale for the authors' model that 2 vesicles are coupled in series to a single release site. Previous papers from this lab cited EM studies from frog and neuromuscular that showed filamentous connections between vesicles (do these synapses show LFD?). Here, the authors primarily cite their previous models to support their arguments. I encourage them to continue searching for ultrastructural evidence for 2-vesicle-docking-units and to cite such studies.

      (3) Comparison to other vesicle models

      The authors use overly assertive language to suggest that the model proves a mechanism. "Altogether, these results indicate that the slow phase of LFD ... reflects a δ decrease without significant changes in pr, in ρ or in IP size". Simulating data does not conclusively "indicate" the underlying mechanism, but the authors could state their data can be "explained by a model where..".

      However, LFD does not require activity-dependent undocking. Instead, the phenomenon has been explained by high-release probability, paired with an activity-dependent increase in either docking or release probability (Chiu and Carter, 2024; Doussau et al., 2017). Does the new model do a better job of replicating some facet of the data? If multiple models can explain the same data, how can we determine which model is correct? The "Alternative Presynaptic Depression Mechanisms" should be expanded to discuss these issues.

    1. Reviewer #1 (Public review):

      Sensory hair cells of the inner ear convert mechanical sound vibrations into electrical signals through mechano-electrical transduction (MET), a process critically dependent on the specialized organization and lipid composition of their plasma membrane. Although the protein components of the MET complex are relatively well characterized, the role of the lipid environment remains poorly understood and often overlooked. Recent discoveries that core MET proteins TMC1 and TMC2 function as lipid scramblases, disrupting membrane lipid asymmetry, expose a significant gap in our understanding of how lipid homeostasis is regulated in hair cells and how membrane dynamics influence MET function.

      In this study, the authors address this gap by identifying the P4-ATPase ATP8B1 and its chaperone TMEM30B as essential regulators of membrane lipid asymmetry in outer hair cells. They also generated HA-tagged knock-in mice to precisely localize the P4-ATPase ATP8B1 and its chaperone TMEM30B within outer hair cells, demonstrating their enrichment in stereocilia, and convincingly demonstrate that loss of these proteins causes phosphatidylserine externalization, hair cell degeneration, and hearing loss in mouse models, phenocopying defects observed in TMC1 mutant mice with constitutive scrambling activity. While these findings establish lipid flippase pathways as critical for hair cell survival and auditory function, they also raise important questions about the precise mechanisms linking lipid asymmetry disruption to MET dysfunction and hair cell pathology.

      Overall, the data convincingly support the conclusion that ATP8B1-TMEM30B flippase activity is required to maintain stereocilia lipid asymmetry and auditory function. The study substantially advances understanding of how lipid homeostasis intersects with MET. However, several points require clarification to ensure that localization claims and mechanistic interpretations are fully supported by the presented data.

      Revisions considered essential by this reviewer are:

      (1) Figure 1D.<br /> The authors should clarify how the qPCR data were normalized and specify the reference (housekeeping) genes used. This information is necessary to evaluate the robustness and comparability of the gene expression data.

      (2) Figure 1F.<br /> The lack of F-actin staining at the hair cell base raises the possibility that the permeabilization conditions may have limited antibody access to certain membrane regions. This is especially important given that the authors used a gentle permeabilization agent such as saponin to preserve membrane integrity. Because the authors conclude that ATP8B1 and TMEM30B are localized "almost exclusively to OHC bundles and the apical membrane, with minimal staining in the remaining plasma membrane," (line 128). Including co-labeling with a plasma membrane marker or more comprehensive F-actin visualization of lateral and basal regions would help ensure that the restricted localization is biological rather than technical. In the absence of such controls, the localization claim may be somewhat overstated and should be tempered accordingly.

      (3) Figure 7B.<br /> Although quantification of ATP8B1-HA intensity at the bundle appears similar between WT and Cib2 KO samples, the representative image suggests that some bundles lack detectable labeling. To better capture phenotype variability, it would be helpful to include an additional quantification showing the fraction or number of bundles with detectable ATP8B1-HA signal in Cib2 KO mice.

      (4) Lines 346-349.<br /> The manuscript suggests that IHCs lack stereocilia-enriched P4-ATPases. However, this conclusion is not directly supported by the presented data. The authors should either provide supporting localization or expression data for other P4-ATPases or soften the statement to indicate that no stereocilia-enriched P4-ATPases were detected under the conditions examined.

      Recommendations:

      (5) The authors convincingly demonstrate that TMEM30B loss results in ATP8B1 mislocalization. While not essential to the central conclusions, examining TMEM30B localization in ATP8B1 KO hair cells would clarify whether this interdependence is reciprocal, as described for other P4-ATPase-CDC50 complexes.

      (6) Lines 359-374.<br /> The discussion of Annexin V labeling is careful and balanced. This paragraph would benefit from referencing other studies that showed minimal Annexin V labeling in healthy P6 organ of Corti, reinforcing that robust PS externalization in the present study is pathological rather than developmental.

      (7) Lines 392-399.<br /> The proposed feedback model linking MET activity and ATP8B1-TMEM30B localization is compelling. The discussion could be strengthened by noting that in TMC1/2 double knockout hair cells, PS externalization is not observed, consistent with the idea that flippase activity becomes critical specifically when scrambling occurs. The mislocalization observed in Cib2 KO hair cells further supports the coupling between TMC-mediated scrambling and flippase-mediated membrane restoration.

    2. Reviewer #2 (Public review):

      Summary:

      Prior work identified TMEM30B (knockout mice) as well as ATP8B1 (human genetics and mouse model), ATP8A2 (knockout mice), and ATP811A (human genetics) as relevant for hearing. The authors also reasoned that, given the recent discovery of TMC1 and TMC2's dual function as mechanotransduction channels of the inner ear and as lipid scramblases, a counterpart flippase should be in the sensory hair-cell stereocilia bundle where mechanotransduction happens. They use CRISPR/CAS to modify the endogenous mouse genes and add an HA tag at the N-terminus of the ATP8B1, ATP8A1, ATP8A2, and ATP11A proteins. Their experiments with these mice unambiguously localized ATP8B1 at the base of outer hair cell stereocilia bundles. Knockout of ATP8B1 results in loss of outer hair cells, deficient auditory function (ABR), and degeneration of outer hair cell stereocilia bundles. Similarly, hair cells from genetically modified mice with endogenous HA-tagged TMEM30B proteins show localization of this protein to outer hair cell stereocilia bundles. TMEM30B knock-out mice phenocopy the ATP8B1 knock-out model. Interestingly, the authors show that annexing V staining precedes hair cell loss in ATP8B1 and TMEM30B knockout mice and that proper localization of these proteins is lost in mice that lack CIB2, a protein essential for hair cell mechanotransduction.

      Strengths:

      (1) Use of knock-in HA-tagged proteins, rather than antibody staining, to unambiguously localize ATP8B1 and TMEM30B.

      (2) Systematic characterization of auditory function (ABR), hair cell loss, and hair-cell stereocilia bundle morphology.

      (3) Advances our understanding of the role played by lipid homeostasis in auditory function.

      (4) Reports on mouse models that will be helpful to further understand the mechanistic role played by ATP8B1 and TMEM30B in normal hearing and hereditary deafness.

      Weaknesses:

      (1) Are the HA tags causing any functional issues? Function and localization of tagged proteins can sometimes be compromised. It would be good to know, for each knock-in model (TMEM30B, ATP8B1, ATP8A1, ATP8A2, and ATP11A ), whether the HA-tagged protein is causing any issues with the mice and particularly with hearing (ABRs). Are these mice normal? Can they hear? These data are missing.

      (2) Following on the point above, is it possible that ATP8B1-HA is well localized, but localization for the other three flippases (ATP8A1-HA, ATP8A2-HA, and ATP11A-HA) is compromised by the tag? Is this potential mislocalization causing any functional phenotypes? (ABRs of point 1). I find it surprising that there are flippases only in outer hair cells, and only formed by ATP8B1. A possible explanation is that the tag is interfering with trafficking. If so, there should be a phenotype (ABRs), although this might be masked by redundancy among these flippases or caused by systemic issues (admittedly difficult to sort out). Given that this manuscript will likely become foundational, and that there is evidence that at least two of the other flippases are involved in hearing loss, it would be good to provide more information about the mice and HA-tagged proteins in the other knock-ins (ATP8A1-HA, ATP8A2-HA, and ATP11A-HA). Depending on the data available for the knock-ins, the authors may want to discuss these scenarios and soften the statement indicating that inner-hair cells may lack flippase activity altogether.

      (3) Expression of ATP8B1 at P0 (Figure 1D), when there should not be protein in outer hair cells yet, seems high. Does this mean that other cells in the cochlea also express ATP8B1? Is this a concern?

      (4) Fluorescence scales in Figure 6 B and D and Figure 7 B and D are very different. So are the values for WT. One would expect that the WT would be similar in all cases (at least within the same compartments), given that the methods section indicates that "All images were collected using identical acquisition parameters, including zoom and laser power, across genotypes". If WT shows such variability, how can we compare?

    1. Reviewer #1 (Public review):

      Summary:

      This paper examines plasticity in early cortical (V1-V3) areas in an impressively large number of rod monochromats (individuals with achromatopia). The paper examines three things:

      (1) Cortical thickness. It is now well established that early complete blindness leads to increases in cortical thickness. This paper shows increased thickness confined to the foveal projection zone within achromats. This paper replicates work by Molz (2022) and Lowndes (2021), but the detailed mapping of cortical thickness as a function of eccentricity and the inclusion of higher retinotopic areas is particularly elegant.

      (2) Failure to show largescale reorganization of early visual areas using retinotopic mapping. This is a replication of a very recent study of Molz et al. but I believe, given anatomical variability, the larger n in this study, and how susceptible pRF findings are to small changes in procedure, this replication is also of interest.

      (3) Connective field modelling, examining the connections between V3-V1. The paper finds changes in the pattern of connections, and smaller connective fields in individuals with achromatopsia than normally sighted controls, and suggests that these reflect compensatory plasticity, with V3 compensating for the lower resolution V1 signal in individuals with achromatopsia.

      This is a carefully done study (both in terms of data collection and analysis) that is an impressive amount of work.

      *Effects of eye-movements

      The authors have carried out the eye-movement analyses I asked of them. Unfortunately, in 4 individuals they couldn't calibrate the eyetracker (it's impressive they managed in 10). I think this means that 4 of 13 (since a different participant was excluded from head motion) individuals weren't included in correlation analyses. Limiting the correlation analysis to individuals with better fixation has obvious issues. I'd recommend redoing (or additionally including) stats using non-parametric measures while classifying these 4 as having fixation instability of 3 (i.e. greater instability than the participant with the worst fixation who was successfully calibrated).

      *Interpreting pRFs

      The paper would be strengthened by a little more explicit clarity about what pRFs represent and how that affects their interpretation of their findings as plasticity vs. non-plasticity (I know the authors are aware of this, but I think it would be helpful for readers who are less experienced in pRFs). In the introduction it would be helpful to point out that pRFs represent the collective response of a large population of neurons, and as a result pRF estimates can vary depending on which population of neurons that stimulus drives.

      For example, imagine for the sake of argument that rods only project to V1 neurons with larger receptive fields. If one measured pRFs in a control observer under phototopic vs. scotopic conditions one would see smaller pRFs in the photopic conditions. This wouldn't represent 'plasticity' - it would represent the fact that the firing neurons contributing to the pRF signal are a slightly different population because of a change in the stimulus content. This is of course exactly what you see in 2C. And indeed, the authors make this identical point ". In the non-selective condition, the smaller pRFs in controls are in line with the higher spatial resolution of the<br /> cone system, which is not active in the achromat group." But this point would be clearer if more of the conceptual underpinnings were made explicit in the introduction (or at this point in the paper).

      Shifts in which population of neurons drive your pRFs can explain main of the more puzzling results in the paper without detracting from your main conclusions. For example, in 2D, I don't think it's differences in S/N driving your results (pRFs are at least theoretically meant to be robust to S/N). If smaller RFs 'drop out' under low luminance and these smaller RFs also tend to be more central, then one would expect the control results of 1D. And I think a similar argument might even be made for the smaller difference in the rod monochromats.

      It would be possible to make the point of Figure 4B more simply if Figure 4B was replaced by additional Panels in Figure 2 simply showing V3 pRF sizes/eccentricity distributions. That would make the point that you don't see the same expansion in pRF sizes in V3 in a way that is just as clear, and is closer to the data.

      *Interpreting cRFs

      Similarly, I think the paper would be improved with more clarity about the underlying signal in CF modeling. Once again, I appreciate that the authors are familiar with this, but it will help the reader in interpretation. (And I do believe thinking carefully about this may alter your interpretations). CF receptive fields 'find' the region in V1 that best predict the V3 signal in a given voxel. In resting state this likely represents a combination of:

      (1) visually driven signal - correlations that may or may not reflect connectivity but represent the fact that regions that represent the same region of visual space will be active at the same time.

      (2) global bilaterally symmetrical signal consisting of enhanced correlations between iso-eccentric regions (Raemaekers et al., 2014), which may arise from vasculature that symmetrically stems from the posterior cerebral artery (Tong et al., 2013; Tong and Frederick, 2014).

      (3) intrinsic neural fluctuations that are more strongly correlated between connected neurons. These are likely quite weak compared to the other contributions.

      I think if you ignore 2, (which is not likely to differ between rod mono and controls) and model 1 and 3, you might well see shifts in CFs towards the boundary of the scotoma - essentially the CF's location will be biased towards the region of V1 that has stronger correlations - which = the region which has a visual signal.

      I do find convincing the argument that you don't see the same shift in controls in the rod-selective condition. So I think the results of 4A are fine. But a little more clarity about 'what's under the hood' in CF modeling would be nice.

      *Interpreting the relationship between pRFs and cRFs

      So there's something here that confuses me. We are all agreed that V3 pRF sizes are similar across RM and control. V1 pRFs are larger in RM. It feels intuitive that smaller CFs would compensate but I can't make it make sense to myself when I think it through. Each pRF represents a combination of receptive field location scatter and bandwidth. You want to argue that eccentricity mapping looks pretty normal, so there's no reason to think increased rf scatter, and I can believe that (though I do think this assumption should be discussed explictly).

      So far I think we agree.

      But let's think about what drives a CF during visual stimulation ... Specifically lets think about 'the pRF of the CF' (the region of visual space represented by the cluster of voxels in the CF). If pRFs for individual voxels in V1 are big, then the pRF for the CF is also going to be large. But we know that pRFs for V3 are normal size. So, the V3 CF will 'find' a smaller number of voxels in V1, in order to try to find the 'correct sized' CF pRF. Note that this explanation is very similar to yours. But doesn't require ANY 'intrinsic' connectivity. It's really just assuming the whole thing is driven by the visual signal and the CF size is determined by the ratio of the pRF sizes in V3 vs. V1.

      One possible solution would be to regress out the visual stimulus and redo this analysis based on the residuals.

    2. Reviewer #3 (Public review):

      Summary:

      This study addresses a long-standing question in visual neuroscience concerning how the human visual system balances stability and plasticity when sensory input is altered from early in life. Using achromatopsia as a model of lifelong cone deprivation, the authors examine whether early visual cortex undergoes retinotopic reorganization to compensate for the absence of foveal cone input, or whether canonical retinotopic organization is largely preserved. By combining fMRI-based population receptive field (pRF) mapping with connective field (CF) modelling, the authors characterize changes across multiple hierarchical stages of visual processing.

      The main findings indicate that primary visual cortex (V1) shows no systematic remapping of the foveal projection zone, whereas extrastriate cortex, particularly V3, exhibits altered patterns of sampling from V1. The authors interpret these results as evidence for hierarchical adaptation, whereby downstream readout mechanisms adjust to make more efficient use of degraded rod-mediated input while preserving early-stage retinotopic organization.

      Strengths:

      A major strength of this work is the use of silent substitution to generate rod-selective stimuli. This approach enables a principled comparison between achromats and typically sighted controls by isolating rod-driven responses in both groups. In doing so, the study overcomes a key limitation of prior work, where differences in cortical organization could often be confounded by differences in photoreceptor class rather than reflecting neural reorganization per se. The inclusion of a rod-driven baseline in controls provides an important reference for distinguishing long-term adaptation from transient or stimulus-driven effects.

      Another notable strength is the integration of CF modelling alongside conventional pRF mapping. While pRF analyses alone suggest enlarged receptive fields in V1, consistent with reduced spatial resolution, the CF analysis offers a more mechanistic account by revealing changes in how V3 samples information from the V1 surface. This multi-level modelling approach moves beyond descriptive accounts of cortical map structure and provides a framework for interpreting how downstream areas may adjust their integration strategies under conditions of altered input.

      Weaknesses:

      Although the study is methodologically strong, the central claims regarding stability and compensatory plasticity require clearer conceptual framing and stronger empirical support. Stability is primarily defined as the absence of large-scale retinotopic remapping in V1, yet the presence of significantly enlarged V1 pRFs indicates substantial tuning-level plasticity at the input stage; distinguishing topographic stability from functional reorganization would therefore strengthen the interpretation. Moreover, the proposed compensatory mechanism raises a signal-processing concern, as reduced downstream sampling (smaller CFs in V3) cannot restore spatial information lost due to coarse upstream representations, and may instead limit integration. The mechanistic link between altered CF properties and normalization of extrastriate pRFs is not directly tested, as group differences are not shown to covary across individuals or visual field locations. Finally, the interpretation of these changes as compensatory implies functional benefit, yet no behavioral or performance measures are provided to establish that the observed reorganization preserves or enhances visual function, leaving open whether these effects reflect adaptive optimization or passive downstream consequences of altered input.

    1. Reviewer #1 (Public review):

      Summary:

      The paper presents a three-layered hierarchical model for simulating Drosophila larva locomotion, navigation, and learning. The model consists of a basic locomotory layer that generates crawling and turning using a coupled-oscillator framework, incorporating intermittency in movement through alternating runs and pauses. The intermediate layer enables navigation by allowing larvae to actively sense and respond to odor gradients, facilitating chemotaxis. The adaptive learning layer integrates a spiking neural network model of the Mushroom Body, simulating associative learning where larvae modify their behavior based on past experiences. The model is validated through simulations of free exploration, chemotaxis, and odor preference learning, demonstrating close agreement with empirical behavioral data. This modular framework provides a valuable advance for modeling of larva behavior.

      Strengths:

      Every modeling paper requires certain assumptions and abstractions. The main strength of this paper lies in its modular and hierarchical approach to modeling behavior, making connections to influential theories of motor control in the brain. The authors also provide a convincing discussion of the experimental evidence supporting their layered behavioral architecture. This abstraction is valuable, offering researchers a useful conceptual framework and marking a significant step forward in the field. Connections to empirical larval movement are another major strength.

      Weaknesses:

      While the model represents a conceptual advance in the field, some of its assumptions and choices fall behind state-of-the-art approaches. One limitation is the paper's simplified representation of larval neuromechanics, in which the body is reduced to a two-segment structure with basic neural control. Another limitation is the absence of an explicit neuromuscular control system, which would better capture the role of segmental central pattern generators (CPGs) and neuronal circuits in regulating peristalsis and turning in Drosophila larvae. Many detailed neuromechanical models, as cited by the authors, have already been published. These abstractions overlook valuable experimental studies that detail segmental dynamics during crawling and the larval connectome.

      The strength of the model could also be its weakness. The model follows a subsumption architecture, where low-level behaviors operate autonomously while higher layers modulate them. However, this approach may underestimate the complexity of real neural circuits, which likely exhibit more intricate feedback mechanisms between sensory input and motor execution.

    2. Reviewer #2 (Public review):

      The paper proposes a hierarchically layer approach to larval locomotion, chemotaxis and learning. The model consists of a basic locomotor layer with two coupled oscillators, one for crawls and one for turns. The intermediate layer modulates the frequency and amplitude of tunings to enables chemotaxis. The higher layer, integrates a spiking neural network model of the Mushroom Body to modify the door valence in response to experience as during learning.

      The model is compared to experimental data with a good degree of agreement. This modular framework provides a valuable advance for modeling larva behavior.

      Strengths:

      A novel multilayer level model that reflects current thinking of the neuronal organisation of motor control. The model is very useful to investigate the neuronal architecture of central pattern generators<br /> and higher order motor control circuits that could be linked to larval connectome data.

      Weaknesses:

      All the limitations of the model are discussed and therefore the paper perfectly fits its purpose.