10,000 Matching Annotations
  1. Last 7 days
    1. Reviewer #1 (Public review):

      Summary:

      Mazer & Yovel 2025 dissect the inverse problem of how echolocators in groups manage to navigate their surroundings despite intense jamming using computational simulations.

      The authors show that despite the 'noisy' sensory environments that echolocating groups present, agents can still access some amount of echo-related information and use it to navigate their local environment. It is known that echolocating bats have strong small and large-scale spatial memory that plays an important role for individuals. The results from this paper also point to the potential importance of an even lower-level, short-term role of memory in the form of echo 'integration' across multiple calls, despite the unpredictability of echo detection in groups. The paper generates a useful basis to think about the mechanisms in echolocating groups for experimental investigations too.

      Strengths:

      * The paper builds on biologically well-motivated and parametrised 2D acoustics and sensory simulation setup to investigate the various key parameters of interest

      * The 'null-model' of echolocators not being able to tell apart objects & conspecifics while echolocating still shows agents succesfully emerge from groups - even though the probability of emergence drops severely in comparison to cognitively more 'capable' agents. This is nonetheless an important result showing the direction-of-arrival of a sound itself is the 'minimum' set of ingredients needed for echolocators navigating their environment.

      * The results generate an important basis in unraveling how agents may navigate in sensorially noisy environments with a lot of irrelevant and very few relevant cues.

      * The 2D simulation framework is simple and computationally tractable enough to perform multiple runs to investigate many variables - while also remaining true to the aim of the investigation.

      Weaknesses:

      * Authors have not yet provided convincing justification for the use of different echolocation phases during emergence and in cave behaviour. In the previous modelling paper cited for the details - here the bat-agents are performing a foraging task, and so the switch in echolocation phases is understandable. While flying with conspecifics, the lab's previous paper has shown what they call a 'clutter response' - but this is not necessarily the same as going into a 'buzz'-type call behaviour. As pointed out by another reviewer - the results of the simulations may hinge on the fact that bats are showing this echolocation phase-switching, and thus improving their echo-detection. This is not necessarily a major flaw - but something for readers to consider in light of the sparse experimental evidence at hand currently.

      * The decision to model direction-of-arrival with such high angular resolution (1-2 degrees) is not entirely justifiable - and the authors may wish to do simulation runs with lower angular resolution. Past experimental paradigms haven't really separated out target-strength as a confounding factor for angular resolution (e.g. see the cited Simmons et al. 1983 paper). Moreover, to this reviewer's reading of the cited paper - it is not entirely clear how this experiment provides source-data to support the DoA-SNR parametrisation in this manuscript. The cited paper has two array-configurations, both of which are measured to have similar received levels upon ensonification. A relationship between angular resolution and signal-to-noise ratio is understandable perhaps - and one can formulate such a relationship, but here the reviewer asks that the origin/justification be made clear. On an independent line, also see the recent contrasting results of Geberl, Kugler, Wiegrebe 2019 (Curr. Biol.) - who suggest even poorer angular resolution in echolocation.

    2. Reviewer #2 (Public review):

      This manuscript describes a detailed model for bats flying together through a fixed geometry. The model considers elements which are faithful to both bat biosonar production and reception and the acoustics governing how sound moves in air and interacts with obstacles. The model also incorporates behavioral patterns observed in bats, like one-dimensional feature following and temporal integration of cognitive maps. From a simulation study of the model and comparison of the results with the literature, the authors gain insight into how often bats may experience destructive interference of their acoustic signals and those of their peers, and how much such interference may actually negatively effect the groups' ability to navigate effectively. The authors use generalized linear models to test the significance of the effects they observe.

      The work relies on a thoughtful and detailed model which faithfully incorporates salient features, such as acoustic elements like the filter for a biological receiver and temporal aggregation as a kind of memory in the system. At the same time, the authors abstract features that are complicating without being expected to give additional insights, as can be seen in the choice of a two-dimensional rather than three-dimensional system. I thought that the level of abstraction in the model was perfect, enough to demonstrate their results without needless details. The results are compelling and interesting, and the authors do a great job discussing them in the context of the biological literature.

      With respect to the first version of the manuscript, the authors have remedied all my outstanding questions or concerns in the current version. The new supplementary figure 5 is especially helpful in understanding the geometry.

    1. Reviewer #1 (Public review):

      Summary:

      The authors provide a detailed ultrastructural analysis of the larval pharyngeal sensory organs, including the dorsal pharyngeal sensilla, dorsal pharyngeal organ, ventral pharyngeal sensilla, and posterior pharyngeal sensilla. Using electron microscopy and 3D reconstruction, Richter et al., present a comprehensive mapping and classification of pharyngeal sensory structures, defining mthe orphological type of pharyngeal sensilla based on ultrastructure and generating a neuron-to-sensillum map. These findings significantly advance our understanding of internal larval sensory systems and establish a robust framework for future functional studies in coordination with external sensory systems.

      Strengths:

      The application of high-resolution electron microscopy and 3D imaging analysis successfully overcomes technical challenges associated with visualizing deep internal structures. This enables an unprecedented level of anatomical detail of the larval pharyngeal sensory system. Thus, the study complements and completes existing maps of larval sensory circuits, contributing a comprehensive neuroanatomical characterization of larval sensory input pathways. These insights will inform future studies on larval behavior, sensory processing, and may also have applied relevance for insect control strategies.

      Weaknesses:

      While the manuscript is concise, clearly written, and methodologically rigorous, it primarily addresses a specialized readership with expertise in insect neuroanatomy.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript documents the structure of the pharyngeal nervous system of the Drosophila larva. The authors wanted to achieve a detailed ultrastructural reconstruction of the gustatory sensory organs in the Drosophila pharynx. Using serial EM and the associated bioinformatics tools, they have achieved their goal. The paper is written clearly and illustrated beautifully with 3D models and annotated sections. The data will significantly enrich the field of Drosophila neurobiology.

      Strengths:

      Given the dataset, the findings presented are solid and will be an important work of reference for the future.

      Weaknesses:

      Previous work, including EM, on the pharyngeal sensory organ is not sufficiently referenced and used for comparison with the data presented in this study.

    1. Reviewer #1 (Public review):

      Summary:

      This work provides important new evidence of the cognitive and neural mechanisms that give rise to feelings of shame and guilt, as well as their transformation into compensatory behavior. The authors use a well-designed interpersonal task to manipulate responsibility and harm, eliciting varying levels of shame and guilt in participants. The study combines behavioral, computational, and neuroimaging approaches to offer a comprehensive account of how these emotions are experienced and acted upon. Notably, the findings reveal distinct patterns in how harm and responsibility contribute to guilt and shame and how these factors are integrated into compensatory decision-making.

      Strengths:

      (1) Investigating both guilt and shame in a single experimental framework allows for a direct comparison of their behavioral and neural effects while minimizing confounds.

      (2) The study provides a novel contribution to the literature by exploring the neural bases underlying the conversion of shame into behavior.

      (3) The task is creative and ecologically valid, simulating a realistic social situation while retaining experimental control.

      (4) Computational modeling and fMRI analysis yield converging evidence for a quotient-based integration of harm and responsibility in guiding compensatory behavior.

      Weaknesses:

      (1) Post-experimental self-reports rely both on memory and on the understanding of the conceptual difference between the two emotions. Additionally, it is unclear whether the 16 scenarios were presented in random order; sequential presentation could have introduced contrast effects or demand characteristics.

      (2) In the neural analysis of emotion sensitivity, the authors identify brain regions correlated with responsibility-driven shame sensitivity and then use those brain regions as masks to test whether they were more involved in the responsibility-driven shame sensitivity than the other types of emotion sensitivity. I wonder if this is biasing the results. Would it be better to use a cross-validation approach? A similar issue might arise in "Activation analysis (neural basis of compensatory sensitivity)."

      Additional comments and questions:

      (1) Regarding the traits of guilt and shame, I appreciate using the scores from the subscales (evaluations and action tendencies) separately for the analyses (instead of a composite score). An issue with using the actions subscales when measuring guilt and shame proneness is that the behavioral tendencies for each emotion get conflated with their definitions, risking circularity. It is reassuring that the behavior evaluation subscale was significantly correlated with compensatory behavior (not only the action tendencies subscale). However, the absence of significant neural correlates for the behavior evaluation subscale raises questions: Do the authors have thoughts on why this might be the case, and any implications?

      (2) Regarding the computational model finding that participants seem to disregard self-interest, do the authors believe it may reflect the relatively small endowment at stake? Do the authors believe this behavior would persist if the stakes were higher? Additionally, might the type of harm inflicted (e.g., electric shock vs. less stigmatized/less ethically charged harm like placing a hand in ice-cold water) influence the weight of self-interest in decision-making?

      Taken together, the conclusions of the paper are well supported by the data. It would be valuable for future studies to validate these findings using alternative tasks or paradigms to ensure the robustness and generalizability of the observed behavioral and neural mechanisms.

    2. Reviewer #2 (Public review):

      Summary:

      The authors combined behavioral experiments, computational modeling, and functional magnetic resonance imaging (fMRI) to investigate the psychological and neural mechanisms underlying guilt, shame, and the altruistic behaviors driven by these emotions. The results revealed that guilt is more strongly associated with harm, whereas shame is more closely linked to responsibility. Compared to shame, guilt elicited a higher level of altruistic behavior. Computational modeling demonstrated how individuals integrate information about harm and responsibility. The fMRI findings identified a set of brain regions involved in representing harm and responsibility, transforming responsibility into feelings of shame, converting guilt and shame into altruistic actions, and mediating the effect of trait guilt on compensatory behavior.

      Strengths:

      This study offers a significant contribution to the literature on social emotions by moving beyond prior research that typically focused on isolated aspects of guilt and shame. The study presents a comprehensive examination of these emotions, encompassing their cognitive antecedents, affective experiences, behavioral consequences, trait-level characteristics, and neural correlates. The authors have introduced a novel experimental task that enables such a systematic investigation and holds strong potential for future research applications. The computational modeling procedures were implemented in accordance with current field standards. The findings are rich and offer meaningful theoretical insights. The manuscript is well written, and the results are clearly and logically presented.

      Weaknesses:

      In this study, participants' feelings of guilt and shame were assessed retrospectively, after they had completed all altruistic decision-making tasks. This reliance on memory-based self-reports may introduce recall bias, potentially compromising the accuracy of the emotion measurements.

      In many behavioral economic models, self-interest plays a central role in shaping individual decision-making, including moral decisions. However, the model comparison results in this study suggest that models without a self-interest component (such as Model 1.3) outperform those that incorporate it (such as Model 1.1 and Model 1.2). The authors have not provided a satisfactory explanation for this counterintuitive finding.

      The phrases "individuals integrate harm and responsibility in the form of a quotient" and "harm and responsibility are integrated in the form of a quotient" appear in the Abstract and Discussion sections. However, based on the results of the computational modeling, it is more accurate to state that "harm and the number of wrongdoers are integrated in the form of a quotient." The current phrasing misleadingly suggests that participants represent information as harm divided by responsibility, which does not align with the modeling results. This potentially confusing expression should be revised for clarity and accuracy.

      In the Discussion, the authors state: "Since no brain region associated with social cognition showed significant responses to harm or responsibility, it appears that the human brain encodes a unified measure integrating harm and responsibility (i.e., the quotient) rather than processing them as separate entities when both are relevant to subsequent emotional experience and decision-making." However, this interpretation overstates the implications of the null fMRI findings. The absence of significant activation in response to harm or responsibility does not necessarily imply that the brain does not represent these dimensions separately. Null results can arise from various factors, including limitations in the sensitivity of fMRI. It is possible that more fine-grained techniques, such as intracranial electrophysiological recordings, could reveal distinct neural representations of harm and responsibility. The interpretation of these null findings should be made with greater caution.

    3. Reviewer #3 (Public review):

      Summary:

      Zhu et al. set out to elucidate how the moral emotions of guilt and shame emerge from specific cognitive antecedents - harm and responsibility - and how these emotions subsequently drive compensatory behavior. Consistent with their prediction derived from functionalist theories of emotion, their behavioral findings indicate that guilt is more influenced by harm, whereas shame is more influenced by responsibility. In line with previous research, their results also demonstrate that guilt has a stronger facilitating effect on compensatory behavior than shame. Furthermore, computational modeling and neuroimaging results suggest that individuals integrate harm and responsibility information into a composite representation of the individual's share of the harm caused. Brain areas such as the striatum, insula, temporoparietal junction, lateral prefrontal cortex, and cingulate cortex were implicated in distinct stages of the processing of guilt and/or shame. In general, this work makes an important contribution to the field of moral emotions. Its impact could be further enhanced by clarifying methodological details, offering a more nuanced interpretation of the findings, and discussing their potential practical implications in greater depth.

      Strengths:

      First, this work conceptualizes guilt and shame as processes unfolding across distinct stages (cognitive appraisal, emotional experience, and behavioral response) and investigates the psychological and neural characteristics associated with their transitions from one stage to the next.

      Second, the well-designed experiment effectively manipulates harm and responsibility - two critical antecedents of guilt and shame.

      Third, the findings deepen our understanding of the mechanisms underlying guilt and shame beyond what has been established in previous research.

      Weaknesses:

      (1) Over the course of the task, participants may gradually become aware of their high error rate in the dot estimation task. This could lead them to discount their own judgments and become inclined to rely on the choices of other deciders. It is unclear whether participants in the experiment had the opportunity to observe or inquire about others' choices. This point is important, as the compensatory decision-making process may differ depending on whether choices are made independently or influenced by external input.

      (2) Given the inherent complexity of human decision-making, it is crucial to acknowledge that, although the authors compared eight candidate models, other plausible alternatives may exist. As such, caution is warranted when interpreting the computational modeling results.

      (3) I do not agree with the authors' claim that "computational modeling results indicated that individuals integrate harm and responsibility in the form of a quotient" (i.e., harm/responsibility). Rather, the findings appear to suggest that individuals may form a composite representation of the harm attributable to each individual (i.e., harm/the number of people involved). The explanation of the modeling results ought to be precise.

      (4) Many studies have reported positive associations between trait gratitude, social value orientation, and altruistic behavior. It would be helpful if the authors could provide an explanation about why this study failed to replicate these associations.

      (5) As the authors noted, guilt and shame are closely linked to various psychiatric disorders. It would be valuable to discuss whether this study has any implications for understanding or even informing the treatment of these disorders.

    1. Reviewer #1 (Public review):

      Summary

      The authors previously published a study of RGC boutons in the dLGN in developing wild-type mice and developing mutant mice with disrupted spontaneous activity. In the current manuscript, they have broken down their analysis of RGC boutons according to the number of Homer/Bassoon puncta associated with each vGlut3 cluster.

      The authors find that, in the first post-natal week, RGC boutons with multiple active zones (mAZs) are about a third as common as boutons with a single active zone (sAZ). The size of the vGluT2 cluster associated with each bouton was proportional to the number of active zones present in each bouton. Within the author's ability to estimate these values (n=3 per group, 95% of results expected to be within ~2.5 standard deviations), these results are consistent across groups: 1) dominant eye vs. non-dominant eye, 2) wild-type mice vs. mice with activity blocked, and at 3) ages P2, P4, and P8. The authors also found that mAZs and sAZs also have roughly the same number (about 1.5) of sAZs clustered around them (within 1.5 um).

      However, the authors do not interpret this consistency between groups as evidence that active zone clustering is not a specific marker or driver of activity dependent synaptic segregation. Rather, the authors perform a large number of tests for statistical significance and cite the presence or absence of statistical significance as evidence that "Eye-specific active zone clustering underlies synaptic competition in the developing visual system (title)". I don't believe this conclusion is supported by the evidence.

      Strengths

      The source dataset is high resolution data showing the colocalization of multiple synaptic proteins across development. Added to this data is labeling that distinguishes axons from the right eye from axons from the left eye. The first order analysis of this data showing changes in synapse density and in the occurrence of multi-active zone synapses is useful information about the development of an important model for activity dependent synaptic remodeling.

      Weaknesses

      In my previous review I argued that it was not possible to determine, from their analysis, whether the differences they were reporting between groups was important to the biology of the system. The authors have made some changes to their statistics (paired t-tests) and use some less derived measures of clustering. However, they still fail to present a meaningfully quantitative argument that the observed group differences are important. The authors base most of their claims on small differences between groups. There are two big problems with this practice. First, the differences between groups appear too small to be biologically important. Second, the differences between groups that are used as evidence for how the biology works are generally smaller than the precision of the author's sampling. That is, the differences are as likely to be false positives as true positives.

      (1) Effect size. The title claims: "Eye-specific active zone clustering underlies synaptic competition in the developing visual system". Such a claim might be supported if the authors found that mAZs are only found in dominant-eye RGCs and that eye-specific segregation doesn't begin until some threshold of mAZ frequency is reached. Instead, the behavior of mAZs is roughly the same across all conditions. For example, the clear trend in Figure 4C and D is that measures of clustering between mAZ and sAZ are as similar as could reasonably be expected by the experimental design. However, some of the comparisons of very similar values produced p-values < 0.05. The authors use this fact to argue that the negligible differences between mAZ and sAZs explain the development of the dramatic differences in the distribution of ipsilateral and contralateral RGCs.

      (2) Sample size. Performing a large number of significance tests and comparing p-values is not hypothesis testing and is not descriptive science. At best, with large sample sizes and controls for multiple tests, this approach could be considered exploratory. With n=3 for each group, many comparisons of many derived measures, among many groups, and no control for multiple testing, this approach constitutes a random result generator.

      The authors argue that n=3 is a large sample size for the type of high resolution / large volume data being used. It is true that many electron microscopy studies with n=1 are used to reveal the patterns of organization that are possible within an individual. However, such studies cannot control individual variation and are, therefore, not appropriate for identifying subtle differences between groups.<br /> In response to previous critiques along these lines, the authors argue they have dealt with this issue by limiting their analysis to within-individual paired comparisons. There are several problems with their thinking in this approach. The main problem is that they did not change the logic of their arguments, only which direction they pointed the t-tests. Instead of claiming that two groups are different because p < 0.05, they say that two groups are different because one produced p < 0.05 and the other produced p > 0.05. These arguments are not statistically valid or biologically meaningful.

      To the best of my understanding, the results are consistent with the following model:

      • RGCs form mAZs at large boutons (known)

      • About a quarter of week-one RGC boutons are mAZs (new observation)

      • Vesicle clustering is proportional to active zone number (~new observation)

      • RGC synapse density increases during the first post-week (known)

      • Blocking activity reduces synapse density (known)

      • Contralateral eye RGCs for more and larger synapses in the lateral dLGN (known)

      • With n=3 and effect sizes smaller than 1 standard deviation, a statistically significant result is about as likely to be a false positive as a true positive.

      • A true-positive statistically significant result does is not evidence of a meaningful deviation from a biological model.

      Providing plots that show the number of active zones present in boutons across these various conditions is useful. However, I could find no compelling deviation from the above default predictions that would influence how I see the role of mAZs in activity dependent eye-specific segregation.

      Below are critiques of most of the claims of the manuscript.

      Claim (abstract): individual retinogeniculate boutons begin forming multiple nearby presynaptic active zones during the first postnatal week.

      Confirmed by data.

      Claim (abstract): the dominant-eye forms more numerous mAZ contacts,

      Misleading: The dominant-eye (by definition) forms more contacts than the non-dominant eye. That includes mAZ.

      Claim (abstract): At the height of competition, the non-dominant-eye projection adds many single active zone (sAZ) synapses

      Weak: While the individual observation is strong, it is a surprising deviation based on a single n=3 experiment in a study that performed twelve such experiments (six ages, mutant/wildtype, sAZ/mAZ)

      Claim (abstract): Together, these findings reveal eye-specific differences in release site addition during synaptic competition in circuits essential for visual perception and behavior.

      False: This claim is unambiguously false. The above findings, even if true, do not argue for any functional significance to active zone clustering.

      Claim (line 84): "At the peak of synaptic competition midway through the first postnatal week, the non-dominant-eye formed numerous sAZ inputs, equalizing the global synapse density between the two eyes"

      Weak: At one of twelve measures (age, bouton type, genotype) performed with 3 mice each, one density measure was about twice as high as expected.

      Claim (line 172): "In WT mice, both mAZ (Fig. 3A, left) and sAZ (Fig. 3B, left) inputs showed significant eye-specific volume differences at each age."

      Questionable: There appears to be a trend, but the size and consistency is unclear.

      Claim (line 175): "the median VGluT2 cluster volume in dominant-eye mAZ inputs was 3.72 fold larger than that of non-dominant-eye inputs (Fig. 3A, left)."

      Cherry picking. Twelve differences were measured with an n of 3, 3 each time. The biggest difference of the group was cited. No analysis is provided for the range of uncertainty about this measure (2.5 standard deviations) as an individual sample or as one of twelve comparisons.

      Claim (line 174): "In the middle of eye-specific competition at P4 in WT mice, the median VGluT2 cluster volume in dominant-eye mAZ inputs was 3.72 fold larger than that of non-dominant-eye inputs (Fig. 3A, left). In contrast, β2KO mice showed a smaller 1.1 fold difference at the same age (Fig. 3A, right panel). For sAZ synapses at P4, the magnitudes of eye-specific differences in VGluT2 volume were smaller: 1.35-fold in WT (Fig. 3B, left) and 0.41-fold in β2KO mice (Fig. 3B, right). Thus, both mAZ and sAZ input size favors the dominant eye, with larger eye-specific differences seen in WT mice (see Table S3)."

      No way to judge the reliability of the analysis and trivial conclusion: To analyze effect size the authors choose the median value of three measures (whatever the middle value is). They then make four comparisons at the time point where they observed the biggest difference in favor of their hypothesis. There is no way to determine how much we should trust these numbers besides spending time with the mislabeled scatter plots. The authors then claim that this analysis provides evidence that there is a difference in vGluT2 cluster volume between dominant and non-dominant RGCs and that that difference is activity dependent. The conclusion that dominant axons have bigger boutons and that mutants that lack the property that would drive segregation would show less of a difference is very consistent with the literature. Moreover, there is no context provided about what 1.35 or 1.1 fold difference means for the biology of the system.

      Claim (189): "This shows that vesicle docking at release sites favors the dominant-eye as we previously reported but is similar for like eye type inputs regardless of AZ number."

      Contradicts core claim of manuscript: Consistent with previous literature, there is an activity dependent relative increase in vGlut2 clustering of dominant eye RGCs. The new information is that that activity dependence is more or less the same in sAZ and mAZ. The only plausible alternative is that vGlut2 scaling only increases in mAZ which would be consistent with the claims of their paper. That is not what they found. To the extent that the analysis presented in this manuscript tests a hypothesis, this is it. The claim of the title has been refuted by figure 3.

      Claim (line 235): "For the non-dominant eye projection, however, clustered mAZ inputs outnumbered clustered sAZ inputs at P4 (Fig. 4C, bottom left panel), the age when this eye adds sAZ synapses (Fig. 2C)."

      Misleading: The overwhelming trend across 24 comparisons is that the sAZ clustering looks like mAZ clustering. That is the objective and unambiguous result. Among these 24 underpowered tests (n=3), there were a few p-values < 0.05. The authors base their interpretation of cell behavior on these crossings.

      Claim (line 328): "The failure to add synapses reduced synaptic clustering and more inputs formed in isolation in the mutants compared to controls."

      Trivially true: Density was lower in mutant.

      Claim (line 332): "While our findings support a role for spontaneous retinal activity in presynaptic release site addition and clustering..."

      Not meaningfully supported by evidence: I could not find meaningful differences between WT and mutant beside the already known dramatic difference in synapse density.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Zhang and Speer examine changes in the spatial organization of synaptic proteins during eye specific segregation, a developmental period when axons from the two eyes initially mingle and gradually segregate into eye-specific regions of the dorsal lateral geniculate. The authors use STORM microscopy and immunostain presynaptic (VGluT2, Bassoon) and postsynaptic (Homer) proteins to identify synaptic release sites. Activity-dependent changes of this spatial organization are identified by comparing the β2KO mice to WT mice. They describe two types of synapses based on Bassoon clustering: the multiple active zone (mAZ) synapse and single active zone (sAZ) synapse. In this revision, the authors have added EM data to support the idea that mAZ synapses represent boutons with multiple release sites. They have also reanalyzed their data set with different statistical approaches.

      Strengths:

      The data presented is of good quality and provides an unprecedented view at high resolution of the presynaptic components of the retinogeniculate synapse during active developmental remodeling. This approach offers an advance to the previous mouse EM studies of this synapse because of the CTB label allows identification of the eye from which the presynaptic terminal arises.

      Weaknesses:

      While the interpretation of this data set is much more grounded in this second revised submission, some of the authors' conclusions/statements still lack convincing supporting evidence. In particular, the data does not support the title: "Eye-specific active zone clustering underlies synaptic competition in the developing visual system". The data show that there are fewer synapses made for both contra- and ipsi- inputs in the β2KO mice-- this fact alone can account for the differences in clustering. There is no evidence linking clustering to synaptic competition. Moreover, the findings of differences in AZ# or distance between AZs that the authors report are quite small and it is not clear whether they are functionally meaningful.

    3. Reviewer #3 (Public review):

      This study is a follow-up to a recent study of synaptic development based on a powerful data set that combines anterograde labeling, immunofluorescence labeling of synaptic proteins, and STORM imaging (Cell Reports, 2023). Specifically, they use anti-Vglut2 label to determine the size of the presynaptic structure (which they describe as the vesicle pool size), anti-Bassoon to label active zones with the resolution to count them, and anti-Homer to identify postsynaptic densities. Their previous study compared the detailed synaptic structure across the development of synapses made with contra-projecting vs. ipsi-projecting RGCs and compared this developmental profile with a mouse model with reduced retinal waves. In this study, they produce a new detailed analysis on the same data set in which they classify synapses into "multi-active zone" vs. "single-active zone" synapses and assess the number and spacing of these synapses. The authors use measurements to make conclusions about the role of retinal waves in the generation of same-eye synaptic clusters. The authors interpret these results as providing insight into how neural activity drives synapse maturation, the strength of their conclusions is not directly tested by their analysis.

      Strengths:

      This is a fantastic data set for describing the structural details of synapse development in a part of the brain undergoing activity-dependent synaptic rearrangements. The fact that they can differentiate the eye of origin is what makes this data set unique over previous structural work. The addition of example images from the EM dataset provides confidence in their categorization scheme.

      Weaknesses:

      Though the descriptions of single vs multi-active zone synapses are important and represent a significant advance, the authors continue to make unsupported conclusions regarding the biological processes driving these changes. Although this revision includes additional information about the populations tested and the tests conducted, the authors do not address the issue raised by previous reviews. Specifically, they provide no assessment of what effect size represents a biologically meaningful result. For example, a more appropriate title is "The distribution of eye-specific single vs multi-active zone is altered in mice with reduced spontaneous activity" rather than concluding that this difference in clustering is somehow related to synaptic competition. Of course, the authors are free to speculate, but many of the conclusions of the paper are not supported by their results.

    1. Reviewer #1 (Public review):

      Summary:

      This computational study investigates the physical mechanisms underlying enhancer-promoter (E-P) interactions across genomic distances in Drosophila chromosomes, motivated by a previously published study that revealed unexpectedly frequent long-range contacts challenging classical polymer models. The authors performed coarse-grained polymer simulations testing three chromatin organization models: ideal polymers, loop extrusion, and compartmental segregation, comparing their predictions to experimental Hi-C contact maps, mean E-P distances, and two-locus mean-squared displacement dynamics. They found that compartmental segregation best captured both the structural and dynamic features observed experimentally, while neither ideal chains nor loop extrusion alone could reproduce all experimental observables. The combination of compartmental segregation with loop extrusion further improved agreement with experimental data, suggesting these mechanisms might be involved in Drosophila chromatin organization.

      Strengths:

      The paper has two primary strengths:

      (1) The simulations are based on biologically interpretable mechanisms (compartmentalization and loop extrusion), which may facilitate making specific experimentally testable predictions.

      (2) The work uses a systematic approach to increase model complexity by directly fitting to data, first establishing that simple models fail to capture the data until arriving at a more complex model that does capture the data.

      Weaknesses:

      I have two major concerns (detailed below) and multiple minor concerns.

      Major concerns:

      (1) While the upside of the mechanistic simulations is that they are interpretable, the downside is that specific choices for the considered mechanism were made, and conclusions drawn from it are necessarily biased by the initial choices. In this paper, only two mechanisms were considered: loop extrusion and compartmentalization. Yet, it is not clear why these are the most likely underlying mechanisms that might determine the chromosome dynamics. Indeed, previous work (not cited in this paper) showed that Drosophila chromosome structure is not determined by loop extrusion: https://elifesciences.org/articles/94070.

      This should be acknowledged, and the main reasons for choosing these particular mechanisms should be laid out. The conclusions of the paper must then necessarily always be seen under the caveat that only these two mechanisms were considered.

      (2) Even within the framework of the approach, insufficient evidence is given to support the title of the paper "Criticality-driven enhancer-promoter dynamics in Drosophila chromosomes" for two reasons:

      (a) The fact that the best-fit parameters are near a coil-globule transition does not mean that the resulting dynamics are criticality-driven. To claim criticality, one would usually expect much more direct evidence, such as diverging correlation lengths. Furthermore, it would need to be shown that the key features of the dynamics (which should be defined, presumably the static and dynamic exponents) indeed depend on the parameters being at this transition. i.e., when tuning the simulations away from this parameter point, does the behaviour disappear? Only in this case can it be claimed that the behaviour is driven by this phenomenon.

      (b) The results section actually contains no mention of the coil-globule transition, and it is not clear in what way the parameters are close to this transition.

      Thus, three things are necessary:

      (i) How the parameters are close to the transition needs to be explained in detail.

      (ii) The divergence of observed dynamics whenever the parameters are tuned away from the transition needs to be demonstrated.

      (iii) Even if 1 and 2 are fulfilled, a more careful title should be chosen, such as "Polymer simulations near the coil-globule transition are consistent with enhancer-promoter dynamics in Drosophila chromosomes."

      Many of the results in the figures and results section are rather repetitive and could be compressed. The main result of Figure 1 - that the data are not described by an ideal chain - was already fully shown and established in the original paper from which the data are taken. Figure 2 is a negative result with near-identical panels to Figure 3. Figure 4B is hard to interpret.

      The paper makes no concrete suggestions for new experiments to test the hypotheses formulated. Since the paper can only claim that the simulations are consistent with the data, it would significantly strengthen the paper if testable predictions could be made.

    2. Reviewer #2 (Public review):

      Summary:

      In this work, Ganesh and colleagues use experimental data from Hi-C and from live-cell imaging to evaluate different polymer models of 3D genome organization in Drosophila based on both structural and dynamic properties. The authors consider several leading hypotheses, which are examined sequentially in increasing level of complexity - from the minimal Rouse polymer, to a model combining sequence-specific compartmentalization and loop-extrusion without extrusion blockers. They conclude that the combination of both compartmentalization and loop-extrusion gives the best agreement with the data. Their analysis also leads to concrete predictions about the processivity of cohesin loop extrusion in Drosophila, and a conclusion that the compartmental interaction strength is poised near criticality in the coil-globule phase space.

      Strengths:

      There is considerable interest in the field in understanding the mechanisms responsible for the 3D spatial organization genome and the dynamic movement of the genome, which has major implications for our understanding of long-range transcriptional regulation and other genome behaviors. The live-cell experimental work on which this study draws highlights the limitations of existing models to explain even the dynamic behaviors observed in the data, further exciting interest in further exploration. Therefore, this paper seeks to address an important gap in the field. The work is written in a well-organized, well-illustrated fashion. The text and figures are nicely integrated, easy to read, and explain challenging concepts with elegance and brevity in a manner that will be accessible to a broad audience.

      Weaknesses:

      The validity and utility of these conclusions are, in my view, substantially undermined by what appears to be unappreciated peculiarities of the live-cell data set that was used to constrain the model. The live-cell data comes from embryos were edited in a way that intentionally substantively changed both the 3D genome structure and dynamics specifically at the loci which are imaged, a case which is not at all explained by any of the models suggested nor acknowledged in the current work, nor compatible with the Hi-C data that simultaneously used to explain these models. As these ignored synthetic alterations have been previously shown to be determinative of transcriptional activity, the relevance of the author's work to transcriptional control (a prime motivation in the introduction) is unclear.

      The agreement in 3D organization, as represented in chromosome-scale contact frequency heatmaps, is substantially less impressive than the agreement seen in prior work with similar models. This discrepancy appears to be due in part to the unappreciated effects of the mentioned in the previous limitation, as well as inappropriate choices in metrics used to evaluate agreement. It is also not particularly surprising that combining more models, with more free parameters, results in an improvement in the quality of fit.

      Some major results, including both theoretical works and experimental ones, are ignored, despite their relevance to the stated objective of the work. The current manuscript and analysis could be improved substantially by a consideration of these works.

      I describe these issues in more detail below.

      Major issues:

      (1) The genetic element "homie" is present in a subset of the data: The experimental data used in this analysis come from different fly lines, half of which have been edited explicitly to alter genome structure and consequent transcriptional behavior, yet the authors are trying to fit with a common model - a problem which substantially undermines the utility of the analysis.

      Specifically, the authors evaluate the various models/simulations by comparing them to Hi-C from wildtype Drosophila embryos on the chromosome scale and 3D distances and dynamics from live cell imaging in genetically edited embryos, to a series of models in turn. The exercise fatally overlooks a critical fact, (admittedly not easily noticed in the work from Bruckner et al), that the fly embryos used for nearly all their analyses contain not only fluorescent labels, but also contain two copies of a powerful genetic sequence, "homie", known for its ability to dramatically change the 3D organization and dynamics of the genome. Whether or not the fluorescent labels themselves used in the study further alter structure and dynamics is not entirely clear (and will require further work beyond the scope of either study), but at least these fluorescent labels aren't known to dramatically affect 3D structure and dynamics the way homie is. The critical problem is that adding or removing the "homie", as shown in a collection of prior works I describe below in more detail, dramatically affects structure, dynamics, and gene expression. Whether or not the genome contains two distal cis-linked copies of homie fundamentally changes genome structure and dynamics, so to use one dataset which has this edit (the live-cell data) and one dataset which lacks it (the Hi-C data) is, in some sense, to guarantee failure of any model to match all the data.

      If the authors had chosen instead to focus exclusively on the 'no homie' genetic lines in the Brukner data, they would have a much smaller dataset (just 2 distances), which would not cover all the length scales of interest, but it would at least be a dataset not known to be contradictory to the Hi-C. The two 'no homie' lines make much more plausible candidates for the sort of generalizable polymer dynamics these authors seek to explain, as will hopefully be made more clear by a brief review of what is known about homie. I next describe the published data that support these conclusions about how homie affects 3D genome spatial organization and dynamics:

      What is "homie" and how does it affect 3D genome distances, dynamics, and gene expression?

      The genetic element "homie" was named by James Jaynes' lab ( Fujioka...Jaynes 2009) in reference to its remarkable "homing" ability - a fascinating and still poorly understood biological observation that some genetic sequences from Drosophila, when cloned on plasmids and reintegrated into the genome with p-elements, had a remarkable propensity to re-integrate near their endogenous sequence, (Hama et al., 1990; Kassis, 2002; Taillebourg and Dura, 1999; Bender and Hudson, 2000; Fujioka...Jaynes 2009). By contrast, most genetic elements tend to incorporate at random across the genome in such assays (with some bias for active chromatin).

      The Jaynes lab subsequently showed that flies carrying two copies of homie, one integrated in cis, ~140 kb distal from the endogenous element, formed preferential cis contacts with one another. Indeed, if a promoter and reporter gene were included at this distal integration site, the reporter gene would activate gene expression in the pattern normally seen by the gene, even-skipped. The endogenous copy of homie marks one border of ~16 kb mini-TAD which contains the even-skipped gene, (eve), and its developmental enhancers, so this functional interaction provides further evidence of physical proximity (as was also shown by 3C by Jaynes (Fujioka..., Schedl, Jaynes 2016), and later with elegant live imaging, by Jaynes and Gregor (Chen 2018)).

      Critically, if either copy of homie is deleted or substantially mutated, the 3D proximity is lost (Fujioka 2016, Chen 2018, Bruckner 2023), and the expression of the transgene is dramatically reduced (at 58 kb) or lost. Given the author's motivation of understanding "E-P" interactions, the fact that the increased 3D proximity provided by homie is as essential for transcription as the promoter itself at the ~150 kb distance, underscores that these are not negligible changes.

      These effects can be seen by plotting the data from Bruckner 2023, which includes data from labels with separations of 58 kb and ~150 kb "no homie" as well as homie. Unfortunately, the authors don't plot this data in the manuscript in the comparison of 3D distances, though the two-point MSD can be seen in Figure S13C, and laudably, the data is made public in a well-annotated repository on Zenodo, noted in the study. Note that the distance data in Figure S13 were filtered to exclude the transcriptionally off state, and are thus not the quantity the current authors are interested in. If they plot the published data for no homie, they will see the clear effect on the average 3D distance, R(s), and a somewhat stronger effect on the contact frequency P(s), which causes significant deviation from the trend-line followed by the homie-containing data.

      (2) The agreement between the "best performing" simulations for all models and the Hi-C data is not on par with prior studies using similar approaches, apparently due to some erroneous choices in how the optimization is carried out:

      Hi-C-comparison

      The 'best fit' simulation Hi-C looks strikingly different from the biological data in all comparisons, with clearly lower agreement than other authors have shown using highly similar methods (e.g., Shi and Thirumalai 2023; Di Pierro et al. 2017; Nuebler et al. 2018; Esposito et al. 2022; Conte et al. 2022), among many others. I believe this results from a few issues with how the current authors select and evaluate the data in their work:

      (a) Most works have used Pearson's correlation rather than Spearman's correlation when comparing simulation and Hi-C contact frequencies. Pearson's correlation is more appropriate when we expect the values to be linearly related, which they should be in this case, as they are constructed indeed to be measuring the same thing (contact frequency), just derived from two different methods. Spearman's correlation would have been justifiable for comparing how transcription output correlates with contact frequency. This may fix the bafflingly low correlations reported at lower adhesion values in Figure S2C.

      (b) Choice of adhesion strengths - The Hi-C map comparison in Figure 3 strongly suggests that a much more striking visual agreement would have been achieved if much weaker (but still non-zero) homotypic monomer affinity had been selected. In the authors' simulation, the monomer state (A/B identity) strongly dominates polymer position, resulting in the visual appearance of an almost black-and-white checkerboard. The data, meanwhile, look like a weak checkerboard superimposed on the polymer.

      (c) A further confounding problem is the aforementioned issue that the Hi-C data don't come from the edited cell lines, and that the interaction of the two Homie sites is vastly stronger than the compartment interactions of this region of the genome.

      (3) Some important concepts from the field are ignored:

      The crumpled/fractal globule model is widely discussed in the literature (including the work containing the data used in this study) - its exclusion from this analysis thus appears as a substantial gap/oversight:

      A natural alternative to the much-discussed Rouse polymer model is the "crumpled polymer" (Grosberg et al. 1988; Grosberg 2016; Halverson et al. 2011; Halverson et al. 2011), also known as the "fractal globule" (Lieberman-Aiden et al. 2009; Mirny 2011; Dekker and Mirny 2016; Boettiger et al. 2016), much discussed for the way it captures the â…“ scaling of R(s), found for much of the genome (or, equivalently, the -1 exponent of the probability of contact as a function of genome separation, P(s)). Given the 1/3rd scaling in the data, and the fact that the original authors highlighted the crumpled model in addition to the Rouse model, it seems that this comparison would be instructive and the lack of discussion an oversight. Moreover, while prior works (e.g., Buckner, Gregor, 2023) used some traditional simplifying assumptions to estimate the MSD and relaxation time scaling of this model, I believe a more rigorous analysis with explicit simulations (as in Figure 1 for the Rouse model) would be instructive for the crumpled polymer simulations. Note the crumpled globule is not necessarily the same as the globule in the coil-globule transition discussed here - it requires some assumptions about non-entanglement to stay trapped in the meta-stable state which has the 1/3rd R(s) scaling that is indicative of this model, and not the 1/2 exhibited by equilibrium globules (for s<< length of the polymer) and dilute polymers alike.

      While the fit in Figure 2 appears to get closer to the 1/3rd exponent (B= 0.32), this appears to be a largely coincidental allusion of agreement - the simulation data in truth shows a systematic deviation, returning to the 1/2 scaling for distances from 500 kb to whole chromosomes. This feature is not very evident as the authors restrict the analysis to only the few points available in the experimental data, though had they tested intervening distances I expect they would show log-log P(s) is nonlinear (non-powerlaw) for distances less than the typical loop length up to a few fold larger than the loop length, and thereafter returns to the scaling provided by the 'base' polymer behavior. This appears to be Rouse-like in these authors' model, with R(s) going like 1/2, even though the data are closer to 1/3rd, as indeed most published simulated P(s) curves based on loop extrusion - e.g., (Fudenberg et al. 2016; Nuebler et al. 2018). In this vein, it would be instructive to the readers if the authors would include additional predictions from the simulation on the plot that lie at genomic separation distances not tested in the data, to better appreciate the predictions.

      Minor issues

      (1) I think it is too misleading to only describe the experimental data from Brukner as "E-P" interactions from Drosophila. It is important to note somewhere that this is not an endogenous interaction with a functional role in Drosophila - it is a synthetic interaction between enhancers in the vicinity of the eve gene and a synthetic promoter placed at a variable distance away. The uniformity is elegant - (it is the same pair of elements being studied at all distances), but also provides limited scope for generalization as suggested by the current text. Moreover, the enhancers were not directly labeled; rather, the 3D position of nascent RNA transcribed from eve was tracked with an RNA-binding protein and used as a proxy for the 3D position of the enhancers. There is not an individual enhancer at the eve locus that interacts with the transgene, but rather a collection of enhancers is distributed at different positions throughout the entire TAD, which contains eve, and must form separate loops to reach eve. Indeed, it was previously reported that differences in the local position of these enhancers, relative to eve, affect their ability to interact with the distal reporter gene and the endogenous eve gene (Chen 2018). There is also reported competition between these enhancers and the distal gene, which further complicates the analysis (especially since the state of eve and of its enhancers varies among the different cells as a function of stripe position) - see Chen 2018. All of this is ignored in the current work, despite the assertion of the application to understanding E-P interaction. A detailed discussion of these issues is not necessary, but I fear that ignoring them entirely is to invite further confusion and error.

      (2) I believe this sentence is overstated, given available data: " TAD borders are characterized by transitions between epigenetic states rather than by preferentially-bound CTCF [4, 23, 24]." Indeed, this claim has been repeatedly made in the literature as cited here. However, other data clearly demonstrate a strong enrichment of CTCF at TAD borders (and at epigenetic borders, which in Drosophila have a high correspondence with TAD borders, as the authors have already appropriately noted). See, for example, Figure 4 of Sexton Cell 2012, and compare to Figure 2 of Dixon 2012. Of minor note, CTCF peaks co-occupied by the Zinc Finger TF CP190 are more likely to be TAD borders than CTCF alone. How big a species-specific difference this is remains unclear, as it appears some mammalian CTCF-marked TAD boundaries may be co-occupied by additional ZNFs. While plenty of Drosophila TAD boundaries indeed lack CTCF, many are marked by CTCF, this is enriched relative to what would be expected by chance (or relative to the alignment of other TFs, like Twist or Eve with TAD boundaries), and it has been shown that CTCF loss is sufficient to remove a subset of these, see for example Figure 5 of (Kaushal et al. 2021) (though it is possible, most will require mutation of the all the border-associated factors that collectively bind many of the borders, dCTCF, CP190, mod(mdg4) and others).

      (3) This assertion is overstated given available data: "Although TAD boundaries in Drosophila are often associated with insulator proteins [20], there is no direct evidence that these elements block LEFs in vivo. Therefore, we did not impose boundary constraints in our simulations; LEFs were allowed to move freely unless stalled by collisions with other LEFs, with the possibility of crossover.". Deletion of insulator in Drosophila that lie within a common epigenetic state leads to fusion of TADs (e.g., Mateo et al., 2019 - deletion of the CTCF-marked Fub insulator, in posterior tissues where both flanks of Fub are active; Kaushal, 2021, has examples as well). Loss of CTCF causes a small number of TADs to fuse as measured by Hi-C. This is far from 'direct evidence that insulators block LEFs' - as the authors have already noted, even the idea that cohesin extrudes loops in Drosophila in the first place is indeed controversial. However, LEF activity and stalling at insulators would provide a very natural explanation of why chromatin in a shared epigenetic state should form distinct TADs, and why these TADs should fuse upon insulator deletion. Justifying the lack of stalling sites based on empirical data is thus not very convincing to this reviewer. I believe it would be more apt to simply describe this as a simplifying assumption, rather than the above phrase, which may be misleading.

    1. Reviewer #1 (Public review):

      In this revised submission from Kapustin et al., the authors have made significant changes to the manuscript. Namely, the authors have addressed several of the major issues with the original submission, providing a more concrete link between fibronectin and the secretion of extracellular vesicles. Additionally, the authors have moderated some of the conclusions to better suit the rigor of the experimental results and limitations of their approach. Generally, the findings convey an interesting cell autonomous pathway in which smooth muscle cells sense fibronectin, which canonically is a proinflammatory substrate with activating properties in many tissues. Fibronectin-mediated integrin signaling stimulates secretion of small extracellular vesicles containing collagen VI which is deposited into the surrounding extracellular matrix. Collagen VI itself gleaned from extracellular vesicle secretion seems to further alter smooth muscle cell morphodynamics. For this later finding, much of the mechanism behind collagen VI vesicle loading and secretion has yet to be worked out. The authors provide evidence of extracellular vesicles containing collagen VI trapped in fibronectin in atherosclerotic plaques providing a nice validation of their in vitro findings in a diseased human cohort. Some limitations do still exist in the manuscript in its current form such as the assessment of the vesicle origins, contents and their association with the actin cytoskeleton; however, the rigor and execution are much improved from the preceding version. Overall, the pathobiology underlying vascular smooth muscle remodeling in disease states is a critical area of research that warrants further exploration.

    2. Reviewer #2 (Public review):

      The findings in the current manuscript are interesting and valuable contributions to the fields of vascular biology and extracellular vesicle-related mechanisms. They suggest a potential role for smooth muscle cell-derived extracellular vesicles in presenting Type VI collagen to cells to orchestrate their migration, with proposed relevance to aberrant smooth muscle cell movements in the progression of atherosclerotic lesions. A wide range of assays are utilized to test various aspects of this working model, with the resulting data being largely solid and supporting several of the interpretations articulated by the authors. The revised manuscript has adequately addressed key weaknesses.

      The authors present data suggesting a working model in which vascular smooth muscle cells (vSMCs) are stimulated by fibronectin (FN) to generate small extracellular vesicles (sEVs) that harbor Type VI Collagen (collagen VI). These collagen VI-associated sEVs are suggested to accumulate in the extracellular matrix (ECM) and influence cell migration and adhesion dynamics, potentially contributing to disease progression in atherosclerosis. Majors strengths of this manuscript include robust imaging data and the inclusion of human-derived samples in their analysis. The authors also make a reasonable attempt to provide data to support the potential existence of these mechanistic connections, though some minor questions remain regarding data interpretation. The authors largely achieved their aims of finding evidence consistent with their interpretations, and they have presented logical support for their conclusions while acknowledging important limitations and caveats to their current study. This work will likely have a sustained impact on the field of sEV biology and potential intersections with vascular biology, including their methodology e.g., imaging approaches. As biologists continue to explore the role of sEVs in physiological and pathological processes, this work raises an interesting aspect that must be considered more broadly, and that is, what is the role of sEVs that are ECM-associated and not necessarily internalized by recipient cells? Are there discrete mechanisms that govern their role in maintaining and/or disrupting normal physiological processes? This manuscript makes an attempt to address these unresolved yet critical questions.

    1. Reviewer #1 (Public review):

      Summary:

      This study examines how two common psychiatric treatments, antidepressant medication and cognitive distancing, influence baseline levels and moment-to-moment changes in happiness, confidence, and engagement during a reinforcement learning task. Combining a probabilistic selection task, trial-by-trial affect ratings, psychiatric questionnaires, and computational modeling, the authors demonstrate that each treatment has distinct effects on affective dynamics. Notably, the results highlight the key role of affective biases in how people with mental health conditions experience and update their feelings over time, and suggest that interventions like cognitive distancing and antidepressant medication may work, at least in part, by shifting these biases.

      Strengths:

      (1) Addresses an important question: how common psychiatric treatments impact affective biases, with potential translational relevance for understanding and improving mental health interventions.

      (2) The introduction is strong, clear, and accessible, making the study approachable for readers less familiar with the underlying literature.

      (3) Utilizes a large sample that is broadly representative of the UK population in terms of age and psychiatric symptom history, enhancing generalizability.

      (4) Employs a theory-driven computational modeling framework that links learning processes with subjective emotional experiences.

      (5) Uses cross-validation to support the robustness and generalizability of model comparisons and findings.

      Weaknesses:

      The authors acknowledge the limitations in the discussion section.

      Additional questions:

      (1) Group Balance & Screening for Medication Use: How many participants in the cognitive distancing and control groups were taking antidepressant medication? Why wasn't medication use included as part of the screening to ensure both groups had a similar number of participants taking medication?

      (2) Assessment of the Practice of Cognitive Distancing: Is there a direct or more objective method to evaluate whether participants actively engaged in cognitive distancing during the task, and to what extent? Currently, the study infers engagement indirectly through the outcomes, but does not include explicit measures of participants' use of the technique. Would including self-report check-ins throughout the task, asking participants whether they were actively engaging in cognitive distancing, have been useful? However, including frequent self-report check-ins would increase procedural differences between groups, making perhaps the tasks less comparable beyond the intended treatment manipulation. Maybe incorporating a question at the end of the task, asking how much they engaged in cognitive distancing, could offer a useful measure of subjective engagement without overly disrupting the task flow.

      Conclusion:

      This study advances our understanding of the mechanisms underlying mental health interventions. The combination of computational modeling with behavioral and affective data offers a powerful framework for understanding how treatments influence affective biases and dynamics. These findings are of broad interest across clinical and mental health sciences, cognitive and affective research, and applied translational fields focused on improving psychological well-being.

    2. Reviewer #2 (Public review):

      In this paper, Dercon and colleagues report on affective changes related to components of reinforcement learning and on the effects of brief training in psychological distancing and participants' self-reported antidepressant use. About 1,000 participants were assessed online, with half randomized to a brief training in psychological distancing with reminders to distance during the subsequent reinforcement learning (RL) task. Participants completed a battery of psychiatric questionnaires and answered questions about medication use, with about 14% of participants reporting current antidepressant use. All participants completed the RL task and rated their happiness, confidence, engagement, and (at the end of each block of trials) fatigue throughout the task. Computational models were used to estimate trial-by-trial values of expected value and prediction error and to assess the effects of these values on self-reported affect. Participants' affect ratings decreased over time, and participants with higher psychiatric symptoms (particularly anxiety/depressive symptoms) showed lower baseline affect and greater decreases in affect. Participants randomized to the distancing intervention and who reported antidepressant use differed in their affective ratings: distancing reduced the reductions in happiness over time, while antidepressant use was related to higher baseline happiness. Distancing also reduced the effects of trial-level expected value on happiness, while antidepressant use was related to a more enduring effect of trial-level values on happiness.

      Overall, this is an interesting paper with strong methods and an interesting approach. That psychiatric symptoms and cognitive distancing are related to affective ratings is not terribly novel; the relationship with antidepressant use is a bit more novel. The extension of the mood model to an RL task is a new contribution, as is the relationship of these effects with psychologically related manipulations.

      One major concern is the inference that can be drawn from the two "treatments": one is a brief instruction in a component of psychotherapy, and one is ongoing use of medication. The former is not a treatment in and of itself, but a (presumably) active ingredient of one. How to interpret antidepressant use as measured is unclear, e.g., are the residual symptoms in these participants an early indicator of treatment resistance? Are these participants with better access to health care? Are they receiving antidepressants for a mental health issue?

      There are some clarifications needed in the affect model as well.

    3. Reviewer #3 (Public review):

      Summary:

      The present manuscript investigates and proposes different mechanisms for the effects of two therapeutic approaches - cognitive distancing technique and use of antidepressants - on subjective ratings of happiness, confidence, and task engagement, and on the influence of such subjective experiences on choice behavior. Both approaches were found to link to changes in affective state dynamics in a choice task, specifically reduced drift (cognitive distancing) and increased baseline (antidepressant use). Results also suggest that cognitive distancing may reduce the weighing of recent expected values in the happiness model, while antidepressant use may reduce forgetting of choices and outcomes.

      Strengths:

      This is a timely topic and a significant contribution to ongoing efforts to improve our mechanistic understanding of psychopathology and devise effective novel interventions. The relevance of the manuscript's central question is clear, and the links to previous literature and the broader field of computational psychiatry are well established. The modelling approaches are thoughtful and rigorously tested, with appropriate model checks and persuasive evidence that modelling complements the theoretical argument and empirical findings.

      Weaknesses:

      Some vagueness and lack of clarity in theoretical mechanisms and interpretation of results leave outstanding questions regarding (a) the specific links drawn between affective biases, therapies aimed at mitigating them, and mental health function, and (b) the structure and assumptions of the modelling, and how they support the manuscript's central claims. Broadly, I do not fully understand the distinction between how choice behavior vs. affect are impacted separately or together by cognitive distancing. Clarification on this point is needed, possibly through a more explicit proposal of a mechanism (or several alternative mechanisms?) in the introduction and more explicit interpretation of the modelling results in the context of the cyclical choice-affect mechanism.

      (1) Theoretical framework and proposed mechanisms

      The link between affective biases and negative thinking patterns is a bit unclear. The authors seem to make a causal claim that "affective biases are precipitated and maintained by negative thinking patterns", but it is unclear what precisely these negative patterns are; earlier in the same paragraph, they state that affective biases "cause low mood" and possibly shift choices toward those that maintain low mood. So the directionality of the mechanism here is unclear - possibly explaining a bit more of the cyclic nature of this mechanism, and maybe clarifying what "negative thinking patterns" refer to will be helpful.

      More generally, this link between affect and choices, especially given the modelling results later on, should be clarified further. What is the mechanism by which these two impact each other? How do the models of choice and affect ratings in the RL task test this mechanism? I'm not quite sure the paper answers these questions clearly right now.

      The authors also seem to implicitly make the claim that symptoms of mental ill-health are at least in part related to choice behavior. I find this a persuasive claim generally; however, it is understated and undersupported in the introduction, to the point where a reader may need to rely on significant prior knowledge to understand why mitigating the impact of affective biases on choice behavior would make sense as the target of therapeutic interventions. This is a core tenet of the paper, and it would be beneficial to clarify this earlier on.

      It would be helpful to interpret a bit more clearly the findings from 3.4. on decreased drift in all three subjective assessments in the cognitive distancing group. What is the proposed mechanism for this? The discussion mentions that "attenuated declines [...] over time, [add] to our previously reported findings that this psychotherapeutic technique alters aspects of reward learning" - but this is vague and I do not understand, if an explanation for how this happens is offered, what that explanation is. Given the strong correlation of the drift with fatigue, is the explanation that cognitive distancing mitigates affect drift under fatigue? Or is this merely reporting the result without an interpretation around potential mechanisms?

      (Relatedly, aside from possibly explaining the drift parameter, do the fatigue ratings link with choice behavior in any way? Is it possible that the cognitive distancing was helping participants improve choices under fatigue?)

      (2) Task Structure and Modelling

      It is unclear what counted as a "rewarding" vs. "unrewarding" trial in the model. From my understanding of the task description, participants obtained positive or no reward (no losses), and verbal feedback, Correct/Incorrect. But given the probabilistic nature of the task, it follows that even some correct choices likely had unrewarding results. Was the verbal feedback still "Correct" in those cases, but with no points shown? I did not see any discussion on whether it is the #points earned or the verbal feedback that is considered a reward in the model. I am assuming the former, but based on previous literature, likely both play a role; so it would be interesting - and possibly necessary to strengthen the paper's argument - to see a model that assigns value to positive/negative feedback and earned points separately.

      From a theory perspective, it's interesting that the authors chose to assume separate learning rates for rewarding and non-rewarding trials. Why not, for example, separate reward sensitivity parameters? E.g., rather than a scaling parameter on the PE, a parameter modifying the r term inside the PE equation to, perhaps, assign different values to positive and zero points? (While I think overall the math works out similarly at the fitting time, this type of model should be less flexible on scaling the expected value and more flexible on scaling the actual #points / the subjective experience of the obtained verbal feedback, which seems more in line with the theoretical argument made in the introduction). The introduction explicitly states that negative biases "may cause low mood by making outcomes appear less rewarding" - which in modelling equations seems more likely to translate to different reward-perception biases, and not different learning rates. Alternatively, one might incorporate a perseveration parameter (e.g., similar to Collins et al. 2014) that would also accomplish a negative bias. Either of these two mechanisms seems perhaps worth testing out in a model - especially in a model that defines more clearly what rewarding vs. unrewarding may mean to the participant.

      If I understand correctly, the affect ratings models assume that the Q-value and the PE independently impact rating (so they have different weights, w2 and w3), but there is no parameter allowing for different impact for perceived rewarding and unrewarding outcomes? (I may be misreading equations 4-5, but if not, Q-value and PE impact the model via static rather than dynamic parameters.) Given the joint RL-affect fit, this seems to carry the assumption that any perceptual processing differences leading to different subjective perceptions of reward associated with each outcome only impact choice behavior, but not affect? (whereas affect is more broadly impacted, if I'm understanding this correctly, just by the magnitude of the values and PEs?) This is an interesting assumption, and the authors seem to have tested it a bit more in the Supplementary material, as shown in Figure S4. I'm wondering why this was excluded from the main text - it seems like the more flexible model found some potentially interesting differences which may be worth including, especially as they might shed additional insight into the influence of cognitive distancing on the cyclical choice-affect mechanisms proposed.

      Minor comments:

      If fatigue ratings were strongly associated with drift in the best-fitting model (as per page 13), I wonder if it would make sense to use those fatigue ratings as a proxy rather than allow the parameter to vary freely? (This does not in any way detract from the winning model's explanatory power, but if a parameter seems to be strongly explained by a variable we have empirical data for, it's not clear what extra benefit is earned by having that parameter in the model).

    1. Reviewer #1 (Public review):

      Allodynia is commonly measured in the pain field using von Frey filaments, which are applied to a body region (usually hindpaw if studying rodents) by a human. While humans perceive themselves as being objective, as the authors noted, humans are far from consistent when applying these filaments. Not to mention, odors from humans, including of different sexes, can influence animal behavior. There is thus a major unmet need for a way to automate this tedious von Frey testing process, and to remove humans from the experiment. I have no major scientific concerns with the study, as the authors did an outstanding job of comparing this automated system to human experimenters in a rigorous and quantitative manner. They even demonstrated that their automated system can be used in conjunction with in vivo imaging techniques.

      While it is somewhat unclear how easy and inexpensive this device will be, I anticipate everyone in the pain field will be clamoring to get their hands on a system like this. And given the mechanical nature of the device, and propensity for mice to urinate on things, I also wonder how frequently the device breaks/needs to be repaired. Perhaps some details regarding cost and reliability of the device would be helpful to include, as these are the two things that could make researchers hesitant to adopt immediately.

      The only major technical concern, which is easy to address, is whether the device generates ultrasounic sounds that rodents can hear when idle or operational, across the ultrasonic frequencies that are of biological relevance (20-110 kHz). These sounds are generally alarm vocalizations and can create stress in animals, and/or serve as cues of an impending stimulus (if indeed they are produced by the device).

      Comments on revisions:

      Was Fig. 1 updated with the new apparatus design? i.e. to address issue of animal waste affecting function over time?

      I have no further comments.

    2. Reviewer #2 (Public review):

      Summary:

      Burdge, Juhmka et al describe the development and validation of a new automated system for applying plantar stimuli in rodent somatosensory behavior tasks. This platform allows the users to run behavior experiments remotely, removing experimenter effects on animals and reducing variability in manual application of stimuli. The system integrates well with other automated analysis programs that the lab has developed, providing a complete package for standardizing behavior data collection and analysis. The authors present extensive validations of the system against manual stimulus application. Proof of concept studies also show how the system can be used to better understand the effect of experimenters on behavior and the effects of how stimuli are presented on the micro features of the animal withdrawal response.

      Strengths:

      If widely adopted, ARM has the potential to reduce variability in plantar behavior studies across and within labs and provide a means to standardize results. It provides a way to circumvent the confounds that humans bring into performing sensitive plantar behavior tests (e.g. experimenter odors, experince, physical abilities, variation in stimulus application, sex). Furthermore, it can be integrated with other automated platforms, allowing for quicker analysis and potentially automated stimulus delivery. The manuscript also presents some compelling evidence on the effects of stimulus application time and height on withdrawals, which can potentially help labs that are manually applying stimuli standardize applications. The system is well validated and the results are clear and convincingly presented. Claims are well supported by experimental evidence.

      Weaknesses:

      ARM seems like a fantastic system that could be widely adopted, a primary weakness is that it is not currently available to other labs. This will eventually be remedied as it is commercialised.

    3. Reviewer #3 (Public review):

      Summary:

      This report describes the development and initial applications of the ARM (Automated Reproducible Mechano-stimulator), a programmable tool that delivers various mechanical stimuli to a select target (most frequently, a rodent hindpaw). Comparisons to traditional testing methods (e.g., experimenter application of stimuli) reveal that the ARM reduces variability in the anatomical targeting, height, velocity, and total time of stimulus application. Given that the ARM can be controlled remotely, this device was also used to assess effects of experimenter presence on reflexive responses to mechanical stimulation. Although not every experimenter had notable sex-dependent effects on animal behavior, use of the ARM never had this effect (for obvious reasons!). Lastly, the ARM was used to stimulate rodent hindpaws while measuring neuronal activity in the basolateral nucleus of the amygdala (BLA), a brain region that is associated with the negative affect of pain. This device, and similar automated devices, will undoubtedly reduce experimenter-related variability in reflexive mechanical behavior tests; this may increase experimental reproducibility between laboratories who are able to invest in this type of technology.

      Strengths:

      Clear examples of variability in experimenter stimulus application are provided and then contrasted with uniform stimulus application that is inherent to the ARM.

      The ARM is able to quickly oscillate between delivery of various mechanical stimuli; this is advantageous for experimental efficiency.

      New additions to the ARM and PAWS platforms have been methodically tested to ensure reproducibility and reliability.

    1. Reviewer #1 (Public review):

      Summary:

      Ferreiro et al. present a method to simulate protein sequence evolution under a birth-death model where sequence evolution is guided by structural constraints on protein stability. The authors then use this model to explore the predictability of sequence evolution in several viral proteins. In principle, this work is of great interest to molecular evolution and phylodynamics, which has struggled to couple non-neutral models of sequence evolution to phylodynamic models like birth-death processes. Unfortunately, though, the model shows little improvement over neutral models in predicting protein sequence evolution, although it can predict protein stability better than models assuming neutral evolution. It appears that more work is needed to determine exactly what aspects of protein sequence evolution are predictable under such non-neutral phylogenetic models.

      Major concerns:

      (1) The authors have clarified the mapping between birth-death model parameters and fitness, but how fitness is modeled still appears somewhat problematic. The authors assume the death rate = 1 - birth rate. So a variant with a birth rate b = 1 would have a death rate d = 0 and so would be immortal and never die, which does not seem plausible. Also I'm not sure that this would "allow a constant global (birth-death) rate" as stated in line 172, as selection would still act to increase the population mean growth rate r = b - d. It seems more reasonable to assume that protein stability affects only either the birth or death rate and assume the other rate is constant, as in the Neher 2014 model.

      (2) It is difficult to evaluate the predictive performance of protein sequence evolution. This is in part due to the fact that performance is compared in terms of percent divergence, which is difficult to compare across viral proteins and datasets. Some protein sequences would be expected to diverge more because they are evolving over longer time scales, under higher substitution rates or under weaker purifying selection. It might therefore help to normalize the divergence between predicted and observed sequences by the expected or empirically observed amount of divergence seen over the timescale of prediction.

      (3) Predictability may also vary significantly across different sites in a protein. For example, mutations at many sites may have little impact on structural stability (in which case we would expect poor predictive performance) while even conservative changes at other sites may disrupt folding. I therefore feel that there remains much work to be done here in terms of figuring out where and when sequence evolution might be predictable under these types of models, and when sequence evolution might just be fundamentally unpredictable due to the high entropy of sequence space.

    2. Reviewer #2 (Public review):

      In this study, the authors aim to forecast the evolution of viral proteins by simulating sequence changes under a constraint of folding stability. The central idea is that proteins must retain a certain level of structural stability (quantified by folding free energy, ΔG) to remain functional, and that this constraint can shape and restrict the space of viable evolutionary trajectories. The authors integrate a birth-death population model with a structurally constrained substitution (SCS) model and apply this simulation framework to several viral proteins from HIV-1, SARS-CoV-2, and influenza.

      The motivation to incorporate biophysical constraints into evolutionary models is scientifically sound, and the general approach aligns with a growing interest in bridging molecular evolution and structural biology. The authors focus on proteins where immune pressure is limited and stability is likely to be a dominant constraint, which is conceptually appropriate. The method generates sequence variants that preserve folding stability, suggesting that stability-based filtering may capture certain evolutionary patterns.

      However, the study does not substantiate its central claim of forecasting. The model does not predict future sequences with measurable accuracy, nor does it reproduce observed evolutionary paths. Validation is limited to endpoint comparisons in a few datasets. While KL divergence is used to compare amino acid distributions, this analysis is only applied to a single protein (HIV-1 MA), and there is no assessment of mutation-level predictive accuracy or quantification of how well simulated sequences recapitulate real evolutionary paths. No comparison is made to real intermediate variants available from extensive viral sequencing datasets which gather thousands of sequences with detailed collection date annotation (SARS-CoV-2, Influenza, RSV).

      The selection of proteins is narrow and the rationale for including or excluding specific proteins is not clearly justified.

      The analyzed datasets are also under-characterized: we are not given insight into how variable the sequences are or how surprising the simulated sequences might be relative to natural diversity. Furthermore, the use of consensus sequences to represent timepoints is problematic, particularly in the context of viral evolution, where divergent subclades often coexist - a consensus sequence may not accurately reflect the underlying population structure.

      The fitness function used in the main simulations is based on absolute ΔG and rewards increased stability without testing whether real evolutionary trajectories tend to maintain, increase, or reduce folding stability over time for the particular systems (proteins) that are studied. While a variant of the model does attempt to center selection around empirical ΔG values, this more biologically plausible version is underutilized and not well validated.

      Ultimately, the model constrains sequence evolution to stability-compatible trajectories but does not forecast which of these trajectories are likely to occur. It is better understood as a filter of biophysically plausible outcomes than as a predictive tool. The distinction between constraint-based plausibility and sequence-level forecasting should be made clearer. Despite these limitations, the work may be of interest to researchers developing simulation frameworks or exploring the role of protein stability in viral evolution, and it raises interesting questions about how biophysical constraints shape sequence space over time.

    1. Reviewer #1 (Public review):

      Summary:

      This study is an evaluation of patient variants in the kidney isoform of AE1 linked to distal renal tubular acidosis. Drawing on observations in the mouse kidney, this study extends findings to autophagy pathways in a kidney epithelial cell line.

      Strengths:

      Experimental data are convincing and nicely done.

      Weaknesses:

      Some data are lacking or not explained clearly. Mutations are not consistently evaluated throughout the study, which makes it difficult to draw meaningful conclusions.

    2. Reviewer #2 (Public review):

      Context and significance:

      Distal renal tubular acidosis (dRTA) can be caused by mutations in a Cl-/HCO3- exchanger (kAE1) encoded by the SLC4A1 gene. The precise mechanisms underlying the pathogenesis of the disease due to these mutations are unclear, but it is thought that loss of the renal intercalated cells (ICs) that express kAE1 and/or aberrant autophagy pathway function in the remaining ICs may contribute to the disease. Understanding how mutations in SLC4A1 affect cell physiology and cells within the kidney, a major goal of this study, is an important first step to unraveling the pathophysiology of this complex heritable kidney disease.

      Summary:

      The authors identify a number of new mutations in the SLC4A1 gene in patients with diagnosed dRTA that they use for heterologous experiments in vitro. They also use a dRTA mouse model with a different SLC4A1 mutation for experiments in mouse kidneys. Contrary to previous work that speculated dRTA was caused mainly by trafficking defects of kAE1, the authors observe that their new mutants (with the exception of Y413H, which they only use in Figure 1) traffic and localize at least partly to the basolateral membrane of polarized heterologous mIMCD3 cells, an immortalized murine collecting duct cell line. They go on to show that the remaining mutants induce abnormalities in the expression of autophagy markers and increased numbers of autophagosomes, along with an alkalinized intracellular pH. They also reported that cells expressing the mutated kAE1 had increased mitochondrial content coupled with lower rates of ATP synthesis. The authors also observed a partial rescue of the effects of kAE1 variants through artificially acidifying the intracellular pH. Taken together, this suggests a mechanism for dRTA independent of impaired kAE1 trafficking and dependent on intracellular pH changes that future studies should explore.

      Strengths:

      The authors corroborate their findings in cell culture with a well-characterized dRTA KI mouse and provide convincing quantification of their images from the in vitro and mouse experiments.

      Weaknesses:

      The data largely support the claims as stated, with some minor suggestions for improving the clarity of the work. Some of the mutants induce different strengths of effects on autophagy and the various assays than others, and it is not clear why this is from the present manuscript, given that they propose pHi and the unifying mechanism.

    3. Reviewer #3 (Public review):

      Summary:

      The authors have identified novel dRTA causing SLC4A1 mutations and studied the resulting kAE1 proteins to determine how they cause dRTA. Based on a previous study on mice expressing the dRTA kAE1 R607H variant, the authors hypothesize that kAE1 variants cause an increase in intracellular pH, which disrupts autophagic and degradative flux pathways. The authors clone these new kAE1 variants and study their transport function and subcellular localization in mIMCD cells. The authors show increased abundance of LC3B II in mIMCD cells expressing some of the kAE1 variants, as well as reduced autophagic flux using eGFP-RFP-LC3. These data, as well as the abundance of autophagosomes, serve as the key evidence that these kAE1 mutants disrupt autophagy. Furthermore, the authors demonstrate that decreasing the intracellular pH abrogates the expression of LC3B II in mIMCD cells expressing mutant SLC4A1. Lastly, the authors argue that mitochondrial function, and specifically ATP synthesis, is suppressed in mIMCD cells expressing dRTA variants and that mitochondria are less abundant in AICs from the kidney of R607H kAE1 mice. While the manuscript does reveal some interesting new results about novel dRTA causing kAE1 mutations, the quality of the data to support the hypothesis that these mutations cause a reduction in autophagic flux can be improved. In particular, the precise method of how the western blots and the immunofluorescence data were quantified, with included controls, would enhance the quality of the data and offer more supportive evidence of the authors' conclusions.

      Strengths:

      The authors cloned novel dRTA causing kAE1 mutants into expression vectors to study the subcellular localization and transport properties of the variants. The immunofluorescence images are generally of high quality, and the authors do well to include multiple samples for all of their western blots.

      Weaknesses:

      Inconsistent results are reported for some of the variants. For example, R295H causes intracellular alkalinization but also has no effect on intracellular pH when measured by BCECF. The authors also appear to have performed these in vitro studies on mIMCD cells that were not polarized, and therefore, the localization of kAE1 to the basolateral membrane seems unlikely, based upon images included in the manuscript. Additionally, there is no in vivo work to demonstrate that these kAE1 variants alter intracellular pH, including the R607H mouse, which is available to the authors. The western blots are of varying quality, and it is often unclear which of the bands are being quantified. For example, LAMP1 is reported at 100kDa, the authors show three bands, and it is unclear which one(s) are used to quantify protein abundance. Strikingly, the authors report a nonsensical value for their quantification of LCRB II in Figure 2, where the ratio of LCRB II to total LCRB (I + II) is greater than one. The control experiments with starvation and bafilomyocin are not supportive and significantly reduce enthusiasm for the authors' findings regarding autophagy. There are labeling errors between the manuscript and the figures, which suggest a lack of vigilance in the drafting process.

    1. Reviewer #1 (Public review):

      Summary:

      Lysosomal damage is commonly found in many diseases including normal aging and age-related disease. However, the transcriptional programs activated by lysosomal damage has not been thoroughly characterized. This study aims to investigate lysosome damage-induced major transcriptional responses and the underlying signaling basis. The authors have convincingly shown that lysosomal damage activates a ubiquitination-dependent signaling axis involving TAB, TAK1, and IKK, which culminate in the activation of NF-kB and subsequent transcriptional upregulation of pro-inflammatory genes and pro-survival genes. Overall, the major aims of this study are successfully achieved.

      Strengths:

      This study is well-conceived and strictly executed, leading to clear and well-supported conclusions. Through unbiased transcriptomics and proteomics screens, the authors identifies NF-kB as a major transcriptional program activated upon lysosome damage. TAK1 activation by lysosome damage-induced ubiquitination is found to be essential for NF-kB activation and MAP kinase signaling. The transcriptional and proteomic changes are shown to be largely driven by TAK1 signaling. Finally, the TAK1-IKK signaling is shown to provide resistance to apoptosis during lysosomal damage response. The main signaling axis of this pathway has been convincingly demonstrated.

      Overall, this study identifies major transcriptional responses following lysosomal damage through unbiased approaches. It is important to consider the impact of these pathways in disease settings where lysosomal integrity is compromised.

      Comments on revisions:

      The authors have adequately addressed all previous comments. I have no further recommendations.

    2. Reviewer #2 (Public review):

      Summary:

      Endo et al. investigate the novel role of ubiquitin response upon lysosomal damage in activating cellular signaling for cell survival. The authors provide a comprehensive transcriptome and proteome analysis of aging-related cells experiencing lysosomal damage, identifying transcription factors involved in transcriptome and proteome remodeling with a focus on the NF-κB signaling pathway. They further characterized the K63-ubiquitin-TAB-TAK1-NF-κB signaling axis in controlling gene expression, inflammatory responses, and apoptotic processes.

      Strengths:

      In the aging-related model, the authors provide a comprehensive transcriptome and characterize the K63-ubiquitin-TAB-TAK1-NF-κB signaling axis. Through compelling experiments and advanced tools, they elucidate its critical role in controlling gene expression, inflammatory responses, and apoptotic processes.

      Weaknesses:

      The study lacks deeper connections with previous research, particularly:

      • The established role of TAB-TAK1 in AMPK activation during lysosomal damage

      • The potential significance of TBK1 in NF-κB signaling pathways

      Comments on revisions:

      The authors have successfully addressed all the raised questions and the manuscript is now significantly improved.

    3. Reviewer #3 (Public review):

      Summary:

      The response to lysosomal damage is a fast-moving and timely field. Besides repair and degradation pathways, increasing interest has been focusing on damaged-induced signaling. The authors conducted both transcriptomics and proteomics to characterize the cellular response to lysosomal damage. They identify a signaling pathway leading to activation of NFkappaB. Based on this and supported by Western blot and microscopy data, the authors nicely show that TAB2/3 and TAK1 are activated at damaged lysosomes and kick off the pathway to alter gene expression, which induces cytokines and protect from cell death. TAB2/3 activation is proposed to occur through K63 ubiquitin chain formation. Generally, this is a careful and well conducted study that nicely delineates the pathway under lysosomal stress. The "omics" data serves a valuable resource for the field. More work should be invested into how TAB2/3 are activated at the damaged lysosomes, also to increase novelty in light of previous reports.

      Strengths:

      Generally, this is a careful and well-conducted study that nicely delineates how the NFkB pathway is activated under lysosomal stress and modulates cell behavior. The "omics" data serves as a valuable resource for the field.

      Weaknesses:

      While activation of TAB2/3 by K63-linked Ub chains is convincing, more work needs to be done on how they are recruited by distinct damage types to probe relevance for different pathophysiological conditions."

      Comments on revisions:

      The authors have addressed much of my criticism. Specifically, they have put (with new experiments) the data on the TAB2/3-TAK1 pathway in perspective to the previously reported LUBAC-mediated activation of NFkB. They also addressed the question about the significance of K63-linked chains for TAB2/3 activation with new complementation experiments (a K63-specific NZF mutant failed to rescue).

      The third point (types of damage as triggers) raises more questions, though. The authors find that, in contrast to LLOMe, GPN or DC661-induced damage does not activate TAK1 (consistent with lower damage levels). However, the authors still observe K63 ubiquitylation. This goes along with their finding that TAB2 is recruited in the absence of any ubiquitylation (blocked by TAK-243). It argues that TAB2 is recruited by an unknown cue (that may be damage-specific) and then activated by K63. The authors need to clarify whether TAB2 is or is not recruited in the GPN/DC661 conditions (in which K63 occurs, but TAK1 is not activated). The point about the effects of other damage types was also raised by reviewer #1 and should be solved. The fact that TAB2 is recruited independently of K63 should also be visualized in the model. The manuscript will then be an important contribution to the field.

    1. Reviewer #1 (Public review):

      Summary:

      This study by Bushey et al., focuses on two newly released red-shifted anion-Channelrhodopsins (A1ACR and HfACR, referred as Ruby-ACRs) in Drosophila. Here, the authors use a combination of electrophysiology, calcium imaging, and behavioral analyses to demonstrate the advantages of Ruby-ACRs over previous optogenetic silencers like the green-shifted GtACR1 and the blue-shifted GtACR2: higher photocurrent, faster kinetics, and operating at a light spectrum range that prevents unwanted behavioral effects in the fly. The availability of these new red-shifted silencers constitutes a great addition to the Drosophila genetic toolkit.

      Strengths:

      (1) The authors generate both UAS and LexAop RubyACR reagents and test them in a variety of preparations (electrophysiological recordings, calcium imaging, different behavioral paradigms) that cover the breadth of the fly research environment.

      (2) The optical stimulation parameters are carefully measured and characterized. Especially impressive is that they managed to titrate over both wavelength and intensity across their various assays. This provides a comprehensive dataset to the community.

      (3) Tools are made available to the community through the stock center.

      Weaknesses:

      (1) The authors could better describe their construct and choice of parameters for the chosen construct. I am specifically wondering about the following points:

      a) Why use that particular backbone (not the most commonly used one across recent literature (pJFRC7 is more common).

      b) Why do the CsChrimson and GTACR1 have a Kir sequence in it, and why did the authors not put this in the RubyACRs? I would also prefer if authors don't refer to GtACR1 as GTACR-Kir in text (e.g., in line 72); instead, they should either refer to it as GtACR1 or GtACR1-kir-mVenus (based on the full genotype mentioned in their table at the end). Same for CsChrimson-kir. From what I understand, this is just a Kir trafficking sequence and not the entire Kir sequence, which can confuse the readers.

      c) Finally, I would also encourage authors to deposit plasmids on Addgene.

      (2) Figure 2 is interesting, but it is a bit unfortunate that there is a YFP baseline in most of the samples here (except Chrimson88; this should also be mentioned). I wonder how the YFP baseline impacts this data. Could the high intensity stimulation (red light) lead to bleaching of YFP or tdTomato that reduces the baseline in the green channel? All this also makes me wonder if authors tried tagging the RubyACRs with other fluorophores or non-fluorescent tags and how that impacted their functioning. Non-YFP-tagged versions would be more useful for applications involving GCaMP imaging.

      (3) Another point for Figure 2: Since RubyACRs seem to have such a broad activation range, I wonder how much the imaging light (920nm) impacts the baseline in these experiments. If there were plots without the red light stimulation and just varying imaging light intensity, that could be useful to the research community.

      (4) Also, for Figures 2C - D, in the methods authors indicate that the stimulation light intensities were progressively increased. Could this lead to desensitization of opsin? Wouldn't randomized intensities be a better way to do this? Perhaps it should be mentioned as a caveat.

      (5) In Figure 3E the bottom middle panel Vglut-Gal4,GtACR1 shows a major increase in walking at light onset. This seems very different than all other conditions, and I could not find any discussion of this. It would help if some explanation were provided for this.

    2. Reviewer #2 (Public review):

      Summary:

      Bushey et al. investigate the feasibility of using RubyACRs, specifically A1ACR1 and HfACR1 (described previously in (Govorunova et al., 2020)) as red-shifted inhibitory opsins in Drosophila melanogaster. The study employs a wide range of techniques to demonstrate successful neuronal inhibition. Electrophysiology experiments established that HfACR1 was most effective at hyperpolarizing cells, compared to A1ACR1 and GtACR1; both RubyACRs also appeared to be more effective than GtACR1 when the latter was actuated by green light. The authors further demonstrate successful neuronal inhibition using calcium imaging. RubyACRs were also shown to be useful in in vivo behavioral setups, specifically in spontaneous locomotion, associative learning, and courtship paradigms. In the courtship assay, in particular, the authors test multiple wavelengths of light at various light intensities, thus providing a rigorous analysis of the RubyACRs' efficacy under different light conditions.

      Strengths:

      The work provides the Drosophila field with a promising new tool. Red-shifted opsins are particularly advantageous in behavioral assays as red light penetrates the cuticle better than green or blue light, and provides less visual stimulation to the fly. It is also ideal for imaging as it allows for simultaneous optogenetic stimulation and GCamp imaging. A particular strength of the paper is the direct demonstration of RubyACR's capacity to inhibit neurons via electrophysiology and calcium imaging. Furthermore, inhibition effects in the three behavioral assays are strong and convincing. Given the apparent efficacy of RubyACRs and the advantages of a red-sensitive anion channelrhodopsin, this tool has great potential.

      Weaknesses:

      This work convincingly demonstrates the efficacy and potential utility of RubyACRs in Drosophila for imaging and behavior. However, the lethality/toxicity of RubyACRs is a relevant concern that should be addressed in-depth rather than glossed over, as it may pose a major obstacle to use. Discussing this issue in the present study will also help guide potential users and will set the stage for potential future efforts to ameliorate RubyACRs as optogenetic inhibitors.

      Major concerns:

      (1) Table 1 demonstrates high lethality in the RubyACRs compared to GtACR1. For example, in the MI04979-VGlut driver, GtACR1 expression resulted in 32.9% lethality, while HfACR1 expression resulted in 98.7% lethality. This lethality presents an obstacle to the potential adoption of this tool, and should be discussed in detail, rather than in passing. The authors might like to present "% lethality" rather than "% survived", as the former is more relevant when discussing the relative yield and health of flies that can be used in experiments.

      (2) In Figure 3D, driver>opsin flies have lower locomotion during the baseline (i.e., dark) phase, compared to opsin-only controls or GtACR1 flies. For some comparisons, flies are walking around 10-fold slower. For example, in the case of VGlut-GAL4>HfACR1, test flies are walking at <1 mm/s, while "Empty" test flies are walking at ~10 mm/s. This suggests that, for these drivers, neuronal and/or network function is affected. It opens the possibility that the lethality and locomotor defects could be due to cell-autonomous toxicity. We ask the authors to provide a description of this effect in the Results and to discuss it in the Discussion. Relatedly, VGlut-GAL4>GtACR1 flies in red light exhibit a locomotion increase, but this data is not mentioned in the text. The use of differing scales for the Y-axes in these panels can be confusing when the reader is expected to compare velocity across different panels. It would be best if the y-axes were set to a single range, e.g., 0 to 12 mm/s.

      (3) Lethality in broad drivers could result from cell-autonomous toxicity or neuronal dysfunction resulting from RubyACR expression. Ideally, the authors would address or even investigate the possible mechanisms of toxicity of the RubyACRs. Do cells and/or synapses expressing RubyACRs have normal morphology and function? For example, the authors could compare cell survival between flies with RubyACR expression and flies with a fluorescent protein with no opsin. The authors may also want to present lethality data for other, less broad drivers (such as MB320C, which was used for the associative memory assay) in order to demonstrate whether this problem is confined to broad drivers such as VGlut-GAL4, or if this is a problem with narrow drivers as well. If new experiments are not possible, these issues should at least be mentioned in the Discussion.

      Minor concerns

      (1) The specific method used for quantifying lethality is mentioned briefly in Table 1 but is not detailed in the Methods. The authors derive lethality by comparing to a sibling control group with either the opsin or the driver alone, but the opsin alone or driver alone may cause some lethality by themselves. We suggest the use of a viability assay, e.g. (Rockwell et al., 2019), which would give potential users a clearer picture of which developmental stage is most affected by opsin expression, as well as allow opsin-only, driver-only and experimental groups to be assessed separately (lethality would then be reported as the % of embryos that reach each stage of development, and eventually enclosure).

      (2) For the calcium imaging analysis in Figure 2, the U-shaped curve observed for mean ΔF/F0 for A1ACR1 and HfACR1 may not be due to actual desensitization for the channels, as the authors suggest (lines 143-145), but may be due simply to a shifting baseline. The authors use the 5-s period preceding stimulation onset as F0, but in some cases (e.g., HfACR1 at 250 uW/mm2), calcium fluorescence rises above baseline and remains high post-stimulation (ΔF/F0 of +0.5, which we observe is the same magnitude as the ΔF/F0 of -0.5 observed during inhibition), thus affecting the ΔF/F0 for subsequent trials. The authors should discuss this incomplete recovery in the text, or (if available) use a static channel instead to provide a stable F0 for calculating ΔF/F0. Alternatively, if the authors wish to rigorously test the hypothesis that high light intensity indeed results in desensitization of these channels, they may consider using different flies for each light intensity or longer inter-stimulus intervals.

      (3) For Figure 3C (Flybowl assay), the authors mention that "simply expressing the opsins decreased baseline locomotor activity compared to empty driver lines". However, the "Empty" controls in 3C appear to refer to opsin-only controls, not driver-only controls. The driver-only controls are not presented in the figure. The use of "empty" differs between the text and the figure, as the text refers to "empty" driver lines, while the figure uses "empty" to apparently refer to opsin-only controls. We recommend changing the terminology across all figures to be unambiguous, e.g., by using "opsin-only" or "driver-only" as opposed to the ambiguous "empty". In addition, the fact that opsin-only controls move less than driver-only controls may suggest some toxicity as a result of the opsin-only construct; this should be discussed further.

      (4) Figures 4 and 5 lack the reporting of driver-only controls.

      (5) Figures 3 and 4 lack positive controls; that is, the benchmarking of the efficacy of RubyACRs in their respective behavioral paradigms against a known inhibitor, e.g., GtACR1 with green light. To confirm that this GtACR1 transgene is functional, the authors could include GtACR1 with green light as a positive control for these two figures, as they have done for Figure 5-supplement 2 and 3.

      (6) Several citations are missing. In their discussion, the authors highlight that shorter wavelengths of light are more attenuated by tissue (lines 278-281); this should be accompanied by the relevant citations (Inagaki et al., 2014). Similarly, the claim that behavioral experiments exhibit greater sensitivity to shorter wavelengths should be substantiated (lines 281-283).

      References:

      Govorunova EG, Sineshchekov OA, Li H, Wang Y, Brown LS, Spudich JL. 2020. RubyACRs, nonalgal anion channelrhodopsins with highly red-shifted absorption. Proc Natl Acad Sci U S A 117:22833-22840.

      Inagaki HK, Jung Y, Hoopfer ED, Wong AM, Mishra N, Lin JY, Tsien RY, Anderson DJ. 2014. Optogenetic control of Drosophila using a red-shifted channelrhodopsin reveals experience-dependent influences on courtship. Nat Methods 11:325-332.

      Rockwell AL, Beaver I, Hongay CF. 2019. A direct and simple method to assess Drosophila melanogaster's viability from embryo to adult. J Vis Exp e59996.

    3. Reviewer #3 (Public review):

      Summary:

      This study by Bushey et al. adapts and evaluates two newly developed red-shifted optogenetic inhibitors, A1ACR1 and HfACR1, collectively referred to as RubyACRs, for neuronal silencing in Drosophila melanogaster. Traditional optogenetic inhibitors such as GtACR1 and GtACR2 are activated by green (~515 nm) and blue (~470 nm) light, respectively, which poses several limitations in Drosophila. Specifically, shorter-wavelength light suffers from reduced tissue penetration and increased absorption, and is visible to flies, potentially confounding behavioral assays, particularly those involving visual processing. In contrast, RubyACRs are activated by red light (~610-660 nm), which penetrates the cuticle more effectively and thus can be more potent in manipulating fly behavior. In the current manuscript, the authors first demonstrate that both A1ACR1 and HfACR1 can be robustly expressed in fly neurons and are properly trafficked to the plasma membrane. Upon red-light stimulation, both opsins produce strong and sustained hyperpolarization in larval motor neurons, outperforming GtACR1 in both magnitude and temporal dynamics. Next, using two-photon calcium imaging in the visual system, the authors further demonstrate that activation of RubyACRs significantly reduces GCaMP6s signal, indicating that they can reliably inhibit neuronal activity. Importantly, unlike reported in some mammalian studies, RubyACRs do not appear to trigger paradoxical depolarization at axon terminals in the fly visual system, as no evidence of aberrant depolarization is observed in motion-detecting Mi1 neurons.

      In the second part of the manuscript, the authors characterize the effects of RubyACRs on fly behavior (walking, learning, and courtship song). Using the inhibition of genetically labelled neurons that regulate these behaviors, the authors demonstrate that stimulation of RubyACRs leads to potent suppression of locomotion, courtship song, or dopamine-dependent associative learning.

      Strengths:

      Altogether, the experiments conducted in this manuscript demonstrate that RubyACRs are powerful tools for optogenetic inhibition in Drosophila, with advantages in spectral compatibility, behavioral specificity, and potential applications in vivo two-photon calcium imaging.

      Weaknesses:

      The manuscript is strong, but it can be further improved with a few additional analyses and minor revisions. Especially, a more detailed evaluation of RubyACRs with two-photon excitation will help clarify to what extent these opsins can be simultaneously used together with green GECIs, such as GCaMPs.

    1. Reviewer #1 (Public review):

      I applaud the authors' for providing a thorough response to my comments from the first round of review. The authors' have addressed the points I raised on the interpretation of the behavioral results as well as the validation of the model (fit to the data) by conducting new analyses, acknowledging the limitations where required and providing important counterpoints. As a result of this process, the manuscript has considerably improved. I have no further comments and recommend this manuscript for publication.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript proposes that the use of a latent cause model for assessment of memory-based tasks may provide improved early detection in Alzheimer's Disease as well as more differentiated mapping of behavior to underlying causes. To test the validity of this model, the authors use a previously described knock-in mouse model of AD and subject the mice to several behaviors to determine whether the latent cause model may provide informative predictions regarding changes in the observed behaviors. They include a well-established fear learning paradigm in which distinct memories are believed to compete for control of behavior. More specifically, it's been observed that animals undergoing fear learning and subsequent fear extinction develop two separate memories for the acquisition phase and the extinction phase, such that the extinction does not simply 'erase' the previously acquired memory. Many models of learning require the addition of a separate context or state to be added during the extinction phase and are typically modeled by assuming the existence of a new state at the time of extinction. The Niv research group, Gershman et al. 2017, have shown that the use of a latent cause model applied to this behavior can elegantly predict the formation of latent states based on a Bayesian approach, and that these latent states can facilitate the persistence of the acquisition and extinction memory independently. The authors of this manuscript leverage this approach to test whether deficits in production of the internal states, or the inference and learning of those states, may be disrupted in knock-in mice that show both a build-up of amyloid-beta plaques and a deterioration in memory as the mice age.

      Strengths:

      I think the authors' proposal to leverage the latent cause model and test whether it can lead to improved assessments in an animal model of AD is a promising approach for bridging the gap between clinical and basic research. The authors use a promising mouse model and apply this to a paradigm in which the behavior and neurobiology are relatively well understood - an ideal situation for assessing how a disease state may impact both the neurobiology and behavior. The latent cause model has the potential to better connect observed behavior to underlying causes and may pave a road for improved mapping of changes in behavior to neurobiological mechanisms in diseases such as AD.<br /> The authors also compare the latent cause model to the Rescorla-Wagner model and a latent state model allowing for better assessment of the latent cause model as a strong model for assessing reinstatement.

      Weaknesses:

      I have several substantial concerns which I've detailed below. These include important details on how the behavior was analyzed, how the model was used to assess the behavior, and the interpretations that have been made based on the model.<br /> (1) There is substantial data to suggest that during fear learning in mice separate memories develop for the acquisition and extinction phases, with the acquisition memory becoming more strongly retrieved during spontaneous recovery and reinstatement. The Gershman paper, cited by the authors, shows how the latent causal model can predict this shift in latent causes by allowing for the priors to decay over time, thereby increasing the posterior of the acquisition memory at the time of spontaneous recovery. In this manuscript, the authors suggest a similar mechanism of action for reinstatement, yet the model does not appear to return to the acquisition memory after reinstatement, at least based on the simulation and examples shown in figures 1 and 3. More specifically, in figure 1, the authors indicate that the posterior probability of the latent cause, zA (the putative acquisition memory), increases, partially leading to reinstatement. This does not appear to be the case as test 3 (day 36) appears to have similar posterior probabilities for zA as well as similar weights for the CS as compared to the last days of extinction. Rather, the model appears to mainly modify the weights in the most recent latent cause, zB - the putative the 'extinction state', during reinstatement. The authors suggest that previous experimental data have indicated that spontaneous recovery or reinstatement effects are due to an interaction of the acquisition and extinction memory. These studies have shown that conditioned responding at a later time point after extinction is likely due to a balance between the acquisition memory and the extinction memory, and that this balance can shift towards the acquisition memory naturally during spontaneous recovery, or through artificial activation of the acquisition memory or inhibition of the extinction memory (see Lacagnina et al. for example). Here the authors show that the same latent cause learned during extinction, zB, appears to dominate during the learning phase of reinstatement, with rapid learning to the context - the weight for the context goes up substantially on day 35 - in zB. This latent cause, zB, dominates at the reinstatement test, and due to the increased associative strength between the context and shock, there is a strong CR. For the simulation shown in figure 1, it's not clear why a latent cause model is necessary for this behavior. This leads to the next point.

      (2) The authors compared the latent cause model to the Rescorla-Wagner model. This is very commendable, particularly since the latent cause model builds upon the RW model, so it can serve as an ideal test for whether a more simplified model can adequately predict the behavior. The authors show that the RW model cannot successfully predict the increased CR during reinstatement (Appendix figure 1). Yet there are some issues with the way the authors have implemented this comparison:<br /> (2A) The RW model is a simplified version of the latent cause model and so should be treated as a nested model when testing, or at a minimum, the number of parameters should be taken into account when comparing the models using a method such as the Bayesian Information Criterion, BIC.<br /> (2B) The RW model provides the associative strength between stimuli and does not necessarily require a linear relationship between V and the CR. This is the case in the original RW model as well as in the LCM. To allow for better comparison between the models, the authors should be modeling the CR in the same manner (using the same probit function) in both models. In fact, there are many instances in which a sigmoid has been applied to RW associative strengths to predict CRs. I would recommend modeling CRs in the RW as if there is just one latent cause. Or perhaps run the analysis for the LCM with just one latent cause - this would effectively reduce the LCM to RW and keep any other assumptions identical across the models.<br /> (2C) In the paper, the model fits for the alphas in the RW model are the same across the groups. Were the alphas for the two models kept as free variables? This is an important question as it gets back to the first point raised. Because the modeling of the reinstatement behavior with the LCM appears to be mainly driven by latent cause zB, the extinction memory, it may be possible to replicate the pattern of results without requiring a latent cause model. For example, the 12-month-old App NL-G-F mice behavior may have a deficit in learning about the context. Within the RW model, if the alpha for context is set to zero for those mice, but kept higher for the other groups, say alpha_context = 0.8, the authors could potentially observe the same pattern of discrimination indices in figure 2G and 2H at test. Because the authors don't explicitly state which parameters might be driving the change in the DI, the authors should show in some way that their results cannot simply be due to poor contextual learning in the 12 month old App NL-G-F mice, as this can presumably be predicted by the RW model. The authors' model fits using RW don't show this, but this is because they don't consider this possibility that the alpha for context might be disrupted in the 12-month-old App NL-G-F mice. Of course, using the RW model with these alphas won't lead to as nice of fits of the behavior across acquisition, extinction, and reinstatement as the authors' LCM, the number of parameters are substantially reduced in the RW model. Yet the important pattern of the DI would be replicated with the RW model (if I'm not mistaken), which is the important test for assessment of reinstatement.

      (3) As stated by the authors in the introduction, the advantage of the fear learning approach is that the memory is modified across the acquisition-extinction-reinstatement phases. Although perhaps not explicitly stated by the authors, the post-reinstatement test (test 3) is the crucial test for whether there is reactivation of a previously stored memory, with the general argument being that the reinvigorated response to the CS can't simply be explained by relearning the CS-US pairing, because re-exposure the US alone leads to increase response to the CS at test. Of course there are several explanations for why this may occur, particularly when also considering the context as a stimulus. This is what I understood to be the justification for the use of a model, such as the latent cause model, that may better capture and compare these possibilities within a single framework. As such, it is critical to look at the level of responding to both the context alone and to the CS. It appears that the authors only look at the percent freezing during the CS, and it is not clear whether this is due to the contextual-US learning during the US re-exposure or to increased responding to the CS - presumably caused by reactivation of the acquisition memory. The authors do perform a comparison between the preCS and CS period, but it is not clear whether this is taken into account in the LCM. For example, the instance of the model shown in figure 1 indicates that the 'extinction cause', or cause z6, develops a strong weight for the context during the reinstatement phase of presenting the shock alone. This state then leads to increased freezing during the final CS probe test as shown in the figure. If they haven't already, I think the authors must somehow incorporate these different phases (CS vs ITI) into their model, particularly since this type of memory retrieval that depends on assessing latent states is specifically why the authors justified using the latent causal model. In more precise terms, it's not clear whether the authors incorporate a preCS/ITI period each day the cue is presented as a vector of just the context in addition to the CS period in which the vector contains both the context and the CS. Based on the description, it seemed to me that they only model the CRs during the CS period on days when the CS is presented, and thereby the context is only ever modeled on its own (as just the context by itself in the vector) on extinction days when the CS is not presented. If they are modeling both timepoints each day that the CS I presented, then I would recommend explicitly stating this in the methods section.

      (4) The authors fit the model using all data points across acquisition and learning. As one of the other reviewers has highlighted, it appears that there is a high chance for overfitting the data with the LCM. Of course, this would result in much better fits than models with substantially fewer free parameters, such as the RW model. As mentioned above, the authors should use a method that takes into account the number of parameters, such as the BIC.

      (5) The authors have stated that they do not think the Barnes maze task can be modeled with the LCM. Whether or not this is the case, if the authors do not model this data with the LCM, the Barnes maze data doesn't appear valuable to the main hypothesis. The authors suggest that more sophisticated models such as the LCM may be beneficial for early detection of diseases such as Alzheimer's, so the Barnes maze data is not valuable for providing evidence of this hypothesis. Rather, the authors make an argument that the memory deficits in the Barnes maze mimic the reinstatement effects providing support that memory is disrupted similarly in these mice. Although, the authors state that the deficits in memory retrieval are similar across the two tasks, the authors are not explicit as to the precise deficits in memory retrieval in the reinstatement task - it's a combination of overgeneralizing latent causes during acquisition, poor learning rate, over differentiation of the stimuli.

    3. Reviewer #3 (Public review):

      Summary:

      This paper seeks to identify underlying mechanisms contributing to memory deficits observed in Alzheimer's disease (AD) mouse models. By understanding these mechanisms, they hope to uncover insights into subtle cognitive changes early in AD to inform interventions for early-stage decline.

      Strengths:

      The paper provides a comprehensive exploration of memory deficits in an AD mouse model, covering early and late stages of the disease. The experimental design was robust, confirming age-dependent increases in Aβ plaque accumulation in the AD model mice and using multiple behavior tasks that collectively highlighted difficulties in maintaining multiple competing memory cues, with deficits most pronounced in older mice.

      In the fear acquisition, extinction, and reinstatement task, AD model mice exhibited a significantly higher fear response after acquisition compared to controls, as well as a greater drop in fear response during reinstatement. These findings suggest that AD mice struggle to retain the fear memory associated with the conditioned stimulus, with the group differences being more pronounced in the older mice.

      In the reversal Barnes maze task, the AD model mice displayed a tendency to explore the maze perimeter rather than the two potential target holes, indicating a failure to integrate multiple memory cues into their strategy. This contrasted with the control mice, which used the more confirmatory strategy of focusing on the two target holes. Despite this, the AD mice were quicker to reach the target hole, suggesting that their impairments were specific to memory retrieval rather than basic task performance.

      The authors strengthened their findings by analyzing their data with a leading computational model, which describes how animals balance competing memories. They found that AD mice showed somewhat of a contradiction: a tendency to both treat trials as more alike than they are (lower α) and similar stimuli as more distinct than they are (lower σx) compared to controls.

      Weaknesses:

      While conceptually solid, the model struggles to fit the data and to support the key hypothesis about AD mice's inability to retain competing memories. These issues are evident in Figure 3:

      (1) The model misses trends in the data, including the gradual learning of fear in all groups during acquisition, the absence of a fear response at the start of the experiment, and the faster return of fear during reinstatement compared to the gradual learning of fear during acquisition. It also underestimates the increase in fear at the start of day 2 of extinction, particularly in controls.

      (2) The model explains the higher fear response in controls during reinstatement largely through a stronger association to the context formed during the unsignaled shock phase, rather than to any memory of the conditioned stimulus from acquisition (as seen in Figure 3C). In the experiment, however, this memory does seem to be important for explaining the higher fear response in controls during reinstatement (as seen in Author Response Figure 3). The model does show a necessary condition for memory retrieval, which is that controls rely more on the latent causes from acquisition. But this alone is not sufficient, since the associations within that cause may have been overwritten during extinction. The Rescorla-Wagner model illustrates this point: it too uses the latent cause from acquisition (as it only ever uses a single cause across phases) but does not retain the original stimulus-shock memory, updating and overwriting it continuously. Similarly, the latent cause model may reuse a cause from acquisition without preserving its original stimulus-shock association.

      These issues lead to potential overinterpretation of the model parameters. The differences in α and σx are being used to make claims about cognitive processes (e.g., overgeneralization vs. over differentiation), but the model itself does not appear to capture these processes accurately.

      The authors could benefit from a model that better matches the data and captures the retention and retrieval of fear memories across phases. While they explored alternatives, including the Rescorla-Wagner model and a latent state model, these showed no meaningful improvement in fit. This highlights a broader issue: these models are well-motivated but may not fully capture observed behavior.

      Conclusion:

      Overall, the data support the authors' hypothesis that AD model mice struggle to retain competing memories, with the effect becoming more pronounced with age. While I believe the right computational model could highlight these differences, the current models fall short in doing so.

    1. Reviewer #1 (Public review):

      Summary:

      Review of the manuscript titled " Mycobacterial Metallophosphatase MmpE acts as a nucleomodulin to regulate host gene expression and promotes intracellular survival".

      The study provides an insightful characterization of the mycobacterial secreted effector protein MmpE, which translocates to the host nucleus and exhibits phosphatase activity. The study characterizes the nuclear localization signal sequences and residues critical for the phosphatase activity, both of which are required for intracellular survival.

      Strengths:

      (1) The study addresses the role of nucleomodulins, an understudied aspect in mycobacterial infections.

      (2) The authors employ a combination of biochemical and computational analyses along with in vitro and in vivo validations to characterize the role of MmpE.

      Weaknesses:

      (1) While the study establishes that the phosphatase activity of MmpE operates independently of its NLS, there is a clear gap in understanding how this phosphatase activity supports mycobacterial infection. The investigation lacks experimental data on specific substrates of MmpE or pathways influenced by this virulence factor.

      (2) The study does not explore whether the phosphatase activity of MmpE is dependent on the NLS within macrophages, which would provide critical insights into its biological relevance in host cells. Conducting experiments with double knockout/mutant strains and comparing their intracellular survival with single mutants could elucidate these dependencies and further validate the significance of MmpE's dual functions.

      (3) The study does not provide direct experimental validation of the MmpE deletion on lysosomal trafficking of the bacteria.

      (4) The role of MmpE as a mycobacterial effector would be more relevant using virulent mycobacterial strains such as H37Rv.

    2. Reviewer #2 (Public review):

      Summary:

      In this paper, the authors have characterized Rv2577 as a Fe3+/Zn2+ -dependent metallophosphatase and a nucleomodulin protein. The authors have also identified His348 and Asn359 as critical residues for Fe3+ coordination. The authors show that the proteins encode for two nuclease localization signals. Using C-terminal Flag expression constructs, the authors have shown that the MmpE protein is secretory. The authors have prepared genetic deletion strains and show that MmpE is essential for intracellular survival of M. bovis BCG in THP-1 macrophages, RAW264.7 macrophages, and a mouse model of infection. The authors have also performed RNA-seq analysis to compare the transcriptional profiles of macrophages infected with wild-type and MmpE mutant strains. The relative levels of ~ 175 transcripts were altered in MmpE mutant-infected macrophages and the majority of these were associated with various immune and inflammatory signalling pathways. Using these deletion strains, the authors proposed that MmpE inhibits inflammatory gene expression by binding to the promoter region of a vitamin D receptor. The authors also showed that MmpE arrests phagosome maturation by regulating the expression of several lysosome-associated genes such as TFEB, LAMP1, LAMP2, etc. These findings reveal a sophisticated mechanism by which a bacterial effector protein manipulates gene transcription and promotes intracellular survival.

      Strength:

      The authors have used a combination of cell biology, microbiology, and transcriptomics to elucidate the mechanisms by which Rv2577 contributes to intracellular survival.

      Weakness:

      The authors should thoroughly check the mice data and show individual replicate values in bar graphs.

    3. Reviewer #3 (Public review):

      Summary:

      In this manuscript titled "Mycobacterial Metallophosphatase MmpE Acts as a Nucleomodulin to Regulate Host Gene Expression and Promote Intracellular Survival", Chen et al describe biochemical characterisation, localisation and potential functions of the gene using a genetic approach in M. bovis BCG and perform macrophage and mice infections to understand the roles of this potentially secreted protein in the host cell nucleus. The findings demonstrate the role of a secreted phosphatase of M. bovis BCG in shaping the transcriptional profile of infected macrophages, potentially through nuclear localisation and direct binding to transcriptional start sites, thereby regulating the inflammatory response to infection.

      Strengths:

      The authors demonstrate using a transient transfection method that MmpE when expressed as a GFP-tagged protein in HEK293T cells, exhibits nuclear localisation. The authors identify two NLS motifs that together are required for nuclear localisation of the protein. A deletion of the gene in M. bovis BCG results in poorer survival compared to the wild-type parent strain, which is also killed by macrophages. Relative to the WT strain-infected macrophages, macrophages infected with the ∆mmpE strain exhibited differential gene expression. Overexpression of the gene in HEK293T led to occupancy of the transcription start site of several genes, including the Vitamin D Receptor. Expression of VDR in THP1 macrophages was lower in the case of ∆mmpE infection compared to WT infection. This data supports the utility of the overexpression system in identifying potential target loci of MmpE using the HEK293T transfection model. The authors also demonstrate that the protein is a phosphatase, and the phosphatase activity of the protein is partially required for bacterial survival but not for the regulation of the VDR gene expression.

      Weaknesses:

      (1) While the motifs can most certainly behave as NLSs, the overexpression of a mycobacterial protein in HEK293T cells can also result in artefacts of nuclear localisation. This is not unprecedented. Therefore, to prove that the protein is indeed secreted from BCG, and is able to elicit transcriptional changes during infection, I recommend that the authors (i) establish that the protein is indeed secreted into the host cell nucleus, and (ii) the NLS mutation prevents its localisation to the nucleus without disrupting its secretion.

      Demonstration that the protein is secreted: Supplementary Figure 3 - Immunoblotting should be performed for a cytosolic protein, also to rule out detection of proteins from lysis of dead cells. Also, for detecting proteins in the secreted fraction, it would be better to use Sauton's media without detergent, and grow the cultures without agitation or with gentle agitation. The method used by the authors is not a recommended protocol for obtaining the secreted fraction of mycobacteria.

      Demonstration that the protein localises to the host cell nucleus upon infection: Perform an infection followed by immunofluorescence to demonstrate that the endogenous protein of BCG can translocate to the host cell nucleus. This should be done for an NLS1-2 mutant expressing cell also.

      (2) In the RNA-seq analysis, the directionality of change of each of the reported pathways is not apparent in the way the data have been presented. For example, are genes in the cytokine-cytokine receptor interaction or TNF signalling pathway expressed more, or less in the ∆mmpE strain?

      (3) Several of these pathways are affected as a result of infection, while others are not induced by BCG infection. For example, BCG infection does not, on its own, produce changes in IL1β levels. As the authors did not compare the uninfected macrophages as a control, it is difficult to interpret whether ∆mmpE induced higher expression than the WT strain, or simply did not induce a gene while the WT strain suppressed expression of a gene. This is particularly important because the strain is attenuated. Does the attenuation have anything to do with the ability of the protein to induce lysosomal pathway genes? Does induction of this pathway lead to attenuation of the strain? Similarly, for pathways that seem to be downregulated in the ∆mmpE strain compared to the WT strain, these might have been induced upon infection with the WT strain but not sufficiently by the ∆mmpE strain due to its attenuation/ lower bacterial burden.

      (4) CHIP-seq should be performed in THP1 macrophages, and not in HEK293T. Overexpression of a nuclear-localised protein in a non-relevant line is likely to lead to several transcriptional changes that do not inform us of the role of the gene as a transcriptional regulator during infection.

      (5) I would not expect to see such large inflammatory reactions persisting 56 days post-infection with M. bovis BCG. Is this something peculiar for an intratracheal infection with 1x107 bacilli? For images of animal tissue, the authors should provide images of the entire lung lobe with the zoomed-in image indicated as an inset.

      (6) For the qRT-PCR based validation, infections should be performed with the MmpE-complemented strain in the same experiments as those for the WT and ∆mmpE strain so that they can be on the same graph, in the main manuscript file. Supplementary Figure 4 has three complementary strains. Again, the absence of the uninfected, WT, and ∆mmpE infected condition makes interpretation of these data very difficult.

      (7) The abstract mentions that MmpE represses the PI3K-Akt-mTOR pathway, which arrests phagosome maturation. There is not enough data in this manuscript in support of this claim. Supplementary Figure 5 does provide qRT-PCR validation of genes of this pathway, but the data do not indicate that higher expression of these pathways, whether by VDR repression or otherwise, is driving the growth restriction of the ∆mmpE strain.

      (8) The relevance of the NLS and the phosphatase activity is not completely clear in the CFU assays and in the gene expression data. Firstly, there needs to be immunoblot data provided for the expression and secretion of the NLS-deficient and phosphatase mutants. Secondly, CFU data in Figure 3A, C, and E must consistently include both the WT and ∆mmpE strain.

    1. Reviewer #1 (Public review):

      From my reading, this study aimed to achieve two things:

      (1) A neurally-informed account of how Pieron's and Fechner's laws can apply in concert at distinct processing levels.

      (2) A comprehensive map in time and space of all neural events intervening between stimulus and response in an immediately-reported perceptual decision.

      I believe that the authors achieved the first point, mainly owing to a clever contrast comparison paradigm, but with good help also from a new topographic parsing algorithm they created. With this, they found that the time intervening between an early initial sensory evoked potential and an "N2" type process associated with launching the decision process varies inversely with contrast according to Pieron's law. Meanwhile, the interval from that second event up to a neural event peaking just before response increases with contrast, fitting Fechner's law, and a very nice finding is that a diffusion model whose drift rates are scaled by Fechner's law, fit to RT, predicts the observed proportion of correct responses very well. These are all strengths of the study.

      The second, generally stated aim above is, in the opinion of this reviewer, unconvincing and ill-defined. Presumably, the full sequence of neural events is massively task-dependent, and surely it is more in number than just three. Even the sensory evoked potential typically observed for average ERPs, even for passive viewing, would include a series of 3 or more components - C1, P1, N1, etc. So are some events being missed? Perhaps the authors are identifying key events that impressively demarcate Pieron- and Fechner-adherent sections of the RT, but they might want to temper the claim that they are finding ALL events. In addition, the propensity for topographic parsing algorithms to potentially lump together distinct processes that partially co-evolve should be acknowledged.

      To take a salient example, the last neural event seems to blend the centroparietal positivity with a more frontal midline negativity, some of which would capture the CNV and some motor-execution related components that are more tightly time-locked to, of course, the response. If the authors plotted the traditional single-electrode ERP at the frontal focus and centroparietal focus separately, they are likely to see very different dynamics and contrast- and SAT-dependency. What does this mean for the validity of the multivariate method? If two or more components are being lumped into one neural event, wouldn't it mean that properties of one (e.g., frontal burstiness at response) are being misattributed to the other (centroparietal signal that also peaks but less sharply at response)?

      Also related to the method, why must the neural events all be 50 ms wide, and what happens if that is changed? Is it realistic that these neural events would be the same duration on every trial, even if their duration was a free parameter? This might be reasonable for sensory and motor components, but unlikely for cognitive.

      In general, I wonder about the analytic advantage of the parsing method - the paradigm itself is so well-designed that the story may be clear from standard average event-related potential analysis, and this might sidestep the doubts around whether the algorithm is correctly parsing all neural events.

      In particular, would the authors consider plotting CPP waveforms in the traditional way, across contrast levels? The elegant design is such that the C1 component (which has similar topography) will show up negative and early, giving way to the CPP, and these two components will show opposite amplitude variations (not just temporal intervals as is this paper's main focus), because the brighter the two gratings, the stronger the aggregate early sensory response but the weaker the decision evidence due to Fechner. I believe this would provide a simple, helpful corroborating analysis to back up the main functional interpretation in the paper.

      The first component is picking up on the C1 component (which is negative for these stimulus locations), not a "P100". Please consult any visual evoked potential study (e.g., Luck, Hillyard, etc).

      It is unexpected that this does not vary in latency with contrast - see, for example. Gebodh et al (2017, Brain Topography) - and there is little discussion of this. Could it be that nonlinear trends were not correctly tested for?

      There is very little analysis or discussion of the second stage linked to attention orientation - what would the role of attention orientation be in this task? Is it spatial attention directed to the higher contrast grating (and if so, should it lateralise accordingly?), or is it more of an alerting function the authors have in mind here?

    2. Reviewer #2 (Public review):

      Summary:

      The authors decomposed response times into component processes and manipulated the duration of these processes in opposing directions by varying contrast, and overall by manipulating speed-accuracy tradeoffs. They identify different processes and their durations by identifying neural states in time and validate their functional significance by showing that their properties vary selectively as expected with the predicted effects of the contrast manipulation. They identify 3 processes: stimulus encoding, attention orienting, and decision. These map onto classical event-related potentials. The decision-making component matched the CPP, and its properties varied with contrast and predicted decision-accuracy, while also exhibiting a burst not characteristic of evidence accumulation.

      Strengths:

      The design of the experiment is remarkable and offers crucial insights. The analysis techniques are beyond state-of-the-art, and the analyses are well motivated and offer clear insights.

      Weaknesses:

      It is not clear to me that the results confirm that there are only 3 processes, since e.g., motor preparation and execution were not captured. While the authors discuss this, this is a clear weakness of the approach, as other components may also have been missed. It is also unclear to what extent topographies map onto processes, since, e.g., different combinations of sources can lead to the same scalp topography.

    3. Reviewer #3 (Public review):

      Summary:

      In this manuscript, the authors examine the processing stages involved in perceptual decision-making using a new approach to analysing EEG data, combined with a critical stimulus manipulation. This new EEG analysis method enables single-trial estimates of the timing and amplitude of transient changes in EEG time-series, recurrent across trials in a behavioural task. The authors find evidence for three events between stimulus onset and the response in a two-spatial-interval visual discrimination task. By analysing the timing and amplitude of these events in relation to behaviour and the stimulus manipulation, the authors interpret these events as related to separable processing stages for stimulus encoding, attention orientation, and decision (deliberation). This is largely consistent with previous findings from both event-related potentials (across trials) and single-trial estimates using decoding techniques and neural network approaches.

      Strengths:

      This work is not only important for the conceptual advance, but also in promoting this new analysis technique, which will likely prove useful in future research. For the broader picture, this work is an excellent example of the utility of neural measures for mental chronometry.

      Weaknesses:

      The manuscript would benefit from some conceptual clarifications, which are important for readers to understand this manuscript as a stand-alone work. This includes clearer definitions of Piéron's and Fechner's laws, and a fuller description of the EEG analysis technique. The manuscript, broadly, but the introduction especially, may be improved by clearly delineating the multiple aims of this project: examining the processes for decision-making, obtaining single-trial estimates of meaningful EEG-events, and whether central parietal positivity reflects ramping activity or steps averaged across trials. A fuller discussion of the limitations of the work, in particular, the absence of motor contributions to reaction time, would also be appreciated.

      At times, the novelty of the work is perhaps overstated. Rather, readers may appreciate a more comprehensive discussion of the distinctions between the current work and previous techniques to gauge single-trial estimates of decision-related activity, as well as previous findings concerning distinct processing stages in decision-making. Moreover, a discussion of how the events described in this study might generalise to different decision-making tasks in different contexts (for example, in auditory perception, or even value-based decision-making) would also be appreciated.

  2. resu-bot-bucket.s3.ca-central-1.amazonaws.com resu-bot-bucket.s3.ca-central-1.amazonaws.com
    1. Implemented an LLM chatbox for AI-assisted debugging, fulfilling the client's priority and enhancing the tool's functionality.

      Quantify the enhancement. How much did functionality improve? Provide metrics if available.

    2. Collaborated within a 6-person team in an Agile environment, delivering project milestones over 5 sprints and incorporating peer feedback through 360-degree reviews.

      Specify the outcomes of the project milestones. What was the impact on the client or team?

    1. Reviewer #2 (Public review):

      McDougal et al. describe the surprising finding that IFIT1 proteins from different mammalian species inhibit replication of different viruses, indicating that evolution of IFIT1 across mammals has resulted in host species-specific antiviral specificity. Before this work, research into the antiviral activity and specificity of IFIT1 had mostly focused on the human ortholog, which was described to inhibit viruses including vesicular stomatitis virus (VSV) and Venezuelan equine encephalitis virus (VEEV) but not other viruses including Sindbis virus (SINV) and parainfluenza virus type 3 (PIV3). In the current work, the authors first perform evolutionary analyses on IFIT1 genes across a wide range of mammalian species and reveal that IFIT1 genes have evolved under positive selection in primates, bats, carnivores, and ungulates. Based on these data, they hypothesize that IFIT1 proteins from these diverse mammalian groups may show distinct antiviral specificities against a panel of viruses. By generating human cells that express IFIT1 proteins from different mammalian species, the authors show a wide range of antiviral activities of mammalian IFIT1s. Most strikingly, they find several IFIT1 proteins that have completely different antiviral specificities relative to human IFIT1, including IFIT1s that fail to inhibit VSV or VEEV, but strongly inhibit PIV3 or SINV. These results indicate that there is potential for IFIT1 to inhibit a much wider range of viruses than human IFIT1 inhibits. Electrophoretic mobility shift assays (EMSAs) suggest that some of these changes in antiviral specificity can be ascribed to changes in direct binding of viral RNAs. Interestingly, they also find that chimpanzee IFIT1, which is >98% identical to human IFIT1, fails to inhibit any tested virus. Replacing three residues from chimpanzee IFIT1 with those from human IFIT1, one of which has evolved under positive selection in primates, restores activity to chimpanzee IFIT1. Together, these data reveal a vast diversity of IFIT1 antiviral specificity encoded by mammals, consistent with an IFIT1-virus evolutionary "arms race".

      Overall, this is a very interesting and well-written manuscript that combines evolutionary and functional approaches to provide new insight into IFIT1 antiviral activity and species-specific antiviral immunity. The conclusion that IFIT1 genes in several mammalian lineages are evolving under positive selection is supported by the data. The virology results, which convincingly show that IFIT1s from different species have distinct antiviral specificity, are the most surprising and exciting part of the paper. As such, this paper will be interesting for researchers studying mechanisms of innate antiviral immunity, as well as those interested in species-specific antiviral immunity. Moreover, it may prompt others to test a wide range of orthologs of antiviral factors beyond those from humans or mice, which could further the concept of host-specific innate antiviral specificity. Additional areas for improvement, which are mostly to clarify the presentation of data and conclusions, are described below.

      Strengths:

      (1) This paper is a very strong demonstration of the concept that orthologous innate immune proteins can evolve distinct antiviral specificities. Specifically, the authors show that IFIT1 proteins from different mammalian species are able to inhibit replication of distinct groups of viruses, which is most clearly illustrated in Figure 4G. This is an unexpected finding, as the mechanism by which IFIT1 inhibits viral replication was assumed to be similar across orthologs. While the molecular basis for these differences remains unresolved, this is a clear indication that IFIT1 evolution functionally impacts host-specific antiviral immunity and that IFIT1 has the potential to inhibit a much wider range of viruses than previously described.

      (2) By revealing these differences in antiviral specificity across IFIT1 orthologs, the authors highlight the importance of sampling antiviral proteins from different mammalian species to understand what functions are conserved and what functions are lineage- or species-specific. These results might therefore prompt similar investigations with other antiviral proteins, which could reveal a previously undiscovered diversity of specificities for other antiviral immunity proteins.

      (3) The authors also surprisingly reveal that chimpanzee IFIT1 shows no antiviral activity against any tested virus despite only differing from human IFIT1 by eight amino acids. By mapping this loss of function to three residues on one helix of the protein, the authors shed new light on a region of the protein with no previously known function.

      (4) Combined with evolutionary analyses that indicate that IFIT1 genes are evolving under positive selection in several mammalian groups, these functional data indicate that IFIT1 is engaged in an evolutionary "arms race" with viruses, which results in distinct antiviral specificities of IFIT1 proteins from different species.

      Weaknesses:

      (1) Some of the results and discussion text could be more focused on the model of evolution-driven changes in IFIT1 specificity. In particular, the majority of the residue mapping is on the chimpanzee protein, where it would appear that this protein has lost all antiviral function, rather than changing its antiviral specificity like some other examples in this paper. As such, the connection between the functional mapping of individual residues with the positive selection analysis and changes in antiviral specificity is not present. While the model that changes in antiviral specificity have been positively selected for is intriguing, it is not supported by data in the paper.

      (2) The strength of the differences in antiviral specificity could be highlighted to a greater degree. Specifically, the text describes a number of interesting examples of differences in inhibition of viruses from Figure 3C and 3D, and 4C-F. The revised version has added some clarity by at least providing raw data for 3C and 3D for the reader to make their own comparisons, but it is still difficult to quickly assess which are the most interesting comparisons to make (e.g. for future mapping of residues that might be important).

    2. Reviewer #3 (Public review):

      Summary:

      This manuscript by McDougal et al, demonstrates species-specific activities of diverse IFIT1 orthologs, and seeks to utilize evolutionary analysis to identify key amino acids under positive selection that contribute to antiviral activity of this host factor. While the authors identify amino acid residues important for antiviral activity of some orthologs, and propose a possible mechanism by which these residues may function, the significance or applicability of these findings to other orthologs is unclear. However, the subject matter is of interest to the field, and these findings contribute to the body of knowledge regarding IFIT1 evolution.

      Strengths:

      Assessment of multiple IFIT1 orthologs shows the wide variety of antiviral activity of IFIT1, and identification of residues outside of the known RNA binding pocket in the protein suggests additional novel mechanisms which may regulate IFIT1 activity.

      Weaknesses:

      Given that there appears to be very little overlap observed in orthologs that inhibited the viruses tested, it's possible that other amino acids may be key drivers of antiviral activity in these other orthologs. Thus, it's difficult to conclude whether the findings that residues 362/4/6 are important for IFIT1 activity can be broadly applied to other orthologs, or whether these are unique to human and chimpanzee IFIT1. While additional molecular studies of the impact of these mutations on IFIT1 function (e.g. impact on IFIT complex formation) would lend further insight, as it stands, these findings demonstrate a role for these residues in IFIT1 activity.

    1. Reviewer #1 (Public review):

      Summary:

      Weiss and co-authors presented a versatile probabilistic tool. aTrack helps in classifying tracking behaviors and understanding important parameters for different types of single particle motion types: Brwonian, Confined, or Directed motion. The tool can be used further to analyze populations of tracks and the number of motion states. This is a stand-alone software package, making it user-friendly for a broad group of researchers.

      Strengths:

      This manuscript presents a novel method for trajectory analysis.

      Comments on revisions:

      The authors have strengthened and improved the manuscript

    2. Reviewer #2 (Public review):

      Summary:

      The authors present a software package "aTrack" for identification of motion types and parameter estimation in single-particle tracking data. The software is based on maximum likelihood estimation of the time-series data given an assumed motion model and likelihood ratio tests for model selection. They characterized the performance of the software mostly on simulated data and showed that it is applicable to experimental data.

      Strengths:

      Although many tools exist in the single-particle tracking (SPT) field, this particular software package is developed using an innovative mathematical model and a probabilistic approach. It also provide inference of motion types, which are critical to answer biological questions in SPT experiments.

      (1) The authors adopt a novel mathematical framework, which is unique in the SPT field.

      (2) The authors have validated their method extensively using simulated tracks and compared to existing methods when appropriate.

      (3) The code is freely available

      Weaknesses:

      The authors did a good job during the revision to address most of the weaknesses in my (as well as other reviewer's) first round of review. Nevertheless, the following issue is still not fully addressed.<br /> The hypothesis testing method presented here lacks rigorous statistical foundation. The authors improved on this point after the revision, but in their newly added SI section "Statistical Test", only justified their choices using "hand-waving" arguments (i.e. there is not a single reference to proper statistical textbooks or earlier works in this important section). I understand that sometimes mathematical rigor comes later after some intuition-guided choices of critical parameters seems to work, but nevertheless need to point it out as a remaining weakness.

    1. Reviewer #2 (Public review):

      Summary:

      The authors report that Arabidopsis short HSFs S-HsfA2, S-HsfA4c, and S-HsfB1 confer extreme heat. They have truncated DNA binding domains that bind to a new heat-regulated element. Considering Short HSFA2, the authors have highlighted the molecular mechanism by which S-HSFs prevent HSR hyperactivation via negative regulation of HSP17.6B. The S-HsfA2 protein binds to the DNA binding domain of HsfA2, thus preventing its binding to HSEs, eventually attenuating HsfA2-activated HSP17.6B promoter activity. This report adds insights to our understanding of heat tolerance and plant growth.

      Strengths:

      (1) The manuscript represents ample experiments to support the claim.

      (2) The manuscript covers a robust number of experiments and provides specific figures and graphs to in support of their claim.

      (3) The authors have chosen a topic to focus on stress tolerance in changing environment.

      (4) The authors have summarized the probable mechanism using a figure.

      Weaknesses:

      Quite minimum

      (1) Fig. 3. the EMSA to reveal binding

      (2) Alignment of supplementary figures 6-7.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript by Kolb and Hasseman et al. introduces a significantly improved GABA sensor, building on the pioneering work of the Janelia team. Given GABA's role as the main inhibitory neurotransmitter and the historical lack of effective optical tools for real-time in vivo GABA dynamics, this development is particularly impactful. The new sensor boasts an enhanced signal-to-noise ratio (SNR) and appropriate kinetics for detecting GABA dynamics in both in vitro and in vivo settings. The study is well-presented, with convincing and high-quality data, making this tool a valuable asset for future research into GABAergic signaling.

      Strengths:

      The core strength of this work lies in its significant advancement of GABA sensing technology. The authors have successfully developed a sensor with higher SNR and suitable kinetics, enabling the detection of GABA dynamics both in vitro and in vivo. This addresses a critical gap in neuroscience research, offering a much-needed optical tool for understanding the most important inhibitory neurotransmitter. The clear representation of the work and the convincing, high-quality data further bolster the manuscript's strengths, indicating the sensor's reliability and potential utility. We anticipate this tool will be invaluable for further investigation of GABAergic signaling.

      Weaknesses:

      Despite the notable progress, a key limitation is that the current generation of GABA sensors, including the one presented here, still exhibits inferior performance compared to state-of-the-art glutamate sensors. While this work is a substantial leap forward, it highlights that further improvements in GABA sensors would still be highly beneficial for the field to match the capabilities seen with glutamate sensors.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript presents the development and characterization of iGABASnFR2, a genetically encoded GABA sensor with markedly improved performance over its predecessor, iGABASnFR1. The study is comprehensive and methodologically rigorous, integrating high-throughput mutagenesis, functional screening, structural analysis, biophysical characterization, and in vivo validation. iGABASnFR2 represents a significant advancement in GABA sensor engineering and application in imaging GABA transmission in slice and in vivo. This is a timely and technically strong contribution to the molecular toolkit for neuroscience.

      Strengths:

      The authors apply a well-established sensor optimization pipeline and iterative engineering strategy from single-site to combinatorial mutants to engineer iGABASnFR2. The development of both positive and negative going variants (iGABASnFR2 and iGABASnFR2n) offers experimental flexibility. The structure and interpretation of the key mutations provide insights into the working mechanism of the sensor, which also suggest optimization strategies. Although individual improvements in intrinsic properties are incremental, their combined effect yields clear functional gains, enabling detection of direction-selective GABA release in the retina and volume-transmitted GABA signaling in somatosensory cortex, which were challenging or missed using iGABASnFR1.

      Weaknesses:

      With minor revisions and clarifications, especially regarding membrane trafficking, this manuscript will be a valuable resource for probing inhibitory transmission.

    1. Reviewer #1 (Public review):

      Summary:

      Hosack and Arce-McShane investigate how the 3D movement direction of the tongue is represented in the orofacial part of the sensory-motor cortex and how this representation changes with the loss of oral sensation. They examine the firing patterns of neurons in the orofacial parts of the primary motor cortex (MIo) and somatosensory cortex (SIo) in non-human primates (NHPs) during drinking and feeding tasks. While recording neural activity, they also tracked the kinematics of tongue movement using biplanar video-radiography of markers implanted in the tongue. Their findings indicate that many units in both MIo and SIo are directionally tuned during the drinking task. However, during the feeding task, directional turning was more frequent in MIo units and less prominent in SIo units. Additionally, in some recording sessions, they blocked sensory feedback using bilateral nerve block injections, which seemed to result in fewer directionally tuned units and changes in the overall distribution of the preferred direction of the units.

      Strengths:

      The most significant strength of this paper lies in its unique combination of experimental tools. The author utilized a video-radiography method to capture 3D kinematics of the tongue movement during two behavioral tasks while simultaneously recording activity from two brain areas. This specific dataset and experimental setup hold great potential for future research on the understudied orofacial segment of the sensory-motor area.

      Weaknesses:

      A substantial portion of the paper is dedicated to establishing directional tuning in individual neurons, followed by an analysis of how this tuning changes when sensory feedback is blocked. While such characterizations are valuable, particularly in less-studied motor cortical areas and behaviors, the discrepancies in tuning changes across the two NHPs, coupled with the overall exploratory nature of the study, render the interpretation of these subtle differences somewhat speculative. At the population level, both decoding analyses and state space trajectories from factor analysis indicate that movement direction (or spout location) is robustly represented. However, as with the single-cell findings, the nuanced differences in neural trajectories across reach directions and between baseline and sensory-block conditions remain largely descriptive. To move beyond this, model-based or hypothesis-driven approaches are needed to uncover mechanistic links between neural state space dynamics and behavior.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript by Hosack and Arce-McShane examines the directional tuning of neurons in macaque primary motor (MIo) and somatosensory (SIo) cortex. The neural basis of tongue control is far less studied than, for example, forelimb movements, partly because the tongue's kinematics and kinetics are difficult to measure. A major technical advantage of this study is using biplanar video-radiography, processed with modern motion tracking analysis software, to track the movement of the tongue inside the oral cavity. Compared to prior work, the behaviors are more naturalistic behaviors (feeding and licking water from one of three spouts), although the animals were still head-fixed.

      The study's main findings are that:

      • A majority of neurons in MIo and a (somewhat smaller) percentage of SIo modulated their firing rates during tongue movements, with different modulation depending on the direction of movement (i.e., exhibited directional tuning). Examining the statistics of tuning across neurons, there was anisotropy (e.g., more neurons preferring anterior movement) and a lateral bias in which tongue direction neurons preferred that was consistent with the innervation patterns of tongue control muscles (although with some inconsistency between monkeys).<br /> • Consistent with this encoding, tongue position could be decoded with moderate accuracy even from small ensembles of ~28 neurons.<br /> • There were differences observed in the proportion and extent of directional tuning between the feeding and licking behaviors, with stronger tuning overall during licking. This potentially suggests behavioral context-dependent encoding.<br /> • The authors then went one step further and used a bilateral nerve block to the sensory inputs (trigeminal nerve) from the tongue. This impaired the precision of tongue movements and resulted in an apparent reduction and change in neural tuning in Mio and SIo.

      Strengths:

      The data are difficult to obtain and appear to have been rigorously measured, and provide a valuable contribution to this under-explored subfield of sensorimotor neuroscience. The analyses adopt well-established methods especially from the arm motor control literature, and represent a natural starting point for characterizing tongue 3D direction tuning.

      Weaknesses:

      There are alternative explanations from some of the interpretations, but those interpretations are described in a way that clearly distinguishes results from interpretations, and readers can make their own assessments. Some of these limitations are described in more detail below.

      One weakness of the current study is that there is substantial variability in results between monkeys.

      This study focuses on describing directional tuning using the preferred direction (PD) / cosine tuning model popularized by Georgopoulous and colleagues for understanding neural control of arm reaching in the 1980s. This is a reasonable starting point and a decent first order description of neural tuning. However, the arm motor control field has moved far past that viewpoint, and in some ways an over-fixation on static representational encoding models and PDs held that field back for many years. The manuscript benefit from drawing the readers' attention (perhaps in their Discussion) that PDs are a very simple starting point for characterizing how cortical activity relates to kinematics, but that there is likely much richer population-level dynamical structure and that a more mechanistic, control-focused analytical framework may be fruitful. A good review of this evolution in the arm field can be found in Vyas S, Golub MD, Sussillo D, Shenoy K. 2020. Computation Through Neural Population Dynamics. Annual Review of Neuroscience. 43(1):249-75. A revised version of the manuscript incorporates more population-level analyses, but with inconsistent use of quantifications/statistics and without sufficient contextualization of what the reader is to make of these results.

      The described changes in tuning after nerve block could also be explained by changes in kinematics between these conditions, which temper the interpretation of these interesting results.

      I am not convinced of the claim that tongue directional encoding fundamentally changes between drinking and feeding given the dramatically different kinematics and the involvement of other body parts like the jaw (e.g., the reference to Laurence-Chasen et al. 2023 just shows that there is tongue information independent of jaw kinematics, not that jaw movements don't affect these neurons' activities). I also find the nerve block results inconsistent (more tuning in one monkey, less in the other?) and difficult to really learn something fundamental from, besides that neural activity and behavior both change - in various ways - after nerve block (not at all surprising but still good to see measurements of).

      The manuscript states that "Our results suggest that the somatosensory cortex may be less involved than the motor areas during feeding, possibly because it is a more ingrained and stereotyped behavior as opposed to tongue protrusion or drinking tasks". An alternative explanation be more statistical/technical in nature: that during feeding, there will be more variability in exactly what somatosensation afferent signals are being received from trial to trial (because slight differences in kinematics can have large differences in exactly where the tongue is and the where/when/how of what parts of it are touching other parts of the oral cavity)? This variability could "smear out" the apparent tuning using these types of trial-averaged analyses. Given how important proprioception and somatosensation are for not biting the tongue or choking, the speculation that somatosensory cortical activity is suppressed during feedback is very counter-intuitive to this reviewer. In the revised manuscript the authors note these potential confounds and other limitations in the Discussion.

    3. Reviewer #3 (Public review):

      Summary

      In this study, the authors aim to uncover how 3D tongue direction is represented in the Motor (M1o) and Somatosensory (S1o) cortex. In non-human primates implanted with chronic electrode arrays, they use X-ray based imaging to track the kinematics of the tongue and jaw as the animal is either chewing food or licking from a spout. They then correlate the tongue kinematics with the recorded neural activity. They perform both single-unit and population level analyses during feeding and licking. Then, they recharacterize the tuning properties after bilateral lidocaine injections in the two sensory branches of the trigeminal nerve. They report that their nerve block causes a reorganization of the tuning properties and population trajectories. Overall, this paper concludes that M1o and S1o both contain representations of the tongue direction, but their numbers, their tuning properties and susceptibility to perturbed sensory input are different.

      Strengths

      The major strengths of this paper are in the state-of-the-art experimental methods employed to collect the electrophysiological and kinematic data. In the revision, the single-unit analyses of tuning direction are robustly characterized. The differences in neural correlations across behaviors, regions and perturbations are robust. In addition to the substantial amount of largely descriptive analyses, this paper makes two convincing arguments 1) The single-neuron correlates for feeding and licking in OSMCx are different - and can't be simply explained by different kinematics and 2) Blocking sensory input alters the neural processing during orofacial behaviors. The evidence for these claims is solid.

      Weaknesses

      The main weakness of this paper is in providing an account for these differences to get some insight into neural mechanisms. For example, while the authors show changes in neural tuning and different 'neural trajectory' shapes during feeding and drinking - their analyses of these differences are descriptive and provide limited insight for the underlying neural computations.

    1. Reviewer #4 (Public review):

      Summary:

      Several behavioral experiments and one TMS experiment were performed to examine adaptation to room reverberation for speech intelligibility in noise. This is an important topic that has been extensively studied by several groups over the years. And the study is unique in that it examines one candidate brain area, dlPFC, potentially involved in this learning, and finds that disrupting this area by TMS results in a reduction in the learning. The behavioral conditions are in many ways similar to previous studies. However, they find results that do not match previous results (e.g., performance in anechoic condition is worse than in reverberation), making it difficult to assess the validity of the methods used. One unique aspect of the behavioral experiments is that Ambisonics was used to simulate the spaces, while headphone simulation was mostly used previously. The main behavioral experiment was performed by interleaving 3 different rooms and measuring speech intelligibility as a function of the number of words preceding the target in a given room on a given trial. The findings are that performance improves on the time scale of seconds (as the number of words preceding the target increases), but also on a much larger time scale of tens to hundreds of seconds (corresponding to multiple trials), while for some listeners it is degraded for the first couple of trials. The study also finds that the performance is best in the room that matches the T60 most commonly observed in everyday environments. These are potentially interesting results. However, there are issues with the design of the study and analysis methods that make it difficult to verify the conclusions based on the data.

      Strengths:

      (1) Analysis of the adaptation to reverberation on multiple time scales, for multiple reverberant and anechoic environments, and also considering contextual effects of one environment interleaved with the other two environments.

      (2) TMS experiment showing reduction of some of the learning effects by temporarily disabling the dlPFC.

      Weaknesses:

      While the study examines the adaptation for different carrier lengths, it keeps multiple characteristics (mainly talker voice and location) fixed in addition to reverberation. Therefore, it is possible that the subjects adapt to other aspects of the stimuli, not just to reverberation. A condition in which only reverberation would switch for the target would allow the authors to separate these confounding alternatives. Now, the authors try to address the concerns by indirect evidence/analyses. However, the evidence provided does not appear sufficient.

      The authors use terms that are either not defined or that seem to be defined incorrectly. The main issue then is the results, which are based on analysis of what the authors call d', Hit Rate, and Final Hit rate. First of all, they randomly switch between these measures. Second, it's not clear how they define them, given that their responses are either 4-alternative or 8-alternative forced choice. d', Hit Rate, and False Alarm Rate are defined in Signal detection theory for the detection of the presence of a target. It can be easily extended to a 2-alternative forced choice. But how does one define a Hit, and, in particular, a False Alarm, in a 4/8-alternative? The authors do not state how they did it, and without that, the computation of d' based on HR and FAR is dubious. Also, what the authors call Hit Rate, is presumably the percent correct performance (PCC), but even that is not clear. Then they use FHR and act as if this was the asymptotic value of their HR, even though in many conditions their learning has not ended, and randomly define a variable of +-10 from FHR, which must produce different results depending on whether the asymptote was reached or not. Other examples of usage of strange usage of terms: they talk about "global likelihood learning" (L426) without a definition or a reference, or about "cumulative hit rate" (L1738), where it is not clear to me what "cumulative" means there.

      There are not enough acoustic details about the stimuli. The authors find that reverberant performance is overall better than anechoic in 2 rooms. This goes contrary to previous results. And the authors do not provide enough acoustic details to establish that this is not an artefact of how the stimuli were normalized (e.g., what were the total signal and noise levels at the two ears in the anechoic and reverberant conditions?).

      There are some concerns about the use of statistics. For example, the authors perform two-way ANOVA (L724-728) in which one factor is room, but that factor does not have the same 3 levels across the two levels of the other factor. Also, in some comparisons, they randomly select 11 out of 22 subjects even though appropriate test correct for such imbalances without adding additional randomness of whether the 11 selected subjects happened to be the good or the bad ones.

      Details of the experiments are not sufficiently described in the methods (L194-205) to be able to follow what was done. It should be stated that 1 main experiment was performed using 3 rooms, and that 3 follow-ups were done on a new set of subjects, each with the room swapped.

    2. Reviewer #3 (Public review):

      Summary:

      This manuscript presents a well-designed and insightful behavioural study examining human adaptation to room acoustics, building on prior work by Brandewie & Zahorik. The psychophysical results are convincing and add incremental but meaningful knowledge to our understanding of reverberation learning. However, I find the transcranial magnetic stimulation (TMS) component to be over-interpreted. The TMS protocol, while interesting, lacks sufficient anatomical specificity and mechanistic explanation to support the strong claims made regarding a unique role of the dorsolateral prefrontal cortex (dlPFC) in this learning process. More cautious interpretation is warranted, especially given the modest statistical effects, the fact that the main TMS result of interest is a null result, the imprecise targeting of dlPFC (which is not validated), and the lack of knowledge about the timescale of TMS effects in relation to the behavioural task. I recommend revising the manuscript to shift emphasis toward the stronger behavioural findings and to present a more measured and transparent discussion of the TMS results and their limitations.

      Strengths:

      (1) Well-designed acoustical stimuli and psychophysical task.

      (2) Comparisons across room combinations are well conducted.

      (3) The virtual acoustic environment is impressive and applied well here.

      (4) A timely study with interesting behavioural results.

      Weaknesses:

      (1) Lack of hypotheses, particularly for TMS.

      (2) Lack of evidence for targeting TMS in [brain] space and time.

      (3) The most interesting effect of TMS is a null result compared to a weak statistical effect for "meta adaptation"

    3. Reviewer #2 (Public review):

      Summary:

      This study investigated how listeners adapt to and utilize statistical properties of different acoustic spaces to improve speech perception. The researchers used repetitive TMS to perturb neural activity in DLPFC, inhibiting statistical learning compared to sham conditions. The authors also identified the most effective room types for the effective use of reverberations in speech in noise perception, with regular human-built environments bringing greater benefits than modified rooms with lower or higher reverberation times.

      Strengths:

      The introduction and discussion sections of the paper are very interesting and highlight the importance of the current study, particularly with regard to the use of ecologically valid stimuli in investigating statistical learning. However, they could be condensed into parts. TMS parameters and task conditions were well-considered and clearly explained.

      Weaknesses

      (1) The Results section is difficult to follow and includes a lot of detail, which could be removed. As such, it presents as confusing and speculative at times.

      (2) The hypotheses for the study are not clearly stated.

      (3) Multiple statistical models are implemented without correcting the alpha value. This leaves the analyses vulnerable to Type I errors.

      (4) It is confusing to understand how many discrete experiments are included in the study as a whole, and how many participants are involved in each experiment.

      (5) The TMS study is significantly underpowered and not robust. Sample size calculations need further explanation (effect sizes appear to be based on behavioural studies?). I would caution an exploratory presentation of these data, and calculate a posteriori the full sample size based on effect sizes observed in the TMS data.

    4. Reviewer #1 (Public review):

      Summary:

      This manuscript describes the results of an experiment that demonstrates a disruption in statistical learning of room acoustics when transcranial magnetic stimulation (TMS) is applied to the dorsolateral prefrontal cortex in human listeners. The work uses a testing paradigm designed by the Zahorik group that has shown improvement in speech understanding as a function of listening exposure time in a room, presumably through a mechanism of statistical learning. The manuscript is comprehensive and clear, with detailed figures that show key results. Overall, this work provides an explanation for the mechanisms that support such statistical learning of room acoustics and, therefore, represents a major advancement for the field.

      Strengths:

      The primary strength of the work is its simple and clear result, that the dorsolateral prefrontal cortex is involved in human room acoustic learning.

      Weaknesses:

      A potential weakness of this work is that the manuscript is quite lengthy and complex.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, the authors argue that defining higher visual areas (HVAs) based on reversals of retinotopic tuning has led to an over-parcellation of secondary visual cortices. Using retinotopic models, they propose that the HVAs are more parsimoniously mapped as a single area V2, which encircles V1 and exhibits complex retinotopy. They reanalyze functional data to argue that functional differences between HVAs can be explained by retinotopic coverage. Finally, they compare the classification of mouse visual cortex to that of other species to argue that our current classification is inconsistent with those used in other model species.

      Strengths:

      This manuscript is bold and thought-provoking, and is a must-read for mouse visual neuroscientists. The authors take a strong stance on combining all HVAs, with the possible exception of area POR, into a single V2 region. Although I suspect many in the field will find that their proposal goes too far, many will agree that we need to closely examine the assumptions of previous classifications to derive a more accurate areal map. The authors' supporting analyses are clear and bolster their argument. Finally, they make a compelling argument for why the classification is not just semantic, but has ramifications for the design of experiments and analysis of data.

      Weaknesses:

      Although I enjoyed the polemic nature of the manuscript, there are a few issues that weaken their argument.

      (1) Although the authors make a compelling argument that retinotopic reversals are insufficient to define distinct regions, they are less clear about what would constitute convincing evidence for distinct visual regions. They mention that a distinct area V3 has been (correctly) defined in ferrets based on "cytoarchitecture, anatomy, and functional properties", but elsewhere argue that none of these factors are sufficient to parcellate any of the HVAs in mouse cortex, despite some striking differences between HVAs in each of these factors. It would be helpful to clearly define a set of criteria that could be used for classifying distinct regions.

      (2) On a related note, although the authors carry out impressive analyses to show that differences in functional properties between HVAs could be explained by retinotopy, they glossed over some contrary evidence that there are functional differences independent of retinotopy. For example, axon projections to different HVAs originating from a single V1 injection - presumably including neurons with similar retinotopy - exhibit distinct functional properties (Glickfeld LL et al, Nat Neuro, 2013). As another example, interdigitated M2+/M2- patches in V1 show very different HVA connectivity and response properties, again independent of V1 location/retinotopy (Meier AM et al., bioRxiv). One consideration is that the secondary regions might be considered a single V2 with distinct functional modules based on retinotopy and connectivity (e.g., V2LM, V2PM, etc).

      (3) Some of the HVAs-such as AL, AM, and LI-appear to have redundant retinotopic coverage with other HVAS, such as LM and PM. Moreover, these regions have typically been found to have higher "hierarchy scores" based on connectivity (Harris JA et al., Nature, 2019; D'Souza RD et al., Nat Comm, 2022), though unfortunately, the hierarchy levels are not completely consistent between studies. Based on existing evidence, there is a reasonable argument to be made for a hybrid classification, in which some regions (e.g., LM, P, PM, and RL) are combined into a single V2 (though see point #2 above) while other HVAs are maintained as independent visual regions, distinct from V2. I don't expect the authors to revise their viewpoint in any way, but a more nuanced discussion of alternative classifications is warranted.

    2. Reviewer #2 (Public review):

      Summary:

      The study by Rowley and Sedigh-Sarvestani presents modeling data suggesting that map reversals in mouse lateral extrastriate visual cortex do not coincide with areal borders, but instead represent borders between subregions within a single area V2. The authors propose that such an organization explains the partial coverage in higher-order areas reported by Zhuang et al., (2017). The scheme revisits an organization proposed by Kaas et al., (1989), who interpreted the multiple projection patches traced from V1 in the squirrel lateral extrastriate cortex as subregions within a single area V2. Kaas et al's interpretation was challenged by Wang and Burkhalter (2007), who used a combination of topographic mapping of V1 connections and receptive field recordings in mice. Their findings supported a different partitioning scheme in which each projection patch mapped a specific topographic location within single areas, each containing a complete representation of the visual field. The area map of mouse visual cortex by Wang and Burkhalter (2007) has been reproduced by hundreds of studies and has been widely accepted as ground truth (CCF) (Wang et al., 2020) of the layout of rodent cortex. In the meantime, topographic mappings in marmoset and tree shew visual cortex made a strong case for map reversals in lateral extrastriate cortex, which represent borders between functionally diverse subregions within a single area V2. These findings from non-rodent species raised doubts about whether during evolution, different mammalian branches have developed diverse partitioning schemes of the cerebral cortex. Rowley and Sedigh-Sarvestani favor a single master plan in which, across evolution, all mammalian species have used a similar blueprint for subdividing the cortex.

      Strengths:

      The story illustrates the enduring strength of science in search of definitive answers.

      Weaknesses:

      To me, it remains an open question whether Rowley and Sedigh-Sarvestani have written the final chapter of the saga. A key reason for my reservation is that the areas the maps used in their model are cherry-picked. The article disregards published complementary maps, which show that the entire visual field is represented in multiple areas (i.e. LM, AL) of lateral extrastriate cortex and that the map reversal between LM and AL coincides precisely with the transition in m2AChR expression and cytoarchitecture (Wang and Burkhalter, 2007; Wang et al., 2011). Evidence from experiments in rats supports the gist of the findings in the mouse visual cortex (Coogan and Burkhalter, 1993).

      (1) The selective use of published evidence, such as the complete visual field representation in higher visual areas of lateral extrastriate cortex (Wang and Burkhalter, 2007; Wang et al., 2011) makes the report more of an opinion piece than an original research article that systematically analyzes the area map of mouse visual cortex we have proposed. No direct evidence is presented for a single area V2 with functionally distinct subregions.

      (2) The article misrepresents evidence by commenting that m2AChR expression is mainly associated with the lower field. This is counter to published findings showing that m2AChR spans across the entire visual field (Gamanut et al., 2018; Meier et al., 2021). The utility of markers for delineating areal boundaries is discounted, without any evidence, in disregard of evidence for distinct areal patterns in early development (Wang et al., 2011). Pointing out that markers can be distributed non-uniformly within an area is well-familiar. m2AChR is non-uniformly expressed in mouse V1, LM and LI (Ji et al., 2015; D'Souza et al., 2019; Meier et al., 2021). Recently, it has been found that the patchy organization within V1 plays a role in the organization of thalamocortical and intracortical networks (Meier et al., 2025). m2AChR-positive patches and m2AChR-negative interpatches organize the functionally distinct ventral and dorsal networks, notably without obvious bias for upper and lower parts of the visual field.

      (3) The study has adopted an area partitioning scheme, which is said to be based on anatomically defined boundaries of V2 (Zhuang et al., 2017). The only anatomical borders used by Zhuang et al. (2017) are those of V1 and barrel cortex, identified by cytochrome oxidase staining. In reality, the partitioning of the visual cortex was based on field sign maps, which are reproduced from Zhuang et al., (2017) in Figure 1A. It is unclear why the maps shown in Figures 2E and 2F differ from those in Figure 1A. It is possible that this is an oversight. But maintaining consistent areal boundaries across experimental conditions that are referenced to the underlying brain structure is critical for assigning modeled projections to areas or sub-regions. This problem is evident in Figure 2F, which is presented as evidence that the modeling approach recapitulates the tracings shown in Figure 3 of Wang and Burkhalter (2007). The dissimilarities between the modeling and tracing results are striking, unlike what is stated in the legend of Figure 2F.

      (4) The Rowley and Sedigh-Sarvestani find that the partial coverage of the visual field in higher order areas shown by Zhuang et al (2017) is recreated by the model. It is important to caution that Zhuang et al's (2017) maps were derived from incomplete mappings of the visual field, which was confined to -25-35 deg of elevation. This underestimates the coverage we have found in LM and AL. Receptive field mappings show that LM covers 0-90 deg of azimuth and -30-80 elevation (Wang and Burkhalter, 2007). AL covers at least 0-90 deg of azimuth and -30-50 deg of elevation (Wang and Burkhalter, 2007; Wang et al., 2011). These are important differences. Partial coverage in LM and AL underestimates the size of these areas and may map two projection patches as inputs to subregions of a single area rather than inputs to two separate areas. Complete, or nearly complete, visual representations in LM and AL support that each is a single area. Importantly, both areas are included in a callosal-free zone (Wang and Burkhalter, 2007). The surrounding callosal connections align with the vertical meridian representation. The single map reversal is marked by a transition in m2AChR expression and cytoarchitecture (Wang et al., 2011).

      (5) The statement that the "lack of visual field overlap across areas is suggestive of a lack of hierarchical processing" is predicated on the full acceptance of the mappings by Zhuang et al (2017). Based on the evidence reviewed above, the reclassification of visual areas proposed in Figure 1C seems premature.

      (6) The existence of lateral connections is not unique to rodent cortex and has been described in primates (Felleman and Van Essen, 1991).

      (7) Why the mouse and rat extrastriate visual cortex differ from those of many other mammals is unclear. One reason may be that mammals with V2 subregions are strongly binocular.

    3. Reviewer #3 (Public review):

      Summary:

      The authors review published literature and propose that a visual cortical region in the mouse that is widely considered to contain multiple visual areas should be considered a single visual area.

      Strengths:

      The authors point out that relatively new data showing reversals of visual-field sign within known, single visual areas of some species require that a visual field sign change by itself should not be considered evidence for a border between visual areas.

      Weaknesses:

      The existing data are not consistent with the authors' proposal to consolidate multiple mouse areas into a single "V2". This is because the existing definition of a single area is that it cannot have redundant representations of the visual field. The authors ignore this requirement, as well as the data and definitions found in published manuscripts, and make an inaccurate claim that "higher order visual areas in the mouse do not have overlapping representations of the visual field". For quantification of the extent of overlap of representations between 11 mouse visual areas, see Figure 6G of Garrett et al. 2014. [Garrett, M.E., Nauhaus, I., Marshel, J.H., and Callaway, E.M. (2014). Topography and areal organization of mouse visual cortex. The Journal of neuroscience 34, 12587-12600. 10.1523/JNEUROSCI.1124-14.2014.

    1. Reviewer #1 (Public review):

      Summary:

      Parise presents another instantiation of the Multisensory Correlation Detector model that can now accept stimulus-level inputs. This is a valuable development as it removes researcher involvement in the characterization/labeling of features and allows analysis of complex stimuli with a high degree of nuance that was previously unconsidered (i.e. spatial/spectral distributions across time). The author demonstrates the power of the model by fitting data from dozens of previous experiments including multiple species, tasks, behavioral modality, and pharmacological interventions.

      Strengths:

      One of the model's biggest strengths, in my opinion, is its ability to extract complex spatiotemporal co-relationships from multisensory stimuli. These relationships have typically been manually computed or assigned based on stimulus condition and often distilled to a single dimension or even single number (e.g., "-50 ms asynchrony"). Thus, many models of multisensory integration depend heavily on human preprocessing of stimuli and these models miss out on complex dynamics of stimuli; the lead modality distribution apparent in figure 3b and c are provocative. I can imagine the model revealing interesting characteristics of the facial distribution of correlation during continuous audiovisual speech that have up to this point been largely described as "present" and almost solely focused on the lip area.

      Another aspect that makes the MCD stand out among other models is the biological inspiration and generalizability across domains. The model was developed to describe a separate process - motion perception - and in a much simpler organism - drosophila. It could then describe a very basic neural computation that has been conserved across phylogeny (which is further demonstrated in the ability to predict rat, primate, and human data) and brain area. This aspect makes the model likely able to account for much more than what has already been demonstrated with only a few tweaks akin to the modifications described in this and previous articles from Parise.

      What allows this potential is that, as Parise and colleagues have demonstrated in those papers since our (re)introduction of the model in 2016, the MCD model is modular - both in its ability to interface with different inputs/outputs and its ability to chain MCD units in a way that can analyze spatial, spectral, or any other arbitrary dimension of a stimulus. This fact leaves wide-open the possibilities for types of data, stimuli, and tasks a simplistic neutrally inspired model can account for.

      And so it's unsurprising (but impressive!) that Parise has demonstrated the model's ability here to account for such a wide range of empirical data from numerous tasks (synchrony/temporal order judgement, localization, detection, etc.) and behavior types (manual/saccade responses, gaze, etc.) using only the stimulus and a few free parameters. This ability is another of the model's main strengths that I think deserves some emphasis: it represents a kind of validation of those experiments - especially in the context of cross-experiment predictions.

      Finally, what is perhaps most impressive to me is that the MCD (and the accompanying decision model) does all this with very few (sometimes zero) free parameters. This highlights the utility of the model and the plausibility of its underlying architecture, but also helps to prevent extreme overfitting if fit correctly.

      Weaknesses:

      The model boasts an incredible versatility across tasks and stimulus configurations and its overall scope of the model is to understand how and what relevant sensory information is extracted from a stimulus. We still need to exercise care when interpreting its parameters, especially considering the broader context of top-down control of perception and that some multisensory mappings may not be derivable purely from stimulus statistics (e.g., the complementary nature of some phonemes/visemes).

    2. Reviewer #2 (Public review):

      Summary:

      Building on previous models of multisensory integration (including their earlier correlation-detection framework used for non-spatial signals), the author introduces a population-level Multisensory Correlation Detector (MCD) that processes raw auditory and visual data. Crucially, it does not rely on abstracted parameters, as is common in normative Bayesian models," but rather works directly on the stimulus itself (i.e., individual pixels and audio samples). By systematically testing the model against a range of experiments spanning human, monkey, and rat data - the authors show that their MCD population approach robustly predicts perception and behavior across species with a relatively small (0-4) number of free parameters.

      Strengths:

      (1) Unlike prior Bayesian models that used simplified or parameterized inputs, the model here is explicitly computable from full natural stimuli. This resolves a key gap in understanding how the brain might extract "time offsets" or "disparities" from continuously changing audio-visual streams.

      (2) The same population MCD architecture captures a remarkable range of multisensory phenomena, from classical illusions (McGurk, ventriloquism) and synchrony judgments, to attentional/gaze behavior driven by audio-visual salience. This generality strongly supports the idea that a single low-level computation (correlation detection) can underlie many distinct multisensory effects.

      (3) By tuning model parameters to different temporal rhythms (e.g., faster in rodents, slower in humans), the MCD explains cross-species perceptual data without reconfiguring the underlying architecture.

      (4) The authors frame their model as a plausible algorithmic account of the Bayesian multisensory-integration models in Marr's levels of hierarchy.

      Weaknesses:

      What remains unclear is how the parameters themselves relate to stimulus quantities (like stimulus uncertainty), as is often straightforward in Bayesian models. A theoretical missing link is the explicit relationship between the parameters of the MCD models and those of a cue combination model, thereby bridging Marr's levels of hierarchy.

      Likely Impact and Usefulness

      The work offers a compelling unification of multiple multisensory tasks-temporal order judgments, illusions, Bayesian causal inference, and overt visual attention-under a single, fully stimulus-driven framework. Its success with natural stimuli should interest computational neuroscientists, systems neuroscientists, and machine learning scientists. This paper thus makes an important contribution to the field by moving beyond minimalistic lab stimuli, illustrating how raw audio and video can be integrated using elementary correlation analyses.

    1. Reviewer #1 (Public review):

      Summary:

      This is a wonderful and landmark study in the field of human embryo modeling. It uses patterned human gastruloids and conducts a functional screen on neural tube closure, and identifies positive and negative regulators, and defines the epistasis among them.

      Strengths:

      The above was achieved following optimization of the micro-pattern-based gastruloid protocol to achieve high efficiency, and then optimized to conduct and deliver CRISPRi without disrupting the protocol. This is a technical tour de force as well as one of the first studies to reveal new knowledge on human development through embryo models, which has not been done before.

      The manuscript is very solid and well-written. The figures are clear, elegant, and meaningful. The conclusions are fully supported by the data shown. The methods are well-detailed, which is very important for such a study.

      Weaknesses:

      This reviewer did not identify any meaningful, major, or minor caveats that need addressing or correcting.

      A minor weakness is that one can never find out if the findings in human embryo models can be in vitro revalidated in humans in vivo. This is for obvious and justified ethical reasons. However, the authors acknowledge this point in the section of the manuscript detailing the limitations of their study.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript is a technical report on a new model of early neurogenesis, coupled to a novel platform for genetic screens. The model is more faithful than others published to date, and the screening platform is an advance over existing ones in terms of speed and throughput.

      Strengths:

      It is novel and useful.

      Weaknesses:

      The novelty of the results is limited in terms of biology, mainly a proof of concept of the platform and a very good demonstration of the hierarchical interactions of the top regulators of GRNs.

      The value of the manuscript could be enhanced in two ways:

      (1) by showing its versatility and transforming the level of neural tube to midbrain and hindbrain, and looking at the transcriptional hierarchies there.

      (2) by relating the patterning of the organoids to the situation in vivo, in particular with the information in reference 49. The authors make a statement "To compare our findings with in vivo gene expression patterns, we applied the same approach to published scRNA-seq data from 4-week-old human embryos at the neurula stage" but it would be good to have a more nuanced reference: what stage, what genes are missing, what do they add to the information in that reference?

    1. Reviewer #1 (Public review):

      Summary:

      Gao et al. has demonstrated that the the pesticide emamectin benzoate (EB) treatment of brown plathopper (BPH) leads to increased egg laying in the insect, which is a common agricultural pest. The authors hypothesize that EB upregulates JH titer resulting in increased fecundity.

      Strengths:

      The finding that a class of pesticide increases fecundity of brown planthopper is interesting.

      Comments on revisions:

      All my concerns have been addressed to reasonable level of satisfaction.

    1. Reviewer #1 (Public review):

      In this manuscript, Roy et al. used the previously published deep transfer learning tool, DEGAS, to map disease associations onto single-cell RNA-seq data from bulk expression data. The authors performed independent runs of DEGAS using T2D or obesity status and identified distinct β-cell subpopulations. β-cells with high obese-DEGAS scores contained two subpopulations derived largely from either non-diabetic or T2D donors. Finally, immunostaining using human pancreas sections from healthy and T2D donors validated the heterogeneous expression and depletion of DLK1 in T2D islets.

      Strengths:

      (1) This meta-analysis of previously published scRNA-seq data uses a deep transfer learning tool.

      (2) Identification of novel beta cell subclusters.

      (3) Identified a relatively innovative role of DLK1 in T2D disease progression.

      Comments on revisions:

      All previous concerns have been addressed.

    2. Reviewer #2 (Public review):

      Summary:

      The manuscript by Gitanjali Roy et al. applies deep transfer learning (DEGAS) to assign patient-level disease attributes (metadata) to single cells of T2D and non-diabetic patients, including obese patients. This led to the identification of a singular cluster of T2D-associated β-cells; and two subpopulations of obese- β-cells derived from either non-diabetic or T2D donors. The objective was to identify novel and established genes implicated in T2D and obesity. Their final goal is to validate their findings at the protein level using immunohistochemistry of pancreas tissue from non-diabetic and T2D organ donors.

      Strengths:

      This paper is well-written, and the findings are relevant for β-cell heterogeneity in T2D and obesity.

      Weaknesses:

      The validation they provide is not sufficiently strong: no DLK1 immunohistochemistry is shown of obese patient-derived sections. Additional presumptive relevant candidates from this transcriptomic analysis should be screened for, at the protein level.

      Comments on revisions:

      The authors have largely addressed my comments. No further experiments are requested.

    1. Reviewer #1 (Public review):

      This study established a C921Y OGT-ID mouse model, systematically demonstrating in mammals the pathological link between O-GlcNAc metabolic imbalance and neurodevelopmental disorders (cortical malformation, microcephaly) as well as behavioral abnormalities (hyperactivity, impulsivity, learning/memory deficits). However, critical flaws in the current findings require resolution to ensure scientific rigor.

      The most concerning finding appears in Figure S12. While Supplementary Figure S12 demonstrates decreased OGA expression without significant OGT level changes in C921Y mutants via Western blot/qPCR, previous reports (Florence Authier, et al., Dis Model Mech. 2023) described OGT downregulation in Western blot and an increase in qPCR in the same models. The opposite OGT expression outcomes in supposedly identical mouse models directly challenge the model's reliability. This discrepancy raises serious concerns about either the experimental execution or the interpretation of results. The authors must revalidate the data with rigorous controls or provide a molecular biology-based explanation.

      A few additional comments to the author may be helpful to improve the study.

      Major

      (1) While this study systematically validated multi-dimensional phenotypes (including neuroanatomical abnormalities and behavioral deficits) in OGT C921Y mutant mice, there is a lack of relevant mechanisms and intervention experiments. For example, the absence of targeted intervention studies on key signaling pathways prevents verification of whether proteomics-identified molecular changes directly drive phenotypic manifestations.

      (2) Although MRI detected nodular dysplasia and heterotopia in the cingulate cortex, the cellular basis remains undefined. Spatiotemporal immunofluorescence analysis using neuronal (NeuN), astrocytic (GFAP), and synaptic (Synaptophysin) markers is recommended to identify affected cell populations (e.g., radial glial migration defects or intermediate progenitor differentiation abnormalities).

      (3) While proteomics revealed dysregulation in pathways including Wnt/β-catenin and mTOR signaling, two critical issues remain unresolved: a) O-GlcNAc glycoproteomic alterations remain unexamined; b) The causal relationship between pathway changes and O-GlcNAc imbalance lacks validation. It is recommended to use co-immunoprecipitation or glycosylation sequencing to confirm whether the relevant proteins undergo O-GlcNAc modification changes, identify specific modification sites, and verify their interactions with OGT.

      (4) Given that OGT-ID neuropathology likely originates embryonically, we recommend serial analyses from E14.5 to P7 to examine cellular dynamics during critical corticogenesis phases.

      (5) The interpretation of Figure 8A constitutes overinterpretation. Current data fail to conclusively demonstrate impairment of OGT's protein interaction network and lack direct evidence supporting the proposed mechanisms of HCF1 misprocessing or OGA loss.

    2. Reviewer #2 (Public review):

      Summary:

      The authors are trying to understand why certain mutants of O-GlcNAc transferase (OGT) appear to cause developmental disorders in humans. As an important step towards that goal, the authors generated a mouse model with one of these mutations that disrupts OGT activity. They then go on to test these mice for behavioral differences, finding that the mutant mice exhibit some signs of hyperactivity and differences in learning and memory. They then examine alterations to the structure of the brain and skull, and again find changes in the mutant mice that have been associated with developmental disorders. Finally, they identify proteins that are up- or down-regulated between the two mice as potential mechanisms to explain the observations.

      Strengths:

      The major strength of this manuscript is the creation of this mouse model, as a key step in beginning to understand how OGT mutants cause developmental disorders. This line will prove important for not only the authors but other investigators as well, enabling the testing of various hypotheses and potentially treatments. The experiments are also rigorously performed, and the conclusions are well supported by the data.

      Weaknesses:

      The only weakness identified is a lack of mechanistic insight. However, this certainly may come in the future through more targeted experimentation using this mouse model.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript presents a robust set of experiments that provide new fundamental insights into the role of STN neurons during active and passive avoidance tasks. These forms of avoidance have received comparatively less attention in the literature than the more extensively studied escape or freezing responses, despite being extremely relevant to human behaviour and more strongly influenced by cognitive control.

      Strengths:

      Understanding the neural infrastructure supporting avoidance behaviour would be a fundamental milestone in neuroscience. The authors employ sophisticated methods, including calcium imaging and optogenetics, to delineate the functions of STN neurons during avoidance behaviours. The work is extremely thorough, and the evidence presented is compelling. Experiments are carefully constructed, well-controlled, and the statistical analyses are appropriate.

      Points for Authors' Consideration:

      (1) Motoric role of STN:<br /> The authors interpret their findings within the context of active avoidance, a cognitively demanding process. An alternative interpretation is that STN activation enhances global motoric tone, facilitating general movement rather than specifically encoding cautious avoidance. Experimentally, this could be evaluated by examining STN-induced motoric tone in non-avoidance contexts, such as open field tests with bilateral stimulations. Alternatively, or additionally, the authors could explicitly discuss evidence for and against the possibility that increased motoric tone may account for aspects of the observed behaviours.

      (2) Temporal Dynamics in Calcium Imaging (AA2 vs. AA1):<br /> Based on previous work by this group, a delay (~1-2 sec) in neuronal response onset was anticipated in AA2 compared to AA1. Although a delay in peak response is observed, there is no clear evidence of a significant delay in response onset or changes in slope of neural activity. The authors could quantify calcium onset latencies and slopes and statistically compare these parameters across conditions.

      (3) Speed Differences (AA2 vs. AA1):<br /> Given the increased latency in AA2, and based on previous work from the group, one would expect faster movements following initiation. However, such differences are not evident in the presented data. The authors might want to discuss the absence of an expected speed increase and clarify whether this absence is consistent with previous findings.

      (4) Behavioural Differences Across Neuronal Classes (Figure 7):<br /> The manuscript currently does not compare responses of neuronal classes I, II, and III between AA1 and AA2 conditions separately or provide information regarding their activity during AA3.

      (5) Streamlining Narrative and Figures:<br /> Given the extensive amount of material presented, the manuscript and figures would benefit from streamlining. Many data points and graphs could be moved to supplementary materials without affecting the core interpretation and simplifying the reading of the work by a non-expert audience. Similarly, the main text could be refined to more clearly emphasise the key findings, which would improve both readability and impact. At the same time, certain aspects would benefit from additional clarification. For example, it would be helpful to explain the key features of the AA1-AA3 tasks at the point of introduction, rather than referring readers to previous literature. Overall, enhancing clarity and accessibility would serve the authors well and broaden the impact of the work.

    2. Reviewer #2 (Public review):

      Summary:

      Zhou, Sajid et al. present a study investigating the STN involvement in signaled movement. They use fiber photometry, implantable lenses, and optogenetics during active avoidance experiments to evaluate this. The data are useful for the scientific community, and the overall evidence for their claims is solid, but many aspects of the findings are confusing and seemingly contradictory. For example, STN activity increases with contraversive turning in the fiber photometry experiments, but optogenetic stimulation of the STN evokes ipsiversive turning. While the authors present a huge collection of data, it is somewhat difficult to extract the key information and the meaningful implications resulting from this data.

      Strengths:

      The study is comprehensive in using many techniques, stimulation powers, frequencies, and configurations.

      Weaknesses:

      Here are the specific weaknesses of the paper.

      (1) Vglut2 isn't a very selective promoter for the STN. Did the authors verify every injection across brain slices to ensure the para-subthalamic nucleus, thalamus, lateral hypothalamus, and other Vglut2-positive structures were never infected?

      (2) The authors say in the methods that the high vs low power laser activation for optogenetic experiments was defined by the behavioral output. This is misleading, and the high vs low power should be objectively stated and the behavioral results divided according to the power used, not according to the behavioral outcome.

      (3) In the fiber photometry experiments exposing mice to the range of tones, it is impossible to separate the STN response to the tone from the STN response to the movement evoked by the tone. The authors should expose the mouse to the tones in a condition that prevents movement, such as anesthetized or restrained, to separate out the two components.

      (4) The claim 'STN activation is ideally suited to drive active avoids' needs more explanation. This claim comes after the fiber photometry experiments during active avoidance tasks, so there has been no causality established yet.

      (5) The statistical comparisons in Figure 7E need some justification and/or clarification. The 9 neuron types are originally categorized based on their response during avoids, then statistics are run showing that they respond differently during avoids. It is no surprise that they would have significantly different responses, since that is how they were classified in the first place. The authors must explain this further and show that this is not a case of circular reasoning.

      (6) The authors show that neurons that have strong responses to orientation show reduced activity during avoidance. What are the implications of this? The author should explain why this is interesting and important.

      (7) It is not clear which conditions each mouse experienced in which order. This is critical to the interpretation of Figure 9 and the reduction of passive avoids during STN stimulation. Did these mice have the CS1+STN stimulation pairing or the STN+US pairing prior to this experiment? If they did, the stimulation of the STN could be strongly associated with either punishment or with the CS1 that predicts punishment. If that is the case, stimulating the STN during CS2 could be like presenting CS1+CS2 at the same time and could be confusing.

      (8) The experiments in Figure 10 are used to say that STN stimulation is not aversive, but they only show that STN stimulation cannot be used as punishment in place of a shock. This doesn't mean that it is not aversive; it just means it is not as aversive as a shock. The authors should do a simpler aversion test, such as conditioned or real-time place preference, to claim that STN stimulation is not aversive. This is particularly surprising as previous work (Serra et al., 2023) does show that STN stimulation is aversive.

      (9) In the discussion, the idea that the STN encodes 'moving away' from contralateral space is pretty vague and unsupported. It is puzzling that the STN activates more strongly to contraversive turns, but when stimulated, it evokes ipsiversive turns; however, it seems a stretch to speculate that this is related to avoidance. In the last experiments of the paper, the axons from the STN to the GPe and to the midbrain are selectively stimulated. Do these evoke ipsiversive turns similarly?

      (10) In the discussion, the authors claim that the STN is essential for modulating action timing in response to demands, but their data really only show this in one direction. The STN stimulation reliably increases the speed of response in all conditions (except maximum speed conditions such as escapes). It seems to be over-interpreting the data to say this is an inability to modulate the speed of the task, especially as clear learning and speed modulation do occur under STN lesion conditions, as shown in Figure 12B. The mice learn to avoid and increase their latency in AA2 vs AA1, though the overall avoids and latency are different from controls. The more parsimonious conclusion would be that STN stimulation biases movement speed (increasing it) and that this is true in many different conditions.

      (11) In the discussion, the authors claim that the STN projections to the midbrain tegmentum directly affect the active avoidance behavior, while the STN projections to the SNr do not affect it. This seems counter to their results, which show STN projections to either area can alter active avoidance behavior. What is the laser power used in these terminal experiments? If it is high (3mW), the authors may be causing antidromic action potentials in the STN somas, resulting in glutamate release in many brain areas, even when terminals are only stimulated in one area. The authors could use low (0.25mW) laser power in the terminals to reduce the chance of antidromic activation and spatially restrict the optical stimulation.

      (12) Was normality tested for data prior to statistical testing?

      (13) Why are there no error bars on Figure 5B, black circles and orange triangles?

    3. Reviewer #3 (Public review):

      Summary:

      The authors use calcium recordings from STN to measure STN activity during spontaneous movement and in a multi-stage avoidance paradigm. They also use optogenetic excitation, optogenetic inhibition, and lesion approaches to increase or decrease the activity of STN during the avoidance paradigm. The paper reports a large amount of data and makes many claims, some seem well supported to this Reviewer, others not so much.

      Strengths:

      Well-supported claims include data showing that during spontaneous movements, especially contraversive ones, STN calcium activity is increased using bulk photometry measurements. Single-cell measures back this claim but also show that it is only a modest minority of STN cells that respond strongly, with most showing no response during movement, and a similar number showing smaller inhibitions during movement.

      Similar data during cued active avoidance procedures show that STN calcium activity sharply increases in response to auditory cues, and during cued movements to avoid a footshock. Optogenetic and lesion experiments are consistent with an important role for STN in generating cue-evoked avoidance. And a strength of these results is that multiple bi-directional approaches were used.

      Weaknesses:

      I found the experimental design and presentation convoluted and the results over-interpreted.

      (1) I really don't understand or accept this idea that delayed movement is necessarily indicative of cautious movements. Is the distribution of responses multi-modal in a way that might support this idea, or do the authors simply take a normal distribution and assert that the slower responses represent 'caution'? Even if responses are multi-modal and clearly distinguished by 'type', why should readers think this that delayed responses imply cautious responding instead of say: habituation or sensitization to cue/shock, variability in attention, motivation, or stress; or merely uncertainty which seems plausible given what I understand of the task design where the same mice are repeatedly tested in changing conditions. This relates to a major claim (i.e., in the work's title).

      (2) Related to the last, I'm struggling to understand the rationale for dividing cells into 'types' based the their physiological responses in some experiments (e.g., Figure 7).

      (3) The description and discussion of orienting head movements were not well supported, but were much discussed in the avoidance datasets. The initial speed peaks to cue seem to be the supporting data upon which these claims rest, but nothing here suggests head movement or orientation responses.

      (4) Similar to the last, the authors note in several places, including abstract, the importance of STN in response timing, i.e., particularly when there must be careful or precise timing, but I don't think their data or task design provides a strong basis for this claim.

      (5) I think that other reports show that STN calcium activity is recruited by inescapable foot shock as well. What do these authors see? Is shock, independent of movement, contributing to sharp signals during escapes?

      (6) In particular, and related to the last point, the following work is very relevant and should be cited: https://elifesciences.org/reviewed-preprints/104643#tab-content. Note that the focus of this other paper is on a subset of VGLUT2+ Tac1 neurons in paraSTN, but using VGLUT2-Cre to target STN will target both STN and paraSTN.

      (7) In multiple other instances, claims that were more tangential to the main claims were made without clearly supporting data or statistics. E.g., claim that STN activation is related to translational more than rotational movement; claim that GCaMP and movement responses to auditory cues were small; claims that 'some animals' responded differently without showing individual data.

      (8) In several figures, the number of subjects used was not described. This is necessary. Also necessary is some assessment of the variability across subjects. The only measure of error shown in many figures relates to trial-to-trial or event variability, which is minimal because, in many cases, it appears that hundreds of trials may have been averaged per animal, but this doesn't provide a strong view of biological variability. When bar/line plots are used to display data, I recommend showing individual animals where feasible.

      (9) Can the authors consider the extent to which calcium imaging may be better suited to identify increases compared to decreases and how this may affect the results, particularly related to the GRIN data when similar numbers of cells show responses in both directions (e.g., Figure 3)?

      (10) Raw example traces are not provided.

      (11) The timeline of the spontaneous movement and avoidance sessions was not clear, nor was the number of events or sessions per animal nor how this was set. It is not clear if there was pre-training or habituation, if many or variable sessions were combined per animal, or what the time gaps between sessions were, or if or how any of these parameters might influence interpretation of the results.

      (12) It is not clear if or how the spread of expression outside of the target STN was evaluated, and if or how many mice were excluded due to spread or fiber placements.

    1. Reviewer #1 (Public review):

      Summary:

      Previous studies have shown that treatment with 17α-estradiol (a stereoisomer of the 17β-estradiol) extends lifespan in male mice but not in females. The current study by Li et al, aimed to identify cell-specific clusters and populations in the hypothalamus of aged male rats treated with 17α-estradiol (treated for 6 months). This study identifies genes and pathways affected by 17α-estradiol in the aged hypothalamus.

      Strengths:

      Using single-nucleus transcriptomic sequencing (snRNA-seq) on hypothalamus from aged male rats treated with 17α-estradiol they show that 17α-estradiol significantly attenuated age-related increases in cellular metabolism, stress, and decreased synaptic activity in neurons.

      Moreover, sc-analysis identified GnRH as one of the key mediators of 17α-estradiol's effects on energy homeostasis. Furthermore, they show that CRH neurons exhibited a senescent phenotype, suggesting a potential side effect of the 17α-estradiol. These conclusions are supported by supervised clustering by neuropeptides, hormones, and their receptors.

      Weaknesses:

      However, the study has several limitations that reduce the strength of the key claims in the manuscript. In particular:

      (1) The study focused only on males and did not include comparisons with females. However, previous studies have shown that 17α-estradiol extends lifespan in a sex-specific manner in mice, affecting males but not females. Without the comparison with the female data, it's difficult to assess its relevance to the lifespan.

      (2) Its not known whether 17α-estradiol leads to lifespan extension in male rats similar to male mice. Therefore, it is not possible to conclude that the observed effects in the hypothalamus, are linked to the lifespan extension. The manuscript cited in the introduction does not include lifespan data on rats.

      (3) The effect of 17α-estradiol on non-neuronal cells such as microglia and astrocytes is not well described (Fig.1). Previous studies demonstrated that 17α-estradiol reduces microgliosis and astrogliosis in the hypothalamus of aged male mice. Current data suggest that the proportion of oligo, and microglia were increased by the drug treatment, while the proportions of astrocytes were decreased. These data might suggest possible species differences, differences in the treatment regimen, or differences in drug efficiency. This has to be discussed.

      A more detailed analysis of glial cell types within the hypothalamus in response to drug should be provided.

      (4) The conclusion that CRH neurons are going into senescence is not clearly supported by the data. A more detailed analysis of the hypothalamus such as histological examination to assess cellular senescence markers in CRH neurons, is needed to support this claim.

      Revised submission:

      Some of the concerns were addressed in this revised version, and the authors responded and addressed study design limitations in both sexes/ages.

      However, there are still some concerns that were not sufficiently addressed:<br /> While the term "senescent" was changed to "stressed," some histological/ cellular validation of this phenotype is still needed.

      Some discussion on the sex-specific effects of 17α-estradiol in the hypothalamus is still required. Previous studies in mice demonstrated that 17α-estradiol reduced hypothalamic microgliosis and astrogliosis in male but not female UM-HET3 mice.

      Additionally, the provided analysis on astrocytes and microglia is superficial.

    2. Reviewer #2 (Public review):

      Summary:

      Li et al. investigated the potential anti-ageing role of 17α-Estradiol on the hypothalamus of aged rats. To achieve this, they employed a very sophisticated method for single-cell genomic analysis that allowed them to analyze effects on various groups of neurons and non-neuronal cells. They were able to sub-categorize neurons according to their capacity to produce specific neurotransmitters, receptors, or hormones. They found that 17α-Estradiol treatment led to an improvement in several factors related to metabolism and synaptic transmission by bringing the expression levels of many of the genes of these pathways closer or to the same levels to those of young rats, reversing the ageing effect. Interestingly, among all neuronal groups, the proportion of Oxytocin-expressing neurons seems to be the one most significantly changing after treatment with 17α-Estradiol, suggesting an important role of these neurons on mediating its anti-ageing effects. This was also supported by an increase in circulating levels of oxytocin. It was also found that gene expression of corticotropin-releasing hormone neurons was significantly impacted by 17α-Estradiol even though it was not different between aged and young rats, suggesting that these neurons could be responsible for side effects related to this treatment. This article revealed some potential targets that should be further investigated in future studies regarding the role of 17α-Estradiol treatment in aged males.

      Strengths:

      • The single nucleus mRNA sequencing is a very powerful method for gene expression analysis and clustering. The supervised clustering of neurons was very helpful in revealing otherwise invisible differences between neuronal groups and helped identify specific neuronal populations as targets.

      • There is a variety of functions used that allowed the differential analysis of a very complex type of data. This led to a better comparison between the different groups in many levels.

      • There were some physiological parameters measured such as circulating hormone levels that helped the interpretation of the effects of the changes in hypothalamic gene expression.

      Weaknesses:

      • One main control group is missing from the study, the young males treated with 17α-Estradiol.

      • Even though the technical approach is a sophisticated one, analyzing the whole rat hypothalamus instead of specific nuclei or subregions makes the study weaker.

      • Although the authors claim to have several findings, the data fail to support these claims.

      • The study is about improving ageing but no physiological data from the study demonstrated such claim with the exception of the testes histology which was not properly analyzed and was not even significantly different between the groups.

      • Overall, the study remains descriptive with no physiological data to demonstrate that any of the effects on hypothalamic gene expression is related to metabolic, synaptic or other function.

      Comments on revisions:

      The authors revised part of the manuscript to address some of the reviewers' comments. This improved the language and the text flow to a certain extent. They also added an additional analysis including glial cells. However, they failed to address the main weaknesses brought up by the reviewers and did not add any experimental demonstration of their claims on lifespan expansion induced by 17α-estradiol in rats (the cited study does not include lifespan in rats). In addition, they insisted i keeping parts in the discussion that are not directly linked to any of the papers' findings.

    1. Reviewer #1 (Public review):

      In this manuscript, Dillard and colleagues integrate cross-species genomic data with a systems approach to identify potential driver genes underlying human GWAS loci and establish the cell type(s) within which these genes act and potentially drive disease.

      Specifically, they utilize a large single cell RNA-seq (scRNA-seq) dataset from an osteogenic cell culture model - bone marrow-derived stromal cells cultured under osteogenic conditions (BMSC-OBs) - from a genetically diverse outbred mouse population called the Diversity Outbred (DO) stock to discover network driver genes that likely underlie human bone mineral density (BMD) GWAS loci. The DO mice segregate over 40M single nucleotide variants, many of which affect gene expression levels, therefore making this an ideal population for systems genetic and co-expression analyses.

      The current study builds on previous published work from the same group that used co-expression analysis to identify co-expressed "modules" of genes that were enriched for BMD GWAS associations. In this study, the authors utilized a much larger scRNA-seq dataset from 80 DO BMSC-OBs, inferred co-expression based on Bayesian networks for each identified mesenchymal cell type, focused on networks with dynamic expression trajectories that are most likely driving differentiation of BMSC-OBs, and then prioritized genes ("differentiation driver genes" or DDGs) in these osteogenic differentation networks that had known expression or splicing QTLs (eQTL/sQTLs) in any GTEx tissue that co-localized with human BMD GWAS loci. The systems analysis is impressive, the experimental methods are described in detail, and the experiments appear to be carefully done. The computational analysis of the single cell data is comprehensive and thorough, and the evidence presented in support of the identified DDGs, including Tpx2 and Fgfrl1, is for the most part convincing. Some limitations in the data resources and methods hamper enthusiasm somewhat and are discussed below.

      Overall, while this study will no doubt be valuable to the BMD community, the cross-species data integration and analytical framework may be more valuable and generally applicable to the study of other diseases, especially for diseases with robust human GWAS data but for which robust human genomic data in relevant cell types is lacking.

      Specific strengths of the study include the large scRNA-seq dataset on BMSC-OBs from 80 DO mice, the clustering analysis to identify specific cell types and sub-types, the comparison of cell type frequencies across the DO mice, and the CELLECT analysis to prioritize cell clusters that are enriched for BMD heritability (Figure 1). The network analysis pipeline outlined in Figure 2 is also a strength, as is the pseudotime trajectory analysis (results in Figure 3).

      Potential drawbacks of the authors' approach include their focus on genes that were previously identified as having an eQTL or sQTL in any GTEx tissue. The authors rightly point out that the GTEx database does not contain data for bone tissue, but reason that eQTLs can be shared across many tissues - this assumption is valid for many cis-eQTLs, but it could also exclude many genes as potential DDGs with effects that are specific to bone/osteoblasts. Indeed, the authors show that important BMD driver genes have cell-type specific eQTLs. Another issue concerns potential model overfitting in the iterativeWGCNA analysis of mesenchymal cell type-specific co-expression, which identified an average of 76 co-expression modules per cell cluster (range 26-153). Based on the limited number of genes that are detected as expressed in a given cell due to sparse per cell read depth (400-6200 reads/cell) and drop outs, it's surprising that as many as 153 co-expression modules could be distinguished within any cell cluster. I would suspect some degree of model overfitting is responsible for these results.

      Overall, though, these concerns are minor relative to the many strengths of the study design and results. Indeed, I expect the analytical framework employed by the authors here will be valuable to -- and replicated by -- researchers in other disease areas.

      Comments on revisions:

      Thank you for addressing my concerns. This is an impressive study and manuscript that you should be proud of.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Farber and colleagues have performed single cell RNAseq analysis on bone marrow derived stem cells from DO Mice. By performing network analysis, they look for driver genes that are associated with bone mineral density GWAS associations. They identify two genes as potential candidates to showcase the utility of this approach.

      Strengths:

      The study is very thorough and the approach is innovative and exciting. The manuscript contains some interesting data relating to how cell differentiation is occurring and the effects of genetics on this process. The section looking for genes with eQTLs that differ across the differentiation trajectory (Figure 4) was particularly exciting.

      Weaknesses:

      The manuscript is, in parts, hard to read due to the use of acronyms and there are some questions about data analysis that still need to be addressed.

      Comments on revisions:

      Dillard et al have made several improvements to their manuscript.

      (1) We previously asked the authors to determine whether any cell types were enriched for BMD-related traits since the premise of the paper is that 'many genes impacting BMD do so by influencing osteogenic differentiation or ... adipogenic differentiation'. Given the potential for the cell culture method to skew the cell type distribution non-physiologically, it is important to establish which cell types in their assay are most closely associated with BMD traits. The new CELLECT analysis and Figure 1E address this point nicely. However, it would still be nice to see the correlations between these cell types and BMD traits in the mice as this would provide independent evidence to support their physiological importance more broadly.

      (2) Shortening the introduction.

      (3) Addressing limitations that arise from not accounting for founder genome SNPs when aligning scRNA-seq data.

      (4) The main take-away of this paper is, to us, the development of a single cell approach to studying BMD-related traits. It is encouraging that the cells post-culture appear to be representative of those pre-culture (supplemental figure 3).

      However, the authors seem to have neglected several comments made by both reviewers. While we share the authors' enthusiasm for the single cell analytical approach, we do not understand their reluctance to perform further statistical tests. We feel that the following comments have still not been addressed:

      (1) The manuscript still contains the following:

      "To provide further support that tradeSeq-identified genes are involved in differentiation, we performed a cell type-specific expression quantitative trait locus (eQTL) analysis for each mesenchymal cell type from the 80 DO mice. We identified 563 genes (eGenes) regulated by a significant cis-eQTL in specific cell types of the BMSC-OB scRNA-seq data (Supplementary Table S14). In total, 73 eGenes were also tradeSeq-identified genes in one or more cell type boundaries along their respective trajectories (Supplementary Table S9)."

      The purpose of this paragraph is to convince readers that the eGenes approach aligns with the tradeSeq approach (and that their approach can therefore be trusted). It is essential that such claims are supported by statistical reasoning. Given that it would be very simple to perform permutation/enrichment analyses to address this point, and both reviewers requested similar analyses, we do not understand the author's reluctance here. Otherwise, this section should be rewritten so that it does not imply that the identification of these genes provides support for their approach.

      (2) Given that a central purpose of this manuscript is to establish a systematic workflow for identifying candidate genes, the manuscript could still benefit from more explanation as to why the authors chose to highlight Tpx2 and Fgfrl1. Tpx2 does already have a role in bone physiology through the IMPC. The authors should comment on why they did not explore Kremen1, for instance, as this gene seems important for the transition to both OB1 and 2.

      A final minor comment is that it would be very helpful if the authors could indicate if the DDGs in Table 1 are also eGenes for the relevant cell type. This is much more meaningful than looking through GTEx.

    1. Reviewer #1 (Public review):

      Summary:

      The authors introduce a densely-sampled dataset where 6 participants viewed images and sentence descriptions derived from the MS Coco database over the course of 10 scanning sessions. The authors further showcase how image and sentence decoders can be used to predict which images or descriptions were seen, using pairwise decoding across a set of 120 test images. The authors find decodable information widely distributed across the brain, with a left-lateralized focus. The results further showed that modality-agnostic models generally outperformed modality-specific models, and that data based on captions was not explained better by caption-based models but by modality-agnostic models. Finally, the authors decoded imagined scenes.

      Strengths:

      (1) The dataset presents a potentially very valuable resource for investigating visual and semantic representations and their interplay.

      (2) The introduction and discussion are very well written in the context of trying to understand the nature of multimodal representations and present a comprehensive and very useful review of the current literature on the topic.

      Weaknesses:

      (1) The paper is framed as presenting a dataset, yet most of it revolves around the presentation of findings in relation to what the authors call modality-agnostic representations, and in part around mental imagery. This makes it very difficult to assess the manuscript, whether the authors have achieved their aims, and whether the results support the conclusions.

      (2) While the authors have presented a potential use case for such a dataset, there is currently far too little detail regarding data quality metrics expected from the introduction of similar datasets, including the absence of head-motion estimates, quality of intersession alignment, or noise ceilings of all individuals.

      (3) The exact methods and statistical analyses used are still opaque, making it hard for a reader to understand how the authors achieved their results. More detail in the manuscript would be helpful, specifically regarding the exact statistical procedures, what tests were performed across, or how data were pooled across participants.

      (4) Many findings (e.g., Figure 6) are still qualitative but could be supported by quantitative measures.

      (5) Results are significant in regions that typically lack responses to visual stimuli, indicating potential bias in the classifier. This is relevant for the interpretation of the findings. A classification approach less sensitive to outliers (e.g., 70-way classification) could avoid this issue. Given the extreme collinearity of the experimental design, regressors in close temporal proximity will be highly similar, which could lead to leakage effects.

      (6) The manuscript currently lacks a limitations section, specifically regarding the design of the experiment. This involves the use of the overly homogenous dataset Coco, which invites overfitting, the mixing of sentence descriptions and visual images, which invites imagery of previously seen content, and the use of a 1-back task, which can lead to carry-over effects to the subsequent trial.

      (7) I would urge the authors to clarify whether the primary aim is the introduction of a dataset and showing the use of it, or whether it is the set of results presented. This includes the title of this manuscript. While the decoding approach is very interesting and potentially very valuable, I believe that the results in the current form are rather descriptive, and I'm wondering what specifically they add beyond what is known from other related work. This includes imagery-related results. This is completely fine! It just highlights that a stronger framing as a dataset is probably advantageous for improving the significance of this work.

    2. Reviewer #2 (Public review):

      Summary:

      This study introduces SemReps-8K, a large multimodal fMRI dataset collected while subjects viewed natural images and matched captions, and performed mental imagery based on textual cues. The authors aim to train modality-agnostic decoders--models that can predict neural representations independently of the input modality - and use these models to identify brain regions containing modality-agnostic information. They find that such decoders perform comparably or better than modality-specific decoders and generalize to imagery trials.

      Strengths:

      (1) The dataset is a substantial and well-controlled contribution, with >8,000 image-caption trials per subject and careful matching of stimuli across modalities - an essential resource for testing theories of abstract and amodal representation.

      (2) The authors systematically compare unimodal, multimodal, and cross-modal decoders using a wide range of deep learning models, demonstrating thoughtful experimental design and thorough benchmarking.

      (3) Their decoding pipeline is rigorous, with informative performance metrics and whole-brain searchlight analyses, offering valuable insights into the cortical distribution of shared representations.

      (4) Extension to mental imagery decoding is a strong addition, aligning with theoretical predictions about the overlap between perception and imagery.

      Weaknesses:

      While the decoding results are robust, several critical limitations prevent the current findings from conclusively demonstrating truly modality-agnostic representations:

      (1) Shared decoding ≠ abstraction: Successful decoding across modalities does not necessarily imply abstraction or modality-agnostic coding. Participants may engage in modality-specific processes (e.g., visual imagery when reading, inner speech when viewing images) that produce overlapping neural patterns. The analyses do not clearly disambiguate shared representational structure from genuinely modality-independent representations. Furthermore, in Figure 5, the modality-agnostic encoder did not perform better than the modality-specific decoder trained on images (in decoding images), but outperformed the modality-specific decoder trained on captions (in decoding captions). This asymmetry contradicts the premise of a truly "modality-agnostic" encoder. Additionally, given the similar performance between modality-agnostic decoders based on multimodal versus unimodal features, it remains unclear why neural representations did not preferentially align with multimodal features if they were truly modality-independent.

      (2) The current analysis cannot definitively conclude that the decoder itself is modality-agnostic, making "Qualitative Decoding Results" difficult to interpret in this context. This section currently provides illustrative examples, but lacks systematic quantitative analyses.

      (3) The use of mental imagery as evidence for modality-agnostic decoding is problematic. Imagery involves subjective, variable experiences and likely draws on semantic and perceptual networks in flexible ways. Strong decoding in imagery trials could reflect semantic overlap or task strategies rather than evidence of abstraction.

      The manuscript presents a methodologically sophisticated and timely investigation into shared neural representations across modalities. However, the current evidence does not clearly distinguish between shared semantics, overlapping unimodal processes, and true modality-independent representations. A more cautious interpretation is warranted. Nonetheless, the dataset and methodological framework represent a valuable resource for the field.

    3. Reviewer #3 (Public review):

      Summary:

      The authors recorded brain responses while participants viewed images and captions. The images and captions were taken from the COCO dataset, so each image has a corresponding caption, and each caption has a corresponding image. This enabled the authors to extract features from either the presented stimulus or the corresponding stimulus in the other modality. The authors trained linear decoders to take brain responses and predict stimulus features. "Modality-specific" decoders were trained on brain responses to either images or captions, while "modality-agnostic" decoders were trained on brain responses to both stimulus modalities. The decoders were evaluated on brain responses while the participants viewed and imagined new stimuli, and prediction performance was quantified using pairwise accuracy. The authors reported the following results:

      (1) Decoders trained on brain responses to both images and captions can predict new brain responses to either modality.

      (2) Decoders trained on brain responses to both images and captions outperform decoders trained on brain responses to a single modality.

      (3) Many cortical regions represent the same concepts in vision and language.

      (4) Decoders trained on brain responses to both images and captions can decode brain responses to imagined scenes.

      Strengths:

      This is an interesting study that addresses important questions about modality-agnostic representations. Previous work has shown that decoders trained on brain responses to one modality can be used to decode brain responses to another modality. The authors build on these findings by collecting a new multimodal dataset and training decoders on brain responses to both modalities.

      To my knowledge, SemReps-8K is the first dataset of brain responses to vision and language where each stimulus item has a corresponding stimulus item in the other modality. This means that brain responses to a stimulus item can be modeled using visual features of the image, linguistic features of the caption, or multimodal features derived from both the image and the caption. The authors also employed a multimodal one-back matching task, which forces the participants to activate modality-agnostic representations. Overall, SemReps-8K is a valuable resource that will help researchers answer more questions about modality-agnostic representations.

      The analyses are also very comprehensive. The authors trained decoders on brain responses to images, captions, and both modalities, and they tested the decoders on brain responses to images, captions, and imagined scenes. They extracted stimulus features using a range of visual, linguistic, and multimodal models. The modeling framework appears rigorous, and the results offer new insights into the relationship between vision, language, and imagery. In particular, the authors found that decoders trained on brain responses to both images and captions were more effective at decoding brain responses to imagined scenes than decoders trained on brain responses to either modality in isolation. The authors also found that imagined scenes can be decoded from a broad network of cortical regions.

      Weaknesses:

      The characterization of "modality-agnostic" and "modality-specific" decoders seems a bit contradictory. There are three major choices when fitting a decoder: the modality of the training stimuli, the modality of the testing stimuli, and the model used to extract stimulus features. However, the authors characterize their decoders based on only the first choice-"modality-specific" decoders were trained on brain responses to either images or captions, while "modality-agnostic" decoders were trained on brain responses to both stimulus modalities. I think that this leads to some instances where the conclusions are inconsistent with the methods and results.

      First, the authors suggest that "modality-specific decoders are not explicitly encouraged to pick up on modality-agnostic features during training" (line 137) while "modality-agnostic decoders may be more likely to leverage representations that are modality-agnostic" (line 140). However, whether a decoder is required to learn modality-agnostic representations depends on both the training responses and the stimulus features. Consider the case where the stimuli are represented using linguistic features of the captions. When you train a "modality-specific" decoder on image responses, the decoder is forced to rely on modality-agnostic information that is shared between the image responses and the caption features. On the other hand, when you train a "modality-agnostic" decoder on both image responses and caption responses, the decoder has access to the modality-specific information that is shared by the caption responses and the caption features, so it is not explicitly required to learn modality-agnostic features. As a result, while the authors show that "modality-agnostic" decoders outperform "modality-specific" decoders in most conditions, I am not convinced that this is because they are forced to learn more modality-agnostic features.

      Second, the authors claim that "modality-specific decoders can be applied only in the modality that they were trained on, while "modality-agnostic decoders can be applied to decode stimuli from multiple modalities, even without knowing a priori the modality the stimulus was presented in" (line 47). While "modality-agnostic" decoders do outperform "modality-specific" decoders in the cross-modality conditions, it is important to note that "modality-specific" decoders still perform better than expected by chance (figure 5). It is also important to note that knowing about the input modality still improves decoding performance even for "modality-agnostic" decoders, since it determines the optimal feature space-it is better to decode brain responses to images using decoders trained on image features, and it is better to decode brain responses to captions using decoders trained on caption features.

    1. Reviewer #1 (Public review):

      In this manuscript, the authors describe a new method to more accurately estimate the fitness advantage of new SARS-CoV-2 variants when they emerge. This was a key public health question during the pandemic and drove a number of important policy choices during the latter half of the acute phase of the pandemic. They attempt to link fitness to expected wave size. The analyses are tested on data from 33 different US states for which the data were considered sufficient. The main novelty of the method is that it links the frequency of variants to the number of cases and thus estimates fitness in terms of the reproduction number.

      The results with the new method appear to be more consistent estimates of fitness advantage over time, suggesting that the methods suggested are more accurate than the comparator methods.

      Given that the paper presents a methodological advancement, the absence of a simulation study is a weakness. I am satisfied that the trends estimated via the different approaches suggest a useful advancement for a difficult problem. However, the work would have been considerably stronger if synthetic data had been used to illustrate without doubt how the revised method better captures underlying, pre-specified differences in fitness.

    2. Reviewer #2 (Public review):

      Summary:

      This study develops a joint epidemiological and population genetic model to infer variant-specific effective reproduction numbers Rt and growth advantages of SARS-CoV-2 variants using US case counts and sequence data (Jan 2021-Mar 2022). For this, they use the commonly used renewal equation framework, observation models (negative binomial with zero inflation and Dirichlet-multinomial likelihoods, both to account for overdispersion). For the parameterization of Rt, again, they used a classic cubic spline basis expansion. Additionally, they use Bayesian Inference, specifically SVI. I was reassured to see the sensitivity analysis on the generation time to check effects on Rt.

      This is an incredibly robust study design. Integrating case and sequence data enables estimation of both absolute and relative variant fitness, overcoming limitations of frequency-only or case-only models. This reminds me of https://www.medrxiv.org/content/10.1101/2023.01.02.23284123v4.full

      I also really appreciated the flexible and interpretable parameterization of the renewal equations with splines. But I may be biased since I really like splines!

      The approach is justified, however, it has some big limitations. Specifically, there are some notable weaknesses, that I detail below.

      (1) The model does not account for demographic stochasticity or transmission overdispersion (superspreading), which are known to affect SARS-CoV-2 dynamics and can bias Rt, especially in low incidence or early introduction phases.

      (2) While the authors explore the sensitivity of generation time, the reliance on fixed generation time parameters (with some adjustments for Delta/Omicron) may still bias results

      (3) There is no explicit adjustment for population immunity, which limits the ability to disentangle intrinsic variant fitness (even though the model allows for inclusion of covariates - this to me is one of two major flaws in the study.

      (4) The second major flaw in my opinion is that there is no hierarchical pooling across states - each state is modeled independently. A hierarchical Bayesian model could borrow strength across states, improving estimates for states with sparse data and enabling more robust inference of shared variant effects.

      I would strongly recommend the following things in order of priority, where the first two points I consider critical.

      (1) Implement a hierarchical model for variant growth advantages and Rt across states.

      (2) Include time-varying covariates for vaccination rates, prior infection, and non-pharmaceutical interventions directly. This would help disentangle intrinsic variant transmissibility from changes in population susceptibility and behavior.

      (3) Extend the renewal model to a stochastic or branching process framework that explicitly models overdispersed transmission.

      (4) It would be good to allow for multiple seeding events per variant and per state. This can be informed by phylogeography in a minimum effort way and would improve the accuracy of Rt.

      (5) By now, I don't think it will be a surprise that addressing sampling bias is standard, reweighting sequence data or comparing results with independent surveillance data to assess the impact of non-representative sequencing.

    1. Reviewer #1 (Public review):

      This is a well-designed and very interesting study examining the impact of imprecise feedback on outcomes on decision-making. I think this is an important addition to the literature and the results here, which provide a computational account of several decision-making biases, are insightful and interesting.

      I do not believe I have substantive concerns related to the actual results presented; my concerns are more related to the framing of some of the work. My main concern is regarding the assertion that the results prove that non-normative and non-Bayesian learning is taking place. I agree with the authors that their results demonstrate that people will make decisions in ways that demonstrate deviations from what would be optimal for maximizing reward in their task under a strict application of Bayes rule. I also agree that they have built reinforcement learning models which do a good job of accounting for the observed behavior. However, the Bayesian models included are rather simple- per the author descriptions, applications of Bayes' rule with either fixed or learned credibility for the feedback agents. In contrast, several versions of the RL models are used, each modified to account for different possible biases. However more complex Bayes-based models exist, notably active inference but even the hierarchical gaussian filter. These formalisms are able to accommodate more complex behavior, such as affect and habits, which might make them more competitive with RL models. I think it is entirely fair to say that these results demonstrate deviations from an idealized and strict Bayesian context; however, the equivalence here of Bayesian and normative is I think misleading or at least requires better justification/explanation. This is because a great deal of work has been done to show that Bayes optimal models can generate behavior or other outcomes that are clearly not optimal to an observer within a given context (consider hallucinations for example) but which make sense in the context of how the model is constructed as well as the priors and desired states the model is given.

      As such, I would recommend that the language be adjusted to carefully define what is meant by normative and Bayesian and to recognize that work that is clearly Bayesian could potentially still be competitive with RL models if implemented to model this task. An even better approach would be to directly use one of these more complex modelling approaches, such as active inference, as the comparator to the RL models, though I would understand if the authors would want this to be a subject for future work.

      Abstract:

      The abstract is lacking in some detail about the experiments done, but this may be a limitation of the required word count? If word count is not an issue, I would recommend adding details of the experiments done and the results. One comment is that there is an appeal to normative learning patterns, but this suggests that learning patterns have a fixed optimal nature, which may not be true in cases where the purpose of the learning (e.g. to confirm the feeling of safety of being in an in-group) may not be about learning accurately to maximize reward. This can be accommodated in a Bayesian framework by modelling priors and desired outcomes. As such the central premise that biased learning is inherently non-normative or non-Bayesian I think would require more justification. This is true in the introduction as well.

      Introduction:

      As noted above the conceptualization of Bayesian learning being equivalent to normative learning I think requires either further justification. Bayesian belief updating can be biased an non-optimal from an observer perspective, while being optimal within the agent doing the updating if the priors/desired outcomes are set up to advantage these "non-optimal" modes of decision making.

      Results:

      I wonder why the agent was presented before the choice - since the agent is only relevant to the feedback after the choice is made. I wonder if that might have induced any false association between the agent identity and the choice itself. This is by no means a critical point but would be interesting to get the authors' thoughts.

      The finding that positive feedback increases learning is one that has been shown before and depends on valence, as the authors note. They expanded their reinforcement learning model to include valence; but they did not modify the Bayesian model in a similar manner. This lack of a valence or recency effect might also explain the failure of the Bayesian models in the preceding section where the contrast effect is discussed. It is not unreasonable to imagine that if humans do employ Bayesian reasoning that this reasoning system has had parameters tuned based on the real world, where recency of information does matter; affect has also been shown to be incorporable into Bayesian information processing (see the work by Hesp on affective charge and the large body of work by Ryan Smith). It may be that the Bayesian models chosen here require further complexity to capture the situation, just like some of the biases required updates to the RL models. This complexity, rather than being arbitrary, may be well justified by decision making in the real world.

      The methods mention several symptom scales- it would be interesting to have the results of these and any interesting correlations noted. It is possible that some of individual variability here could be related to these symptoms, which could introduce precision parameter changes in a Bayesian context and things like reward sensitivity changes in an RL context.

      Discussion:

      (For discussion, not a specific comment on this paper): One wonders also about participant beliefs about the experiment or the intent of the experimenters. I have often had participants tell me they were trying to "figure out" a task or find patterns even when this was not part of the experiment. This is not specific to this paper, but it may be relevant in the future to try and model participant beliefs about the experiment especially in the context of disinformation, when they might be primed to try and "figure things out".

      As a general comment, in the active inference literature, there has been discussion of state-dependent actions, or "habits", which are learned in order to help agents more rapidly make decisions, based on previous learning. It is also possible that what is being observed is that these habits are at play, and that they represent the cognitive biases. This is likely especially true given, as the authors note, the high cognitive load of the task. It is true that this would mean that full-force Bayesian inference is not being used in each trial, or in each experience an agent might have in the world, but this is likely adaptive on the longer timescale of things, considering resource requirements. I think in this case you could argue that we have a departure from "normative" learning, but that is not necessarily a departure from any possible Bayesian framework, since these biases could potentially be modified by the agent or eschewed in favor of more expensive full-on Bayesian learning when warranted. Indeed in their discussion on the strategy of amplifying credible news sources to drown out low-credibility sources, the authors hint to the possibility of longer term strategies that may produce optimal outcomes in some contexts, but which were not necessarily appropriate to this task. As such, the performance on this task- and the consideration of true departure from Bayesian processing- should be considered in this wider context. Another thing to consider is that Bayesian inference is occurring, but that priors present going in produce the biases, or these biases arise from another source, for example factoring in epistemic value over rewards when the actual reward is not large. This again would be covered under an active inference approach, depending on how the priors are tuned. Indeed, given the benefit of social cohesion in an evolutionary perspective, some of these "biases" may be the result of adaptation. For example, it might be better to amplify people's good qualities and minimize their bad qualities in order to make it easier to interact with them; this entails a cost (in this case, not adequately learning from feedback and potentially losing out sometimes), but may fulfill a greater imperative (improved cooperation on things that matter). Given the right priors/desired states, this could still be a Bayes-optimal inference at a social level and as such may be ingrained as a habit which requires effort to break at the individual level during a task such as this.

      The authors note that this task does not relate to "emotional engagement" or "deep, identity-related, issues". While I agree that this is likely mostly true, it is also possible that just being told one is being lied to might elicit an emotional response that could bias responses, even if this is a weak response.

      Comments on revisions:

      In their updated version the authors have made some edits to address my concerns regarding the framing of the 'normative' bayesian model, clarifying that they utilized a simple bayesian model which is intended to adhere in an idealized manner to the intended task structure, though further simulations would have been ideal.

      The authors, however, did not take my recommendation to explore the symptoms in the symptom scales they collected as being a potential source of variability. They note that these were for hypothesis generation and were exploratory, fair enough, but this study is not small and there should have been sufficient sample size for a very reasonable analysis looking at symptom scores.

      However, overall the toned down claims and clarifications of intent are adequate responses to my previous review.

    2. Reviewer #2 (Public review):

      This important paper studies the problem of learning from feedback given by sources of varying credibility. The convincing combination of experiment and computational modeling helps to pin down properties of learning, while opening unresolved questions for future research.

      Summary:

      This paper studies the problem of learning from feedback given by sources of varying credibility. Two bandit-style experiments are conducted in which feedback is provided with uncertainty, but from known sources. Bayesian benchmarks are provided to assess normative facets of learning, and alternative credit assignment models are fit for comparison. Some aspects of normativity appear, in addition to possible deviations such as asymmetric updating from positive and negative outcomes.

      Strengths:

      The paper tackles an important topic, with a relatively clean cognitive perspective. The construction of the experiment enables the use of computational modeling. This helps to pinpoint quantitatively the properties of learning and formally evaluate their impact and importance. The analyses are generally sensible, and advanced parameter recovery analyses (including cross-fitting procedure) provide confidence in the model estimation and comparison. The authors have very thoroughly revised the paper in response to previous comments.

      Weaknesses:

      The authors acknowledge the potential for cognitive load and the interleaved task structure to play a meaningful role in the results, though leave this for future work. This is entirely reasonable, but remains a limitation in our ability to generalize the results. Broadly, some of the results obtain in cases where the extent of generalization is not always addressed and remains uncertain.

    3. Reviewer #3 (Public review):

      Summary

      This paper investigates how disinformation affects reward learning processes in the context of a two-armed bandit task, where feedback is provided by agents with varying reliability (with lying probability explicitly instructed). They find that people learn more from credible sources, but also deviate systematically from optimal Bayesian learning: They learned from uninformative random feedback, learned more from positive feedback, and updated too quickly from fully credible feedback (especially following low-credibility feedback). Overall, this study highlights how misinformation could distort basic reward learning processes, without appeal to higher order social constructs like identity.

      Strengths

      • The experimental design is simple and well-controlled; in particular, it isolates basic learning processes by abstracting away from social context
      • Modeling and statistics meet or exceed standards of rigor
      • Limitations are acknowledged where appropriate, especially those regarding external validity
      • The comparison model, Bayes with biased credibility estimates, is strong; deviations are much more compelling than e.g. a purely optimal model
      • The conclusions are of substantial interest from both a theoretical and applied perspective

      Weaknesses

      The authors have addressed most of my concerns with the initial submission. However, in my view, evidence for the conclusion that less credible feedback yields a stronger positivity bias remains weak. This is due to two issues.

      Absolute or relative positivity bias?

      The conclusion of greater positivity bias for lower credible feedback (Fig 5) hinges on the specific way in which positivity bias is defined. Specifically, we only see the effect when normalizing the difference in sensitivity to positive vs. negative feedback by the sum. I appreciate that the authors present both and add the caveat whenever they mention the conclusion. However, without an argument that the relative definition is more appropriate, the fact of the matter is that the evidence is equivocal.

      There is also a good reason to think that the absolute definition is more appropriate. As expected, participants learn more from credible feedback. Thus, normalizing by average learning (as in the relative definition) amounts to dividing the absolute difference by increasingly large numbers for more credible feedback. If there is a fixed absolute positivity bias (or something that looks like it), the relative bias will necessarily be lower for more credible feedback. In fact, the authors own results demonstrate this phenomenon (see below). A reduction in relative bias thus provides weak evidence for the claim.

      It is interesting that the discovery study shows evidence of a drop in absolute bias. However, for me, this just raises questions. Why is there a difference? Was one a just a fluke? If so, which one?

      Positivity bias or perseveration?

      Positivity bias and perseveration will both predict a stronger relationship between positive (vs. negative) feedback and future choice. They can thus be confused for each other when inferred from choice data. This potentially calls into question all the results on positivity bias.

      The authors clearly identify this concern in the text and go to considerable lengths to rule it out. However, the new results (in revision 1) show that a perseveration-only model can in fact account for the qualitative pattern in the human data (the CA parameters). This contradicts the current conclusion:

      Critically, however, these analyses also confirmed that perseveration cannot account for our main finding of increased positivity bias, relative to the overall extent of CA, for low-credibility feedback.

      Figure 24c shows that the credibility-CA model does in fact show stronger positivity bias for less credible feedback. The model distribution for credibility 1 is visibly lower than for credibilities 0.5 and 0.75.

      The authors need to be clear that it is the magnitude of the effect that the perseveration-only model cannot account for. Furthermore, they should additionally clarify that this is true only for models fit to data; it is possible that the credibility-CA model could capture the full size of the effect with different parameters (which could fit best if the model was implemented slightly differently).

      The authors could make the new analyses somewhat stronger by using parameters optimized to capture just the pattern in CA parameters (for example by MSE). This would show that the models are in principle incapable of capturing the effect. However, this would be a marginal improvement because the conclusion would still rest on a quantitative difference that depends on specific modeling assumptions.

      New simulations clearly demonstrate the confound in relative bias

      Figure 24 also speaks to the relative vs. absolute question. The model without positivity bias shows a slightly stronger absolute "positivity bias" for the most credible feedback, but a weaker relative bias. This is exactly in line with the logic laid out above. In standard bandit tasks, perseveration can be quite well-captured by a fixed absolute positivity bias, which is roughly what we see in the simulations (I'm not sure what to make of the slight increase; perhaps a useful lead for the authors). However, when we divide by average credit assignment, we now see a reduction. This clearly demonstrates that a reduction in relative bias can emerge without any true differences in positivity bias.

      Given everything above, I think it is unlikely that the present data can provide even "solid" evidence for the claim that positivity bias is greater with less credible feedback. This confound could be quickly ruled out, however, by a study in which feedback is sometimes provided in the absence of a choice. This would empirically isolate positivity bias from choice-related effects, including perseveration.

    1. Reviewer #1 (Public review):

      Summary:

      The authors use a gambling task with momentary mood ratings from Rutledge et al. and compare computational models of choice and mood to identify markers of decisional and affective impairments underlying risk-prone behavior in adolescents with suicidal thoughts and behaviors (STB). The results show that adolescents with STB show enhanced gambling behavior (choosing the gamble rather than the sure amount), and this is driven by a bias towards the largest possible win rather than insensitivity to possible losses. Moreover, this group shows a diminished effect of receiving a certain reward (in the non-gambling trials) on mood. The results were replicated in an undifferentiated online sample where participants were divided into groups with or without STB based on their self-report of suicidal ideation on one question in the Beck Depression Inventory self-report instrument. The authors suggest, therefore, that adolescents with decreased sensitivity to certain rewards may need to be monitored more closely for STB due to their increased propensity to take risky decisions aimed at (expected) gains (such as relief from an unbearable situation through suicide), regardless of the potential losses.

      Strengths:

      (1) The study uses a previously validated task design and replicates previously found results through well-explained model-free and model-based analyses.

      (2) Sampling choice is optimal, with adolescents at high risk; an ideal cohort to target early preventative diagnoses and treatments for suicide.

      (3) Replication of the results in an online cohort increases confidence in the findings.

      (4) The models considered for comparison are thorough and well-motivated. The chosen models allow for teasing apart which decision and mood sensitivity parameters relate to risky decision-making across groups based on their hypotheses.

      (5) Novel finding of mood (in)sensitivity to non-risky rewards and its relationship with risk behavior in STB.

      Weaknesses:

      (1) The sample size of 25 for the S- group was justified based on previous studies (lines 181-183); however, all three papers cited mention that their sample was low powered as a study limitation.

      (2) Modeling in the mediation analysis focused on predicting risk behavior in this task from the model-derived bias for gains and suicidal symptom scores. However, the prediction of clinical interest is of suicidal behaviors from task parameters/behavior - as a psychiatrist or psychologist, I would want to use this task to potentially determine who is at higher risk of attempting suicide and therefore needs to be more closely watched rather than the other way around (predicting behavior in the task from their symptom profile). Unfortunately, the analyses presented do not show that this prediction can be made using the current task. I was left wondering: is there a correlation between beta_gain and STB? It is also important to test for the same relationships between task parameters and behavior in the healthy control group, or to clarify that the recommendations for potential clinical relevance of these findings apply exclusively to people with a diagnosis of depression or anxiety disorder. Indeed, in line 672, the authors claim their results provide "computational markers for general suicidal tendency among adolescents", but this was not shown here, as there were no models predicting STB within patient groups or across patients and healthy controls.

      (3) The FDR correction for multiple comparisons mentioned briefly in lines 536-538 was not clear. Which analyses were included in the FDR correction? In particular, did the correlations between gambling rate and BSI-C/BSI-W survive such correction? Were there other correlations tested here (e.g., with the TAI score or ERQ-R and ERQ-S) that should be corrected for? Did the mediation model survive FDR correction? Was there a correction for other mediation models (e.g., with BSI-W as a predictor), or was this specific model hypothesized and pre-registered, and therefore no other models were considered? Did the differences in beta_gain across groups survive FDR when including comparisons of all other parameters across groups? Because the results were replicated in the online dataset, it is ok if they did not survive FDR in the patient dataset, but it is important to be clear about this in presenting the findings in the patient dataset.

      (4) There is a lack of explicit mention when replication analyses differ from the analyses in the patient sample. For instance, the mediation model is different in the two samples: in the patient sample, it is only tested in S+ and S- groups, but not in healthy controls, and the model relates a dimensional measure of suicidal symptoms to gambling in the task, whereas in the online sample, the model includes all participants (including those who are presumably equivalent to healthy controls) and the predictor is a binary measure of S+ versus S- rather than the response to item 9 in the BDI. Indeed, some results did not replicate at all and this needs to be emphasized more as the lack of replication can be interpreted not only as "the link between mood sensitivity to CR and gambling behavior may be specifically observable in suicidal patients" (lines 582-585) - it may also be that this link is not truly there, and without a replication it needs to be interpreted with caution.

      (5) In interpreting their results, the authors use terms such as "motivation" (line 594) or "risk attitude" (line 606) that are not clear. In particular, how was risk attitude operationalized in this task? Is a bias for risky rewards not indicative of risk attitude? I ask because the claim is that "we did not observe a difference in risk attitude per se between STB and controls". However, it seems that participants with STB chose the risky option more often, so why is there no difference in risk attitude between the groups?

    2. Reviewer #2 (Public review):

      Summary:

      This article addresses a very pertinent question: what are the computational mechanisms underlying risky behaviour in patients who have attempted suicide? In particular, it is impressive how the authors find a broad behavioural effect whose mechanisms they can then explain and refine through computational modeling. This work is important because, currently, beyond previous suicide attempts, there has been a lack of predictive measures. This study is the first step towards that: understanding the cognition on a group level. This is before being able to include it in future predictive studies (based on the cross-sectional data, this study by itself cannot assess the predictive validity of the measure).

      Strengths:

      (1) Large sample size.

      (2) Replication of their own findings.

      (3) Well-controlled task with measures of behaviour and mood + precise and well-validated computational modeling.

      Weaknesses:

      I can't really see any major weakness, but I have a few questions:

      (1) I can see from the parameter recovery that the parameters are very well identified. Is it surprising that this is the case, given how many parameters there are for 90 trials? Could the authors show cross-correlations? I.e., make a correlation matrix with all real parameters and all fitted parameters to show that not only the diagonal (i.e., same data is the scatter plots in S3) are high, but that the off-diagonals are low.

      (2) Could the authors clarify the result in Figure 2B of a correlation between gambling rate and suicidal ideation score, is that a different result than they had before with the group main effect? I.e., is your analysis like this: gambling rate ~ suicide ideation + group assignment? (or a partial correlation)? I'm asking because BSI-C is also different between the groups. [same comment for later analyses, e.g. on approach parameter].

      (3) The authors correlate the impact of certain rewards on mood with the % gambling variable. Could there not be a more direct analysis by including mood directly in the choice model?

      (4) In the large online sample, you split all participants into S+ and S-. I would have imagined that instead, you would do analyses that control for other clinical traits. Or, for example, you have in the S- group only participants who also have high depression scores, but low suicide items.

    3. Reviewer #3 (Public review):

      This manuscript investigates computational mechanisms underlying increased risk-taking behavior in adolescent patients with suicidal thoughts and behaviors. Using a well-established gambling task that incorporates momentary mood ratings and previously established computational modeling approaches, the authors identify particular aspects of choice behavior (which they term approach bias) and mood responsivity (to certain rewards) that differ as a function of suicidality. The authors replicate their findings on both clinical and large-scale non-clinical samples.

      The main problem, however, is that the results do not seem to support a specific conclusion with regard to suicidality. The S+ and S- groups differ substantially in the severity of symptoms, as can be seen by all symptom questionnaires and the baseline and mean mood, where S- is closer to HC than it is to S+. The main analyses control for illness duration and medication but not for symptom severity. The supplementary analysis in Figure S11 is insufficient as it mistakes the absence of evidence (i.e., p > 0.05) for evidence of absence. Therefore, the results do not adequately deconfound suicidality from general symptom severity.

      The second main issue is that the relationship between an increased approach bias and decreased mood response to CR is conceptually unclear. In this respect, it would be natural to test whether mood responses influence subsequent gambling choices. This could be done either within the model by having mood moderate the approach bias or outside the model using model-agnostic analyses.

      Additionally, there is a conceptual inconsistency between the choice and mood findings that partly results from the analytic strategy. The approach bias is implemented in choice as a categorical value-independent effect, whereas the mood responses always scale linearly with the magnitude of outcomes. One way to make the models more conceptually related would be to include a categorical value-independent mood response to choosing to gamble/not to gamble.

      The manuscript requires editing to improve clarity and precision. The use of terms such as "mood" and "approach motivation" is often inaccurate or not sufficiently specific. There are also many grammatical errors throughout the text.

      Claims of clinical relevance should be toned down, given that the findings are based on noisy parameter estimates whose clinical utility for the treatment of an individual patient is doubtful at best.

    1. Reviewer #1 (Public review):

      Summary:

      In the present study, Chen et al. investigate the role of Endophilin A1 in regulating GABAergic synapse formation and function. To this end, the authors use constitutive or conditional knockout of Endophilin A1 (EEN1) to assess the consequences on GABAergic synapse composition and function, as well as the outcome for PTZ-induced seizure susceptibility. The authors show that EEN1 KO mice show a higher susceptibility to PTZ-induced seizures, accompanied by a reduction in the GABAergic synaptic scaffolding protein gephyrin as well as specific GABAAR subunits and eIPSCs. The authors then investigate the underlying mechanisms, demonstrating that Endophilin A1 binds directly to gephyrin and GABAAR subunits, and identifying the subdomains of Endophilin A1 that contribute to this effect. Overall, the authors state that their study places Endophilin A1 as a new regulator of GABAergic synapse function.

      Strengths:

      Overall, the topic of this manuscript is very timely, since there has been substantial recent interest in describing the mechanisms governing inhibitory synaptic transmission at GABAergic synapses. The study will therefore be of interest to a wide audience of neuroscientists studying synaptic transmission and its role in disease. The manuscript is well written and contains a substantial quantity of data. In the revised version of the manuscript, the authors have increased the number of samples analyzed and have significantly improved the statistical analysis, thereby substantially strengthening the conclusions of their study.

    2. Reviewer #2 (Public review):

      Summary:

      The function of neural circuits relies heavily on the balance of excitatory and inhibitory inputs. Particularly, inhibitory inputs are understudied when compared to their excitatory counterparts due to the diversity of inhibitory neurons, their synaptic molecular heterogeneity, and their elusive signature. Thus, insights into these aspects of inhibitory inputs can inform us largely on the functions of neural circuits and the brain.

      Endophilin A1, an endocytic protein heavily expressed in neurons, has been implicated in numerous pre- and postsynaptic functions, however largely at excitatory synapses. Thus, whether this crucial protein plays any role in inhibitory synapse, and whether this regulates functions at the synaptic, circuit, or brain level remains to be determined.

      The three remaining concerns are:

      (1) The use of one-way ANOVA is not well justified.

      (2) The use of superplots to show culture to culture variability would make it more transparent.

      (3) Change EEN1 in Figure 8B to EndoA1.

      Comments on revised version:

      The authors addressed the concerns adequately.

    3. Reviewer #3 (Public review):

      Chen et al. identify endophilin A1 as a novel component of the inhibitory postsynaptic scaffold. Their data show impaired evoked inhibitory synaptic transmission in CA1 neurons of mice lacking endophilin A1, and an increased susceptibility to seizures. Endophilin can interact with the postsynaptic scaffold protein gephyrin and promotes assembly of the inhibitory postsynaptic element. Endophilin A1 is known to play a role in presynaptic terminals and in dendritic spines, but a role for endophilin A1 at inhibitory postsynaptic densities has not yet been described, providing a valuable addition to the field.

      To investigate the role of endophilin A1 at inhibitory postsynapses, the authors used a broad array of experimental approaches, including tests of seizure susceptibility, electrophysiology, biochemistry, neuronal culture and image analysis. The authors have addressed the remaining concerns in their revision. Taken together, their results expand the synaptic role of endophilin-A1 to include the inhibitory post synaptic element.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript uses a diverse isolate collection of Streptococcus pneumoniae from hospital patients in the Netherlands to understand the population-level genetic basis of growth rate variation in this pathogen, which is a key determinant of S. pneumoniae within-host fitness. Previous efforts have studied this phenomenon in strain-specific comparisons, which can lack the statistical power and scope of population-level studies. The authors collected a rigorous set of in vitro growth data for each S. pneumoniae isolate and subsequently paired growth curve analysis with whole-genome analyses to identify how phylogenetics, serotype and specific genetic loci influence in vitro growth. While there were noticeable correlations between capsular serotype and phylogeny with growth metrics, they did not identify specific loci associated with altered in vitro growth, suggesting that these phenotypes are controlled by the collective effect of the entire genetic background of a strain. This is an important finding that lays the foundation for additional, more highly-powered studies that capture more S. pneumoniae genetic diversity to identify these genetic contributions.

      Strengths:

      The authors were able to completely control the experimental and genetic analyses to ensure all isolates underwent the same analysis pipeline to enhance the rigor of their findings.

      The isolate collection captures an appreciable amount of S. pneumoniae diversity and, importantly, enables disentangling the contributions of the capsule and phylogenetic background to growth rates.

      This study provides a population-level, rather than strain-specific, view of how genetic background influences growth rate in S. pneumoniae. This is an advance over previous studies that have only looked at smaller sets of strains.

      The methods used are well-detailed and robust to allow replication and extension of these analyses. Moreover, the manuscript is very well written and includes a thoughtful and thorough discussion of the strengths and limitations of the current study.

      Weaknesses:

      As acknowledged by the authors, the genetic diversity and sample size of this newly collected isolate set is still limited relative to the known global diversity of S. pneumoniae, which evidently limits the power to detect loci with smaller/combinatorial contributions to growth rate (and ultimately infection).

      The in vitro growth data is limited to a single type of rich growth medium, which may not fully reflect the nutritional and/or selective pressures present in the host.

      The current study does not use genetic manipulation or in vitro/in vivo infection models to experimentally test whether alteration of growth rates as observed in this study is linked to virulence or successful infection. The availability of a naturally diverse collection with phylogenetic and serotype combinations already identified as interesting by the authors provides a strong rationale for wet-lab studies of these phenotypes.

      Update on first revision:

      The authors have responded to all of my initial comments as well as those of the other reviewers, and I have no further concerns to be addressed.

    2. Reviewer #2 (Public review):

      The study by Chaguza et al. presents a novel perspective on pneumococcal growth kinetics, suggesting that the overall genetic background of Streptococcus pneumoniae, rather than specific loci, plays a more dominant role in determining growth dynamics. Through a genome-wide association study (GWAS) approach, the authors propose a shift in how we understand growth regulation, differing from earlier findings that pinpointed individual genes, such as wchA or cpsE, as key regulators of growth kinetics. This study highlights the importance of considering the cumulative impact of the entire genetic background rather than focusing solely on individual genetic loci.

      The study emphasizes the cumulative effects of genetic variants, each contributing small individual impacts, as the key drivers of pneumococcal growth. This polygenic model moves away from the traditional focus on single-gene influences. Through rigorous statistical analyses, the authors persuasively advocate for a more holistic approach to understanding bacterial growth regulation, highlighting the complex interplay of genetic factors across the entire genome. Their findings open new avenues for investigating the intricate mechanisms underlying bacterial growth and adaptation, providing fresh insights into bacterial pathogenesis.

      Strengths:

      This study exemplifies a holistic approach to unraveling key factors in bacterial pathogenesis. By analyzing a large dataset of whole-genome sequences and employing robust statistical methodologies, the authors provide strong evidence to support their main findings. Which is a leap forward from previous studies focused on a relatively smaller number of strains. Their integration of genome-wide association studies (GWAS) highlights the cumulative, polygenic influences on pneumococcal growth kinetics, challenging the traditional focus on individual loci. This comprehensive strategy not only advances our understanding of bacterial growth regulation but also establishes a foundation for future research into the genetic underpinnings of bacterial pathogenesis and adaptation. The amount of data generated and corresponding approaches to analyze the data are impressive as well as convincing. The figures are convincing and comprehensible too. The revised version of the manuscript, after the addition and including explanations, is more convincing and acceptable.

      Weaknesses:

      This study suggests evidence that the genetic background significantly influences bacterial growth kinetics. However, the absence of experimental validation remains a critical limitation. Although the authors acknowledge in their response to reviewers that bench-experiments were beyond the scope of this work and are planned, this gap of experimental validation weakens the current conclusions. Demonstrable validation will be essential to corroborate the associations identified through the GWAS approach. Future experimental efforts will be critical to substantiate these findings and to deepen our understanding of the genetic determinants governing bacterial growth dynamics.

    3. Reviewer #3 (Public review):

      This study provides insights into the growth kinetics of a diverse collection of Streptococcus pneumoniae, identifying capsule and lineage differences. It was not able to identify any specific loci from the GWAS that were associated with the growth features. It does provide a useful study linking phenotypic data with large scale genomic population data.

      In the revised version, the authors have addressed the points raised by the reviewers. The authors have provided additional detail in the Introduction and Methods that both improves the general accessibility for the broad readership of eLife, and the ability of other researchers to reproduce the approaches used in this study. They have expanded the Results and Discussion text in some sections to provide greater clarity and accuracy in reporting their data.

      The inclusion of a Data Availability statement was a useful addition and will help ensure the manuscript adheres to eLife's publishing policies.

    1. Reviewer #1 (Public review):

      Summary:

      This work by Govorunova et al. identified three naturally blue-shifted channelrhodopsins (ChRs) from ancyromonads, namely AnsACR, FtACR, and NlCCR. The phylogenetic analysis places the ancyromonad ChRs in a distinct branch, highlighting their unique evolutionary origin and potential for novel applications in optogenetics. Further characterization revealed the spectral sensitivity, ionic selectivity, and kinetics of the newly discovered AnsACR, FtACR, and NlCCR. This study also offers valuable insights into the molecular mechanism underlying the function of these ChRs, including the roles of specific residues in the retinal-binding pocket. Finally, this study validated the functionality of these ChRs in both mouse brain slices (for AnsACR and FtACR) and in vivo in Caenorhabditis elegans (for AnsACR), demonstrating the versatility of these tools across different experimental systems.<br /> In summary, this work provides a potentially valuable addition to the optogenetic toolkit by identifying and characterizing novel blue-shifted ChRs with unique properties.

      Strengths:

      This study provides a thorough characterization of the biophysical properties of the ChRs' properties and demonstrated the versatility of these tools in different ex vivo and in vivo experimental systems. The authors also explored the potential of AnsACR for multiplexed optogenetics. Finally, the mutagenesis experiments revealed the roles of key residues in the photoactive site that can affect the spectral and kinetic properties of the channelrhodopsins.

      Weaknesses:

      The revised manuscript has addressed most of the previous major weaknesses.

    2. Reviewer #2 (Public review):

      Summary:

      Govorunova et al present three new anion opsins that have potential applications silencing neurons. They identify new opsins by scanning numerous databases for sequence homology to known opsins, focusing on anion opsins. The three opsin identified, are uncommonly fast, potent, and are able to silence neuronal activity. The authors characterize numerous parameters of the opsins and compare these opsins to the existing and widely used GtACR opsins.

      Strengths:

      This paper follows the tradition of the Spudich lab, presenting and rigorously characterizing potentially valuable opsins. Furthermore, they explore several mutations of the identified opsin that may make these opsins even more useful for the broader community. The opsins AnsACR and FtACR are particularly notable having extraordinarily fast onset kinetics that could have utility in many domains. Furthermore, the authors show AnsACR is useable in multiphoton experiments having a peak photocurrent in a commonly used wavelength. Overall, the author's detailed measurements and characterization make for an important resource - both presenting new opsins that may be important for future experiment, and providing characterizations to expand our understanding of opsin biophysics in general.

    3. Reviewer #3 (Public review):

      Summary:

      The authors aimed to develop Channelrhodopsins (ChRs), light-gated ion channels, with high potency and blue action spectra for use in multicolor (multiplex) optogenetics applications. To achieve this, they performed a bioinformatics analysis to identify ChR homologues in several protist species, focusing on ChRs from ancyromonads, which exhibited the highest photocurrents and the most blue-shifted action spectra among the tested candidates. Within the ancyromonad clade, the authors identified two new anion-conducting ChRs and one cation-conducting ChR. These were characterized in detail using a combination of manual and automated patch-clamp electrophysiology, absorption spectroscopy, and flash photolysis. The authors also explored sequence features that may explain the blue-shifted action spectra and differences in ion selectivity among closely related ChRs.

      Strengths:

      A key strength of this study is the high-quality experimental data, which were obtained using well-established techniques such as manual patch-clamp and absorption spectroscopy, complemented by modern automated patch-clamp approaches. These data convincingly support most of the claims. The newly characterized ChRs expand the optogenetics toolkit and will be of significant interest to researchers working with microbial rhodopsins, those developing new optogenetic tools, as well as neuro- and cardioscientists employing optogenetic methods.

      Weaknesses:

      This study does not exhibit major methodological weaknesses.

    1. Reviewer #1 (Public review):

      The authors report on a thorough investigation of the interaction of megakaryocytes (MK) with their associated ECM during maturation. They report convincing evidence to support the existence of a dense cage-like pericellular structure containing laminin γ1 and α4 and collagen IV, which interacts with integrins β1 and β3 on MK and serve to fix the perisinusoidal localization of MK and prevent their premature intravasation. As with everything in nature, the authors support a Goldilocks range of MK-ECM interactions - inability to digest the ECM via inhibition of MMPs leads to insufficient MK maturation and development of smaller MK. This important work sheds light into the role of cell-matrix interactions in MK maturation, and suggests that higher-dimensional analyses are necessary to capture the full scope of cellular biology in the context of their microenvironment. The authors have responded appropriately to the majority of my previous comments.

    2. Reviewer #2 (Public review):

      Summary:

      This study makes a significant contribution to understanding the microenvironment of megakaryocytes (MKs) in the bone marrow, identifying an extracellular matrix (ECM) cage structure that influences MK localization and maturation. The authors provide compelling evidence for the presence of this ECM cage and its role in MK homeostasis, employing an array of sophisticated imaging techniques and molecular analyses.

      The authors have addressed most of the concerns raised in the previous review, providing clarifications and additional data that strengthen their conclusions

      More broadly, this work adds to a growing recognition of the ECM as an active participant in haematopoietic cell regulation in the bone marrow microenvironment. This work could pave the way to future studies investigating how the megakaryocytes' ECM cage affects their function as part of the haematopoietic stem cell niche, and by extension, influences global haematopoiesis.

    1. Reviewer #1 (Public review):

      Summary:

      This study shows a novel role for SCoR2 in regulating metabolic pathways in the heart to prevent injury following ischemia/reperfusion. It combines a new multi-omics method to determine SCoR2 mediated metabolic pathways in the heart. This paper would be of interest to cardiovascular researchers working on cardioprotective strategies following ischemic injury in the heart.

      Strengths:

      (1) Use of SCoR2KO mice subjected to I/R injury.

      (2) Identification of multiple metabolic pathways in the heart by a novel multi-omics approach.

      Comments on revisions:

      Authors have addressed all concerns raised in the previous round of review. Substantial modifications have been made in response to those concerns. There are no further comments.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript addresses the gap in knowledge related to the cardiac function of the S-denitrosylase SNO-CoA Reductase 2 (SCoR2; product of the Akr1a1 gene). Genetic variants in SCoR2 have been linked to cardiovascular disease, yet its exact role in heart remains unclear. This paper demonstrates that mice deficient in SCoR2 show significant protection in a myocardial infarction (MI) model. SCoR2 influenced ketolytic energy production, antioxidant levels, and polyol balance through the S-nitrosylation of crucial metabolic regulators.

      Strengths:

      Addresses a well-defined gap in knowledge related to the cardiac function of SNO-CoA Reductase 2. Besides the in-depth case for this specific player, the manuscripts sheds more light on the links between S-nytrosylation and metabolic reprogramming in heart.

      Rigorous proof of requirement through the combination of gene knockout and in vivo myocardial ischemia/reperfusion

      Identification of precise Cys residue for SNO-modification of BDH1 as SCoR2 target in cardiac ketolysis

      Weaknesses:

      The experiments with BDH1 stability were performed in mutant 293 cells. Was there a difference in BDH1 stability in myocardial tissue or primary cardiomyocytes from SCoR2-null vs -WT mice? Same question extends to PKM2.

      In the absence of tracing experiments, the cross-sectional changes in ketolysis, glycolysis or polyol intermediates presented in Figures 4 and 5 are suggestive at best. This needs to be stressed while describing and interpreting these results.

      The findings from human samples with ischemic and non-ischemic cardiomyopathy do not seem immediately or linearly in line with each other and with the model proposed from the KO mice. While the correlation holds up in the non-ischemic cardiomyopathy (increased SNO-BDH1, SNO-PKM2 with decreased SCoR2 expression), how do the Authors explain the decreased SNO-BDH1 with preserved SCoR2 expression in ischemic cardiomyopathy? This seems counterintuitive as activation of ketolysis is a quite established myocardial response to the ischemic stress. It may help the overall message clarity to focus the human data part on only NICM patients.

      (partially linked to the point above) an important proof that is lacking at present is the proof of sufficiency for SCoR2 in S-Nytrosylation of targets and cardiac remodeling. Does SCoR2 overexpression in heart or isolated cardiomyocytes reduce S-nitrosylation of BDH1 and other targets, undermining heart function at baseline or under stress?

      Comments on revisions:

      Some of my points have been addressed. However, the points related to 1) BDH1 stability effect in cardiomyocytes; 2) human relevance of SNO-BDH1; 3) SCoR2 sufficiency remain unclear. That said, this manuscript will provide useful information to the field as such.

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript demonstrates that mice lacking the denitrosylase enzyme SCoR2/AKR1A1 demonstrate a robust cardioprotection resulting from reprogramming of multiple metabolic pathways, revealing<br /> widespread, coordinated metabolic regulation by SCoR2.

      Strengths:

      The extensive experimental evidence provided the use of the knockout model

      Weaknesses:

      No direct evidence for the underlying mechanism.

      The mouse model used is not a tissue-specific knock-out.

    1. Reviewer #1 (Public review):

      Summary:

      There is growing appreciation for the important of luminal (apical) ECM in tube development, but such matrices are much less well understood than basal ECMs. Here the authors provide insights into the aECM that shapes the Drosophila salivary gland (SG) tube and the importance of PAPSS-dependent sulfation in its organization and function.

      The first part of the paper focuses on careful phenotypic characterization of papss mutants, using multiple markers and TEM. This revealed reduced markers of sulfation and defects in both apical and basal ECM organization, Golgi (but not ER) morphology, number and localization of other endosomal compartments, plus increased cell death. The authors focus on the fact that papss mutants have an irregular SG lumen diameter, with both narrowed regions and bulged regions. They address the pleiotropy, showing that preventing the cell death and resultant gaps in the tube did not rescue the SG luminal shape defects and discussing similarities and differences between the papss mutant phenotype and those caused by more general trafficking defects. The analysis uses a papss nonsense mutant from an EMS screen - I appreciate the rigorous approach the authors took to analyze transheterozygotes (as well as homozygotes) plus rescued animals in order to rule out effects of linked mutations. Importantly, the rescue experiments also demonstrated that sulfation enzymatic activity is important.

      The 2nd part of the paper focuses on the SG aECM, showing that Dpy and Pio ZP protein fusions localize abnormally in papss mutants and that these ZP mutants (and Np protease mutants) have similar SG lumen shaping defects to the papss mutants. A key conclusion is that SG lumen defects correlate with loss of a Pio+Dpy-dependent filamentous structure in the lumen. These data suggest that ZP protein misregulation could explain this part of the papss phenotype.

      Overall, the text is very well written and clear. Figures are clearly labeled. The methods involve rigorous genetic approaches, microscopy, and quantifications/statistics and are documented appropriately. The findings are convincing.

      Significance:

      This study will be of interest to researchers studying developmental morphogenesis in general and specifically tube biology or the aECM. It should be particularly of interest to those studying sulfation or ZP proteins (which are broadly present in aECMs across organisms, including humans).

      This study adds to the literature demonstrating the importance of luminal matrix in shaping tubular organs and greatly advances understanding of the luminal matrix in the Drosophila salivary gland, an important model of tubular organ development and one that has key matrix differences (such as no chitin) compared to other highly studied Drosophila tubes like the trachea.

      The detailed description of the defects resulting from papss loss suggests that there are multiple different sulfated targets, with a subset specifically relevant to aECM biology. A limitation is that specific sulfated substrates are not identified here (e.g. are these the ZP proteins themselves or other matrix glycoproteins or lipids?); therefore, it's not clear how direct or indirect the effects of papss are on ZP proteins. However, this is clearly a direction for future work and does not detract from the excellent beginning made here.

    2. Reviewer #2 (Public review):

      Summary

      This study provides new insights into organ morphogenesis using the Drosophila salivary gland (SG) as a model. The authors identify a requirement for sulfation in regulating lumen expansion, which correlates with several effects at the cellular level, including regulation of intracellular trafficking and the organization of Golgi, the aECM and the apical membrane. In addition, the authors show that the ZP proteins Dumpy (Dpy) and Pio form an aECM regulating lumen expansion. Previous reports already pointed to a role for Papss in sulfation in SG and the presence of Dpy and Pio in the SG. Now this work extends these previous analyses and provides more detailed descriptions that may be relevant to the fields of morphogenesis and cell biology (with particular focus on ECM research and tubulogenesis). This study nicely presents valuable information regarding the requirements of sulfation and the aECM in SG development.

      Strengths:

      - The results supporting a role for sulfation in SG development are strong. In addition, the results supporting the involvement of Dpy and Pio in the aECM of the SG, their role in lumen expansion, and their interactions, are also strong.

      - The authors have made an excellent job in revising and clarifying the many different issues raised by the reviewers, particularly with the addition of new experiments and quantifications. I consider that the manuscript has improved considerably.

      - The authors generated a catalytically inactive Papss enzyme, which is not able to rescue the defects in Papss mutants, in contrast to wild type Papss. This result clearly indicates that the sulfation activity of Papss is required for SG development.

    1. Reviewer #1 (Public review):

      Summary:

      The authors of this study propose a model in which NPY family regulators antagonize the activity of the pid mutation in the context of floral development and other auxin-related phenotypes. This is hypothesized to occur through regulation of or by PID and its action on the PIN1 auxin transporter.

      Strengths:

      The findings are intriguing.

      Weaknesses and Major Comments:

      (1) While the findings are indeed intriguing, the mechanism of action and interaction among these components remains poorly understood. The study would benefit from significantly more thorough and focused experimental analyses to truly advance our understanding of pid phenotypes and the interplay among PID, NPYs, and PIN1.

      (2) The manuscript appears hastily assembled, with key methodological and conceptual details either missing or inconsistent. Although issues with figure formatting and clarity (e.g., lack of scale bars and inconsistent panel layout) may alone warrant revision, the content remains the central concern and must take precedence over presentation.

      (3) Given that fertile progeny are obtained from pid-TD pin1/PIN1 and pid NPY OE lines, it would be important to analyze whether mutations and associated phenotypes are heritable. This is especially relevant since CRISPR lines can be mosaic. Comprehensive genotyping and inheritance studies are required.

      (4) The Materials and Methods section lacks essential information on how the lines were generated, genotyped, propagated, and scored. There is also generally no mention of how reproducible the observations were. These genetic experiments need to be described in detail, including the number of lines analyzed and consistency across replicates.

      (5) The nature of the pid alleles used in the study is not described. This is essential for interpretation.

      (6) The authors measure PIN1 phosphorylation in response to NPY overexpression and conclude that the newly identified phosphorylation sites are inhibitory because they do not overlap with known activating sites. This conclusion is speculative without functional validation. Functional assays are available and must be included to substantiate this claim.

      (7) Figure 5 implies that NPY1 acts downstream of PID, but there is no biochemical evidence supporting this hierarchy. Additional experiments are needed to demonstrate the epistatic or regulatory relationship.

      (8) The authors should align their genetic observations with cell biological data on PIN1, PIN2, and PID localization and distribution.

    2. Reviewer #2 (Public review):

      Summary:

      The study is well-conducted, revealing that NPY1, with previously less-characterized molecular functions, can suppress pid mutant phenotypes with a phosphorylation-based mechanism. Overexpression of NPY1 (NPY1-OE) results in PIN phosphorylation at unique sites and bypasses the requirement for PID for this event. Conversely, a C-terminal deleted form of NPY1 (NPY1-dC) fails to rescue pid despite promoting a certain phospho-profile in PIN proteins.

      Strengths:

      (1) The careful genetic analyses of pid suppression by NPY1-OE and the inability of NPY1dC to do the same.

      (2) Phospho-proteomics approaches reveal that NPY1-OE induces phosphorylation of PINs at non-canonical sites, independent of PID.

      Weaknesses:

      (1) The native role of NPY1 is not tested by phospho-proteomics in loss-of-function npy1 mutants. Such analysis would be crucial to demonstrate that NPY1 is required for the observed phosphorylation events.

      (2) The functional consequences of the newly identified phosphorylation sites in PINs remain speculative. Site-directed mutagenesis (phospho-defective and phospho-mimetic) would help clarify their physiological roles.

      (3) The kinase responsible for NPY1-mediated phosphorylation remains unidentified. Since NPY1 is a non-kinase protein, a model involving recruitment of partner kinases (e.g., PIN-phosphorylating kinases other than PID) should be considered or discussed.

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript from Mudgett et al. explores the relative roles of PID and NPY1 in auxin-dependent floral initiation in Arabidopsis. Micro vectorial auxin flows directed by PIN1 are essential to flower initiation, and loss of PIN1 or two of its regulators, PID and NPY1 (in a yucca-deficient background) phenocopies the pinformed phenotype. This group has previously shown that PID-PIN1 interactions and function are dosage-dependent. The authors pick up this thread by demonstrating that a heterozygote containing a CRISPR deletion of one copy of PIN1 can restore quasi-wild type floral initiation to pid.

      The authors then show that overexpression of NPY1 is sufficient to more or less restore wild-type floral initiation to the pid mutant. The authors claim that this result demonstrates that NPY1 functions downstream of PID, as this ectopic abundance of NPY1 resulted in phosphorylation of PIN1 at sites that differ from sites of action of PID. The authors pursue evidence that PID action via NPY1 is analogous to the mode of action by which phot1/2 act on NPH3 in seedling phototropism. Such a model is supported by the evidence presented herein that the C terminus of NPY1, which has abundant Ser/Thr content, is phosphorylated, and that the deletion of this domain prevents overexpression compensation of the pinformed phenotype.<br /> While the results presented support evidence in the literature that PID acts on NPY1 to regulate PIN1 function, it is also possible that NPY1 overexpression results in limited expansion of phosphorylation targets observed with other AGC kinases. And if the phot model is any indication, there may be other PID targets that modulate PIN1-dependent floral initiation.

      However, overexpression of the NPY1 C-terminal deletion construct resulted in phosphorylation of both PIN1 and PIN2 and agravitropic root growth similar to what is observed in pin2 mutants. This suggests that direct PID phosphorylation of PINs and action via NPY1 can be distinguished by phosphorylation sites and by growth phenotypes.

      Strengths:

      A very important effort that places NPY1 downstream of PID in floral initiation.

      Weaknesses:

      As PID has been shown to act on sites that regulate PIN protein polarity as well as PIN protein function, it would be useful if the authors consider how their results would fit/not fit with a model where combinatorial function of NPY1 and PID regulate PIN1 in a manner similar to the way that PID appears to function combinatorially with D6PK on PIN3.

    1. Reviewer #1 (Public review):

      The authors use inducible Fz::mKate2-sfGFP to explore "cell-scale signaling" in PCP. They reach several conclusions. First, they conclude that cell-scale signaling does not depend on limiting pools of core components (other than Fz). Second, they conclude that cell-scale signaling does not depend on microtubule orientation, and third, they conclude that cell-scale signaling is strong relative to cell to cell coupling of polarity.

      There are some interesting inferences that can be drawn from the manuscript, but there are also some significant challenges in interpreting the results and conclusions from the work as presented. I suggest that the authors 1) define "cell-scale signaling," as the precise meaning must be inferred, 2) reconsider some premises upon which some conclusions depend, 3) perform an essential assay validation, and 4) explain some other puzzling inconsistencies.

      Major concerns:

      The exact meaning of cell-scale signaling is not defined, but I infer that the authors use this term to describe how what happens on one side of a cell affects another side. The remainder of my critique depends on this understanding of the intended meaning.

      The authors state that any tissue wide directional information comes from pre-existing polarity and its modification by cell flow, such that the de novo signaling paradigm "bypasses" these events and should therefore not be responsive to any further global cues. It is my understanding that this is not a universally accepted model, and indeed, the authors' data seem to suggest otherwise. For example, the image in Fig 5B shows that de novo induction restores polarity orientation to a predominantly proximal to distal orientation. If no global cue is active, how is this orientation explained? The 6 hr condition, that has only partial polarity magnitude, is quite disordered. Do the patterns at 8 and 10 hrs become more proximally-distally oriented? It is stated that they all show swirls, but please provide adult wing images, and the corresponding orientation outputs from QuantifyPolarity to help validate the notion that the global cues are indeed bypassed by this paradigm.

      It is implicit that, in the de novo paradigm, polarization is initiated immediately or shortly after heat shock induction. However, the results should be differently interpreted if the level of available Fz protein does not rise rapidly and then stabilize before the 6 hr time point, and instead continues to rise throughout the experiment. Western blots of the Fz::mKate2-sfGFP at time points after induction should be performed to demonstrate steady state prior to measurements. Otherwise, polarity magnitude could simply reflect the total available pool of Fz at different times after induction. Interpreting stability is complex, and could depend on the same issue, as well as the amount of recycling that may occur. Prior work from this lab using FRAP suggested that turnover occurs, and could result from recycling as well as replenishment from newly synthesized protein.

      From the Fig 3 results, the authors claim that limiting pools of core proteins do not explain cell-scale signaling, a result expected based on the lack of phenotypes in heterozygotes, but of course they do not test the possibility that Fz is limiting. They do note that some other contributing protein could be.

      In Fig 3, it is unclear why the authors chose to test dsh1/+ rather than dsh[null]/+. In any case, the statistically significant effect of Dsh dose reduction is puzzling, and might indicate that the other interpretation is correct. Ideally, a range including larger and smaller reductions would be tested. As is, I don't think limiting Dsh is ruled out.

      The data in Fig 5 are somewhat internally inconsistent, and inconsistent with the authors' interpretation. In both repolarization conditions, the authors claim that repolarization extends only to row 1, and row 1 is statistically different from non-repolarized row 1, but so too is row 3. Row 2 is not. This makes no sense, and suggests either that the statistical tests are inappropriate and/or the data is too sparse to be meaningful. For the related boundary intensity data in Fig 6, the authors need to describe exactly how boundaries were chosen or excluded from the analysis. Ideally, all boundaries would be classified as either meido-lateral (meaning anterior-posterior) or proximal-distal depending on angle.

      If the authors believe their Fig 5 and 6 analyses, how do they explain that hairs are reoriented well beyond where the core proteins are not? This would be a dramatic finding, because as far as I know, when core proteins are polarized, prehair orientation always follows the core protein distribution. Surprisingly, the authors do not so much as comment about this. The authors should age their wings just a bit more to see whether the prehair pattern looks more like the adult hair pattern or like that predicted by their protein orientation results.

    2. Reviewer #2 (Public Review):

      This paper aims to dissect the relative importance of the various cues that establish PCP in the wing disc of Drosophila, which remains a prominent and relevant model for PCP. The authors suggest that one must consider cues at three scales (molecular, cell and tissue) and specifically design tests for the importance of cell-level cues, which they call non-local cell scale signalling. They develop clever experimental approaches that allow them to track complex stability and also to induce polarity at experimentally defined times. In a first set of experiments, they restore PCP after the global cues have disappeared (de novo polarisation) and conclude from the results that another (cell scale) cue must exist. In another set of experiments, they show that de novo repolarization is robust to the dosage of various components of core PCP, leading them to conclude that there must be an underlying cell scale polarity, which, apparently, has nothing to do with microtubule or cell shape polarity. They then describe nice evidence that de novo polarisation is relatively short range both in a polarised and unpolarised field. They conclude that there is a strong cell-intrinsic polarity that remains to be characterised.

      Major concerns:

      (1) The first set of repolarisation experiments is performed after the global cell rearrangements that have been shown to act as global signals. However, this approach does not exclude the possible contribution of an unknown diffusible global signal.

      (2) The putative non-local cell scale signal must be more precisely defined (maybe also given a better name). It is not clear to me that one can separate cell-scale from molecular-scale signal. Local signals can redistribute within a cell (or membrane) so local signals are also cell-scale. Without a clear definition, it is difficult to interpret the results of the gene dosage experiments. The link between gene dosage and cell-scale signal is not rigorously stated. Related to this, the concluding statement of the introduction is too cryptic.

      Critique:

      The experiments described in this paper are of high quality with a sophisticated level of design and analysis. However, there needs to be some recalibration of the extent of the conclusions that can be drawn. Moreover, a limitation of this paper is that, despite the quality of their data, they cannot give a molecular hint about the nature of their proposed cell-scale signal.

    3. Reviewer #3 (Public Review):

      The manuscript by Carayon and Strutt addresses the role of cell-scale signaling during the establishment of planar cell polarity (PCP) in the Drosophila pupal wing. The authors induce locally the expression of a tagged core PCP protein, Frizzled, and observe and analyze the de novo establishment of planar cell polarity. Using this system, the authors show that PCP can be established within several hours, that PCP is robust towards variation in core PCP protein levels, that PCP proteins do not orient microtubules, and that PCP is robust towards 'extrinsic' re-polarization. The authors conclude that the polarization at the cell-scale is strongly intrinsic and only weakly affected by the polarity of neighboring cells.

      Major comments:

      The data are clearly presented and the manuscript is well written. The conclusions are well supported by the data. 

      (1) The authors use a system to de novo establish PCP, which has the advantage of excluding global cues orienting PCP and thus to focus on the cell-intrinsic mechanisms. At the same time, the system has the limitation that it is unclear to what extent de novo PCP establishment reflects 'normal' cell scale PCP establishment, in particular because the Gal4/UAS expression system that is used to induce Fz expression will likely result in much higher Fz levels compared with the endogenous levels. The authors should briefly discuss this limitation.

      (2) Fig. 3. The authors use heterozygous mutant backgrounds to test the robustness of de novo PCP establishment towards (partial) depletion in core PCP proteins. The authors conclude that de novo polarization is 'extremely robust to variation in protein level'. Since the authors (presumably) lowered protein levels by 50%, this conclusion appears to be somewhat overstated. The authors should tune down their conclusion.

      Significance: 

      The manuscript contributes to our understanding of how planar cell polarity is established. It extends previous work by the authors (Strutt and Strutt, 2002,2007) that already showed that induction of core PCP pathway activity by itself is sufficient to induce de novo PCP. This manuscript further explores the underlying mechanisms. The authors test whether de novo PCP establishment depends on an 'inhibitory signal', as previously postulated (Meinhardt, 2007), but do not find evidence. They also test whether core PCP proteins help to orient microtubules (which could enhance cell intrinsic polarization of core PCP proteins), but, again, do not find evidence, corroborating previous work (Harumoto et al, 2010). The most significant finding of this manuscript, perhaps, is the observation that local de novo PCP establishment does not propagate far through the tissue. A limitation of the study is that the mechanisms establishing intrinsic cell scale polarity remain unknown. The work will likely be of interest to specialists in the field of PCP.

  3. Aug 2025
    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Liu et al have tried to dissect the neural and molecular mechanisms that C. elegans use to avoid the digestion of harmful bacterial food. Liu et al show that C. elegans use ON-OFF state of AWC olfactory neurons to regulate the digestion of harmful gram-positive bacteria S. saprophyticus (SS). Authors show that when C. elegans are fed on SS food, AWC neurons switch to OFF fate, which prevents the digestion of S. saprophyticus, and this helps C. elegans avoid these harmful bacteria. Using genetic and transcriptional analysis as well as making use of previously published findings, Liu et al implicate p38 MAPK pathway (in particular, NSY-1, the C. elegans homolog of MAPKKK ASK1) and insulin signaling in this process.

      Strengths:

      The revised manuscript has improved significantly. The authors have addressed almost all the comments that I had in my initial review.

      Weaknesses:

      None.

    2. Reviewer #2 (Public review):

      Summary:

      Using C. elegans as a model, the authors present an interesting story demonstrating a new regulatory connection between olfactory neurons and the digestive system. Mechanistically, they identified key factors (NSY-1, STR-130 et.al) in neurons, as well as critical 'signaling factors' (INS-23, DAF-2) that bridge different cells/tissues to execute the digestive shutdown induced by poor-quality food (Staphylococcus saprophyticus, SS).

      Strengths:

      The conclusions of this manuscript are mostly well supported by the experimental results shown.

      Weaknesses:

      The authors have done a nice job in addressing my comments.

    3. Reviewer #3 (Public review):

      Summary:

      The study explores a molecular mechanism by which C. elegans detects low-quality food through neuron-digestive crosstalk, offering new insights into food quality control systems. Liu and colleagues demonstrated that NSY-1, expressed in AWC neurons, is a key regulator for sensing Staphylococcus saprophyticus (SS), inducing avoidance behavior and shutting down the digestive system via intestinal BCF-1. They further revealed that INS-23, an insulin peptide, interacts with the DAF-2 receptor in the gut to modulate SS digestion. The study uncovers a food quality control system connecting neural and intestinal responses, enabling C. elegans to adapt to environmental challenges.

      Strengths:

      The study employs a genetic screening approach to identify nsy-1 as a critical regulator in detecting food quality and initiating adaptive responses in C. elegans. The use of RNA-seq analysis is particularly noteworthy, as it reveals distinct regulatory pathways involved in food sensing (Figure 4) and digestion of Staphylococcus saprophyticus (Figure 5). The strategic application of both positive and negative data mining enhances the depth of analysis. Importantly, the discovery that C. elegans halts digestion in response to harmful food and employs avoidance behavior highlights a physiological adaptation mechanism.

      Weaknesses:

      Major weaknesses have been addressed.

    1. Reviewer #1 (Public review):

      The authors adequately addressed the concerns I raised in my initial review, which are noted below.

      (1) I suggest that the authors choose a different term in their title, abstract and manuscript to describe the phenotypes associated with ufd-1 and npl-4 knockdown other than an "inflammation-like response." Inflammation is a pathological term with four cardinal signs: redness (rubor), swelling (tumor), warmth (calor) and pain (dolor). These are not symptoms known to occur in C. elegans. The authors could consider using "inappropriate," "aberrant" or "toxic" immune activation in the title and abstract.

      (2) I think it is important to point out in the context of the authors novelty claim in the abstract and manuscript that the toxic effects of inappropriate immune activation in C. elegans has been widely catalogued. For example: doi.org/10.1371/journal.ppat.1011120 (2023); doi:10.1186/s12915-016-0320-z (2016).; doi:10.1126/science.1203411 (2011); doi:10.1534/g3.115.025650 (2016). In addition, doi:10.7554/eLife.74206 (2022) previously described a mutation that caused innate immune activation that reduced accumulation of P. aeruginosa in the intestine, but also caused animals to have a shortened lifespan.

      Thus, I do not think this study reveals the existence of inflammatory-like responses in C. elegans, as stated by the authors. Indeed, I think it is important for the authors to remove this novelty claim from their paper and discuss their work in the context of these studies in a paragraph in the introduction.

      (3) The authors rely on the use of RNAi of ufd-1 and npl-4 to study their effect on P. aeruginosa colonization and pathogen resistance throughout the manuscript. To address the possibility of off-target effects of the RNAi, the authors should consider both (i) showing with qRT-PCR that these genes are indeed targeted during RNAi, and (ii) confirming their phenotypes with an orthologous technique, preferably by studying ufd-1 and npl-4 loss-of-function mutants [both in the wild-type and sek-1(km4) backgrounds]. If mutation of these genes is lethal, the authors could use Auxin Inducible Degron (AID) technology to induce the degradation of these proteins in post-developmental animals.

      (4) I am confused about the author's explanation regarding their observation that inhibition of the UFD-1/ NPL-4 complex extends the lifespan of sek-1(km25) animals, but not pmk-1(km25) animals, as SEK-1 is the MAPKK that functions immediately upstream of the p38 MAPK PMK-1 to promote pathogen resistance.

      I am also confused why their RNA-seq experiment revealed a signature of intracellular pathogen response genes and not PMK-1 targets, which the authors propose is accounting for toxic immune activation. Activation of which immune response leads to toxicity?

      (5) The authors did not test alternative explanations for why UFD-1/ NPL-4 complex inhibition compromises survival during pathogen infection, other than exuberant immune activation. For example, it is possible that inhibition of this proteosome complex shortens lifespan by compromising the general health/ normal physiology of nematodes. Immune responses could be activated as a secondary consequence of this stress, and not be a direct cause of early mortality. Does sek-1(km4) mutant suppress the lifespan shortened lifespan of ufd-1 and npl-4 knockdown? This experiment should also be done with loss-of-function mutants, as noted in point 3.

      (6) The conclusion of Figure 6 hinges on an experiment that uses double RNAi to knockdown two genes at the same time (Fig. 6D and 6G), an approach that is inherently fraught in C. elegans biology owing to the likelihood that the efficiency of RNAi-mediated gene knockdown is compromised and may account for the observed phenotypes. The proper control for double RNAi is not empty vector + ufd-1(RNAi), but rather gfp(RNAi) + ufd-1(RNAi), as the introduction of a second hairpin RNA is what may compromise knockdown efficiency. In this context, it is important to confirm that knockdown of both genes occurs as expected (with qRT-PCR) and to confirm this phenotype using available elt-2 loss-of-function mutants.

      (7) A supplementary table with the source data for at least three replications (mean lifespan, n, statistical comparison) for each pathogenesis assay should be included in this manuscript.

      Comments on revisions:

      The authors adequately addressed the concerns I raised.

    2. Reviewer #2 (Public review):

      Summary:

      The authors aimed to uncover what role, if any, the UFD1/NPL4 complex might play in innate immune responses of the nematode C. elegans. The authors find that loss of the complex renders animals more sensitive to both pathogenic and non-pathogenic bacteria. However, there appears to be a complex interplay with known innate immune pathways since loss of UFD1/NPL4 actually results in increased survival of animals lacking the canonical innate immune pathways.

      Strengths:

      The authors perform robust genetic analysis to exclude and include possible mechanisms by which the UFD1/NPL4 pathway acts in the innate immune response.

      Weaknesses:

      The argument that the loss of the UFD1/NPL4 complex triggers a response that mimics that of an intracellular pathogen is not thoroughly investigated. Additionally, the finding of a role of the GATA transcription factor, ELT-2, in this response is suggestive, but experiments showing sufficiency in the context of loss of the UFD1/NPL4 complex need to be explored.

      Comments on revisions:

      The authors have performed several control experiments for their RNAi based experiments and also tested the requirement for xbp-1s in their paradigm. The findings and their interpretations are acceptable.

    1. Reviewer #3 (Public review):

      Summary:

      This study investigates the role of the host protein RBMX2 in regulating the response to Mycobacterium bovis infection and its connection to epithelial-mesenchymal transition (EMT), a key pathway in cancer progression. Using bovine and human cell models, the authors have wisely shown that RBMX2 expression is upregulated following M. bovis infection and promotes bacterial adhesion, invasion, and survival by disrupting epithelial tight junctions via the p65/MMP-9 signaling pathway. They also demonstrate that RBMX2 facilitates EMT and is overexpressed in human lung cancers, suggesting a potential link between chronic infection and tumor progression. The study highlights RBMX2 as a novel host factor that could serve as a therapeutic target for both TB pathogenesis and infection-related cancer risk.

      Strengths:

      The major strengths lie in its multi-omics integration (transcriptomics, proteomics, metabolomics) to map RBMX2's impact on host pathways, combined with rigorous functional assays (knockout/knockdown, adhesion/invasion, barrier tests) that establish causality through the p65/MMP-9 axis. Validation across bovine and human cell models and in clinical tissue samples enhances translational relevance. Finally, identifying RBMX2 as a novel regulator linking mycobacterial infection to EMT and cancer progression opens exciting therapeutic avenues.

      Weaknesses:

      There are a few minor weaknesses like grammatical errors, spelling mistakes. Also, the manuscript is too dense; improving the narratives in the Results and Discussion section could help readers follow the logic of the experimental design and conclusions.

    1. Reviewer #1 (Public review):

      In the manuscript, Aldridge and colleagues investigate the role of IL-27 in regulating hematopoiesis during T. gondii infection. Using loss-of-function approaches, reporter mice, and the generation of serial chimeric mice, they elegantly demonstrate that IL-27 induction plays a critical role in modulating bone marrow myelopoiesis and monocyte generation to the infection site. The study is well-designed, with clear experimental approaches that effectively address the mechanisms by which IL-27 regulates bone marrow myelopoiesis and prevents HSC exhaustion. I have two minor comments that could enhance the conceptual framework of this study:

      (1) The authors indirectly show that IL-27R expression on HSPCs is necessary for regulating HSC proliferation and preventing exhaustion. However, given that they have access to IL-27RFlox mice, they could cross these with Fgd5Cre mice to specifically delete IL-27R on long-term HSCs. This would provide direct evidence for the role of IL-27 signaling in LTHSCs during infection.

      (2) Since memory T and B cells often home to the bone marrow, it would be interesting to consider the potential cross-talk between these cells, HSPCs, and IL-27 signaling during secondary T. gondii infection. A brief discussion of this possibility would strengthen the study's broader implications.

    2. Reviewer #2 (Public review):

      Aldridge et al. demonstrate the important role of IL-27 in limiting emergency myelopoiesis in response to Toxoplasma gondii infection. Interestingly, IL-27 acts specifically at the level of early haematopoietic progenitors, inducing STAT signalling, which, in this case, dampens proliferation and preserves HSC fitness.

      They used different mouse genetic models such as HSC lineage tracing, IL27 and IL27R-deficient mice to show that :

      HSCs actively participate in emergency myelopoiesis during Toxoplasma gondii infection.

      The absence of IL27 and IL27R increases monocyte progenitors and monocytes, mainly inflammatory monocytes CCR2hi.

      At steady state, loss of IL27 impairs HSC fitness as competitive transplantation shows long-term engraftment deficiency of IL27 BM cells. This impairment is exacerbated after infection.

      IL27 is produced by various BM and other tissue cells at steady state and its expression increases with infection, mainly by increasing the number of monocytes producing it.

      This article highlights a new mechanism that acts directly at the level of early hematopoietic cells to limit over-inflammation during infection.

    1. Reviewer #1 (Public review):

      Summary:

      The authors were attempting to describe if trained innate immunity would modulate antibody dependent-cellular phagocytosis (ADCP) and/or efferocytosis.

      Strengths:

      The use of primary murine macrophages, and not a cell line, is considered a strength.

      The trained immunity mediated changes to phagocytosis affected both myeloma and breast cancer cells. The broad effect is consistent with trained immunity.

      In this revised manuscript, the authors now include in vivo data to show in vivo relevance.

      Weaknesses:

      There are many types of cancers so it would be helpful to focus the title more for the types of cancers included in the present study, the most relevant of course would be the type of cancer used for the in vivo model.

    2. Reviewer #3 (Public review):

      Summary:

      Chatzis et al showed that β-glucan trained macrophages have decreased phagocytic activity of apoptotic tumor cells and that is accompanied by lower levels of secreted IL-1β using mouse model.

      Strengths:

      This finding has potential impact on designing new cancer immunotherapeutic approaches by targeting macrophage efferocytosis.

      The concerns have been addressed.

    1. Reviewer #1 (Public review):

      Summary:

      This fundamental work employed multidisciplinary approaches and conducted rigorous experiments to study how a specific subset of neurons in the dorsal striatum (i.e., "patchy" striatal neurons) modulates locomotion speed depending on the valence of naturalistic contexts.

      Strengths:

      The scientific findings are novel and original and significantly advance our understanding of how the striatal circuit regulates spontaneous movement in various contexts.

      Weaknesses:

      This is extensive research involving various circuit manipulation approaches. Some of these circuit manipulations are not physiological. This is discussed.

    2. Reviewer #2 (Public review):

      Hawes et al. investigated the role of striatal neurons in the patch compartment of the dorsal striatum. Using Sepw1-Cre line, the authors combined a modified version of the light/dark transition box test that allows them to examine locomotor activity in different environmental valence with a variety of approaches, including cell-type-specific ablation, miniscope calcium imaging, fiber photometry, and opto-/chemogenetics. First, they found ablation of patchy striatal neurons resulted in an increase in movement vigor when mice stayed in a safe area or when they moved back from more anxiogenic to safe environments. The following miniscope imaging experiment revealed that a larger fraction of striatal patchy neurons was negatively correlated with movement speed, particularly in an anxiogenic area. Next, the authors investigated differential activity patterns of patchy neurons' axon terminals, focusing on those in GPe, GPi, and SNr, showing that the patchy axons in SNr reflect movement speed/vigor. Chemogenetic and optogenetic activation of these patchy striatal neurons suppressed the locomotor vigor, thus demonstrating their causal role in the modulation of locomotor vigor when exposed to valence differentials. Unlike the activation of striatal patches, such a suppressive effect on locomotion was absent when optogenetically activating matrix neurons by using the Calb1-Cre line, indicating distinctive roles in the control of locomotor vigor by striatal patch and matrix neurons. Together, they have concluded that nigrostriatal neurons within striatal patches negatively regulate movement vigor, dependent on behavioral contexts where motivational valence differs.

      The strengths of this work include the use of multiple experimental approaches, including genetic/viral ablation of patch neurons, miniscope single-cell imaging, as well as projection-specific recording of axonal activity by fiber photometry, and causal manipulation of the neurons by chemogenetic and optogenetics. Although similar findings were reported previously, the authors' results will be of value owing to multiple levels of investigation. In my view, this study will add to the important literature by demonstrating how patch (striosomal) neurons in the striatum controls movement vigor.

    3. Reviewer #3 (Public review):

      Hawes et al. combined behavioral, optical imaging, and activity manipulation techniques to investigate the role of striatal patch SPNs in locomotion regulation. Using Sepw1-Cre transgenic mice, they found that patch SPNs encode locomotion deceleration in a light-dark box procedure through optical imaging techniques. Moreover, genetic ablation of patch SPNs increased locomotion speed, while chemogenetic activation of these neurons decreased it. The authors concluded that a subtype of patch striatonigral neurons modulates locomotion speed based on external environmental cues.

      In the revision, the authors have largely addressed my concerns with additional explanation and discussion, although some of the key experiments to strengthen the authors' claim by identifying the function of specific cell populations remain to be conducted due to technical challenges. Nevertheless, the current results remain valuable and interesting to a wide audience in the field.

    1. Reviewer #1 (Public review):

      Summary:

      The authors have investigated the role of GAT3 in the visual system. First, they have developed a CRISPR/Cas9-based approach to locally knock out this transporter in the visual cortex. They then demonstrated electrophysiologically that this manipulation increases inhibitory synaptic input into layer 2/3 pyramidal cells. They further examined the functional consequences by imaging neuronal activity in the visual cortex in vivo. They found that absence of GAT3 leads to reduced spontaneous neuronal activity and attenuated neuronal responses and reliability to visual stimuli, but without an effect on orientation selectivity. Further analysis of this data suggests that Gat3 removal leads to less coordinated activity between individual neurons and in population activity patterns, thereby impaired information encoding. Overall, this is an elegant and technically advanced study that demonstrates a new and important role of GAT3 in controlling processing of visual information.

      Strengths:

      Development of a new approach for a local knockout (GAT3)

      Important and novel insights into visual system function and its dependence on GAT3

      Plausible cellular mechanism

      Weaknesses:

      No major weaknesses.

    2. Reviewer #2 (Public review):

      Summary:

      Park et al. has made a tool for spatiotemporally restricted knockout of the astrocytic GABA transporter GAT3 leveraging CRISPR/Cas9 and viral transduction in adult mice, and evaluated the effects of GAT3 on neural encoding of visual stimulation.

      Strengths:

      This concise manuscript leverages state-of-the-art gene CRISPR/Cas9 technology for knocking out astrocytic genes. This has to a little degree been preformed previously in astrocytes and represents an important development in the field. Moreover they utilize in vivo two-photon imaging of neural responses to visual stimuli as a readout of neural activity, in addition to validating their data with ex vivo electrophysiology. Lastly, they use advanced statistical modeling to analyze the impact on GAT3 knockout. Overall, the study comes across as rigorous and convincing.

      Weaknesses:

      Adding the following experiments would potentially have strengthened the conclusions and helped interpret the findings, although may be considered outside the scope of this manuscript, and be pursued in future work:

      (1) Neural activity is quite profoundly influenced by GAT3 knockout. Corroborating these relatively large changes to neural activity with in vivo electrophysiology of some sort as an additional readout would have strengthened the conclusions.

      (2) Given the quite large effects on neural coding in visual cortex assessed with jRGECO imaging it would have been interesting the mouse groups could have been subjected to behavioral testing assessing the visual system.

    1. Reviewer #1 (Public review):

      The manuscript by Yin and colleagues addresses a long-standing question in the field of cortical morphogenesis, regarding factors that determine differential cortical folding across species and individuals with cortical malformations. The authors present work based on a computational model of cortical folding evaluated alongside a physical model that makes use of gel swelling to investigate the role of a two-layer model for cortical morphogenesis. The study assesses these models against empirically derived cortical surfaces based on MRI data from ferret, macaque monkey, and human brains.

      The manuscript is clearly written and presented, and the experimental work (physical gel modeling as well as numerical simulations) and analyses (subsequent morphometric evaluations) are conducted at the highest methodological standards. It constitutes an exemplary use of interdisciplinary approaches for addressing the question of cortical morphogenesis by bringing together well-tuned computational modeling with physical gel models. In addition, the comparative approaches used in this paper establish a foundation for broad-ranging future lines of work that investigate the impact of perturbations or abnormalities during cortical development.

      The cross-species approach taken in this study is a major strength of the work. However, correspondence across the two methodologies did not appear to be equally consistent in predicting brain folding across all three species. The results presented in Figures 4 (and Figures S3 & S4) show broad correspondence in shape index and major sulci landmarks across all three species. Nevertheless, the results presented for the human brain lack the same degree of clear correspondence for the gel model results as observed in the macaque and ferret. While this study clearly establishes a strong foundation for comparative cortical anatomy across species and the impact of perturbations on individual morphogenesis, further work that fine-tunes physical modeling of complex morphologies, such as that of the human cortex, may help to further understand the factors that determine cortical functionalization and pathologies.

    2. Reviewer #2 (Public review):

      This manuscript explores the mechanisms underlying cerebral cortical folding using a combination of physical modelling, computational simulations, and geometric morphometrics. The authors extend their prior work on human brain development (Tallinen et al., 2014; 2016) to a comparative framework involving three mammalian species: ferrets (Carnivora), macaques (Old World monkeys), and humans (Hominoidea). By integrating swelling gel experiments with mathematical differential growth models, they simulate sulcification instability and recapitulate key features of brain folding across species. The authors make commendable use of publicly available datasets to construct 3D models of fetal and neonatal brain surfaces: fetal macaque (ref. [26]), newborn ferret (ref. [11]), and fetal human (ref. [22]).

      Using a combination of physical models and numerical simulations, the authors compare the resulting folding morphologies to real brain surfaces using morphometric analysis. Their results show qualitative and quantitative concordance with observed cortical folding patterns, supporting the view that differential tangential growth of the cortex relative to the subcortical substrate is sufficient to account for much of the diversity in cortical folding. This is a very important point in our field, and can be used in the teaching of medical students.

      Brain folding remains a topic of ongoing debate. While some regard it as a critical specialization linked to higher cognitive function, others consider it an epiphenomenon of expansion and constrained geometry. This divergence was evident in discussions during the Strüngmann Forum on cortical development (Silver et al., 2019). Though folding abnormalities are reliable indicators of disrupted neurodevelopmental processes (e.g., neurogenesis, migration), their relationship to functional architecture remains unclear. Recent evidence suggests that the absolute number of neurons varies significantly with position-sulcus versus gyrus-with potential implications for local processing capacity (e.g., https://doi.org/10.1002/cne.25626). The field is thus in need of comparative, mechanistic studies like the present one.

      This paper offers an elegant and timely contribution by combining gel-based morphogenesis, numerical modelling, and morphometric analysis to examine cortical folding across species. The experimental design - constructing two-layer PDMS models from 3D MRI data and immersing them in organic solvents to induce differential swelling - is well-established in prior literature. The authors further complement this with a continuum mechanics model simulating folding as a result of differential growth, as well as a comparative analysis of surface morphologies derived from in vivo, in vitro, and in silico brains.

      I offer a few suggestions here for clarification and further exploration:

      Major Comments

      (1) Choice of Developmental Stages and Initial Conditions

      The authors should provide a clearer justification for the specific developmental stages chosen (e.g., G85 for macaque, GW23 for human). How sensitive are the resulting folding patterns to the initial surface geometry of the gel models? Given that folding is a nonlinear process, early geometric perturbations may propagate into divergent morphologies. Exploring this sensitivity-either through simulations or reference to prior work-would enhance the robustness of the findings.

      (2) Parameter Space and Breakdown Points

      The numerical model assumes homogeneous growth profiles and simplifies several aspects of cortical mechanics. Parameters such as cortical thickness, modulus ratios, and growth ratios are described in Table II. It would be informative to discuss the range of parameter values for which the model remains valid, and under what conditions the physical and computational models diverge. This would help delineate the boundaries of the current modelling framework and indicate directions for refinement.

      (3) Neglected Regional Features: The Occipital Pole of the Macaque

      One conspicuous omission is the lack of attention to the occipital pole of the macaque, which is known to remain smooth even at later gestational stages and has an unusually high neuronal density (2.5× higher than adjacent cortex). This feature is not reproduced in the gel or numerical models, nor is it discussed. Acknowledging this discrepancy-and speculating on possible developmental or mechanical explanations-would add depth to the comparative analysis. The authors may wish to include this as a limitation or a target for future work.

      (4) Spatio-Temporal Growth Rates and Available Human Data

      The authors note that accurate, species-specific spatio-temporal growth data are lacking, limiting the ability to model inhomogeneous cortical expansion. While this may be true for ferret and macaque, there are high-quality datasets available for human fetal development, now extended through ultrasound imaging (e.g., https://doi.org/10.1038/s41586-023-06630-3). Incorporating or at least referencing such data could improve the fidelity of the human model and expand the applicability of the approach to clinical or pathological scenarios.

      (5) Future Applications: The Inverse Problem and Fossil Brains

      The authors suggest that their morphometric framework could be extended to solve the inverse growth problem-reconstructing fetal geometries from adult brains. This speculative but intriguing direction has implications for evolutionary neuroscience, particularly the interpretation of fossil endocasts. Although beyond the scope of this paper, I encourage the authors to elaborate briefly on how such a framework might be practically implemented and validated.

      Conclusion

      This is a well-executed and creative study that integrates diverse methodologies to address a longstanding question in developmental neurobiology. While a few aspects-such as regional folding peculiarities, sensitivity to initial conditions, and available human data-could be further elaborated, they do not detract from the overall quality and novelty of the work. I enthusiastically support this paper and believe that it will be of broad interest to the neuroscience, biomechanics, and developmental biology communities.

      Note: The paper mentions a companion paper [reference 11] that explores the cellular and anatomical changes in the ferret cortex. I did not have access to this manuscript, but judging from the title, this paper might further strengthen the conclusions.

    1. Reviewer #1 (Public review):

      Summary:

      In this work, Wang et al. use a combination of genetic tools, novel experimental approaches and biomechanical models to quantify the contribution of passive leg forces in Drosophila. They also deduce that passive forces are not sufficient to support the body weight of the animal. Overall, the contribution of passive forces reported in this work is much less than what one would expect based on the size of the organism and previous literature from larger insects and mammals. This is an interesting finding, but some major caveats in their approach remain unanswered.

      Strengths:

      (1) The authors combine experimental measurements and modeling to quantify the contributions of passive forces at limb joints in Drosophila.

      (2) The authors replicate a previous experimental strategy (Hooper et al 2009, J. Neuro) to suspend animals in air for measuring passive forces and, as in previous studies, find that passive forces are much stronger than gravitational forces acting on the limbs. While in these previous studies using large insects, a lot of invasive approaches for accurately quantifying passive forces are possible (e.g., physically cutting of nerves, directly measuring muscle forces in isolated preparations, etc), the small size of Drosophila makes this difficult. The authors overcome this using a novel approach where they attach additional weight to the leg (changes gravitational force) and inactivate motor neurons (remove active forces). With a few approximations and assumptions, the authors then deduce the contribution of passive forces at each joint for each leg.

      (3) The authors find interesting differences in passive forces across different legs. This could have behavioral implications.

      (4) Finally, the authors compare experimental results of how a free-standing Drosophila is lowered ("falls down") on silencing motor neurons, to a biomechanical "OpenSim" model for deducing the role of passive forces in supporting the body weight of the fly. Using this approach, they conclude that passive forces are not sufficient to support the body weight of the fly.

      Weaknesses:

      (1) Line 65 "(Figure 1A). Inactivation causes a change in the leg's rest position; however, in preliminary experiments, the body rotation did not have a large effect on the rest positions of the leg following inactivation. This result is consistent with the one already reported for stick insects and shows that passive forces within the leg are much larger than the gravitational force on a leg and dominate limb position [1]." This is the direct replication of the previous work by Hooper et al 2009 and therefore authors should ideally show the data for this condition (no weight attached).

      (2) The authors use vglut-gal4, a very broad driver for inactivating motor neurons. The driver labels all glutamatergic neurons, including brain descending neurons and nerve cord interneurons, in addition to motor neurons. Additionally, the strength of inactivation might differ in different neurons (including motor neurons) depending on the expression levels of the opsins. As a result, in this condition, the authors might not be removing all active forces. This is a major caveat that authors do not address. They explore that they are not potentially silencing all inputs to muscles by using an additional octopaminergic driver, but this doesn't address the points mentioned above. At the very least, the authors should try using other motor neuron drivers, as well as other neuronal silencers. This driver is so broad that authors couldn't even use it for physiology experiments. Additionally, the authors could silence VGlut-labeled motor neurons and record muscle activity (potentially using GCaMP as has been done in several recent papers cited by the authors, Azevedo et al, 2020) as a much more direct readout.

      (3) Figure 4 uses an extremely simplified OpenSim model that makes several assumptions that are known to be false. For example, the Thorax-Coxa joint is assumed to be a ball and socket joint, which it is not. Tibia-tarsus joint is completely ignored and likely makes a major contribution in supporting overall posture, given the importance of the leg "claw" for adhering to substrates. Moreover, there are a couple of recent open-source neuromechanical models that include all these details (NeuromechFly by Lobato-Rios et al, 2022, Nat. Methods, and the fly body model by Vaxenburg et al, 2025, Nature). Leveraging these models to rule in or rule out contributions at other joints that are ignored in the authors' OpenSim model would be very helpful to make their case.

      (4) Figure 5 shows the experimental validation of Figure 4 simulations; however, it suffers from several caveats.

      a) The authors track a single point on the head of the fly to estimate the height of the fly. This has several issues. Firstly, it is not clear how accurate the tracking would be. Secondly, it is not clear how the fly actually "falls" on VGlut silencing; do all flies fall in a similar manner in every trial? Almost certainly, there will be some "pitch" and "role" in the way the fly falls. These will affect the location of this single-tracked point that doesn't reflect the authors' expectations. Unless the authors track multiple points on the fly and show examples of tracked videos, it is hard to believe this dataset and, hence, any of the resulting interpretations.

      b) As described in the previous point, the "reason" the fly falls on silencing all glutamatergic neurons could be due to silencing all sorts of premotor/interneurons in addition to the silencing of motor neurons.

      c) (line 175) "The first finding is that there was a large variation in the initial height of the fly (Figure 5C), consistent with a recent study of flies walking on a treadmill[20]." The cited paper refers to how height varies during "walking". However, in the current study, the authors are only looking at "standing" (i.e. non-walking) flies. So it is not the correct reference. In my opinion, this could simply reflect poor estimation of the fly's height based on poor tracking or other factors like pitch and role.

      d) "The rate at which the fly fell to the ground was much smaller in the experimental flies than it was in the simulated flies (Figure 5E). The median rate of falling was 1.3 mm/s compared to 37 mm/s for the simulated flies (Figure 5F). (Line 190) The most likely reason for the longer than expected time for the fly to fall is delays associated with motor neuron inactivation and muscle inactivation." I don't believe this reasoning. There are so many caveats (which I described in the above points) in the model and the experiment, that any of those could be responsible for this massive difference between experiment and modeling. Simply not getting rid of all active forces (inadequate silencing) could be one obvious reason. Other reasons could be that the model is using underestimates of passive forces, as alluded to in point 3.

      (5) Final figure (Figure 6) focuses on understanding the time course of neuronal silencing. First of all, I'm not entirely sure how relevant this is for the story. It could be an interesting supplemental data. But it seems a bit tangential. Additionally, it also suffers from major caveats.

      a) The authors now use a new genetic driver for which they don't have any behavioral data in any previous figures. So we do not know if any of this data holds true for the previous experiments. The authors perform whole-cell recordings from random unidentified motor neurons labeled by E49-Gal4>GtACR1 to deduce a time constant for behavioral results obtained in the VGlut-Gal4>GtACR1 experiments.

      b) The DMD setup is useful for focal inactivation, however, the appropriate controls and data are not presented. Line 200 "A spot of light on the cell body produces as much of the hyperpolarization as stimulating the entire fly (mean of 11.3 mV vs 13.1 mV across 9 neurons). Conversely, excluding the cell body produces only a small effect on the MN (mean of 2.6 mV)." First of all, the control experiment for showing that DMD is indeed causing focal inactivation would be to gradually move the spot of light away from the labeled soma, i.e. to the neighboring "labelled" soma and show that there is indeed focal inactivation. Instead authors move it quite a long distance into unlabeled neuropil. Secondly, I still don't get why the authors are doing this experiment. Even if we believe the DMD is functioning perfectly, all this really tells us is that a random subset motor neurons (maybe 5 or 6 cells, legend is missing this info) labeled by E49-Gal4 is strongly hyperpolarized by its own GtACR1 channel opening, rather than being impacted because of hyperpolarizations in other E49-Gal4 labeled neurons. This has no relevance to the interpretation of any of the VGlut-Gal4 behavioral data. VGLut-Gal4 is much broader and also labels all glutamatergic neurons, most of which are inhibitory interneurons whose silencing could lead to disinhibition of downstream networks.

    2. Reviewer #2 (Public review):

      Summary:

      The authors aim to quantify passive muscle forces in the legs of Drosophila, and test the hypothesis that these forces would be sufficient to support body weight in small insects. They take advantage of the genetic tools available in Drosophila, and use a combination of genetic silencing (optogenetic inactivation of motor neurons), kinematic measurements, and simulations using OpenSim. This integrative toolkit is used to examine the role of passive torques across multiple leg joints. They find that passive forces are weaker than expected - in particular, passive forces were found to be too weak to support the body weight of the fly. This challenges previous scaling assumptions derived from studies in larger insects and has potential implications for our understanding of motor control in small animals.

      Strengths:

      The primary strength of this work lies in its integration of multiple analyses. By pulling together simulations, kinematic measurements from high-resolution videos, and genetic manipulation, they are able to overcome limitations of past studies. In particular, optogenetic manipulation allowed for measurements to be made in whole animals, and the modeling component is valuable because it both validates experimental findings and elucidates the mechanism behind some of the observed dynamic consequences (e.g., the rapid fall after motor inactivation). The conclusions made in the study are well-supported by the data and could have an impact on a number of fields, including invertebrate neurobiology and bioinspired design.

      Weaknesses:

      While (as mentioned above) the study's conclusions are well-supported by the results and modeling, limitations arise because of the assumptions made. For instance, using a linear approximation may not hold at larger joint angles, and future studies would benefit from accounting for nonlinearities. Future studies could also delve into the source of passive forces, which is important for more deeply understanding the anatomical and physical basis of the results in this study. For instance, assessments of muscle or joint properties to correlate stiffness values with physical structure might be an area of future consideration

    3. Reviewer #3 (Public review):

      Summary:

      The authors present a novel method to measure passive joint torques - torques due to internal forces other than active muscle contraction - in the fruit fly: genetically inactivating all motor neurons in intact limb acted upon by a gravitational load results in a change in limb configuration; evaluating the moment equilibrium condition about the limb joints then yields a direct estimate of the passive joint torques. Deactivating all motor neurons in an intact standing fly provided two further conclusions: First, because deactivation causes the fly to drop to the floor, the passive joint torques are deemed insufficient to maintain rotational equilibrium against the body weight; using a multi-body-dynamics simulation, the authors estimate that the passive torques would need to be about 40-80 times higher to maintain a typical posture without active muscle action. Second, a delay between the motor neuron inactivation and the onset of the "free fall" motivates the authors to invoke a simple exponential decay model, which is then used to derive a time constant for muscle deactivation, in robust agreement with direct electro-physiological recordings.

      Strengths:

      The experimental design that permits determination of passive joint torques is elegant, effective, novel, and altogether excellent; it permits measurements previously impossible. A careful error analysis is presented, and a spectrum of technically challenging methods, including multi-body dynamics and e-phys, is deployed to further interpret and contextualise the results.

      Weaknesses:

      (1) Passive torques are measured, but only some short speculative statements, largely based on previous work, are offered on their functional significance; some of these claims are not well supported by experimental evidence or theoretical arguments. Passive forces are judged as "large" compared to the weight force of the limb, but the arguably more relevant force is the force limb muscles can generate, which, even in equilibrium conditions, is already about two orders of magnitude larger. The conclusion that passive forces are dynamically irrelevant seems natural, but contrasts with the assertion that "passive forces [...] will have a strong influence on limb kinematics". As a result, the functional significance of passive joint torques in the fruit fly, if any, remains unclear, and this ambiguity represents a missed opportunity. We now know the magnitude of passive joint torques - do they matter and for what? Are they helpful, for example, to maintain robust neuronal control, or a mechanical constraint that negatively impacts performance, e.g., because they present a sink for muscle work?

      (2) The work is framed with a scaling argument, but the assumptions that underpin the associated claims are not explicit and can thus not be evaluated. This is problematic because at least some arguments appear to contradict textbook scaling theory or everyday experience. For example, active forces are assumed to scale with limb volume, when every textbook would have them scale with area instead; and the asserted scaling of passive forces involves some hidden assumptions that demand more explicit discussion to alert the reader to associated limitations. Passive forces are said to be important only in small animals, but a quick self-experiment confirms that they are sufficient to stabilize human fingers or ankles against gravity, systems orders of magnitude larger than an insect limb, in seeming contradiction with the alleged dominance of scale. Throughout the manuscript, there are such and similar inaccuracies or ambiguities in the mechanical framing and interpretation, making it hard to fairly evaluate some claims, and rendering others likely incorrect.

    1. Reviewer #1 (Public review):

      The manuscript by Choi and colleagues investigates the impact of variation in cortical geometry and growth on cortical surface morphology. Specifically, the study uses physical gel models and computational models to evaluate the impact of varying specific features/parameters of the cortical surface. The study makes use of this approach to address the topic of malformations of cortical development and finds that cortical thickness and cortical expansion rate are the drivers of differences in morphogenesis.

      The study is composed of two main sections. First, the authors validate numerical simulation and gel model approaches against real cortical postnatal development in the ferret. Next, the study turns to modelling malformations in cortical development using modified tangential growth rate and cortical thickness parameters in numerical simulations. The findings investigate three genetically linked cortical malformations observed in the human brain to demonstrate the impact of the two physical parameters on folding in the ferret brain.

      This is a tightly presented study that demonstrates a key insight into cortical morphogenesis and the impact of deviations from normal development. The dual physical and computational modeling approach offers the potential for unique insights into mechanisms driving malformations. This study establishes a strong foundation for further work directly probing the development of cortical folding in the ferret brain. One weakness of the current study is that the interpretation of the results in the context of human cortical development is at present indirect, as the modelling results are solely derived from the ferret. However, these modelling approaches demonstrate proof of concept for investigating related alterations more directly in future work through similar approaches to models of the human cerebral cortex.

    2. Reviewer #2 (Public review):

      Summary:

      Based on MRI data of the ferret (a gyrencephalic non-primate animal, in whom folding happens postnatally), the authors create in vitro physical gel models and in silico numerical simulations of typical cortical gyrification. They then use genetic manipulations of animal models to demonstrate that cortical thickness and expansion rate are primary drivers of atypical morphogenesis. These observations are then used to explain cortical malformations in humans.

      Strengths:

      The paper is very interesting and original, and combines physical gel experiments, numerical simulations, as well as observations in MCD. The figures are informative, and the results appear to have good overall face validity.

      Weaknesses:

      On the other hand, I perceived some lack of quantitative analyses in the different experiments, and currently, there seems to be rather a visual/qualitative interpretation of the different processes and their similarities/differences.

      Ideally, the authors also quantify local/pointwise surface expansion in the physical and simulation experiments, to more directly compare these processes. Time courses of eg, cortical curvature changes, could also be plotted and compared for those experiments.

      I had a similar impression about the comparisons between simulation results and human MRI data. Again, face validity appears high, but the comparison appeared mainly qualitative.

      I felt that MCDs could have been better contextualized in the introduction.

    1. Reviewer #1 (Public review):

      Summary:

      Foik et al. report that hypochlorous acid, a reactive chlorine species generated during host defense, activates the transcription of the froABCD in P. aeruginosa. This gene cluster had previously been associated with a potential role during the flow of fluids and appears to be regulated by the sigma factor FroR and its anti-sigma factor FroI. In the present study, the authors show that froABCD is expressed both in neutrophils and macrophages, which they claim is likely a result of HOCl but not H2O2 production. Fro expression is also induced in a murine model of corneal infection, which is characterized by immune cell invasion. Expression of the fro system can be quenched by several antioxidants, such as methionine, cysteine, and others. FroR-deficient cells that lack froABCD expression during HOCl stress appear more sensitive to the oxidant.

      Strengths:

      The authors provide a number of data supporting their claim that transcription of the froABCD system is induced by reactive chlorine species. This was shown by RNAseq, qRT-PCR, and through microscopy using a transcriptional reporter fusion. Likewise, elevated expression of froABCD was shown in vitro and in vivo, excluding potential in vitro artifacts. The manuscript, while mostly descriptive, is easy to follow, and the data were presented clearly.

      Weaknesses:

      (1) Lines 60-62: Some of the authors' conclusions are not supported by the data and thus appear unfounded. One example: "we determine that fro upregulation.....These data suggest a novel mechanism..." Their data do not show that MSR upregulation is a direct effect of FroABCD. Instead, it could be possible that the FroR sigma factor also controls the expression of msr genes, which would be independent of froABCD.

      (2) The authors show increased fro transcription both in neutrophils and macrophages; however, the two types of immune cells differ quite dramatically with respect to myeloperoxidase activation and HOCl production. Neither has this been discussed nor considered here.

      (3) With respect to the activation of fro expression upon challenge with conditioned media from stimulated neutrophils, does the conditioned media contain detectable amounts of HOCl? Do chloramines, which are byproducts of HOCl oxidation with amines, also stimulate expression?

      (4) A better control to prove that this fro expression is indeed induced by HOCl in activated neutrophils would be to conduct the experiments in the presence of a myeloperoxidase inhibitor.

      (5) The work was conducted with two different P. aeruginosa strains (i.e. AL143 and PAO1F). None of the figure legends provides details on which strain was used. For instance, in line 111, the authors refer to Figure S1B for data that I thought were done with PAO1F, while in 154, data were presented in the context of the infection model, which was conducted with the other strain.

      (6) It would be good if immune cell recruitment at 2hrs and 20hrs PI could be quantified.

      (7) The conclusions of Figure 4 are, in my opinion, weak (line 187-188; "It is possible that ....."). These antioxidants likely quench the low amounts of NaOCl directly. This would significantly reduce the NaOCl concentrations to a level that no longer activates expression of fro. There is no direct evidence provided that oxidized methionine induces fro expression. Do the authors postulate that this is free methionine, or could methionine and/or cysteine oxidation in FroR increase the binding affinity of the sigma factor to the promoter? Another possibility is that NaOCl deactivates the anti-sigma factor. None of these scenarios has been considered here.

      (8) Line 184: The reaction constants of HOCl with Cys and Met are similar.

      (9) Treatment with 16 uM NaOCl caused a growth arrest of ~15 hrs in the WT (Figure 5A), whereas no growth at all was recorded with 7.5 uM in Figure 3A.

      (10) The concentration range of NaOCl causing fro expression is extremely narrow, while oxidative burst rapidly generates HOCl at much higher concentrations. This should be discussed in more detail.

    2. Reviewer #2 (Public review):

      Summary:

      Foik et al. studied the regulation of the fro operon in response to HOCl, an oxidant derived from immune cells, especially neutrophils. They use a transcriptional fusion of YFP to the froA promoter in an mCherry-expressing P. aeruginosa strain to determine fro-induction under the microscope. They use this system to study fro expression in medium, in the presence of neutrophils and macrophages, neutrophil-conditioned medium, and several chemical stimuli, including NaCl, HOCl, hydrogen peroxide, nitric acid, hydrochloric acid, and sodium hydroxide. They also use a corneal infection model to demonstrate that froA is upregulated in P. aeruginosa 20 h post-infection and perform transcriptional analyses in WT and a froR mutant in response to HOCl.

      Strengths:

      Their data clearly shows that HOCl is a strong inducer of the fro Operon. The addition of HOCl-quenching chemicals together with HOCl abrogates the response. They also show that a froR mutant is more susceptible to HOCl than WT. Their transcriptomic data reveal genes under control of the FroR/FroI sigma factor/anti sigma factor system.

      Weaknesses:

      Although the presented evidence is mostly solid, some of their findings need to be evaluated more carefully; explaining the rationale behind some of the experiments might enhance the article, and some of the models proposed by the authors seem far-fetched, as outlined below:

      (1) In line 76 the authors claim "Relative to P. aeruginosa that were incubated in host cell-free media, P. aeruginosa in close proximity to human neutrophils or that were engulfed in mouse macrophages appeared to increase fro expression (Fig. 1C)". Counting bacterial cells in Figure 1C shows that 1 in 17 bacteria (5.8%) induce the froA-promotor in media in the absence of immune cells, while 4 in 72 bacteria (only 5.5%) do the same in the presence of neutrophils. Contrary to the authors' claims, it appears that P. aeruginosa actually decreases fro-expression in close proximity to neutrophils. There is a slight increase in fro-expression in bacteria co-incubated with macrophages (3 in 21, or 14.3%). A more rigorous statistical analysis might substantiate the authors' claim, but, as is, the claim "neutrophils increase fro expression" is untenable.

      (2) The authors should explain the rationale behind some of the chemicals used. Why did they use nitric acid? Especially at these high concentrations, a strong acid such as nitric acid might have a significant influence on the medium pH. I understand that the medium is phosphate-buffered, but 25 mM nitric acid in an unbuffered medium would shift the pH well below 2. Similar considerations apply to hydrochloric acid and sodium hydroxide.

      (3) In line 187, the authors state that "It is possible that oxidized methionine increases fro expression" and they suggest a model to that effect in Figure 5D. It is unclear why the authors singled out methionine sulfoxide, since a number of other things get oxidized by HOCl. In line 184, the authors state, in the same vein, that "HOCl oxidizes methionine residues 100-fold more rapidly than other cellular components". The authors should state which other cellular compounds they are referring to. Certainly not cysteine and other thiols, which react equally fast and are highly abundant in the cell: P. aeruginosa contains 340 µM GSH, 140 µM CoA-SH (https://doi.org/10.1074/jbc.RA119.009934) plus free cysteine and cysteines in proteins (based on codon usage, 1.34% of amino acids in proteins are cysteine, while methionine is only slightly more present at 2.10%, although a number of starting methionines are removed from mature proteins).

      (4) Overall (and this is probably not addressable with the authors' data), some very interesting questions remain unanswered: what is the molecular mechanism of fro-induction? How is the FroR/FroI system modulated by HOCl? Does the system sense free or protein-bound methionine-sulfoxide? Are certain methionine residues in these proteins directly oxidized by HOCl? Many "HOCl-sensing" proteins are also modified at cysteine residues or amino groups; could those play a role? And lastly: what is the connection between shear/fluid flow and HOCl, or are these totally separate mechanisms of fro-induction?

    1. Reviewer #1 (Public review):

      Summary:

      This study investigated the heterogeneous responses to Mycobacterium tuberculosis (Mtb) in 19 wild-derived inbred mouse strains collected from various geographic locations. The goal of this study is to identify novel mechanisms that regulate host susceptibility to Mtb infection. Using the genetically resistant C57BL/6 mouse strain as the control, they successfully identified a few mouse strains that revealed higher bacterial burdens in the lung, implicating increased susceptibility in those mouse strains. Furthermore, using flow cytometry analysis, they discovered strong correlations between CFU and various immune cell types, including T cells and B cells. The higher neutrophil numbers correlated with significantly higher CFU in some of the newly identified susceptible mouse strains. Interestingly, MANB and MANC mice exhibited comparable numbers of neutrophils but showed drastically different bacterial burdens. The authors then focused on the neutrophil heterogeneity and utilized a single-cell RNA-seq approach, which led to identifying distinct neutrophil subsets in various mouse strains, including C57BL/6, MANA, MANB, and MANC. Pathway analysis on neutrophils in susceptible MANC strain revealed a highly activated and glycolytic phenotype, implicating a possible mechanism that may contribute to the susceptible phenotype. Lastly, the authors found that a small group of neutrophil-specific genes are expressed across many other cell types in the MANC strain.

      Strengths:

      This manuscript has many strengths.

      (1) Utilizing and characterizing novel mouse strains that complement the current widely used mouse models in the field of TB. Many of those mouse strains will be novel tools for studying host responses to Mtb infection.

      (2) The study revealed very unique biology of neutrophils during Mtb infection. It has been well-established that high numbers of neutrophils correlate with high bacterial burden in mice. However, this work uncovered that some mouse strains could be resistant to infection even with high numbers of neutrophils in the lung, indicating the diverse functions of neutrophils. This information is important.

      Weaknesses:

      The weaknesses of the manuscript are that the work is relatively descriptive. It is unclear whether the neutrophil subsets are indeed functionally different. While single-cell RNA seq did provide some clues at transcription levels, functional and mechanistic investigations are lacking. Similarly, it is unclear how highly activated and glycolytic neutrophils in MANC strain contribute to its susceptibility.

    2. Reviewer #2 (Public review):

      Summary:

      These studies investigate the phenotypic variability and roles of neutrophils in tuberculosis (TB) susceptibility by using a diverse collection of wild-derived inbred mouse lines. The authors aimed to identify new phenotypes during Mycobacterium tuberculosis infection by developing, infecting, and phenotyping 19 genetically diverse wild-derived inbred mouse lines originating from different geographic regions in North America and South America. The investigators achieved their main goals, which were to show that increasing genetic diversity increases the phenotypic spectrum observed in response to aerosolized M. tuberculosis, and further to provide insights into immune and/or inflammatory correlates of pulmonary TB. Briefly, investigators infected wild-derived mice with aerosolized M. tuberculosis and assessed early infection control at 21 days post-infection. The time point was specifically selected to correspond to the period after infection when acquired immunity and antigen-specific responses manifest strongly, and also early susceptibility (morbidity and mortality) due to M. tuberculosis infection has been observed in other highly susceptible wild-derived mouse strains, some Collaborative Cross inbred strains, and approximately 30% of individuals in the Diversity Outbred mouse population. Here, the investigators normalized bacterial burden across mice based on inoculum dose and determined the percent of immune cells using flow cytometry, primarily focused on macrophages, neutrophils, CD4 T cells, CD8 T cells, and B cells in the lungs. They also used single-cell RNA sequencing to identify neutrophil subpopulations and immune phenotypes, elegantly supplemented with in vitro macrophage infections and antibody depletion assays to confirm immune cell contributions to susceptibility. The main results from this study confirm that mouse strains show considerable variability to M. tuberculosis susceptibility. Authors observed that enhanced infection control correlated with higher percentages of CD4 and CD8 T cells, and B cells, but not necessarily with the percentage of interferon-gamma (IFN-γ) producing cells. High levels of neutrophils and immature neutrophils (band cells) were associated with increased susceptibility, and the mouse strain with the most neutrophils, the MANC line, exhibited a transcriptional signature indicative of a highly activated state, and containing potentially tissue-destructive, mediators that could contribute to the strain's increased susceptibility and be leveraged to understand how neutrophils drive lung tissue damage, cavitation, and granuloma necrosis in pulmonary TB.

      Strengths:

      The strengths are addressing a critically important consideration in the tuberculosis field - mouse model(s) of the human disease, and taking advantage of the novel phenotypes observed to determine potential mechanisms. Notable strengths include,

      (1) Innovative generation and use of mouse models: Developing wild-derived inbred mice from diverse geographic locations is innovative, and this approach expands the range of phenotypic responses observed during M. tuberculosis infection. Additionally, the authors have deposited strains at The Jackson Laboratory making these valuable resources available to the scientific community.

      (2) Potential for translational research: The findings have implications for human pulmonary TB, particularly the discovery of neutrophil-associated susceptibility in primary infection and/or neutrophil-mediated disease progression that could both inform the development of therapeutic targets and also be used to test the effectiveness of such therapies.

      (3) Comprehensive experimental design: The investigators use many complementary approaches including in vivo M. tuberculosis infection, in vitro macrophage studies, neutrophil depletion experiments, flow cytometry, and a number of data mining, machine learning, and imaging to produce robust and comprehensive analyses of the wild-derives d strains and neutrophil subpopulations in 3 weeks after M. tuberculosis infection.

      Weaknesses:

      The manuscript and studies have considerable strengths and very few weaknesses. One minor consideration is that phenotyping is limited to a single limited-time point; however, this time point was carefully selected and has a strong biological rationale provided by investigators. This potential weakness does not diminish the overall findings, exciting results, or conclusions.

    1. Reviewer #1 (Public review):

      Summary:

      The work by Pinon et al describes the generation of a microvascular model to study Neisseria meningitidis interactions with blood vessels. The model uses a novel and relatively high throughput fabrication method that allows full control over the geometry of the vessels. The model is well characterized from the vascular standpoint and shows improvements when exposed to flow. The authors show that Neisseria binds to the 3D model in a similar geometry that in the animal xenograft model, induces an increase in permeability short after bacterial perfusion, and endothelial cytoskeleton rearrangements including a honeycomb actin structure. Finally, the authors show neutrophil recruitment to bacterial microcolonies and phagocytosis of Neisseria.

      Strengths:

      The article is overall well written, and it is a great advancement in the bioengineering and sepsis infection field. The authors achieved their aim at establishing a good model for Neisseria vascular pathogenesis and the results support the conclusions. I support the publication of the manuscript. I include below some clarifications that I consider would be good for readers.

      One of the most novel things of the manuscript is the use of a relatively quick photoablation system. Could this technique be applied in other laboratories? While the revised manuscript includes more technical details as requested, the description remains difficult to follow for readers from a biology background. I recommend revising this section to improve clarity and accessibility for a broader scientific audience.

      The authors suggest that in the animal model, early 3h infection with Neisseria do not show increase in vascular permeability, contrary to their findings in the 3D in vitro model. However, they show a non-significant increase in permeability of 70 KDa Dextran in the animal xenograft early infection. As a bioengineer this seems to point that if the experiment would have been done with a lower molecular weight tracer, significant increases in permeability could have been detected. I would suggest to do this experiment that could capture early events in vascular disruption.

      One of the great advantages of the system is the possibility of visualizing infection-related events at high resolution. The authors show the formation of actin of a honeycomb structure beneath the bacterial microcolonies. This only occurred in 65% of the microcolonies. Is this result similar to in vitro 2D endothelial cultures in static and under flow? Also, the group has shown in the past positive staining of other cytoskeletal proteins, such as ezrin in the ERM complex. Does this also occur in the 3D system?

      Significance:

      The manuscript is comprehensive, complete and represents the first bioengineered model of sepsis. One of the major strengths is the carful characterization and benchmarking against the animal xenograft model. Beyond the technical achievement, the manuscript is also highly quantitative and includes advanced image analysis that could benefit many scientists. The authors show a quick photoablation method that would be useful for the bioengineering community and improved the state-of-the-art providing a new experimental model for sepsis.

      My expertise is on infection bioengineered models.

    2. Reviewer #2 (Public review):

      Pinon and colleagues have developed a Vessel-on-Chip model showcasing geometrical and physical properties similar to the murine vessels used in the study of systemic infections. The vessel was created via highly controllable laser photoablation in a collagen matrix, subsequent seeding of human endothelial cells, and flow perfusion to induce mechanical cues. This model could be infected with Neisseria meningitidis as a model of systemic infection. In this model, microcolony formation and dynamics, and effects on the host were very similar to those described for the human skin xenograft mouse model (the current gold standard for systemic studies) and were consistent with observations made in patients. The model could also recapitulate the neutrophil response upon N. meningitidis systemic infection.

      The claims and the conclusions are supported by the data, the methods are properly presented, and the data is analyzed adequately. The most important strength of this manuscript is the technology developed to build this model, which is impressive and very innovative. The Vessel-on-Chip can be tuned to acquire complex shapes and, according to the authors, the process has been optimized to produce models very quickly. This is a great advancement compared with the technologies used to produce other equivalent models. This model proves to be equivalent to the most advanced model used to date (skin xenograft mouse model). The human skin xenograft mouse model requires complex surgical techniques and has the practical and ethical limitations associated with the use of animals. However, the Vessel-on-chip model is free of ethical concerns, can be produced quickly, and allows to precisely tune the vessel's geometry and to perform higher resolution microscopy. Both models were comparable in terms of the hallmarks defining the disease, suggesting that the presented model can be an effective replacement of the animal use in this area. In addition, the Vessel-on-Chip allows to perform microscopy with higher resolution and ease, which can in turn allow more complex and precise image-based analysis.

      A limitation of this model is that it lacks the multicellularity that characterizes other similar models, which could be useful to research disease more extensively. However, the authors discuss the possibilities of adding other cells to the model, for example, fibroblasts. It is also not clear whether the technology presented in the current paper can be adopted by other labs. The methodology is complex and requires specialized equipment and personnel, which might hinder its widespread utilization of this model by researchers in the field.

      This manuscript will be of interest for a specialized audience focusing on the development of microphysiological models. The technology presented here can be of great interest to researchers whose main area of interest is the endothelium and the blood vessels, for example, researchers on the study of systemic infections, atherosclerosis, angiogenesis, etc. This manuscript can have great applications for a broad audience and it can present an opportunity to begin collaborations, aimed at answering diverse research questions with the same model.

    3. Reviewer #3 (Public review):

      Summary:

      In this manuscript Pinon et al. describe the development of a 3D model of human vasculature within a microchip to study Neisseria meningitidis (Nm)- host interactions and validate it through its comparison to the current gold-standard model consisting of human skin engrafted onto a mouse. There is a pressing need for robust biomimetic models with which to study Nm-host interactions because Nm is a human-specific pathogen for which research has been primarily limited to simple 2D human cell culture assays. Their investigation relies primarily on data derived from microscopy and its quantitative analysis, which support the authors' goal of validating their Vessel-on-Chip (VOC) as a useful tool for studying vascular infections by Nm, and by extension, other pathogens associated with blood vessels.

      Strengths:<br /> • Introduces a novel human in vitro system that promotes control of experimental variables and permits greater quantitative analysis than previous models<br /> • The VOC model is validated by direct comparison to the state-of-the-art human skin graft on mouse model<br /> • The authors make significant efforts to quantify, model, and statistically analyze their data<br /> • The laser ablation approach permits defining custom vascular architecture<br /> • The VOC model permits the addition and/or alteration of cell types and microbes added to the model<br /> • The VOC model permits the establishment of an endothelium developed by shear stress and active infusion of reagents into the system

      Weaknesses:<br /> • The work presented here is mostly descriptive, with little new information that is learned about the biology of Nm or endothelial cells. However, the goal of this study was to establish the VOC model, and the validation presented here is necessary for follow-on studies on Nm pathogenesis and host response.<br /> • The VOC model contains one cell type, human umbilical cord vascular endothelial cells (HUVECs), while true vasculature contains a number of other cell types that associate with and affect the endothelium, such as smooth muscle cells, pericytes, and components of the immune system. These and other shortcomings of the VOC model as it currently stands warrant additional discussion.

      Impact:<br /> The VOC model presented by Pinon et al. is an exciting advancement in the set of tools available to study human pathogens interacting with the vasculature. This manuscript focuses on validating the model, and as such sets the foundation for impactful research in the future. Of particular value is the photoablation technique that permits the custom design of vascular architecture without the use of artificial scaffolding structures described in previously published works.

    1. Reviewer #1 (Public review):

      Summary:

      In this report, Yabaji et al describe studies designed to address the mechanism behind the TB susceptibility gene sst1. This locus is known to affect expression of IFN and synergizes with Myc to potentiate infectivity. Using a variety of molecular expression and imaging techniques, the authors demonstrate that mice harboring an sst1 transgene (compared to B6 controls) are highly susceptible to TB infection via a mechanism involving loss of antioxidant defense systems, the down regulation of key antioxidant genes and ferritin controlling intracellular iron levels. The combination of increased iron plus decreased antioxidant defense systems in turn increases lipid peroxidation and downstream sequelae. Inhibition of peroxidation diminishes infectivity increases ferritin levels. Furthermore, the authors demonstrate that Myc activation potentiates this process and that down regulation of NRF2 antioxidant defenses accompany potentiated infectivity. Increased peroxidation products (4-HNE) may activate the ASK1/JNK system leading to IFNb superinduction and diminished macrophage viability thereby diminishing ability to withstand TB infection. Extending these findings, additional mouse models plus some work in humans supports the peroxidation hypothesis. Overall, the work is significant for it introduces a molecular basis for TB infectivity and presents a potential novel therapeutic opportunity.

      Strengths:

      (1) Strengths of this study include a multi-omic analysis of infectivity combining gene expression analysis with biochemical and cell biological evaluation.

      (2) Novel identification of an iron-catalyzed lipid peroxidation based mechanism for why the sst1 locus is linked to TB infection.

      (3) Parallels to human biology are included via analysis of Myc upregulation in peripheral blood from patients.

      (4) Appropriate statistical analysis

      Weaknesses:

      (1) Lipid peroxidation is a broad phenotype process and the authors honed in on 4-HNE dependent processes as a likely mechanism because they can measure 4-HNE conjugated proteins. However, lipid peroxidation is a complex phenomenon and the work presented herein is largely descriptive.

      (2) The authors continually refer to increased 4HNE while they do not measure this 9 carbon lipid, they actually measure 4-HNE conjugated proteins immunochemically.

      (3) The authors do not distinguish between increased protein-HNE adducts and increased membrane peroxidation (or both) as mechanistically linked to infectivity.

    1. Reviewer #1 (Public review):

      Summary:

      The authors add to the body of evidence showing theta rhythmic modulations of neuronal activity and behavior.

      Strengths:

      Precise characterization of the effects of visual stimulation on theta-induced neuronal oscillations of spiking neurons in V1 and its relevance for behavior.

      The manuscript is well-written and clearly presented,

      Weaknesses:

      The advances are limited over the established body of evidence. Both theta-induced visual oscillations and their relevance for behavior have been firmly established by prior work, including prior work from the authors. There is no major new technique, data, finding, or insight that extends our knowledge in a majorly significant way beyond existing knowledge, in my opinion. I would suggest that the authors re-evaluate the body of existing work to more strongly place their work in the context of existing work. A study that targets fundamental holes or open questions in the field would have been viewed as more impactful.

    2. Reviewer #2 (Public review):

      Summary:

      Schmid & colleagues test an interesting hypothesis that V1 neurons might act as theta-tuned filters to incoming sensory information, and thereby influence downstream processing and detection performance.

      Strengths:

      The authors report that circular stimuli elicit theta oscillations in V1 single units and population activity. They also report that the phase of the theta oscillations influences performance in a change detection task.

      Weaknesses:

      The results are reported in terms of specific stimulus sizes. To truly reflect general-purpose spatial computations in the primary visual cortex, it will be important to establish a relationship between stimulus size and receptive field size.

      I have several major concerns that I would like the authors to address:

      (1) First paragraph of Results: The results are presented at very specific stimulus sizes: 0.3-degree, 1-degree, 4-degree, and so on. A key missing piece of information is the size of the receptive fields (RFs) that were recorded from. A related missing information is at what eccentricity these RFs were recorded from. Since there is nothing magical about a 1-degree stimulus, any general-purpose computation in the primary visual cortex has to establish a relationship between RF size and stimulus size.

      (2) Second paragraph of Results: The authors state that "specific stimulus sizes consistently induced strong theta rhythmic activity: 1{degree sign} in MUA and 2{degree sign} in LFP". What is the interpretation of these specific sizes? Given that the LFP and MUAe reflect different aspects of neural activity, how does one interpret the discrepancy?

      (3) Third paragraph of Results: Again related to (1), what is the relationship between the stimulus size that elicited the largest theta peaks and RF size at the population level? (1)-(3) taken together, there seems to be an opportunity to reveal something more fundamental about V1 processing that the authors might have missed here.

      (4) Change detection task: It was not clear to me whether the timing of the luminance change, which varied from 500ms to 1500ms, was drawn from an exponential distribution or a uniform distribution. Only an exponential distribution has the property of a flat hazard function, which will be important to establish that the animal could not anticipate the timing of the upcoming change.

      (5) Figure 3D: Have the authors tried to fit the data separately for each animal? There seems to be an inconsistency in the results between the 2 animals. The circular data points ('AL') seem positively correlated, similar to the overall trend, but the diamond data points ('DP') seem to have a negative slope.

    3. Reviewer #3 (Public review):

      Summary:

      This paper investigates changes in brain oscillations in V1 in response to experimentally manipulating visual stimulus features (size, contrast at optimal size) and examines whether these effects are of perceptual relevance. The results reveal prominent stimulus-related theta oscillations in V1 that match in frequency the rhythms of behavioural performance (response speed in detecting targets in the visual display). Phase analyses relate these fluctuations of detection performance more formally to opposite theta phase angles in V1.

      Strengths:

      The non-human primate model provides unique findings on how brain oscillations relate to rhythms in perception (in two rhesus monkeys) that align well with findings from human studies (as occurring in the theta band). However, theta rhythms in humans are typically associated with fronto-parietal activity in the domain of spatial orienting, attentional sampling, while here the focus is on V1. Importantly, microsaccade-controls seem to speak against a spatial orienting/ attentional sampling mechanism to explain the observed effects (at least regarding overt attention).

      Weaknesses:

      This study provides interesting clues on perceptually relevant brain oscillations. Despite the microsaccade-control, I believe it remains an open question whether the V1 rhythmicity is of pure V1 origin, or driven by top-down input, as it is conceivable that specific stimuli capture attention differently (and hence induce specific covert attentional (re)orienting patterns). For perceptually relevant (yet beta) rhythmicity over occipital areas that are top-down generated, see e.g., Veniero et al., 2019.

    1. Reviewer #2 (Public review):

      This manuscript describes the role of the production of c-di-AMP on the chlamydial developmental cycle. The main findings remain the same. The authors show that overexpression of the dacA-ybbR operon results in increased production of c-di-AMP and early expression of transitionary and late genes. The authors also knocked down the expression of the dacA-ybbR operon and reported a modest reduction in the expression of both hctA and omcB. The authors conclude with a model suggesting the amount of c-di-AMP determines the fate of the RB, continued replication, or EB conversion.

      Overall, this is a very intriguing study with important implications however the data is very preliminary and the model is very rudimentary. The data support the observation that dramatically increased c-di-AMP has an impact on transitionary gene expression and late gene expression suggesting dysregulation of the developmental cycle. This effect goes away with modest changes in c-di-AMP (detaTM-DacA vs detaTM-DacA (D164N)). However, the model predicts that low levels of c-di-AMP delays EB production is not not well supported by the data. If this prediction were true then the growth rate would increase with c-di-AMP reduction and the data does not show this. The levels of of c-di-AMP at the lower levels need to be better validated as it seems like only very high levels make a difference for dysregulated late gene expression. However, on the low end it's not clear what levels are needed to have an effect as only DacAopMut and DacAopKD show any effects on the cycle and the c-di-AMP levels are only different at 24 hours.

      The data still do not support the overall model.

      In Figure 1 the authors show at 24 hpi.

      DacA overexpression increases cdiAMP to ~4000 pg/ml

      DacAmut overexpression reduces cdiAMP dramatically to ~256 pg/ml)

      DacATM overexpression increases cdiAMP to ~4000 pg/ml.

      DacAmutTM overexpression does not seem to change cdiAMP ~1500 pg/ml .

      dacAKD decreases cdiAMP to ~300 pg/ml .

      dacAKDcom increased cdiAMP to ~8000 pg/ml.

      DacA-ybbRop overexpression increased cdiAMP to ~500,000 pg/ml.

      DacA-ybbRopmut ~300 pg/ml.

      However in Figure 2 the data show that overexpression of DacA (cdiAMP ~4000 pg/ml) did not have a different phenotype than over expression of the mutant (cdiAMP ~256 pg/ml). HctA expression down, omcB expression down, euo not much change, replication down, and IFUs down. Additionally, Figure 3 shows no differences in anything measured although cdiAMP levels were again dramatically different. DacATM overexpression (~4000 pg/ml) and DacAmutTM (~1500). This makes it unclear what cdiAMP is doing to the developmental cycle.

      In Figure 4 the authors knockdown dacA (dacA-KD) and complement the knockdown (dacA-KDcom) dacAKD decreases cdiAMP (~300) while DacA-KDcom increases cdiAMP much above wt (~8000).<br /> KD decreased hctA and omcB at 24hpi. Complementation resulted in a moderate increase in hctA at a single time point but not at 24 hpi and had no effect on euo or omcB expression. Importantly, complementation decreased the growth rate. Based on the proposed model, growth rate should increase as the chlamydia should all be RBs and replicating and not exiting the cell cycle to become EBs (not replicating). Interestingly reducing cdiAMP levels by over expressing DacAmut (~256 pg/ml) did not have an effect on the cycle but the reduction in cdiAMP by knockdown of dacA (~300 pg/ml) did have a moderate effect on the cycle.

      For Figure 5 DacA-ybbRop was overexpressed and this increased cdiAMP dramatically ~500,000 pg/ml as compared to wt ~1500. This increased hctA only at an early timepoint and not at 24hpi and again had no effect on omcB or euo. Overexpression of the operon with the mutation DacA-ybbRopmut reduced cdiAMP to ~300 pg/ml and this showed a reduction in growth rate similar to dacAmut but a more dramatic decrease in IFUs.

      Overall:

      DacA overexpression increases cdiAMP to ~4000 pg/ml (decreased everything except euo)

      DacAmut overexpression reduces cdiAMP dramatically (~256 pg/ml). (decreased everything except euo)

      DacATM overexpression increases cdiAMP to ~4000 pg/ml (no changes noted)

      DacAmutTM overexpression does not seem to change cdiAMP ~1500 pg/ml (no changes noted)

      dacAKD decrease cdiAMP to ~300 pg/ml (decreased everything except euo)

      dacAKDcom increased cdiAMP to ~8000 pg/ml (decreases growth rate, increase hctA a little but not omcB)

      DacA-ybbRop overexpression increased cdiAMP to ~500,000 pg/ml (decreases growth rate, increase hctA a little but not omcB)

      DacA-ybbRopmut ~300 pg/ml (decreased everything except euo)

      Overall, the data show that increasing cdiAMP only has a phenotype if it is dramatically increased, no effect at 4000 pg/ml. Decreasing cdiAMP has a consistent effect, decreased growth rate, IFU, hctA expression and omcB expression. However, if their proposed model was correct and low levels of cdiAMP blocked EB conversion then more chlamydial cells would be RBs (dividing cells) and the growth rate should increase. Conversely, if cdiAMP levels were dramatically raised then all RBs would all convert and the growth rate would be very low. When cdiAMP was raised to ~4000 pg/ml there was no effect on the growth rate. However, an increase to ~8000 pg/ml resulted in a significant decrease but growth continued. Increasing cdAMP to ~500,000 pg/ml had less of an impact on the growth rate. Overall, the data does not cleanly support the proposed model.

    1. Reviewer #1 (Public review):

      Summary:

      This is a contribution to the field of developmental bioelectricity. How do changes of resting potential at the cell membrane affect downstream processes? Zhou et al. reported in 2015 that phosphatidylserine and K-Ras cluster upon plasma membrane depolarization and that voltage-dependent ERK activation occurs when constitutively active K-RasG12V mutants are overexpressed. In this paper, the authors advance the knowledge of this phenomenon by showing that membrane depolarization up-regulates mitosis and that this process is dependent on voltage-dependent activation of ERK. ERK activity's voltage-dependence is derived from changes in the dynamics of phosphatidylserine in the plasma membrane and not by extracellular calcium dynamics. This paper reports an interesting and important finding. It is somewhat derivative of Zhou et al., 2015. (https://www.science.org/doi/full/10.1126/science.aaa5619). The main novelty seems to be that they find quantitatively different conclusions upon conducting similar experiments, albeit with a different cell line (U2OS) than those used by Zhou et al. Sasaki et al. do show that increased K+ levels increase proliferation, which Zhou et al. did not look at. The data presented in this paper are a useful contribution to a field often lacking such data.

      Strengths:

      Bioelectricity is an important field for areas of cell, developmental, and evolutionary biology, as well as for biomedicine. Confirmation of ERK as a transduction mechanism and a characterization of the molecular details involved in the control of cell proliferation are interesting and impactful.

      Weaknesses:

      The authors lean heavily on the assumption that the Nernst equation is an accurate predictor of membrane potential based on K+ level. This is a large oversimplification that undermines the author's conclusions, most glaringly in Figure 2C. The author's conclusions should be weakened to reflect that the activity of voltage gated ion channels and homeostatic compensation are unaccounted for.

      There are grammatical tense errors are made throughout the paper (ex line 99 "This kinetics should be these kinetics")

      Line 71: Zhou et al. use BHK, N2A, PSA-3 cells, this paper uses U2OS (osteosarcoma) cells. Could that explain the differences in bioelectric properties that they describe? In general, there should be more discussion of the choice of cell line. Why were U2OS cells chosen? What are the implications of the fact that these are cancer cells, and bone cancer cells in particular? Does this paper provide specific insights for bone cancers? And crucially, how applicable are findings from these cells to other contexts?

      Line 115: The authors use EGF to calibrate 'maximal' ERK stimulation. Is this level near saturation? Either way is fine, but it would be useful to clarify.

      Line 121: Starting line 121 the authors say "Of note, U2OS cells expressed wild-type K-Ras but not an active mutant of K-Ras, which means voltage dependent ERK activation occurs not only in tumor cells but also in normal cells". Given that U2OS cells are bone sarcoma cells, is it appropriate to refer to these as 'normal' cells in contrast to 'tumor' cells?

      Line 101: These normalizations seem reasonable, the conclusions sufficiently supported and the requisite assumptions clearly presented. Because the dish-to-dish and cell-to-cell variation may reflect biologically relevant phenomena it would be ideal if non-normalized data could be added in supplemental data where feasible.

      Figure 2C is listed as Figure 2D in the text

      There is no Figure 2F (Referenced in line 148)

    2. Reviewer #2 (Public review):

      Sasaki et al. use a combination of live-cell biosensors and patch-clamp electrophysiology to investigate the effect of membrane potential on the ERK MAPK signaling pathway, and probe associated effects on proliferation. This is an effect that has long been proposed, but a convincing demonstration has remained elusive, because it is difficult to perturb membrane potential without disturbing other aspects of cell physiology in complex ways. The time-resolved measurements here are a nice contribution to this question, and the perforated patch clamp experiments with an ERK biosensor are fantastic - they come closer to addressing the above difficulty of perturbing voltage than any prior work. It would have been difficult to obtain these observations with any other combination of tools.

      However, there are still some concerns as detailed in specific comments below:

      Specific comments:

      (1) All the observations of ERK activation, by both high extracellular K+ and voltage clamp, could be explained by cell volume increase (more discussion in subsequent comments). There is a substantial literature on ERK activation by hypotonic cell swelling (e.g. https://doi.org/10.1042/bj3090013, https://doi.org/10.1002/j.1460-2075.1996.tb00938.x, among others). Here are some possible observations that could demonstrate that ERK activation by volume change is distinct from the effects reported here:

      i) Does hypotonic shock activate ERK in U2OS cells?

      ii) Can hypotonic shock activate ERK even after PS depletion, whereas extracellular K+ cannot?

      iii) Does high extracellular K+ change cell volume in U2OS cells, measured via an accurate method such as fluorescence exclusion microscopy?

      iv) It would be helpful to check the osmolality of all the extracellular solutions, even though they were nominally targeted to be iso-osmotic.

      (2) Some more details about the experimental design and the results are needed from Figure 1:

      i) For how long are the cells serum-starved? From the Methods section, it seems like the G1 release in different K+ concentration is done without serum, is this correct? Is the prior thymidine treatment also performed in the absence of serum?

      ii) There is a question of whether depolarization constitutes a physiologically relevant mechanism to regulate proliferation, and how depolarization interacts with other extracellular signals that might be present in an in vivo context. Does depolarization only promote proliferation after extended serum starvation (in what is presumably a stressed cell state)? What fraction of total cells are observed to be mitotic (without normalization), and how does this compare to the proliferation of these cells growing in serum-supplemented media? Can K+ concentration tune proliferation rate even in serum-supplemented media?

      (3) In Figure 2, there are some possible concerns with the perfusion experiment:

      i) Is the buffer static in the period before perfusion with high K+, or is it perfused? This is not clear from the Methods. If it is static, how does the ERK activity change when perfused with 5 mM K+? In other words, how much of the response is due to flow/media exchange versus change in K+ concentration?

      ii) Why do there appear to be population-average decreases in ERK activity in the period before perfusion with high K+ (especially in contrast to Fig. 3)? The imaging period does not seem frequent enough for photobleaching to be significant.

      (4) Figure 3 contains important results on couplings between membrane potential and MAPK signaling. However, there are a few concerns:

      i) Does cell volume change upon voltage clamping? Previous authors have shown that depolarizing voltage clamp can cause cells to swell, at least in the whole-cell configuration:

      https://www.cell.com/biophysj/fulltext/S0006-3495(18)30441-7 . Could it be possible that the clamping protocol induces changes in ERK signaling due to changes in cell volume, and not by an independent mechanism?

      ii) Does the -80 mV clamp begin at time 0 minutes? If so, one might expect a transient decrease in sensor FRET ratio, depending on the original resting potential of the cells. Typical estimates for resting potential in HEK293 cells range from -40 mV to -15 mV, which would reach the range that induces an ERK response by depolarizing clamp in Fig. 3B. What are the resting potentials of the cells before they are clamped to -80 mV, and why do we not see this downward transient?

      (5) The activation of ERK by perforated voltage clamp and by high extracellular K+ are each convincing, but it is unclear whether they need to act purely through the same mechanism - while additional extracellular K+ does depolarize the cell, it could also be affecting function of voltage-independent transporters and cell volume regulatory mechanisms on the timescales studied. To more strongly show this, the following should be done with the HEK cells where there is already voltage clamp data:

      i) Measure resting potential using the perforated patch in zero-current configuration in the high K+ medium. Ideally this should be done in the time window after high K+ addition where ERK activation is observed (10-20 minutes) to minimize the possibility of drift due to changes in transporter and channel activity due to post-translational regulation.

      ii) Measure YFP/CFP ratio of the HEK cells in the high K+ medium (in contrast to the U2OS cells from Fig. 2 where there is no patch data).

      iii) The assertion that high K+ is equivalent to changes in Vmem for ERK signaling would be supported if the YFP/CFP change from K+ addition is comparable to that induced by voltage clamp to the same potential. This would be particularly convincing if the experiment could be done with each of the 15 mM, 30 mM, and 145 mM conditions.

      (6) Line 170: "ERK activity was reduced with a fast time course (within 1 minute) after repolarization to -80 mV." I don't see this in the data: in Fig. 3C, it looks like ERK remains elevated for > 10 min after the electrical stimulus has returned to -80 mV

      Comments on revisions:

      The authors have done a good job addressing the comments on the previous submission.

    3. Reviewer #3 (Public review):

      Summary:

      This paper demonstrates that membrane depolarization induces a small increase in cell entry into mitosis. Based on previous work from another lab, the authors propose that ERK activation might be involved. They show convincingly using a combination of assays that ERK is activated by membrane depolarization. They show this is Ca2+ independent and is a result of activation of the whole K-Ras/ERK cascade which results from changed dynamics of phosphatidylserine in the plasma membrane that activates K-Ras. Although the activation of the Ras/ERK pathway by membrane depolarization is not new, linking it to an increase in cell proliferation is novel.

      Strengths

      A major strength of the study is the use of different techniques - live imaging with ERK reporters, as well as Western blotting to demonstrate ERK activation as well as different methods for inducing membrane depolarization. They also use a number of different cell lines. Via Western blotting the authors are also able to show that the whole MAPK cascade is activated.

      Weaknesses

      A weakness of the study is the data in Figure 1 showing that membrane depolarization results in an increase of cells entering mitosis. There are very few cells entering mitosis in their sample in any condition. This should be done with many more cells to increase the confidence in the results. The study also lacks a mechanistic link between ERK activation by membrane depolarization and increased cell proliferation.

      The authors did achieve their aims with the caveat that the cell proliferation results could be strengthened. The results, for the most par,t support the conclusions.

      This work suggests that alterations in membrane potential may have more physiological functions than action potential in the neural system as it has an effect on intracellular signalling and potentially cell proliferation.

      In the revised manuscript, the authors have now addressed the issues with Figure 1, and the data presented are much clearer. They did also attempt to pinpoint when in the cell cycle ERK is having its activity, but unfortunately, this was not conclusive.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Mack and colleagues investigate the role of posttranslational modifications, including lysine acetylation and ubiquitination, in methyltransferase activity of SETD2 and show that this enzyme functions as a tumor suppressor in a KRASG12C-driven lung adenocarcinoma. In contrast to H3K36me2-specific oncogenic methyltransferases, the deletion of SETD2, which is capable of H3K36 trimethylation, increases lethality in a KRASG12C-driven lung adenocarcinoma mouse tumor model. In vitro, the authors demonstrate that polyacetylation of histone H3, particularly of H3K27, H3K14 and H3K23, promotes the catalytic activity of SETD2, whereas ubiquitination of H2A and H2B has no effect.

      Strengths:

      Overall, this is a well-designed study that addresses an important biological question regarding the functioning of the essential chromatin component. The manuscript contains excellent quality data, and the conclusions are convincing and justified. This work will be of interest to many biochemists working in the field of chromatin biology and epigenetics.

      Comments on revisions:

      All previous comments are well addressed, and I enthusiastically support publication.

    2. Reviewer #2 (Public review):

      Summary:

      Human histone H3K36 methyltransferase Setd2 has been previously shown to be a tumor suppressor in lung and pancreatic cancer. In this manuscript by Mack et al., the authors first use a mouse KRASG12D-driven lung cancer model to confirm in vivo that Setd2 depletion exacerbates tumorigenesis. They then investigate the enzymatic regulation of the Setd2 SET domain in vitro, demonstrating that H2A, H3, or H4 acetylation stimulates Setd2-SET activity, with specific enhancement by mono-acetylation at H3K14ac or H3K27ac. In contrast, histone ubiquitination has no effect. The authors propose that H3K27ac may regulate Setd2-SET activity by facilitating its binding to nucleosomes. This work provides insight into how cross-talk between histone modifications regulates Setd2 function.

      Comments on revisions:

      (1) Regarding New Figure 2F lane 1, please reference PMID: 33972509 Fig 4D bottom. Setd2-SET is a well-known robust K36 trimethylase. Why, under the authors' conditions, do WT nucleosomes show a significant amount of K36me1 and K36me2 accumulation, whereas K36me3 is not as pronounced? As a comparison, the authors should also report the evidence for the efficiency of each chemical modification that generates K36 methylation mimic.

      (2) The bottom panel of Figure 2B does not match the top one; the number of repeats should be indicated in the figure legends.

      (3) In Figure 4E, the differences between Setd2-bound WT and acetylated nucleosomes are minimal, as judged by both the decreasing trend of unbound nucleosomes and the increasing trend of bound fractions. This experiment needs to be quantified based on multiple repeats.

    1. Reviewer #1 (Public review):

      In this manuscript, Wolfson and co-authors demonstrate a combination of an injury-specific enhancer and engineered AAV that enhances transgene expression in injured myocardium. The authors characterize spatiotemporal dynamics of TREE-directed AAV expression in the injured heart using a non-invasive longitudinal monitoring system. They show that transgene expression is drastically increased 3 days post-injury, driven by 2ankrd1a. They reported a liver-detargeted capsid, AAV cc.84, with decreased viral entry into the liver while maintaining TREE transgene specificity. They further identified the IR41 serotype with enhanced transgene expression in injured myocardium from AAV library screening. This is an interesting study that optimizes the potential application of TREE delivery for cardiac repair.

      Comments on revisions:

      The authors are responsive and have addressed my concerns.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript by Wolfson et al., various adeno-associated viruses (AAVs) were delivered to mice to assess the cardiac-specificity, injury border-zone cardiomyocyte transduction rate, and temporal dynamics in the goal to find better AAVs for gene therapies targeting the heart. The authors delivered tissue regeneration enhancer elements (TREEs) controlling luciferase expression and used IVIS imaging to examine transduction in the heart and other organs. They found that luciferase expression increased in the first week after injury when using AAV9-TREE-Hsp68 promoter, waning to baseline levels by 7 weeks. However, AAV9 vectors transduced the liver, which was significantly reduced by using an AAV.cc84 liver de-targeting capsid. The authors then performed in vivo screening of AAV9 capsids and found AAV-IR41 to preferentially transduce injured myocardium when compared to AAV9. Finally, the authors combined TREEs with AAV-IR41 to show improved luciferase expression compared to AAV9-TREE at 7, 14 and 21 days after injury.

      Overall, this manuscript provides insights into TREE expression dynamics when paired with various heart-targeting capsids, which can be useful for researchers studying ischemic injury of murine hearts. While the authors have shown the success of using AAV9-TREEs in porcine hearts, it is unknown whether the expression dynamics would be similar in pigs or humans, as mentioned in the limitations.

      Strengths:

      Important contribution to the AAV gene therapy literature.

      Comments on revised version:

      My concerns have been adequately addressed.

    3. Reviewer #3 (Public review):

      Summary:

      The tissue regeneration enhancer elements (TREEs) identified in zebrafish have been shown to drive injury-activated temporal-spatial gene expression in mice and large animals. These findings increase the translational potential of findings in zebrafish to mammals. In this manuscript, the authors tested TREEs in combination with different adeno-associated viral (AAV) vectors using in vivo luciferase bioluminescent imaging that allows for longitudinal tracking. The TREE-driven luciferase delivered by a liver de-targeted AAV.cc84 decreased off-target transduction in liver. They further screened an AAV library to identify capsid variants that display enhanced transduction for infarcted myocardium post ischemia reperfusion and myocardial infarction. A new capsid variant, AAV.IR41, was found to show increased transduction post I/R and MI.

      Strengths:

      The authors injected AAV-cargo several days after ischemia/reperfusion (I/R) injury as a clinically relevant approach. Overall, this study is significant in that it identifies new AAV vectors that can be used to deliver promising genes as potential new gene therapies in the future. The manuscript is well-written and the data are also of high quality.

      Weaknesses:

      The authors have addressed my previous concerns.

    1. Reviewer #2 (Public review):

      This is a timely and insightful study aiming to explore the general physical principles for the sub-compartmentalization--or lack thereof--in the phase separation processes underlying the assembly of postsynaptic densities (PSDs), especially the markedly different organizations in three-dimensional (3D) droplets on one hand and the two-dimensional (2D) condensates associated with a cellular membrane on the other. Simulation of a highly simplified model (one bead per protein domain) is apparently carefully executed. Based on a thorough consideration of various control cases, the main conclusion regarding the trade-off between repulsive excluded volume interactions and attractive interactions among protein domains in determining the structures of 3D vs 2D model PSD condensates is quite convincing. The novel results in this manuscript should be published.

      Comment on the revised manuscript:

      The authors have adequately addressed all my previous concerns. The manuscript is now much improved, ready for publication as a version of record.

    2. Reviewer #3 (Public review):

      Summary:

      In this work, Yamada, Brandani and Takada have developed a mesoscopic model of the interacting proteins in the postsynaptic density. They have performed simulations, based on this model and using the software ReaDDy, to study the phase separation in this system in 2D (on the membrane) and 3D (in the bulk). They have carefully investigated the reasons behind different morphologies observed in each case, and have looked at differences in valency, specific/non-specific interactions and interfacial tension.

      Strengths:

      The simulation model is developed very carefully, with strong reliance on binding valency and geometry, experimentally measured affinities, and physical considerations like the hydrodynamic radii. The presented analyses are also thorough, and great effort has been put into investigating different scenarios that might explain the observed effects.

      Weaknesses:

      The biggest weakness of the study, in my opinion, has been a lack of more in-depth and quantitative physical insights about phase separation theories. In the revised version, the authors have added text to point the interested reader to the respective theories, and have included a qualitative assessment of their findings in the light of said theories. This better positions their discussion. I still believe the role of entropic effects need more attention, which can be the subject of future studies.

      The authors have revised their Introduction and added text to the Discussion, to enrich their view on the attractive and repulsive forces as well as mixing entropy. This version better covers the physics of phase separation.

      I appreciate the added discussion about the different diffusive behavior in the membrane in contrast to the bulk (i.e. the Saffman-Delbrück model). This paves the way for future studies, including realistic kinetics of the studied system.

    1. Joint Public Review:

      This manuscript reconsiders the "general form" of Hamilton's rule, in which "benefit" and "cost" are defined as regression coefficients. It points out that there is no reason to insist on Hamilton's rule of the form -c+br>0, and that, in fact, arbitrarily many terms (i.e. higher-order regression coefficients) can be added to Hamilton's rule to reflect nonlinear interactions. Furthermore, it argues that insisting on a rule of the form -c+br>0 can result in conditions that are true but meaningless and that statistical considerations should be employed to determine which form of Hamilton's rule is meaningful for a given dataset or model.

      Comments on latest version:

      The authors have provided a robust, valuable and detailed response to the previous reviews.

      Comments from Reviewer #1: I have nothing further to add.

      Comments from Reviewer #2: I appreciate the clarifications the author has made to the manuscript regarding (i) "sample covariance" terminology, (ii) the generality of the "generalized Price equation", and (iii) the distinction between the covariance and regression forms of the Price equation. I also appreciate that the ms now engages more deeply with some of the previous literature on regression-based Hamilton's rules (e.g. Smith et al., 2010; Rousset 2015). I feel these revisions make this contribution more valuable, and also more technically sound, since the term "sample covariance" is no longer used incorrectly.

      I also add that I agree with the substance of the authors' response to Reviewer #3. That is, the original submission was very clear that the regression-based Hamilton's rule is already completely general in the range of situations to which it applies, and that the added "generality" in the present ms refers to the variety of regression models that can be applied to these situations. In this way, the original ms already anticipates and addresses the criticism that Reviewer #3 raises.

      Reviewer #3 did not provide comments on the revised version.

    1. Reviewer #1 (Public review):

      Filamentous fungi are established work horses in biotechnology with Aspergillus oryzae as a prominent example with a thousand-year of history. Still the cell biology and biochemical properties of the production strains is not well understood. The paper of the Takeshita group describes the change in nuclear numbers and correlate it to different production capacities. They used microfluidic devices to really correlate the production with nuclear numbers. In addition, they used microdissection to understand expression profile changes and found an increase of ribosomes. The analysis of two genes involved in cell volume control in S. pombe did not reveal conclusive answers to explain the phenomenon. It appears that it is a multi-trait phenotype. Finally, they identified SNPs in many industrial strains and tried to correlate them to the capability of increasing their nuclear numbers.

      The methods used in the paper range from high quality cell biology, Raman spectroscopy to atomic force and electron microscopy and from laser microdissection to the use of microfluidic devices to study individual hyphae.

      This is a very interesting, biotechnologically relevant paper with the application of excellent cell biology.

      Comments on revised version:

      The authors addressed all suggestions satisfactorily.

    2. Reviewer #2 (Public review):

      Summary:

      In the study presented by Itani and colleagues it is shown that some strains of Aspergillus oryzae - especially those used industrially for the production of sake and soy sauce - develop hyphae with a significantly increased number of nuclei and cell volume over time. These thick hyphae are formed by branching from normal hyphae and grow faster and therefore dominate the colonies. The number of nuclei positively correlates with the thicker hyphae and also the amount of secreted enzymes. The addition of nutrients such as yeast extract or certain amino acids enhanced this effect. Genome and transcriptome analyses identified genes, including rseA, that are associated with the increased number of nuclei and enzyme production. The authors conclude from their data involvement of glycosyltransferases, calcium channels and the tor regulatory cascade in regulation of cell volume and number of nuclei. Thicker hyphae and an increased number of nuclei was also observed in high-production strains of other industrially used fungi such as Trichoderma reesei and Penicillium chrysogenum, leading to the hypothesis that the mentioned phenotypes are characteristic of production strains which is of significant interest for fungal biotechnology.

      Strengths:

      The study is very comprehensive and involves application of divers state-of-the-art cell biological, biochemical and genetical methods. Overall, the data are properly controlled and analyzed, figures and movies are of excellent quality.<br /> The results are particularly interesting with regard to the elucidation of molecular mechanisms that regulate the size of fungal hyphae and their number of nuclei. For this, the authors have discovered a very good model: (regular) strains with a low number of nuclei and strains with high number of nuclei. Also, the results can be expected to be of interest for the further optimization of industrially relevant filamentous fungi.

      In the revision the authors addressed all my comments and as a result produced an even stronger study.

    3. Reviewer #3 (Public review):

      Summary:

      The authors seek to determine the underlying traits that support the exceptional capacity of Aspergillus oryzae to secrete enzymes and heterologous proteins. To do so, they leverage the availability of multiple domesticated isolates of A. oryzae along with other Aspergillus species to perform comparative imaging and genomic analysis.

      Strengths:

      The strength of this study lies in the use of multifaceted approaches to identify significant differences in hyphal morphology that correlate with enzyme secretion, which is then followed by the use of genomics to identify candidate functions that underlie these differences.

      Weaknesses:

      Although the image analysis and data interpretation is convincing, the genetic data supporting the author's model is somewhat more speculative and will likely require additional investigation.

      Overall, the authors have achieved their aims in that they are able to clearly document the presence of two distinct hyphal forms in A. oryzae and other Aspergillus species, and to correlate the presence of the thicker rapidly growing form with enhanced enzyme secretion. The image analysis is convincing. The discovery that addition of yeast extract and specific amino acids can stimulate formation of the novel hyphal form is also notable. Although the conclusions are generally supported by the results, this is perhaps less so for the genetic analysis as it remains unclear how direct the role of RseA and the calcium transporters might be in supporting the formation of the thicker hyphae.

      The results presented here will impact the field. The complexity of hyphal morphology and how it affects secretion are not well understood despite the importance of these processes for the fungal lifestyle. In addition, the description of approaches that can be used to facilitate the study of these different hyphal forms (i.e., stimulation using yeast extract or specific animo acids) will benefit future efforts to understand the molecular basis of their formation.

    1. Reviewer #1 (Public review):

      Summary:

      The behaviour of cells expressing constitutively active HRas is examined in mosaic monolayers, both in MCF10a breast epithelial and Beas2b bronchial epithelial cell lines, mimicking the potential initial phase of development of carcinoma. Single HRas-positive cells are excluded from MCF10a but not Beas2b monolayers. Most interestingly, however, when in groups, these cells are not excluded, but rather sharply segregated within a MCF10a monolayer. In contrast, they freely mix with wt Beas2b cells. Biophysical analysis identifies high tension at heterotypic interfaces between HRas and wild-type cells as the likely reason for segregation of MCF10a cells. The hypothesis is supported experimentally, as myosin inhibition abolishes segregation. The probable reason for the lack of segregation in the bronchial epithelium is to be found in the different intrinsic properties of these cells, which form a looser tissue with lower basal actomyosin activity. The behaviour of single cells and groups is recapitulated in a vortex model based on the principle of differential interfacial tension, under the condition of high heterotypic interfacial tension.

      Strengths:

      Despite being long recognized as a crucial event during cancer development, segregation of oncogenic cells has been a largely understudied question. This nice work addresses the mechanics of this phenomenon through a straightforward experimental design, applying the biophysical analytical approaches established in the field of morphogenesis. Comparison between two cell types provides some preliminary clues on the diversity of effects in various cancers.

      Weaknesses:

      Although not calling into question the main message of this study, there are a few issues that one may want to address:

      (1) One may be careful in interpreting the comparison between MCF10a and Beas2b cells as used in this study. The conditions may not necessarily be representative of the actual properties of breast and bronchial epithelia. How much of the epithelial organization is reconstituted under these experimental conditions remains to be established. This is particularly obvious for bronchial cells, which would need quite specific culture conditions to build a proper bronchial layer. In this study, they seemed to be on the verge of a mesenchymal phenotype (large gaps, huge protrusions, cells growing on top of each other, as mentioned in the manuscript).

      As an alternative to Beas2b, comparison of MCF10a with another cell line capable of more robust in vitro epithelial organization, but ideally with different adhesive and/or tensile properties, would be highly interesting, as it may narrow down the parameters involved in segregation of oncogenic cells.

      (2) While the seminal description of tissue properties based on interfacial tensions (Brodland 2002) is clearly key to interpreting these data, the actual "Differential Interfacial Tension Hypothesis" poses that segregation results from global differences, i.e., juxtaposition of two tissues displaying different intrinsic tensions. On the contrary, the results of the present work support a different scenario, where what counts is the actual difference in tension ALONG the tissue boundary, in other words, that segregation is driven by high HETEROTYPIC interfacial tension. This is an important distinction that should be clarified.

      (3) Related: The fact that actomyosin accumulates at the heterotypic interface is key here. It would be quite informative to better document the pattern of this accumulation, which is not clear enough from the images of the current manuscript: Are we talking about the actual interface between mutant and wt cells (membrane/cortex of heterotypic contacts)? Or is it more globally overactivated in the whole cell layer along the border? Some better images and some quantification would help.

      (4) In the case of Beas2b cells, mutant cells show higher actin than wt cells, while actin is, on the contrary, lower in mutant MCF10a cells (Figure 2b). Has this been taken into account in the model? It may be in line with the idea that HRas may have a different action on the two cell types, a possibility that would certainly be worth considering and discussing.

      In conclusion, the study conveys an important message, but, as it stands, the strength of evidence is incomplete. It would greatly benefit from a more detailed and complete analysis of the experimental data, a better fit between this analysis and the corresponding vertex model, and a more in-depth discussion of biological and biophysical aspects. These revisions should be rather easily done, and would then make the evidence much more solid.

    2. Reviewer #2 (Public review):

      Summary:

      The authors investigate the behavior of oncogenic cells in mammary and bronchial epithelia. They observe that individual oncogenic cells are preferentially excluded from the mammary epithelium, but they remain integrated in the bronchial epithelium. They also observe that clusters of oncogenic cells form a compact cluster in the mammary epithelium, but they disperse in the bronchial epithelium. The authors demonstrate experimentally and in the vertex model simulations that the difference in observed behavior is due to the differential tension between the mutant and wild-type cells due to a differential expression of actin and myosin.

      Strengths:

      (1) Very detailed analysis of experiments to systematically characterize and quantify differences between mammary and bronchial epithelia.

      (2) Detailed comparison between the experiments and vertex model simulations to identify the differential cell line tension between the oncogenic and wild-type cells as one of the key parameters that are responsible for the different behavior of oncogenic cells in mammary and bronchial epithelia

      Weaknesses:

      (1) It is unclear what the mechanistic origin of the shape-tension coupling is, which is used in the vertex model, and how important that coupling is for the presented results. The authors claim that the shape-tension coupling is due to the anisotropic distribution of stress fibers when cells are under external stress. It is unclear why the stress fibers should affect an effective line tension on the cell boundaries and why the stress fibers should be sensitive to the magnitude of the internal isotropic cell pressure. In experiments, it makes sense that stress fibers form when cells are stretched. Similar stress fibers form when the cytoskeleton or polymer networks are stretched. It is unclear why the stress fibers should be sensitive to the magnitude of internal isotropic cell pressure. If all the surrounding cells have the same internal pressure, then the cell would not be significantly deformed due to that pressure, and stress fibers would not form. The authors should better justify the use of the shape-tension coupling in the model and also present simulation results without that coupling. I expect that most of the observed behavior is already captured by the differential tension, even if there is no shape-tension coupling.

      (2) The observed difference of shape indices between the interfacial and bulk cells in simulations in the absence of differential line tension is concerning. This suggests that either there are not enough statistics from the simulations or that something is wrong with the simulations. For all presented simulation results, the authors should repeat multiple simulations and then present both averages and standard deviations. This way, it would be easier to determine whether the observed differences in simulations are statistically significant.

      (3) The authors should also analyze the cell line tension data in simulations and make a comparison with experiments.

    1. Reviewer #1 (Public review):

      Summary:

      Sakelaris and Riecke used computational modeling to explore how neurogenesis and sequential integration of new neurons into a network support memory formation and maintenance. They focus on the integration of granule cells in the olfactory bulb, a brain area where adult neurogenesis is prominent. Experimental results published during recent years provide an excellent basis to address the question at hand by biologically constrained models. The study extends previous computational models and provides a coherent picture of how multiple processes may act in concert to enable rapid learning, high stability of memories, and high memory capacity. This computational model generates experimentally testable predictions and is likely to be valuable to understand roles of neurogenesis and related phenomena in memory. One of the key findings is that important features of the memory system depend on transient properties of adult-born granule cells such as enhanced excitability and apoptosis during specific phases the development of individual neurons. The model can explain many experimental observations, and suggests specific functions for different processes (e.g., importance of apoptosis for continual learning). While this model is obviously a massive simplification of the biological system, it conceptualizes diverse experimental observations into a coherent picture, it generates testable predictions for experiments, and it and will likely inspire further modeling and experimental studies.

      Strengths:

      - The model can explain diverse experimental observations

      - The model directly represents the biological network

      Weaknesses:

      - As many other models of biological networks, this model contains major simplifications.

    2. Reviewer #2 (Public review):

      Summary:

      The authors propose a mechanism to provide flexibility to learn new information while preserving stability in neural networks by combining structural plasticity and synaptic plasticity.

      Strengths:

      An intriguing idea, well embedded in experimental data.

      Authors have done a great job addressing reviewers' concerns

      Weaknesses:

      None

    3. Reviewer #3 (Public review):

      The manuscript is focused on local bulbar mechanisms to solve the flexibility-stability dilemma in contrast to long range interactions documented in other systems (hippocampus-cortex). The network performance is assessed in a perceptual learning task: the network is presented with alternating, similar artificial stimuli (defined as enrichment) and the authors assess its ability to discriminate between these stimuli by comparing the mitral cell representations quantified by Fisher discriminant analysis. The authors use enhancement in discriminability between stimuli as function of the degree of specificity of connectivity in the network to quantify the formation of an odor-specific network structure which as such has memory - they quantify memory as the specificity of that connectivity.

      The focus on neurogenesis, excitability and synaptic connectivity of abGCs is topical, and the authors systematically built their model, clearly stating their assumptions and setting up the questions and answers. In my opinion, the combination of latent dendritic representations, excitability and apoptosis in an age-dependent manner is interesting and as the authors point out leads to experimentally testable hypotheses.

      In the revised manuscript, the authors have systematically addressed my previous concerns. In particular, they now refer to previous work on granule cells-mitral cell interactions more generally, they explain the pros and cons for usage of specificity in connectivity as a proxy for memory capacity, and the biological plausibility of the model.

    1. Reviewer #1 (Public review):

      Summary:

      This work provides a comprehensive analysis of how adult zebrafish show fear responses to conspecific alarm substances (CAS) and retain their associative memory. It shows that freezing is a more reliable measure of fear response and memory compared to evasive swimming, and that the reactivity and the type of responses depend on the zebrafish strain. It further suggests neuronal substrates of different fear responses based on c-Fos mapping.

      Strengths:

      The behavioral part is the most comprehensive and detailed yet in the zebrafish field, providing strong support for the authors' claim. The flow from Figure 1 to Figure 4 is very smooth. They provide extremely detailed, yet complementary and necessary, analyses of how different categories of behavior emerge over time during the CAS exposure and memory retrieval. I'm convinced that neuro researchers who study fear/stress responses will always refer to this paper to plan and interpret their future experiments.

      Weaknesses:

      The neural analysis part is very comprehensive. Figure 5 and Figure 6 are independent but complement each other very well. They together support that the cerebellar system is the key brain component for a freezing response. Their extreme focus on high-level analyses, however, came at the expense of biological intuitions. I suggest adding some figure panels and result/discussion paragraphs to help with that aspect.

    2. Reviewer #2 (Public review):

      In this study, Fontana et al. develop a paradigm for associative conditioning by pairing exposure to an alarm substance with a novel tank. Exposure to conspecific alarm substance (CAS) in the novel tank triggers freezing and what they characterize as evasive swimming behaviour, which is subsequently seen in a re-exposure to the novel tank without the CAS present. Importantly, these states are identified via automated processes, including postural tracking and a random forest classification process, which could be very useful tools for subsequent studies.

      In their experiments, they focus on the differences in behaviour among strains of zebrafish (both males and females), and among individual zebrafish. For males and females of different strains, they find some differences, though the clearest message seems to be that the most robust measure of the behaviour in response to both the CAS and in the memory trials is the freezing behaviour, while evasive behaviour is more variable. and not always seen. This may relate to their observation of significant "evasiveness" in vehicle control experiments (discussed further below).

      Moving on to individual variation from within this multi-strain male/female dataset, they first examine transition matrices between states and find tthat his is not dramatically altered by stimulus exposure. They then use clustering to identify 4 different "classes" of zebrafish that differ in their expression (or not) of two types of behaviour: freezing and/or evasive behaviour. They show that over the three exposure epochs of the experiment, this classification is somewhat stable in an individual fish, though many fish change their behaviour - e.g., evading + freezing -> only freezing.

      In the final set of experiments, the authors move beyond behavioural analyses and perform whole-brain cFos mapping of these individual zebrafish. They perform analyses aimed at identifying correlations between individual behavioural expression and the number of cFos-positive cells in different brain regions. Using partial least squares analysis, they find areas associated with two types of behavioural contrasts, which differ in their weighting of different behavioural expression during the Memory trials. Covariation and network structure analysis within different classes of larvae also find some differences in covariation among brain areas, providing hypotheses as to underlying network effects that may govern the expression of freezing and/or evasive behavior in the memory trial phases.

      Overall, I find this to be an interesting study that employs state of the are methods of behavioural analyses and whole-brain cFos analyses, but I am left a little bit confused as to what the take home message is and what can be concluded from this complex study that mixes in analyses of strain, sex, and individuality within a quite complex assay with multiple behavioural parameters.

      My suggestions are as follows:

      (1) My first concern relates to the claim in the abstract that "We found that fear memory behavior fell into four distinct groups: non-reactive, evaders, evading freezers, and freezers".

      In my opinion, the "freezing" aspect is well supported as being both triggered by the CAS and for memory effect upon re-exposure to the tank, but I am less convinced about the "evasive" behaviour. In Figure 2, it appears that "evasiveness" is generally not increased in both the Exposure or Memory phases for many groups, and in Figure 5, it appears that "evasiveness" is expressed by nearly 50% of the fish in the pre-exposure condition before CAS addition and in all phases in the vehicle condition. Therefore, it appears that most of the expression of this behaviour is independent of any memory-based effect.

      (2) My second concern relates to the claim in the abstract that "background strain and sex influenced how fish respond to CAS, with males more likely to increase evasive behaviors than females and the TU strain more likely to be non-reactive."

      My understanding, based on the introduction and on the methods, is that it is likely important that the CAS be prepared from conspecifics of the same strain and sex, and for this reason, they prepared different CAS specific for each strain and each sex. Therefore, the "CAS" that is applied is necessarily different for each condition, and I am concerned about if the differences observed could relate more to variation in the quality, purity, concentration, etc. of the specific CAS samples for different groups, rather than their reactivity to the substance or their ability to form memories based on such experiences.

      (3) My third concern relates to the interpretation of the cFos data.

      As I mentioned above, I feel as though the behavioural analysis is perhaps more complex than is warranted via the inclusion of evasiveness, and I wonder if the conclusions from the experiments would be simpler if analyzed only from the perspective of freezing.

      But considering the presented analyses: while I dont think there is anything wrong with the partial least squares approach and the network analyses, I am concerned that the simple messaging in the text does not reflect the complexity of this analysis combining different weightings of different behavioural characteristics in a behavioural contrast, or covariations among many regions and what such analyses mean at the level of brain function. For these reasons, I feel like statements along the lines of "Behavioral variation is driven by differences in the activity of brain regions outside the telencephalon, such as the cerebellum, preglomerular nuclei, preoptic area and hypothalamus" are not well supported.

    3. Reviewer #3 (Public review):

      Summary:

      This manuscript by Fontana et al. sets out to fill a critical gap in our understanding of how individuality in fear responses corresponds to changes in brain activity. Previous work has shown in myriad species that fear behaviors are highly variable, and these variabilities correlate with sex and strain, with epigenetic modifications, and neural activity in specific regions of the brain, such as the amygdala. However, a whole-brain functional assessment of whether activity in different regions of the brain is associated with fear behavior has been difficult to assess, in part due to the large size and opacity of the brain. The Kenney group overcomes these limitations using the zebrafish, together with powerful behavioral and brain imaging approaches pioneered by their lab. To overcome the technical obstacles of delivering a reproducible unconditioned stimulus in water and quantifying nuanced behavioral responses, the authors developed a three-day conditioning paradigm in which fish were repeatedly exposed to CAS in one tank context and to control water in another. Leveraging automated cluster analysis across over 300 individuals from four inbred strains, they identified four distinct memory-recall phenotypes - non-reactive, evaders, evading freezers, and freezers - demonstrating both the robustness of their assay and the influence of genetic background and sex on fear learning. Finally, whole-brain imaging using the AZBA atlas (Kenney et al. eLife) and cfos mapping coupled with multivariate analysis revealed that although all fish reengaged telencephalic regions during recall, high-freezing phenotypes uniquely recruited cerebellar, preglomerular, and pretectal nuclei, whereas mixed evasion-freezing fish showed preferential activation of preoptic and hypothalamic areas - a finding that lays the groundwork for dissecting the distributed neural substrates of associative fear in zebrafish.

      Strengths:

      The strengths of the study lie in the use of zeberarish and the innovative behavioral, modeling, and brain imaging tools applied to address this question. The question of how brain-wide activity correlates with variations in fear behavior is fundamental, and arguably, this system is the only system that could be used to address this. The statistics are appropriate, and the study is well reasoned. Overall, I like this manuscript very much and think it adds invaluable information to the field of fear/anxiety.

      Weaknesses:

      I have a few questions and suggestions.

      (1) The three-day contextual fear paradigm, as implemented - one CAS pairing on day 2 followed by a single recall test on day 3 - inevitably conflates acquisition and long-term memory, making it impossible to know whether strains like TU truly recall the association poorly or simply learn it more slowly. For example, given that TU fish extinguish fear faster than AB or TL strains in extended protocols, they may simply require additional or repeated CAS pairings to achieve the same asymptotic performance. To disentangle learning kinetics from recall strength, the assay could be revised to include multiple acquisition trials (e.g., conditioning on two or more consecutive days) with an immediate post-conditioning probe to assess acquisition independent of consolidation, and continuous measurement of freezing and evasive behaviors across each trial to fit learning curves for each strain. Such refinements - even if on a subset of the strains - would reveal whether "non-reactive" phenotypes reflect genuine recall deficits or merely delayed acquisition.

      (2) My second major question is with respect to Figure 3 panel B. This is a complex figure, and I can understand the gist of what the authors are attempting to show, but it is difficult to understand as it is. Can this be represented in a way that is clearer and explained a bit more easily?

      (3) The brain mapping is by far one of the most interesting aspects of this study, and the methods that the group used are interesting. The brain mapping, however, relies on generating "contrasting" groups (Figure 6A), and I was not clear as to how these two groups were formed. Could the authors elaborate a bit?

    1. Reviewer #1 (Public review):

      Summary:

      In this study, the authors present a thorough mechanistic study of the J-domain protein Apj1 in Saccharomyces cerevisiae, establishing it as a key repressor of Hsf1 during the attenuation phase of the heat shock response (HSR). The authors integrate genetic, transcriptomic (ribosome profiling), biochemical (ChIP, Western), and imaging data to dissect how Apj1, Ydj1 and Sis1 modulate Hsf1 activity under stress and non-stress conditions. The work proposes a model where Apj1 specifically promotes displacement of Hsf1 from DNA-bound heat shock elements, linking nuclear PQC to transcriptional control.

      Strengths:

      Overall, the work is highly novel-this is the first detailed functional dissection of Apj1 in Hsf1 attenuation. It fills an important gap in our understanding of how Hsf1 activity is fine-tuned after stress induction, with implications for broader eukaryotic systems. I really appreciate the use of innovative techniques including ribosome profiling and time-resolved localization of proteins (and tagged loci) to probe Hsf1 mechanism. The overall proposed mechanism is compelling and clear-the discussion proposes a phased control model for Hsf1 by distinct JDPs, with Apj1 acting post-activation, while Sis1 and Ydj1 suppress basal activity.

      The manuscript is well-written and will be exciting for the proteostasis field and beyond.

      Comments on revised version:

      The authors have addressed all my concerns,

    2. Reviewer #2 (Public review):

      Summary:

      Overall, the work is exceptionally well done and controlled and the results properly and appropriately interpreted. While several of the approaches, while powerful, are somewhat indirect (i.e., following gene expression via ribosomal profiling) additional experiments utilizing traditional gene expression assays added in revision combine to ultimately provide a compelling answer to the main questions being asked.

      The key finding from this work is the discovery that Apj1 regulates Hsf1 attenuation in a manner that includes Hsp70. That finding is strongly supported by the experimental data. While it would be ideal to also demonstrate Apj1-controlled differential binding of Ssa1/2 to Hsf1 at either the N- or C-terminal binding sites during attenuation, the Hsp70-Hsf1 interactions are difficult to reproducibly assess in cell extracts and are likely beyond the scope of this study. However, this work paves the way in the future for potential biochemical reconstitution assays that could elucidate both Hsp70-Hsf1 interactions as well as the distinct JDP-Hsf1 interactions reported here.

      This discovery raises additional new questions about JDP specificity in HSR regulation and the role of JDPs in navigating protein aggregation and sensing of proteostatic challenge in the nucleus, thus advancing the field and opening new, exciting avenues for exploration.

    3. Reviewer #3 (Public review):

      Summary:

      The heat shock response (HSR) is an inducible transcriptional program that has provided paradigmatic insight into how stress cues feed information into the control of gene expression. The recent elucidation that the chaperone Hsp70 controls the DNA binding activity of the central HSR transcription factor Hsf1 by direct binding has spurred the question how such a general chaperone obtains specificity. This study has addressed the next logical question, how J-domain proteins execute this task in budding yeast, the leading cell model for studying the HSR. While an involvement and in part overlapping function of general class A and B J-domain proteins, Ydj1 and Sis1 are indicated by the genetic analysis a highly specific role for the class A Apj1 in displacing Hsf1 from the promoters is found unveiling specificity in the system.

      Strengths

      The central strong point of the paper is the identification of class A J-domain protein Apj1 as a specific regulator of the attenuation of the HSR by removing Hsf1 from HSEs at the promoters. The genetic evidence and the ChIP data strongly support this claim. This identification of a specific role for a lowly expressed nuclear J-domain protein changes how the wiring of the HSR should be viewed. It also raises important questions regarding the model of chaperone titration, the concept that a chaperone with limiting availability is involved in a thug of war involving competing interactions with misfolded protein substrates and regulatory interactions with Hsf1. Perhaps Apj1 with its low levels and interactions with misfolded and aggregated proteins in the nucleus is the titrated Hsp70 (co)chaperone that determines the extent of the HSR? This would mean that Apj1 is at the nexus of the chaperone titration mechanism. Although Apj1 is not a highly conserved J domain protein among eukaryotes the strength of the study is that is provides a conceptual framework for what may be required for chaperone titration in other eukaryotes: One or more nuclear J-domain proteins with low nuclear levels that has an affinity for Hsf1 and that can become limiting due to interactions with misfolded Hsp70 proteins. The provides a pathway for how these may be identified using for example ChIP-seq.

      Weakness

      A built-in challenge when studying the mechanism of the HSR is the general role of Hsp70 chaperone system and its J domain proteins. Indeed, a weakness of the study is that it is unclear what of the phenotypic effects have to do with directly recruiting Hsp70 to Hsf1 dependent on a J domain protein and what instead is an indirect effect of protein misfolding caused by the mutation. This interpretation problem is clearly and appropriately dealt with in the manuscript text and in experiments but is of such fundamental nature that it cannot easily be fully ruled out.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript from Jones and colleagues investigates a previously described phenomenon in which P. falciparum malaria parasites display increased trafficking of proteins displayed on the surface of infected RBCs, as well as increased cytoadherence in response to febrile temperatures. While this parasite response was previously described, it was not uniformly accepted, and conflicting reports can be found in the literature. This variability likely arises due to differences in the methods employed and the degree of temperature increase to which the parasites were exposed. Here, the authors are very careful to employ a temperature shift that likely reflects what is happening in infected humans and that they demonstrate is not detrimental to parasite viability or replication. In addition, they go on to investigate what steps in protein trafficking are affected by exposure to increased temperature and show that the effect is not specific to PfEMP1 but rather likely affects all transmembrane domain-containing proteins that are trafficked to the RBC. They also detect increased rates of phosphorylation of trafficked proteins, consistent with overall increased protein export.

      Strengths:

      The authors used a relatively mild increase in temperature (39 degrees), which they demonstrate is not detrimental to parasite viability or replication. This enabled them to avoid potential complications of a more severe heat shock that might have affected previously published studies. They employed a clever method of fractionation of RBCs infected with a var2csa-nanoluc fusion protein expressing parasite line to determine which step in the export pathway was likely accelerating in response to increased temperature. This enabled them to determine that export across the PVM is being affected. They also explored changes in phosphorylation of exported proteins and demonstrated that the effect is not limited to PfEMP1 but appears to affect numerous (or potentially all) exported transmembrane domain-containing proteins.

      Weaknesses:

      All the experiments investigating changes resulting from increased temperature were conducted after an increase in temperature from 16 to 24 hours, with sampling or assays conducted at the 24 hr mark. While this provided consistency throughout the study, this is a time point relatively early in the export of proteins to the RBC surface, as shown in Figure 1E. At 24 hrs, only approximately 50% of wildtype parasites are positive for PfEMP1, while at 32 hrs this approaches 80%. Since the authors only checked the effect of heat stress at 24 hrs, it is not possible to determine if the changes they observe reflect an overall increase in protein trafficking or instead a shift to earlier (or an accelerated) trafficking. In other words, if a second time point had been considered (for example, 32 hrs or later), would the parasites grown in the absence of heat stress catch up?

    2. Reviewer #2 (Public review):

      This manuscript describes experiments characterising how malaria parasites respond to physiologically relevant heat-shock conditions. The authors show, quite convincingly, that moderate heat-shock appears to increase cytoadherance, likely by increasing trafficking of surface proteins involved in this process.

      While generally of a high quality and including a lot of data, I have a few small questions and comments, mainly regarding data interpretation.

      (1) The authors use sorbitol lysis as a proxy for trafficking of PSAC components. This is a very roundabout way of doing things and does not, I think, really show what they claim. There could be a myriad of other reasons for this increased activity (indeed, the authors note potential PSAC activation under these conditions). One further reason could be a difference in the membrane stability following heat shock, which may affect sorbitol uptake, or the fragility of the erythrocytes to hypotonic shock. I really suggest that the authors stick to what they show (increased PSAC) without trying to use this as evidence for increased trafficking of a number of non-specified proteins that they cannot follow directly.

      (2) Supplementary Figure 6C/D: The KAHRP signal does not look like it should. In fact, it doesn't look like anything specific. The HSP70-X signal is also blurry and overexposed. These pictures cannot be used to justify the authors' statements about a lack of colocalisation in any way.

      (3) Figure 6: This experiment confuses me. The authors purport to fractionate proteins using differential lysis, but the proteins they detect are supposed to be transmembrane proteins and thus should always be found associated with the pellet, whether lysis is done using equinatoxin or saponin. Have they discovered a currently unknown trafficking pathway to tell us about? Whilst there is a lot of discussion about the trafficking pathways for TM proteins through the host cell, a number of studies have shown that these proteins are generally found in a membrane-bound state. The authors should elaborate, or choose an experiment that is capable of showing compartment-specific localisation of membrane-bound proteins (protease protection, for example).

      (4) The red blood cell contains, in addition to HSP70-X, a number of human HSPs (HSP70 and HSP90 are significant in this current case). As the name suggests, these proteins non-specifically shield exposed hydrophobic domains revealed upon partial protein unfolding following thermal insult. I would thus have expected to find significantly more enrichment following heat shock, but this is not the case. Is it possible that the physiological heat shock conditions used in this current study are not high enough to cause a real heat shock?

    3. Reviewer #3 (Public review):

      Summary:

      In this paper, it is established that high fever-like 39{degree sign}C temperatures cause parasite-infected red blood cells to become stickier. It is thought that high temperatures might help the spleen to destroy parasite-infected cells, and they become stickier in order to remain trapped in blood vessels, so they stop passing through the spleen.

      Strengths:

      The strength of this research is that it shows that fever-like temperatures can cause parasite-infected red blood cells to stick to surfaces designed to mimic the walls of small blood vessels. In a natural infection, this would cause parasite-infected red blood cells to stop circulating through the spleen, where the parasites would be destroyed by the immune system. It is thought that fevers could lead to infected red blood cells becoming stiffer and therefore more easily destroyed in the spleen. Parasites respond to fevers by making their red blood cells stickier, so they stop flowing around the body and into the spleen. The experiments here prove that fever temperatures increase the export of Velcro-like sticky proteins onto the surface of the infected red blood cells and are very thorough and convincing.

      Weaknesses:

      A minor weakness of the paper is that the effects of fever on the stiffness of infected red blood cells were not measured. This can be easily done in the laboratory by measuring how the passage of infected red blood cells through a bed of tiny metal balls is delayed under fever-like temperatures.

    1. Reviewer #1 (Public review):

      Summary:

      In this article, Almeida and colleagues use a combination of NMR and ITC to study the interaction of the EBH domain of microtubule end-binding protein 1 (EB1) with SxIP peptides derived from the MACF plus-end tracking protein. EBH forms a dimer and in isolation has previously been shown to have a disordered C-terminal tail. Here, the authors use NMR to determine a solution structure of the EBH dimer bound to 11-mer SxIP peptides derived from MACF, and observe that the disordered C-terminal of EBH is recruited by residues C-terminal to the SxIP motif to fold into the final complex. By comparison of binding in different length peptides, and of EBH lacking the C-terminal tail, they show that these additional contacts increase binding affinity by an order of magnitude, greatly stabilising the interaction, in a binding mode they term 'dock-and-lock'.

      The authors also use their new structural knowledge to design peptides with higher affinities, and show in a cell model that these can be weakly recruited to microtubule ends - although a dimeric construct is necessary for efficient recruitment. Ultimately, by demonstrating the feasibility of targeting these proteins, this work points towards the possibility of designing small-molecules to block the interactions.

    2. Reviewer #2 (Public review):

      Summary:

      The C-terminal region of EB1 is responsible for protein-protein interactions, thereby recruiting the binding partners of EB1 to microtubules; the coiled-coil region (EBH) and the acidic tail are critical for their binding partners. The authors demonstrated by using NMR that the binding mode of EBH with the SxIP motif, which is a two-step process termed "dock-and-lock". The ITC analysis supports the results obtained from NMR. The initial version of the manuscript contained ambiguities on the ITC data; however, the results of the revised manuscript are convincing and support the two-step binding model.

      Strength:

      The authors propose a novel model of "dock-and-lock" by using multiple methods of NMR, ITC and cell biology.

    1. Reviewer #1 (Public review):

      This paper presents a computational model of the evolution of two different kinds of helping ("work," presumably denoting provisioning, and defense tasks) in a model inspired by cooperatively breeding vertebrates. The helpers in this model are a mix of previous offspring of the breeder and floaters that might have joined the group, and can either transition between the tasks as they age or not. The two types of help have differential costs: "work" reduces "dominance value," (DV), a measure of competitiveness for breeding spots, which otherwise goes up linearly with age, but defense reduces survival probability. Both eventually might preclude the helper from becoming a breeder and reproducing. How much the helpers help, and which tasks (and whether they transition or not), as well as their propensity to disperse, are all evolving quantities. The authors consider three main scenarios: one where relatedness emerges from the model, but there is no benefit to living in groups, one where there is no relatedness, but living in larger groups gives a survival benefit (group augmentation, GA), and one where both effects operate. The main claim is that evolving defensive help or division of labor requires the group augmentation; it doesn't evolve through kin selection alone in the authors' simulations.

      This is an interesting model, and there is much to like about the complexity that is built in. Individual-based simulations like this can be a valuable tool to explore the complex interaction of life history and social traits. Yet, models like this also have to take care of both being very clear on their construction and exploring how some of the ancillary but potentially consequential assumptions affect the results, including robust exploration of the parameter space. I think the current manuscript falls short in these areas, and therefore, I am not yet convinced of the results.

      In this round, the authors provided some clarity, but some questions still remain, and I remain unconvinced by a main assumption that was not addressed.

      Based on the authors' response, if I understand the life history correctly, dispersers either immediately join another group (with 1-the probability of dispersing), or remain floaters until they successfully compete for a breeder spot or die? Is that correct? I honestly cannot decide because this seems implicit in the first response but the response to my second point raises the possibility of not working while floating but can work if they later join a group as a subordinate. If it is the case that floaters can have multiple opportunities to join groups as subordinates (not as breeders; I assume that this is the case for breeding competition), this should be stated, and more details about how.

      So there is still some clarification to be done, and more to the point, the clarification that happened only happened in the response. The authors should add these details to the main text. Currently, the main text only says vaguely that joining a group after dispersing " is also controlled by the same genetic dispersal predisposition" without saying how.

      In response to my query about the reasonableness of the assumption that floaters are in better condition (in the KS treatment) because they don't do any work, the authors have done some additional modeling but I fail to see how that addresses my point. The additional simulations do not touch the feature I was commenting on, and arguably make it stronger (since assuming a positive beta_r -which btw is listed as 0 in Table 1- would make floaters on average be even more stronger than subordinates). It also again confuses me with regard to the previous point, since it implies that now dispersal is also potentially a lifetime event. Is that true?

      Meanwhile, the simplest and most convincing robustness check, which I had suggested last round, is not done: simply reduce the increase in the R of the floater by age relative to subordinates. I suspect this will actually change the results. It seems fairly transparent to me that an average floater in the KS scenario will have R about 15-20% higher than the subordinates (given no defense evolves, y_h=0.1 and H_work evolves to be around 5, and the average lifespan for both floaters and subordinates are in the range of 3.7-2.5 roughly, depending on m). That could be a substantial advantage in competition for breeding spots, depending on how that scramble competition actually works. I asked about this function in the last round (how non-linear is it?) but the authors seem to have neglected to answer.

      More generally, I find that the assumption (and it is an assumption) floaters are better off than subordinates in a territory to be still questionable. There is no attempt to justify this with any data, and any data I can find points the other way (though typically they compare breeders and floaters, e.g.: https://bioone.org/journals/ardeola/volume-63/issue-1/arla.63.1.2016.rp3/The-Unknown-Life-of-Floaters--The-Hidden-Face-of/10.13157/arla.63.1.2016.rp3.full concludes "the current preliminary consensus is that floaters are 'making the best of a bad job'."). I think if the authors really want to assume that floaters have higher dominance than subordinates, they should justify it. This is driving at least one and possibly most of the key results, since it affects the reproductive value of subordinates (and therefore the costs of helping).

      Regarding division of labor, I think I was not clear so will try again. The authors assume that the group reproduction is 1+H_total/(1+H_total), where H_total is the sum of all the defense and work help, but with the proviso that if one of the totals is higher than "H_max", the average of the two totals (plus k_m, but that's set to a low value, so we can ignore it), it is replaced by that. That means, for example, if total "work" help is 10 and "defense" help is 0, total help is given by 5 (well, 5.1 but will ignore k_m). That's what I meant by "marginal benefit of help is only reduced by a half" last round, since in this scenario, adding 1 to work help would make total help go to 5.5 vs. adding 1 to defense help which would make it go to 6. That is a pretty weak form of modeling "both types of tasks are necessary to successfully produce offspring" as the newly added passage says (which I agree with), since if you were getting no defense by a lot of food, adding more food should plausibly have no effect on your production whatsoever (not just half of adding a little defense). This probably explains why often the "division of labor" condition isn't that different than the no DoL condition.

    2. Reviewer #2 (Public review):

      Summary:

      This paper formulates an individual-based model to understand the evolution of division of labor in vertebrates. The model considers a population subdivided in groups, each group has a single asexually-reproducing breeder, other group members (subordinates) can perform two types of tasks called "work" or "defense", individuals have different ages, individuals can disperse between groups, each individual has a dominance rank that increases with age, and upon death of the breeder a new breeder is chosen among group members depending on their dominance. "Workers" pay a reproduction cost by having their dominance decreased, and "defenders" pay a survival cost. Every group member receives a survival benefit with increasing group size. There are 6 genetic traits, each controlled by a single locus, that control propensities to help and disperse, and how task choice and dispersal relate to dominance. To study the effect of group augmentation without kin selection, the authors cross-foster individuals to eliminate relatedness. The paper allows for the evolution of the 6 genetic traits under some different parameter values to study the conditions under which division of labour evolves, defined as the occurrence of different subordinates performing "work" and "defense" tasks. The authors envision the model as one of vertebrate division of labor.

      The main conclusion of the paper is that group augmentation is the primary factor causing the evolution of vertebrate division of labor, rather than kin selection. This conclusion is drawn because, for the parameter values considered, when the benefit of group augmentation is set to zero, no division of labor evolves and all subordinates perform "work" tasks but no "defense" tasks.

      Strengths:

      The model incorporates various biologically realistic details, including the possibility to evolve age polytheism where individuals switch from "work" to "defence" tasks as they age or vice versa, as well as the possibility of comparing the action of group augmentation alone with that of kin selection alone.

      Weaknesses:

      The model and its analysis is limited, which makes the results insufficient to reach the main conclusion that group augmentation and not kin selection is the primary cause of the evolution of vertebrate division of labor. There are several reasons.

      First, the model strongly restricts the possibility that kin selection is relevant. The two tasks considered essentially differ only by whether they are costly for reproduction or survival. "Work" tasks are those costly for reproduction and "defense" tasks are those costly for survival. The two tasks provide the same benefits for reproduction (eqs. 4, 5) and survival (through group augmentation, eq. 3.1). So, whether one, the other, or both tasks evolve presumably only depends on which task is less costly, not really on which benefits it provides. As the two tasks give the same benefits, there is no possibility that the two tasks act synergistically, where performing one task increases a benefit (e.g., increasing someone's survival) that is going to be compounded by someone else performing the other task (e.g., increasing that someone's reproduction). So, there is very little scope for kin selection to cause the evolution of labour in this model. Note synergy between tasks is not something unusual in division of labour models, but is in fact a basic element in them, so excluding it from the start in the model and then making general claims about division of labour is unwarranted. I made this same point in my first review, although phrased differently, but it was left unaddressed.

      Second, the parameter space is very little explored. This is generally an issue when trying to make general claims from an individual-based model where only a very narrow parameter region has been explored of a necessarily particular model. However, in this paper, the issue is more evident. As in this model the two tasks ultimately only differ by their costs, the parameter values specifying their costs should be varied to determine their effects. Instead, the model sets a very low survival cost for work (yh=0.1) and a very high survival cost for defense (xh=3), the latter of which can be compensated by the benefit of group augmentation (xn=3). Some very limited variation of xh and xn is explored, always for very high values, effectively making defense unevolvable except if there is group augmentation. Hence, as I stated in my previous review, a more extensive parameter exploration addressing this should be included, but this has not been done. Consequently, the main conclusion that "division of labor" needs group augmentation is essentially enforced by the limited parameter exploration, in addition to the first reason above.

      Third, what is called "division of labor" here is an overinterpretation. When the two tasks evolve, what exists in the model is some individuals that do reproduction-costly tasks (so-called "work") and survival-costly tasks (so-called "defense"). However, there are really no two tasks that are being completed, in the sense that completing both tasks (e.g., work and defense) is not necessary to achieve a goal (e.g., reproduction). In this model there is only one task (reproduction, equation 4,5) to which both "tasks" contribute equally and so one task doesn't need to be completed if the other task compensates for it. So, this model does not actually consider division of labor.

    1. Reviewer #1 (Public review):

      Jiang et al. present a measure of phenological lag by quantifying the effects of abiotic constraints on the differences between observed and expected phenological changes, using a combination of previously published phenology change data for 980 species, and associated climate data for study sites. They found that, across all samples, observed phenological responses to climate warming were smaller than expected responses for both leafing and flowering spring events. They also show that data from experimental studies included in their analysis exhibited increased phenological lag compared to observational studies, possibly as a result of reduced sensitivity to climatic changes. Furthermore, the authors present evidence that spatial trends in phenological responses to warming may differ than what would be expected from phenological sensitivity, due to the seasonal timing of when warming occurs. Thus, climate change may not result in geographic convergences of phenological responses. This study presents an interesting way to separate the individual effects of climate change and other abiotic changes on the phenological responses across sites and species.

      Strengths:

      A straightforward mathematical definition of phenological lag allows for this method to potentially be applied in different geographic contexts. Where data exists, other researchers can partition the effects of various abiotic forcings on phenological responses that differ from those expected from warming sensitivity alone.

      Identifying phenological lag, and associated contributing factors, provides a method by which more nuanced predictions of phenological responses to climate change can be made. Thus, this study could improve ecological forecasting models.

      Weaknesses:

      The analysis here could be more robust. A more thorough examination of phenological lag would provide stronger evidence that the framework presented has utility. The differences in phenologica lag by study approach, species origin, region, and growth form are interesting, and could be expanded. For example, the authors have the data to explore the relationships between phenological lag and the quantitative variables included in the final model (altitude, latitude, mean annual temperature) and other spatial or temporal variables. This would also provide stronger evidence for the author's claims about potential mechanisms that contribute to phenological lag.

      The authors include very little data visualizations, and instead report results and model statistics in tables. This is difficult to interpret and may obscure underlying patterns in the data. Including visual representations of variable distributions and between-variable relationships, in addition to model statistics, provides stronger evidence than model statistics alone.

    2. Reviewer #3 (Public review):

      Summary:

      The authors developed a new phenological lag metric and applied this analytical framework to a global dataset to synthesize shifts in spring phenology and assess how abiotic constraints influence spring phenology.

      Strengths:

      The dataset developed in this study is extensive, and the phenological lag metric is valuable.

      Weaknesses:

      The stability of the method used in this study needs improvement, particularly in the calculation of forcing requirements. In addition, the visualization of the results (such as Table 1) should be enhanced.

    1. Reviewer #1 (Public review):

      This work provides a new Python toolkit for combining generative modeling of neural dynamics and inversion methods to infer likely model parameters that explain empirical neuroimaging data. The authors provided tests to show the toolkit's broad applicability, accuracy, and robustness; hence, it will be very useful for people interested in using computational approaches to better understand the brain.

      Strengths:

      The work's primary strength is the tool's integrative nature, which seamlessly combines forward modelling with backward inference. This is important as available tools in the literature can only do one and not the other, which limits their accessibility to neuroscientists with limited computational expertise. Another strength of the paper is the demonstration of how the tool can be applied to a broad range of computational models popularly used in the field to interrogate diverse neuroimaging data, ensuring that the methodology is not optimal to only one model. Moreover, through extensive in-silico testing, the work provided evidence that the tool can accurately infer ground-truth parameters even in the presence of noise, which is important to ensure results from future hypothesis testing are meaningful.

      Weaknesses

      The paper still lacks appropriate quantitative benchmarking relative to non-Bayesian-based inference tools, especially with respect to performance accuracy and computational complexity and efficiency. Without this benchmarking, it is difficult to fully comprehend the power of the software or its ability to be extended to contexts beyond large-scale computational brain modelling.

    2. Reviewer #2 (Public review):

      Summary:

      Whole-brain network modeling is a common type of dynamical systems-based method to create individualized models of brain activity incorporating subject-specific structural connectome inferred from diffusion imaging data. This type of model has often been used to infer biophysical parameters of the individual brain that cannot be directly measured using neuroimaging but may be relevant to specific cognitive functions or diseases. Here, Ziaeemehr et al introduce a new toolkit, named "Virtual Brain Inference" (VBI), offering a new computational approach for estimating these parameters using Bayesian inference powered by artificial neural networks. The basic idea is to use simulated data, given known parameters, to train artificial neural networks to solve the inverse problem, namely, to infer the posterior distribution over the parameter space given data-derived features. The authors have demonstrated the utility of the toolkit using simulated data from several commonly used whole-brain network models in case studies.

      Strength:

      - Model inversion is an important problem in whole-brain network modeling. The toolkit presents a significant methodological step up from common practices, with the potential to broadly impact how the community infers model parameters.<br /> - Notably, the method allows the estimation of the posterior distribution of parameters instead of a point estimation, which provides information about the uncertainty of the estimation, which is generally lacking in existing methods.<br /> - The case studies were able to demonstrate the detection of degeneracy in the parameters, which is important. Degeneracy is quite common in this type of models. If not handled mindfully, they may lead to spurious or stable parameter estimation. Thus, the toolkit can potentially be used to improve feature selection or to simply indicate the uncertainty.<br /> - In principle, the posterior distribution can be directly computed given new data without doing any additional simulation, which could improve the efficiency of parameter inference on the artificial neural network is well-trained.

      Weaknesses:

      - The z-scores used to measure prediction error are generally between 1-3, which seems quite large to me. It would give readers a better sense of the utility of the method if comparisons to simpler methods, such as k-nearest neighbor methods, are provided in terms of accuracy.<br /> - A lot of simulations are required to train the posterior estimator, which is computationally more expensive than existing approaches. Inferring from Figure S1, at the required order of magnitudes of the number of simulations, the simulation time could range from days to years, depending on the hardware. The payoff is that once the estimator is well-trained, the parameter inversion will be very fast given new data. However, it is not clear to me how often such use cases would be encountered. It would be very helpful if the authors could provide a few more concrete examples of using trained models for hypothesis testing, e.g., in various disease conditions.

    1. Reviewer #1 (Public review):

      Summary:

      Ramirez Carbo et al. use the powerful M. xanthus spore morphogenesis model to address fundamental mechanisms in coordinated peptidoglycan remodeling and degradation. As peptidoglycan is an essential macromolecule and difficult to study in vivo, the authors use indirect but important methodology. The authors first identify two lytic transglycosylase (Ltg) enzymes necessary for spore morphogenesis using mutant phenotypic studies. They characterize these mutants for their role in coordinating spore morphogenesis induced either in fruiting bodies (starvation-dependent) or in liquid-rich media conditions (chemical-dependent). They conclude from these phenotypic and epistatic analyses that LtgA is necessary for morphogenesis during chemical-induced sporulation, and LtgB appears to be necessary to coordinate LtgA activity by interfering with LtgA function. Under starvation-induced sporulation, the absence of LtgB interferes with the building of fruiting bodies. LtgA does not appear to play a primary role in promoting aggregation into fruiting bodies, nor in degradation of peptidoglycan as assayed by loss of signal in anti-PG immunofluorescence. The authors demonstrate that the purified periplasmic domain of LtgA is highly active in degrading purified PG sacculi in vitro, while that of LtgB is highly reduced (relative to LtgA or lysozyme). The authors use photoactivated mCherry Lyt fusions and PALM to track the fusion protein mobility, which they state correlates with activity as immobilization results from PG binding. They demonstrate that in vegetative cells, a greater proportion of LtgA-PAmCh is more immobile (more active) than LtgB-PAmCh, but that directly after chemical-induction of sporulation, LtgB-PAmCh becomes more immobile (active). These analyses in the partner mutant backgrounds suggest that LtgA-PAmCh is more immobile (less active) in the absence of LtgB, but the reverse is not observed. Finally, the authors demonstrate that overexpression of LtgA in vegetative conditions leads to cell rounding, likely because of uncontrolled PG degradation, while overexpression of LtgB displays no phenotype.

      Strengths:

      This paper capitalizes on a novel spore morphogenesis mechanism to define proteins and mechanisms involved in peptidoglycan reorganization. The authors use the powerful PALM microscopy technique to assess Ltg activity in vivo by assaying for immobility as a proxy for PG binding. The authors elucidate a novel mechanism by which two Ltg's function together- with one (LtgB) seeming to regulate the activity of the other (the primary Ltg).

      Despite some weaknesses, there is no question that this study provides important insight into mechanisms of peptidoglycan remodeling- a difficult but highly impactful area of study with implications for the development of novel therapeutics and the discovery of mechanisms of fundamental bacterial physiology.

      Weaknesses:

      In many places, the authors do not adequately justify interpretations of their assays, leading to some apparently unjustified conclusions. Many of these are minor and may just require citations to demonstrate that the interpretations are justified by previous studies (detailed in recommendations below), but two bigger concerns are as follows:

      (1) It is not clear how the muropeptides listed in Figure 1 were assigned, and it is missing in the methods. In the sporulating conditions, the spectra look like combinations of multiple peaks, and the data, as stated, is not convincing to the non-specialist eye.

      (2) The observation that the lytB mutant prevents appropriate aggregation into fruiting bodies does not allow the interpretation that the absence of LytB prevents PG morphogenesis in the starvation-induced sporulation pathway, per se. It is more likely that in the lytB mutant, the morphogenesis program is not even triggered. This is because signaling proteins and regulators (specifically, C-signal accumulation/activated FruA), which are dependent on increased cell-cell signaling in the fruiting body, do not accumulate appropriately in shallow aggregates. C-signal/FruA are necessary to trigger the sporulation program in FBs. BTW: A hypothesis to explain the indirect effect of ltgB absence on aggregation could be that UDP-precursors are not regulated appropriately (unregulated LtyA??), so polysaccharides necessary for motility are not properly produced.

      Along these lines, fruiting body formation does not equal sporulation, and even "darkened" fruiting bodies can be misleading, as some mutants form polysaccharide-rich fruiting bodies (that appear dark under certain light conditions in the stereomicroscope) but do not sporulate efficiently. The wording in the text suggests that the authors assume that sporulation levels are normal because fruiting bodies are produced (see specific comments for details).

      (3) The authors repeatedly state that production of spore coat polysaccharides likely affects the PG IP staining (see below), but this is not well justified. A citation is needed if this has already been directly shown, or the language needs to be softened.

      (4) Better justification for the immobility of Lyt proteins in vivo as an assay for activity may be required. If this is well known in the field, it should be explicitly stated. The authors address this better in the discussion - but still state it is a correlation.

    2. Reviewer #2 (Public review):

      Summary:

      The authors' initial goal was to demonstrate loss of PG during the slow sporulation process of Myxococcus xanthus, with examination of the PG degradation products in order to implicate possible enzymes involved. Upon finding a predominance of LGT products, they examined sporulation in strains lacking each of the 14 candidate LGTs encoded in the genome, leading to the identification of two sporulation-linked LGTs. An extensive characterization of the roles played by these LGTs. One LGT is responsible for the slow sporulation PG degradation, while another is required for the rapid sporulation process. Interestingly, the "slow" LGT seems to provide an important regulatory brake on the rapid enzyme. Single-molecule fluorescent tracking of these enzymes was used to develop a model for their interaction with PG that mimics their observed activity. The rate of PG synthesis activity was also shown to impact the rate of PG degradation, suggesting potential interplay between the synthetic and degradative enzymes.

      Strengths:

      The genetic analysis to identify sporulation-linked LGTs and their effects on growth, sporulation, and spore properties was well done and productive. The fluorescence microscopy to track LGT mobility, presumably tied to activity, produced a convincing argument about the mechanism of regulation of one LGT by another.

      Weaknesses:

      While the impact of LGTs on sporulation was clearly demonstrated, the PG analysis that resulted from the study of LGTs raised some important unanswered questions. The analyses suggest that the PG is degraded to quite small fragments, which would normally be lost during the purification of PG. How these small fragments were thus detected is unclear, and this suggests a more complex story concerning PG metabolism during sporulation. An anti-PG antibody is used to quantify PG in the spores, but it is not made clear what the specificity of this antibody is, and thus whether it would recognize the LGT-altered PG of the spore. The authors suggest a "new mechanism of sporulation" when they have actually simply identified an important factor (PG degradation by LGTs) within a complex "process of sporulation".

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Chen et al. use cryo-electron tomography and an in vitro reconstitution system to demonstrate that the autoinhibited form of LRRK2 can assemble into filaments that wrap around microtubules. These filaments are generally shorter and less ordered than the previously characterized active-LRRK2 filaments. The structure reveals a novel interface involving the N-terminal repeats, which were disordered in the earlier active filament structure. Additionally, the autoinhibited filaments exhibit distinct helical parameters compared to the active form.

      Strengths:

      This study presents the highest-resolution structure of LRRK2 filaments obtained via subtomogram averaging, marking a significant technical advance over the authors' previous work published in Cell. The data are well presented, with high-quality visualizations, and the findings provide meaningful insights into the structural dynamics of LRRK2.

      Weaknesses and Suggestions:

      The revised manuscript by Chen et al. has fully addressed all of my previous suggestions regarding the rearrangement of the main figures.

    2. Reviewer #2 (Public review):

      The authors of this paper have done much pioneering work to decipher and understand LRRK2 structure and function and uncover the mechanism by which LRRK2 binds to microtubules and to study the roles that this may play in biology. Their previous data demonstrated that LRRK2 in the active conformation (pathogenic mutation or Type I inhibitor complex) bound to microtubule filaments in an ordered helical arrangement. This they showed induced a "roadblock" in the microtubule impacting vesicular trafficking. The authors have postulated that this is a potentially serious flaw with Type 1 inhibitors and that companies should consider generating Type 2 inhibitors in which the LRRK2 is trapped in the inactive conformation. Indeed the authors have published much data that LRRK2 complexed to Type 2 inhibitors does not seem to associate with microtubules and cause roadblocks in parallel experiments to those undertaken with type 1 inhibitors published above.

      In the current study the authors have undertaken an in vitro reconstitution of microtubule bound filaments of LRRK2 in the inactive conformation, which surprisingly revealed that inactive LRRK2 can also interact with microtubules in its auto-inhibited state. The authors' data shows that while the same interphases are seen with both the active LRRK2 and inactive microtubule bound forms of LRRK2, they identified a new interphase that involves the WD40-ARM-ANK- domains that reportedly contributes to the ability of the inactive form of LRRK2 to bind to microtubule filaments. The structures of the inactive LRRK2 complexed to microtubules are of medium resolution and do not allow visualisation of side chains.

      This study is extremely well written and the figures incredibly clear and well presented. The finding that LRRK2 in the inactive autoinhibited form can associate with microtubules is an important observation that merits further investigation. This new observation makes an important contribution to the literature and builds upon the pioneering research that this team of researchers has contributed to the LRRK2 fields.

      Comments on revised version:

      The authors have adequately addressed my questions and those of the other Reviewers in my opinion.

    3. Reviewer #3 (Public review):

      Summary:

      The manuscript by Chen et al examines the structure of the inactive LRRK2 bound to microtubules using cryo-EM tomography. Mutations in this protein have been shown to be linked to Parkinson's Disease. It is already shown that the active-like conformation of LRRK2 binds to the MT lattice, but this investigation shows that full-length LRRk2 can oligomerize on MTs in its autoinhibited state with different helical parameters than were observed with active-like state. The structural studies suggest that the autoinhibited state is less stable on MTs.

      Strengths:

      The protein of interest is very important biomedically and a novel conformational binding to microtubules in proposed

      The authors have addressed my original critique.

    1. Reviewer #1 (Public review):

      This study examined the effect of blood pressure variability on brain microvascular function and cognitive performance. By implementing a model of blood pressure variability using intermittent infusion of AngII for 25 days, the authors examined different cardiovascular variables, cerebral blood flow and cognitive function during midlife (12-15-month-old mice). Key findings from this study demonstrate that blood pressure variability impairs baroreceptor reflex and impairs myogenic tone in brain arterioles, particularly at higher blood pressure. They also provide evidence that blood pressure variability blunts functional hyperemia and impairs cognitive function and activity. Simultaneous monitoring of cardiovascular parameters, in vivo imaging recordings, and the combination of physiological and behavioral studies reflect rigor in addressing the hypothesis. The experiments are well designed, and data generated are clear.

      A number of issues raised earlier were addressed by the authors in the revised manuscript. The responses are convincing. These included circadian rhythm considerations, baroreflex findings, BP fluctuations driven by animal movement, and data presentation.

      Overall, this is a solid study with huge physiological implications. I believe that it will be of great benefit to the field.

    1. Reviewer #1 (Public review):

      Summary:

      Meijer et al. sought to investigate the role of cortical layer 6b (L6b) neurons in modulating sleep-wake states and cortical oscillations under baseline and sleep deprived conditions and in response to orexin A and B. Using chronic EEG recordings in mice with silencing of Drd1a+ neurons (via constitutive Cre-dependent knockout of SNAP25), the authors report that while overall baseline sleep-wake architecture and response to sleep deprivation minimal/unchanged, "L6b silencing" leads to a slowing of theta activity during wakefulness and REM sleep, and a reduction in EEG power during NREM sleep. Additionally, orexin B-induced increases in theta activity were attenuated in L6b silenced mice, which the authors state suggests a modulatory role for L6b in orexin-mediated arousal regulation. The manuscript is generally well written with clarity and transparency. However, a major concern is the lack of specificity in the genetic manipulation, which targets Drd1a+ neurons not exclusive to L6b, undermining the attribution of observed effects solely to L6b. Verification of neuronal silencing is also unclear, and statistical inconsistencies between the main text and figures/tables make it difficult to effectively evaluate the text and stated outcomes.

      Strengths:

      (1) The text is well written.

      (2) The authors are transparent about methodological details.

      (3) The stated sleep, circadian, and orexin infusion experiments appear to be well designed, executed, and analyzed (with the exceptions of some statistical analyses detailed below).

      Weaknesses:

      (1) All outcomes are attributed specifically to L6b neurons, but the genetic manipulation is not specific to L6b neurons. The authors acknowledge this as a limitation, but in my view, this global manipulation is more than a limitation - it affects the overall interpretations of the data. The Hoerder-Suabedissen et al., 2018 paper shows sparse, but also dense, expression of Drd1a+ neurons in brain regions outside of the L6b. Given this issue, the results are largely overstated throughout the paper.

      (2) It is not clear to me that the "silencing" of Drd1a+ neurons was verified.

      (3) There were various discrepancies (and potentially misattributions) between the stated significant differences in Supplementary Table T1 data and Figure 3a & S2 spectral plots. This issue makes it difficult to effectively evaluate the main text and stated outcomes.

      Related, the authors stated that post hoc comparisons of EEG spectral frequency bins were not corrected for multiple testing. Instead, significance was only denoted if changes in at least two consecutive frequency bins were significant. However, there are multiple plots in which a single significance marker is placed over an isolated bin (i.e., 4c, 6, S5, S6). Unless each marker is equivalent to 2 consecutive frequency bins, these markers should be removed from the plots. Otherwise, please define the frequency and size of these markers in the main text.

      (4) A rainbow color scale, as in Figure 3, we've now learned, can be misleading and difficult to interpret. The viridis color scale or a different diverging color scale are good alternatives.

      (5) How much time elapsed between vehicle/orexin A & B infusions?

      (6) For Figure 6, there are statistical discrepancies between the main text and the plots (pg. 10):

      a) The text claims post hoc differences for relative ORXA frontal EEG, but there are no significance markers on the plot.<br /> b) The text states that there were no post hoc differences for the relative ORXA occipital EEG, but significance markers are on the plot.<br /> c) The main test for the relative ORXB frontal EEG was not significant, but there are post hoc significance markers on the plot.<br /> d) For relative ORXB occipital EEG, there are significant markers on the plot outside of the stated range in the text.

      (7) Some important details are only available in figure captions, making it difficult to understand the main text. For example, when describing Figure 3c in the main text on page 7, it is not clear what type of transitions are being discussed without reading the figure caption. Likewise, a "decrease," "shift," and "change" are mentioned, but relative to what? Similar comment for the EEG theta activity description on pages 7 - 8. Please add relevant details to the main text.

      (8) Statistical comparisons for data in Figure 3e, post hoc analyses for data in Figure S7a-b REM data, and post hoc analyses for Figure S7c (not b) occipital EEG should be included to support differences claims. Please denote these differences on the respective plots.

      (9) In the subsection titled "Layer 6b mediates effects of orexin on vigilance states (pg. 8)," there does not seem to be any stated differences between control and L6b silenced mice. A more accurate subtitle is needed.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Meijer and colleagues investigated the effects of inactivation (conditional silencing) of cortical layer 6b neurons on sleep-wake states and EEG spectral power under the following three conditions: during natural sleep-wake states, after sleep deprivation, or after intracerebroventricular administration of orexin A and B. The authors report that silencing of L6b neurons did not have a significant effect on the total time spent in sleep-wake states, duration, or number of state epochs, or the response to sleep deprivation. However, silencing of L6b neurons did slow down theta-frequency (6-9 Hz) during wake and REM sleep, and reduced the total EEG power during NREM sleep. Infusion of orexin A in the mice in which cortical layer 6b neurons were inactivated produced an increase in wakefulness. A similar effect was observed after infusion of orexin A in the mice in which these neurons were not silenced, but the effect (i.e., increase in wakefulness) was of a smaller magnitude. Silencing of cortical layer 6b neurons attenuated the effect of orexin B in increasing theta activity, as was observed in the control mice. The authors conclude that the cortical neurons in layer 6b play an essential role in state-dependent dynamics of brain activity, vigilance state control, and sleep regulation.

      Strengths:

      (1) A focus on cortical layer 6b neurons, which are an understudied neuronal population, especially in the context of brain and behavioral state transitions.

      (2) The authors used a well-established mouse model to study the effect of inactivation of cortical layer 6b neurons.

      Weaknesses:

      (1) Although the authors used a highly selective approach to silence layer 6b neurons, the observed changes in EEG oscillations cannot be solely attributed to layer 6b neurons because of the ICV route for orexin administration.

      (2) The rationale for using only male rats is not provided.

    1. Joint Public Review:

      Summary:

      There has been extensive electrophysiological research investigating the relationship between local field potential patterns and individual cell spike patterns in the hippocampus. In this study, the authors used innovative imaging techniques to examine spike synchrony of hippocampal cells during locomotion and immobility states. The authors report that hippocampal place cells exhibit prominent synchronous spikes that co-occur with theta oscillations during exploration of novel environments.

      Strengths:

      The single cell voltage imaging used in this study is a highly novel method that may allow recordings that were not previously possible using traditional methods.

      Weaknesses:

      Local field potential recordings were obtained from the contralateral hemisphere for technical reasons, which limits some of the study's claims.

    1. Reviewer #1 (Public review):

      Summary:

      The work from this paper successfully mapped transcriptional landscape and identified EA-responsive cell types (endothelial, microglia). Data suggest EA modulates BBB via immune pathways and cell communication. However, claims of "BBB opening" are not directly proven (no permeability data).

      Strengths:

      First scRNA-seq atlas of EA effects on BBB, revealing 23 cell clusters and 8 cell types. High cell throughput (98,338 cells), doublet removal, and robust clustering (Seurat, SingleR). Comprehensive bioinformatics (GO/KEGG, CellPhoneDB for ligand-receptor interactions). Raw data were deposited in GEO (GSE272895) and can be accessed.

      Weaknesses:

      (1) No in vivo/in vitro assays confirm BBB permeability changes (e.g., Evans blue leakage, TEER).

      (2) Only male rats were used, ignoring sex-specific BBB differences.

      (3) Pericytes and neurons, critical for the BBB, were not captured, likely due to dissociation artifacts.

      (4) Protein-level validation (Western blot, IHC) absent for key genes (e.g., LY6E, HSP90).

      (5) Fixed stimulation protocol (2/100 Hz, 40 min); no dose-response or temporal analysis.

    2. Reviewer #2 (Public review):

      Summary:

      This study uses single-cell RNA sequencing to explore how electroacupuncture (EA) stimulation alters the brain's cellular and molecular landscape after blood-brain barrier (BBB) opening. The authors aim to identify changes in gene expression and signaling pathways across brain cell types in response to EA stimulation using single-cell RNA sequencing. This direction holds promise for understanding the consequences of noninvasive methods of BBB opening for therapeutic drug delivery across the BBB.

      Strengths:

      (1) The study addresses an emerging and potentially important application of noninvasive stimulation methods to manipulate BBB permeability.

      (2) The dataset provides broad transcriptional profiling across multiple brain cell types using single-cell resolution, which could serve as a valuable community resource.

      (3) Analyses of receptor-ligand signaling and cell-cell communication are included and have the potential to offer mechanistic insight into BBB regulation.

      Weaknesses:

      (1) The work falls short in its current form. The experimental design lacks a clear justification, and readers are not provided with sufficient background information on the extent, timing, or regional specificity of BBB opening in this EA model. These details, established in prior work, are critical to understanding the rationale behind the current transcriptomic analyses.

      (2) Further, the results are often presented with minimal context or interpretation. There is no model of intercellular or molecular coordination to explain the BBB-opening process, despite the stated goal of identifying such mechanisms. The statement that EA induces a "unique frontal cortex-specific transcriptome signature" is not supported, as no data from other brain regions are presented. Biological interpretation is at times unclear or inaccurate - for instance, attributing astrocyte migration effects to endothelial cell clusters or suggesting microglial tight junction changes without connecting them meaningfully to endothelial function.

      (3) The study does include analyses of receptor-ligand signaling and cell-cell communication, which could be among its most biologically rich outputs. However, these are relegated to supplementary material and not shown in the leading figures. This choice limits the utility of the manuscript as a hypothesis-generating resource.

      (4) Overall, while the dataset may be of interest to BBB researchers and those developing technologies for drug delivery across the BBB, the manuscript in its current form does not yet fulfill its interpretive goals. A more integrated and biologically grounded analysis would be beneficial.

    1. Reviewer #1 (Public review):

      Summary:

      This work investigates the neural basis of continual motor learning, specifically how brains might accommodate new motor memories without interfering with previously learned behaviours. Mainly drawing inspiration from recent experimental studies in monkeys (Losey et al. and Sun, O'Shea et al.), the authors use recurrent neural networks (RNNs) to model sequential learning and examine the emergence and properties of two proposed neural signatures of motor memory: the "uniform shift" observed in preparatory activity and the "memory trace" observed in execution activity.

      Strengths:

      The work's main contribution is demonstrating that both uniform shifts and memory traces emerge in RNN models trained on a sequential BCI task, without requiring explicit additional mechanisms. The work explores the relationship between these signatures and behavioural savings, finding that the memory trace correlates with immediate retention savings in networks without context, while the uniform shift does not. The study also investigates how properties of the new task perturbation (within- vs. outside-manifold) and the presence of explicit context cues affect these signatures and their relationship to savings, generally finding that context signals and outside-manifold perturbations reduce savings by decreasing the inherent overlap in the neural strategies used to solve the task.

      Weaknesses:

      A primary weakness is the lack of clear definitions of the uniform shift and the memory trace, which are quite different metrics. Another primary weakness is that the task modelled is well-matched to the Losey et al. BCI paradigm, but not well-matched to the Sun, O'Shea et al.'s curl field paradigm, which is likely impacting some of the results, primarily the lack of a relationship between the uniform shift and motor memories. While there are improvements that could be made in this work, we think it is a demonstration that modeling learning in neural activity using neural network models continues to be a valuable tool, moving the field forward.