10,000 Matching Annotations
  1. Dec 2023
    1. Reviewer #1 (Public Review):

      Summary:<br /> The presented study focuses on the role of formin-like 2 (FMNL2) in oocyte meiosis. The authors assessed FMNL2 expression and localization in different meiotic stages and subsequently, by using siRNA, investigated the role of FMNL2 in spindle migration, polar body extrusion, and distribution of mitochondria and endoplasmic reticulum (ER) in mouse oocytes.

      Strengths:<br /> Novelty in assessing the role of formin-like 2 in oocyte meiosis.

      Weaknesses:<br /> Methods are not properly described.<br /> Overstating presented data.<br /> It is not clear what statistical tests were used.

      My main concern is that there are missing important details of how particular experiments and analyses were done. The material and methods section is not written in the way that presented experiments could be repeated - it is missing basic information (e.g., used mouse strain, timepoints of oocytes harvest for particular experiments, used culture media, image acquisition parameters, etc.). Some of the presented data are overstated and incorrectly interpreted. It is not clear to me how the analysis of ER and mitochondria distribution was done, which is an important part of the presented data interpretation. I'm also missing important information about the timing of particular stages of assessed oocytes because the localization of both ER and mitochondria differs at different stages of oocyte meiosis. The data interpretation needs to be justified by proper analysis based on valid parameters, as there is considerable variability in the ER and mitochondria structure and localization across oocytes based on their overall quality and stage.

    2. Reviewer #2 (Public Review):

      Summary:<br /> This research involves conducting experiments to determine the role of Fmnl2 during oocyte meiosis I.

      Strengths:<br /> Identifying the role of Fmnl2 during oocyte meiosis I is significant.

      Weaknesses:<br /> The quantitative analysis and the used approach to perturb FMNL2 function are currently incomplete and would benefit from more confirmatory approaches and rigorous analysis.

      1- Most of the results are expected. The new finding here is that FMNL2 regulates cytoplasmic F-actin in mouse oocytes, which is also expected given the role of FMNL2 in other cell types. Given that FMNL2 regulates cytoplasmic F-actin, it is very expected to see all the observed phenotypes. It is already established that F-actin is required for spindle migration to the oocyte cortex, extruding a small polar body and normal organelle distribution and functions.

      2-The authors used Fmnl2 cRNA to rescue the effect of siRNA-mediated knockdown of Fmnl2. It is not clear how this works. It is expected that the siRNA will also target the exogenous cRNA construct (which should have the same sequence as endogenous Fmnl2) especially when both of them were injected at the same time. Is this construct mutated to be resistant to the siRNA?

      3-The authors used only one approach to knockdown FMNL2 which is by siRNA. Using an additional approach to inhibit FMNL2 would be beneficial to confirm that the effect of siRNA-mediated knockdown of FMNL2 is specific.

    3. Reviewer #3 (Public Review):

      Summary:<br /> The authors focus on the role of formin-like protein 2 in the mouse oocyte, which could play an important role in actin filament dynamics. The cytoskeleton is known to influence a number of cellular processes from transcription to cytokinesis. The results show that downregulation of FMNL2 affects spindle migration with resulting abnormalities in cytokinesis in oocyte meiosis I.

      Weaknesses:<br /> The overall description of methods and figures is overall dismissively poor. The description of the sample types and number of replicate experiments is impossible to interpret throughout, and the quantitative analysis methods are not adequately described. The number of data points presented is unconvincing and unlikely to support the conclusions. On the basis of the data presented, the conclusions appear to be preliminary, overstated, and therefore unconvincing.

    1. Reviewer #1 (Public Review):

      Summary:<br /> Fukuhara, Maenaka, and colleagues report a crystal structure of the canine distemper virus (CDV) attachment hemagglutinin protein globular domain. The structure shows a dimeric organization of the viral protein and describes the detailed amino-acid side chain interactions between the two protomers. The authors also use their best judgement to comment on predicted sites for the two cellular receptors - Nectin-4 and SLAM - and thus speculate on the CDV host tropism. A complementary AFM study suggests a breathing movement at the hemagglutinin dimer interface.

      Strengths:<br /> The study of CDV and related Paramyxoviruses is significant for human/animal health and is very timely. The crystallographic data seem to be of good quality.

      Weaknesses:<br /> While the recent CDV hemagglutinin cryo-EM structure is mentioned, it is not compared to the present crystal structure, and thus the context of the present study is poorly justified. Additionally, the results of the AFM experiment are not unexpected. Indeed, other paramyxoviral RBP/G proteins also show movement at the protomer interface.

    2. Reviewer #2 (Public Review):

      Summary:<br /> The authors solved the crystal structure of CDV H-protein head domain at 3,2 A resolution to better understand the detailed mechanism of membrane fusion triggering. The structure clearly showed that the orientation of the H monomers in the homodimer was similar to that of measles virus H and different from other paramyxoviruses. The authors used the available co-crystal strictures of the closely related measles virus H structures with the SLAM and Nectin4 receptors to map the receptor binding site on CDV H. The authors also confirmed which N-linked sites were glycosylated in the CDV H protein and showed that both wildtype and vaccine strains of CDV H have the same glycosylation pattern. The authors documented that the glycans cover a vast majority of the H surface while leaving the receptor binding site exposed, which may in part explain the long-term success of measles virus and CDV vaccines. Finally, the authors used HS-AFM to visualize the real-time dynamic characteristics of CDV-H under physiological conditions. This analysis indicated that homodimers may dissociate into monomers, which has implications for the model of fusion triggering.

      The structural data and analysis were thorough and well-presented. However, the HS-AFM data, while very exciting, was not presented in a manner that could be easily grasped by readers of this manuscript. I have some suggestions for improvement.

      1) The authors claim their structure is very similar to the recently published croy-EM structure of CDV H. Can the authors provide us with a quantitative assessment of this statement?

      2) The results for the HS-AFM are difficult to follow and it is not clear how the authors came to their conclusions. Can the authors better explain this data and justify their conclusions based on it?

      3) The fusion triggering model in Figure 8 is ambiguous as to when H-F interactions are occurring and when they may be disrupted. The authors should clarify this point in their model.

    1. Reviewer #1 (Public Review):

      In the manuscript, the authors tried to explore the molecular alterations of adipose tissue and skeletal muscle in PCOS by global proteomic and phosphorylation site analysis. In the study, the samples are valuable, while there are no repeats for MS and there are no functional studies for the indicted proteins, phosphorylation sites. The authors achieved their aims to some extent, but not enough.

    2. Reviewer #2 (Public Review):

      This study provides the proteomic and phosphoproteomics data for our understanding of the molecular alterations in adipose tissue and skeletal muscle from women with PCOS. This work is useful for understanding of the characteristics of PCOS, as it may provide potential targets and strategies for the future treatment of PCOS. While the manuscript presents interesting findings on omics and phenotypic research, the lack of in-depth mechanistic exploration limits its potential impact.

      The study primarily presents findings from omics and phenotypic research, but fails to provide a thorough investigation into the underlying mechanisms driving the observed results. Without a thorough elucidation of the mechanistic underpinnings, the significance and novelty of the study are compromised.

    1. Reviewer #1 (Public Review):

      In this revised preprint the authors investigate whether a presumably allosteric P2RX7 activating compound that they previously discovered reduces fibrosis in a bleomycin mouse model. They chose this particular model as publicly available mRNA data indicate that the P2RX7 pathway is downregulated in idiopathic pulmonary fibrosis patients compared to control individuals. In their revised manuscript, the authors use three proxies of lung damage, Ashcroft score, collagen fibers, and CD140a+ cells, to assess lung damage following the administration of bleomycin. These metrics are significantly reduced on HEI3090 treatment. Additional data implicate specific immune cell infiltrates and cytokines, namely inflammatory macrophages and damped release of IL-17A, as potential mechanistic links between their compound and reduced fibrosis. Finally, the researchers transplant splenocytes from WT, NLRP3-KO, and IL-18-KO mice into animals lacking the P2RX7 receptor to specifically ascertain how the transplanted splenocytes, which are WT for P2RX7 receptor, respond to HEI3090 (a P2RX7 agonist). Based on these results, the authors conclude that HEI3090 enhanced IL-18 production through the P2RX7-NLRP3 inflammasome axis to dampen fibrosis.

      These findings could be interesting to the field, as there are conflicting results as to whether NLRP3 activation contributes to fibrosis and if so, at what stage(s) (e.g., acute damage phase versus progression). The revised manuscript is more convincing in that three orthogonal metrics for lung damage were quantified. However, major weaknesses of the study still include inconsistent and small effect sizes of HEI3090 treatment versus either batch effects from transplanted splenocytes or the effects of different genetic backgrounds. Moreover, the fundamental assumption that HEI3090 acts specifically and functionally through the P2RX7 pathway in this model cannot be directly tested, as the authors now provide results indicating that P2RX7 knockout mice do not establish lung fibrosis on bleomycin treatment.

      In order to provide clear evidence that HEI3090 functions through P2RX7, a different lung fibrosis model that does not require P2RX7 would be necessary. For example, in such a system the authors could demonstrate a lack of HEI3090-mediated therapeutic effect on P2RX7 knockout. Molecularly, additional evidence on specificity, such as thermal proteome profiling and direct biophysical binding experiments, would also enhance the authors' argument that the compound indeed binds P2RX7 directly and specifically. Since all small molecules have some degree of promiscuity, the absence of an additional P2RX7 modulator, or direct recombinant IL-18 administration (as suggested by another reviewer), is needed to orthogonally validate the functional importance of this pathway. Another way the authors could probe pathway specificity would involve co-administering α-IL-18 with HEI3090 in several key experiments (similar to Figure 4L).

    2. Reviewer #2 (Public Review):

      In the study by Hreich et al, the potency of P2RX7-specific positive modulator HEI3090, developed by the authors, for the treatment of Idiopathic pulmonary fibrosis (IPF) was investigated. Recently, the authors have shown that HEI3090 can protect against lung cancer by stimulating dendritic cell P2RX7, resulting in IL-18 production that stimulates IFN-γ production by T and NK cells (DOI: 10.1038/s41467-021-20912-2). Interestingly, HEI3090 increases IL-18 levels only in the presence of high eATP. Since the treatment options for IPF are limited, new therapeutic strategies and targets are needed. The authors first show that P2RX7/IL-18/IFNG axis is downregulated in patients with IPF. Next, they used a bleomycin-induced lung fibrosis mouse model to show that the use of a positive modulator of P2RX7 leads to the activation of the P2RX7/IL-18 axis in immune cells that limits lung fibrosis onset or progression. Mechanistically, treatment with HEI3090 enhanced IL-18-dependent IFN-γ production by lung T cells leading to a decreased production of IL-17 and TGFβ, major drivers of IPF. The major novelty is the use of the small molecule HEI3090 to stimulate the immune system to limit lung fibrosis progression by targeting the P2RX7, which could be potentially combined with current therapies available. Overall, the study was well performed and the manuscript is clear. However, there is need for more details on the description and interpretation of the adoptive transfer experiments, as well as the statistical analyses and number of replicate independent experiments.

    1. Reviewer #1 (Public Review):

      Summary:

      In this manuscript, the authors observed that miR-199b-5p is elevated in osteoarthritis (OA) patients. They also found that overexpression of miR-199b-5p induced OA-like pathological changes in normal mice and inhibiting miR-199b-5p alleviated symptoms in knee OA mice. They concluded that miR-199b-5p is not only a potential micro-target for knee OA but also provides a potential strategy for the future identification of new molecular drugs.

      Strengths:

      The data are generated from both human patients and animal models.

      Weaknesses:

      The data presented in this manuscript is not solid enough to support their conclusions. There are several questions that need to be addressed to improve the quality of this study.

      The following questions that need to be addressed to improve the quality of the study.

      1. Exosomes were characterized by electron microscopy and western blot analysis (for CD9, 264 CD63, and CD81). However, figure S1 only showed two sample WB results and there is no positive and negative control as well as the confused not clear WB figure.

      2. The sequencing of miRNAs in serum exosomes showed that 88 miRNAs were upregulated and 89 miRNAs were downregulated in KOA patients compared with the control group based on fold change > 1.5 and p < 0.05. Figure 2 legend did not clearly elucidate what those represent and why the authors chose those five miRNAs to further validate although they did mention it with several words in line 108 'based on the p-value and exosomal'.

      3. In Figure 3 legend and methods, the authors did not mention how they performed the cell viability assay. What cell had been used? How long were they treated and all the details? Other figure legends have the same problem without detailed information.

      4. The authors claimed that Gcnt2 and Fzd6 are two target genes of miR-199b-5p. However, there is no convincing evidence such as western blot to support their bioinformatics prediction.

      5. To verify the binding site on 3'UTR of two potential targets, the authors designed a mouse sequence for luciferase assay, but not sure if it is the same when using a human sequence.

    2. Reviewer #2 (Public Review):

      Summary:

      The authors identified miR-199b-5p as a potential OA target gene using serum exosomal small RNA-seq from human healthy and OA patients. Their RNA-seq results were further compared with publicly available datasets to validate their finding of miR-199b-5p. In vitro chondrocyte culture with miR-199b-5p mimic/inhibitor and in vivo animal models were used to evaluate the function of miR-199b-5p in OA. The possible genes that were potentially regulated by miR-199b-5p were also predicted (i.e., Fzd6 and Gcnt2) and then validated by using Luciferase assays.

      Strengths:

      1. Strong in vivo animal models including pain tests.<br /> 2. Validates the binding of miR-199b-5p with Fzd6 and binding of miR-199b-5p with Gcnt2.

      Weaknesses:

      1. The authors may overinterpret their results. The current work shows the possible bindings between miR-199b-5p and Fzd6 as well as bindings between miR-199b-5p and Gcnt2. However, whether miR-199b-5p truly functions through Fzd6 and/or Gcnt2 requires genetic knockdown of Fzd6 and Gcnt2 in the presence of miR-199b-5p.<br /> 2. In vitro chondrocyte experiments were conducted in a 2D manner, which led to chondrocyte de-differentiation and thus may not represent the chondrocyte response to the treatments.<br /> 3. There is a lack of description for bioinformatic analysis.<br /> 4. There are several errors in figure labeling.

  2. Nov 2023
    1. Reviewer #2 (Public Review):

      Summary:

      In this study, Christin Krause et al mapped the hepatic miRNA-transcriptome of type 2 diabetic obese subjects, and identified miR-182-5p and its target genes LRP6 as potential drivers of dysregulated glucose tolerance and fatty acid metabolism in obese T2-diabetics.

      Strengths:

      This study contains some interesting findings and is valuable for the understanding of the key regulatory role of miRNAs in the pathogenesis of T2D.

      Weaknesses:

      The authors didn't systemically investigate the function of miR-182 in T2DM or NAFLD.

    2. Reviewer #1 (Public Review):

      Summary:

      This study demonstrated a novel exciting link between the conserved miRNA-target axis of miR-182-Lrp6 in liver metabolism which causatively contributes to type 2 diabetes and NAFLD in mice and, potentially, humans.

      Strengths:

      The direct interaction and inhibition of Lrp6 by miR-182 are convincingly shown. The effects of miR-182-5p on insulin sensitivity are also credible for the in vivo and in vitro gain-of-function experiments.

      Weaknesses:

      However, the DIO cohorts lack key assays for insulin sensitivity such as ITT or insulin-stimulated pAKT, as well as histological evidence to support their claims and strengthen the link between miR-182-5p and T2D or NAFLD. Besides, the lack of loss-of-function experiments limits its aptitude as a potential therapeutic target.

    3. Reviewer #3 (Public Review):

      Summary:

      In this manuscript, Krause and colleagues identify miR-182 as diabetes-associated microRNA: miR-182 is increased in bariatric surgery patients with versus without T2D; miR-182 was the only microRNA associated with three metabolic traits; miR-182 levels were associated with increased body weight in mice under different dietary manipulations; overexpression in Hep-G2 led to a decrease in LRP6; and overexpression in HFD fed mice led to increased insulin and liver TG. The manuscript provides a potentially useful resource for microRNA expression in human livers, though the functional importance of miR-182 remains unclear.

      Strengths:

      The use of human tissues and good sample sizes is strong.

      Weaknesses:

      The study is primarily correlative; the in vivo overexpression is non-physiological; and the mechanisms by which miR-182 exerts its effects are not rigorously tested.

    1. Reviewer #2 (Public Review):

      Summary:<br /> In this paper, Chamness and colleagues make a pioneering effort to map epistatic interactions among mutations in a membrane protein. They introduce thousands of mutations to the mouse GnRH Receptor (GnRHR), either under wild-type background or two mutant backgrounds, representing mutations that destabilize GnRHR by distinct mechanisms. The first mutant background is W107A, destabilizing the tertiary fold, and the second, V276T, perturbing the efficiency of cotranslational insertion of TM6 to the membrane, which is essential for proper folding. They then measure the surface expression of these three mutant libraries, using it as a proxy for protein stability, since misfolded proteins do not typically make it to the plasma membrane. The resulting dataset is then used to shed light on how diverse mutations interact epistatically with the two genetic background mutations. Their main conclusion is that epistatic interactions vary depending on the degree of destabilization and the mechanism through which they perturb the protein. The mutation V276T forms primarily negative (aggravating) epistatic interactions with many mutations, as is common to destabilizing mutations in soluble proteins. Surprisingly, W107A forms many positive (alleviating) epistatic interactions with other mutations. They further show that the locations of secondary mutations correlate with the types of epistatic interactions they form with the above two mutants.

      Strengths:<br /> Such a high throughput study for epistasis in membrane proteins is pioneering, and the results are indeed illuminating. Examples of interesting findings are that: (1) No single mutation can dramatically rescue the destabilization introduced by W107A. (2) Epistasis with a secondary mutation is strongly influenced by the degree of destabilization introduced by the primary mutation. (3) Misfolding caused by mis-insertion tends to be aggravated by further mutations. The discussion of how protein folding energetics affects epistasis (Fig. 7) makes a lot of sense and lays out an interesting biophysical framework for the findings.

      Weaknesses:<br /> The major weakness comes from the potential limitations in the measurements of surface expression of severely misfolded mutants. This point is discussed quite fairly in the paper, in statements like "the W107A variant already exhibits marginal surface immunostaining" and many others. It seems that only about 5% of the W107A makes it to the plasma membrane compared to wild-type (Figures 2 and 3). This might be a low starting point from which to accurately measure the effects of secondary mutations.

      Still, the authors claim that measurements of W107A double mutants "still contain cellular subpopulations with surface immunostaining intensities that are well above or below that of the W107A single mutant, which suggests that this fluorescence signal is sensitive enough to detect subtle differences in the PME of these variants". I was not entirely convinced that this was true. Firstly, I think it would be important to test how much noise these measurements have and how much surface immunostaining the W107A mutant displays above the background of cells that do not express the protein at all. But more importantly, it is not clear if under this regimen surface expression still reports on stability/protein fitness. It is unknown if the W107A retains any function or folding at all. For example, it is possible that the low amount of surface protein represents misfolded receptors that escaped the ER quality control. The differential clustering of epistatic mutations (Fig. 6) provides some interesting insights as to the rules that dictate epistasis, but these too are dominated by the magnitude of destabilization caused by one of the mutations. In this case, the secondary mutations that had the most interesting epistasis were exceedingly destabilizing. With this in mind, it is hard to interpret the results that emerge regarding the epistatic interactions of W107A. Furthermore, the most significant positive epistasis is observed when W107A is combined with additional mutations that almost completely abolish surface expression. It is likely that either mutation destabilizes the protein beyond repair. Therefore, what we can learn from the fact that such mutations have positive epistasis is not clear to me. Based on this, I am not sure that another mutation that disrupts the tertiary folding more mildly would not yield different results.

      With that said, I believe that the results regarding the epistasis of V276T with other mutations are strong and very interesting on their own.

      Additionally, the study draws general conclusions from the characterization of only two mutations, W107A and V276T. At this point, it is hard to know if other mutations that perturb insertion or tertiary folding would behave similarly. This should be emphasized in the text.

      Some statistical aspects of the study could be improved:

      1. It would be nice to see the level of reproducibility of the biological replicates in a plot, such as scatter or similar, with correlation values that give a sense of the noise level of the measurements. This should be done before filtering out the inconsistent data.

      2. The statements "Variants bearing mutations within the C- terminal region (ICL3-TMD6-ECL3-TMD7) fare consistently worse in the V276T background relative to WT (Fig. 4 B & E)." and "In contrast, mutations that are 210 better tolerated in the context of W107A mGnRHR are located 211 throughout the structure but are particularly abundant among residues 212 in the middle of the primary structure that form TMD4, ICL2, and ECL2 213 (Fig. 4 C & F)." are both hard to judge. Inspecting Figures 4B and C does not immediately show these trends, and importantly, a solid statistical test is missing here. In Figures 4E and F the locations of the different loops and TMs are not indicated on the structure, making these statements hard to judge.

      3. The following statement lacks a statistical test: "Notably, these 98 variants are enriched with TMD variants (65% TMD) relative to the overall set of 251 variants (45% TMD)." Is this enrichment significant? Further in the same paragraph, the claim that "In contrast to the sparse epistasis that is generally observed between mutations within soluble proteins, these findings suggest a relatively large proportion of random mutations form epistatic interactions in the context of unstable mGnRHR variants". Needs to be backed by relevant data and statistics, or at least a reference.

    2. Reviewer #1 (Public Review):

      Summary:<br /> The paper carries out an impressive and exhaustive non-sense mutagenesis using deep mutational scanning (DMS) of the gonadotropin-releasing hormone receptor for the WT protein and two single point mutations that I) influence TM insertion (V267T) and ii) influence protein stability (W107A), and then measures the effect of these mutants on correct plasma membrane expression (PME).

      Overall, most mutations decreased mGnRHR PME levels in all three backgrounds, indicating poor mutational tolerance under these conditions. The W107A variant wasn't really recoverable with low levels of plasma membrane localisation. For the V267T variant, most additional mutations were more deleterious than WT based on correct trafficking, indicating a synergistic effect. As one might expect, there was a higher degree of positive correlation between V267T/W107A mutants and other mutants located in TM regions, confirming that improper trafficking was a likely consequence of membrane protein co-translational folding. Nevertheless, context is important, as positive synergistic mutants in the V27T could be negative in the W107A background and vice versa. Taken together, this important study highlights the complexity of membrane protein folding in dissecting the mechanism-dependent impact of disease-causing mutations related to improper trafficking.

      Strengths:<br /> This is a novel and exhaustive approach to dissecting how receptor mutations under different mutational backgrounds related to co-translational folding, could influence membrane protein trafficking.

      Weaknesses:<br /> The premise for the study requires an in-depth understanding of how the single-point mutations analysed affect membrane protein folding, but the single-point mutants used seem to lack proper validation. Furthermore, plasma membrane expression has been used as a proxy for incorrect membrane protein folding, but this not necessarily be the case, as even correctly folded membrane proteins may not be trafficked correctly, at least, under heterologous expression conditions. In addition, mutations can affect trafficking and potential post-translational modifications, like glycosylation.

    1. Reviewer #1 (Public Review):

      The Eph receptor tyrosine kinase family plays a critical function in multiple physiological and pathophysiological processes. Hence, understating the regulation of these receptors is a highly important question. Through extensive experiments in cell lines and cultured neurons Chang et.al show that the signaling hub protein, MYCBP2 positively regulates the overall stability of a specific member of the family, EPHB2, and by that the cellular response to ephrinBs.<br /> Overall, this work sheds light on the divergent in the regulatory mechanisms of the Eph receptors family. Although the physiological importance of this new regularly mechanism in mammals awaits to be discovered, the authors provide genetic evidence using C.elegans that it is evolutionarily conserved.

    2. Reviewer #2 (Public Review):

      Members of the EphB family of tyrosine kinase receptors are involved in a multitude of diverse cellular functions, ranging from the control of axon growth to angiogenesis and synaptic plasticity. In order to provide these diverse functions, it is expected that these receptors interact in a cell-type specific manner with a diverse variety of downstream signalling molecules.

      The authors have used proteomics approaches to characterise some of these molecules in further detail. This molecule, myc-binding protein 2 (MYCBP2) is also known as highwire, has been identified in the context of establishment of neural connectivity. Another molecule coming up on this screen was identified as FBXO45.

      The authors use classical methods of co-IP to show a kinase-independent binding of MYCBP2 to EphB2. They further showed that FBXO45 within a ternary complex increased the stability of the EphB2/MYCBP2 complex.

      To define the interacting domains, they used clearly designed swapping experiments to show that the extracellular and transmembrane domains are necessary and sufficient for the formation of the ternary complex.

      Using a cellular contraction assay, the authors showed the necessity of MYCBP2 in mediating the cytoskeletal response of EphB2 forward signalling. Furthermore, they used the technically challenging stripe assay of alternating lanes of ephrinB-Fc and Fc to show that also in this migration-based essay MYCBP2 is required for EphB mediated differential migration pattern.

      MYCBP2 in addition is necessary to stabilize EphB2, that is in the absence of MYCBP2, EphB2 is degraded in the lysosomal pathway.

      Interestingly, the third protein in this complex, Fbxo45, was further characterized by overexpression of the domain of MYCBP2, known to interact with Fbxo45. Here the authors showed that this approach led to the disruption of the EphB2 / MYCBP2 complex, and also abolished the ephrinB mediated activation of EphB2 receptors and their differential outgrowth on ephrinB2-Fc / Fc stripes.

      Finally, the authors demonstrated an in vivo function of this complex using another model system, C elegans where they were able to show a genetic interaction.

      Data show in a nice set of experiments a novel level of EphB2 forward signalling where a ternary complex of this receptor with multifunctional MYCBP2 and Fbxo45 controls the activity of EphB2, allowing a further complex regulation of this important receptors. Additionally, the authors challenge pre-existing concepts of the function of MYCBP2 which might open up novel ways to think about this protein.<br /> Of interest is this work also in terms of development of the retinotectal projection in zebrafish where MYCBP2/highwire plays a crucial role, and thus might lead to a better understanding of patterning along the DV axis, for which it is known that EphB family members are crucial.

      Overall, the experiments are classical experiments of co-immunoprecipitations, swapping experiments, collapse assays, and stripe assays which all are well carried out and are convincing.

    3. Reviewer #3 (Public Review):

      In this improved version of the manuscript, Chang et al set out to find direct interactions with the Eph-B2 receptor, as our knowledge of its function/regulation is still incomplete. Using proteomic analysis of Hela cells expressing EPHB2, they identified MYCBP2 a potential binder, which they then confirm using extensive biochemical analyses, an interaction that seems to be negatively affected by binding of ephrin-B2 (but not B1). Furthermore, they find that FBXO45, a known MYCBP2 interaction, strongly facilitates its binding to EPHB2. Intriguingly, these interactions depend on the extracellular domains of EPHB2, suggesting the involvement of additional proteins as MYCBP2 is thought to be a cytoplasmic protein. Finally, they find that, in contrast to what could be expected given the known function of MYCBP2 as a ubiquitin E3 ligase, it actually positively regulates EPHB2 protein stability, and function.

      The strength of this manuscript is the extensive biochemical analysis of the EPHB2/MYCBP2/FBXO43 interactions. The vast majority of the conclusions are supported by the data.

      The attempt to extend the study to an in vivo animal using the worm is important, however the additive insight is, unfortunately, minimal.

    1. Reviewer #1 (Public Review):

      The study presented in this manuscript presents very convincing evidence that purifying selection is the main force shaping the landscape of TE polymorphisms in B. distachyon, with only a few putatively adaptive variants detected, even though most conclusions are based on the 10% of polymorphisms contributed by retrotransposons. That first conclusion is not novel, however, as it had already been clearly established in natural A. thaliana strains (Baduel et al. Genome Biol 2021) and in experimental D. simulans lines (Langmüller et al. NAR 2023), two studies that the authors do not mention, or improperly mention. In contrast to the conclusions reached in A. thaliana, however, Horvath et al. report here a seemingly deleterious effect of TE insertions even very far away from genes (>5kb), a striking observation for a genome of relatively similar size. If confirmed, as a caveat of this study is the lack of benchmarking of the TE polymorphisms calls by a pipeline known for a high rate of false positives (see detailed Private Recommendations #1), this set of observations would make an important addition to the knowledge of TE dynamics in the wild and questioning our understanding of the main molecular mechanisms through which TEs can impact fitness.

    2. Reviewer #2 (Public Review):

      Summary:<br /> Transposable elements are known to have a strong potential to generate diversity and impact gene regulation, and they are thought to play an important role in plant adaptation to changing environments. Nevertheless, very few studies have performed genome-wide analyses to understand the global effect of selection on TEs in natural populations. Horvath et al. used available whole-genome re-sequencing data from a representative panel of B. distachyon accessions to detect TE insertion polymorphisms (TIPs) and estimate their time of origin. Using a thorough combination of population genomics approaches, the authors demonstrate that only a small amount of the TE polymorphisms are targeted by positive selection or potentially involved in adaptation. By comparing the age-adjusted population frequencies of TE polymorphisms and neutral SNPs, the authors found that retrotransposons are affected by purifying selection independently of their distance to genes. Finally, using forward simulations they were able to quantify the strength of selection acting on TE polymorphisms, finding that retrotransposons are mainly under moderate purifying selection, with only a minority of the insertions evolving neutrally.

      Strengths:<br /> Horvath et al., use a convincing set of strategies, and their conclusions are well supported by the data. I think that incorporating polymorphism's age into the analysis of purifying selection is an interesting way to reduce the possible bias introduced by the fact that SNPs and TEs polymorphisms do not occur at the same pace. The fact that TE polymorphisms far from genes are also under purifying selection is an interesting result that reinforces the idea that the trans-regulatory effect of TE insertions might not be a rare phenomenon, a matter that may be demonstrated in future studies.

      Weaknesses:<br /> TEs from different classes and orders strongly differ in multiple features such as size, the potential impact of close genes upon insertion, insertion/elimination ratio (ie, MITE/TIR excision, solo-LTR formation), or insertion preference. Given such diversity, it is expected that their survival rates on the genome and the strength of selection acting on them could be different. The authors differentiate DNA transposons and retrotransposons in some of the analyses, the specificities of the most abundant plant TE types (ie, LTR/Gypsy, LTR/Copia, MITE DNA transposons) are not considered.

      The authors used a short-read-based approach to detect TIPs and TAPs. It is known that detecting TE polymorphisms is challenging and can lead to false negatives, depending on the method used and the sequencing coverage. The methodology used here (TEPID) has been previously applied to other species, but it is unclear if the sensitivity of the TIP/TAP caller is equivalent to that of the SNP caller and how these potential differences may affect the results.

    1. Reviewer #1 (Public Review):

      The regulation of motor autoinhibition and activation is essential for efficient intracellular transport. This manuscript used biochemical approaches to explore two members in the kinesin-3 family. They found that releasing UNC-104 autoinhibition triggered its dimerization whereas unlocking KLP-6 autoinhibition is insufficient to activate its processive movement, which suggests that KLP-6 requires additional factors for activation, highlighting the common and diverse mechanisms underlying motor activation. They also identified a coiled-coil domain crucial for the dimerization and processive movement of UNC-104. Overall, these biochemical and single-molecule assays were well performed, and their data support their statements. The manuscript is also clearly written, and these results will be valuable to the field.

    2. Reviewer #2 (Public Review):

      The Kinesin superfamily motors mediate the transport of a wide variety of cargos which are crucial for cells to develop into unique shapes and polarities. Kinesin-3 subfamily motors are among the most conserved and critical classes of kinesin motors which were shown to be self-inhibited in a monomeric state and dimerize to activate motility along microtubules. Recent studies have shown that different members of this family are uniquely activated by to undergo transition from monomers to dimers.

      Niwa and colleagues study two well-described members of the kinesin-3 superfamily, unc104 and KLP6, to uncover the mechanism of monomer to dimer transition upon activation. Their studies reveal that although both Unc104 and KLP6 are both self-inhibited monomers, their propensities for forming dimers are quite different. The authors relate this difference to a region in the molecules called CC2 which has a higher propensity for forming homodimers. Unc104 readily forms homodimers if its self-inhibited state is disabled while KLP6 does not.

      The work suggests that although mechanisms for self-inhibited monomeric states are similar, variations in the kinesin-3 dimerization may present a unique forms of kinesin-3 motor regulation with implications on the forms of motility functions carried out by these unique kinesin-3 motors.

    3. Reviewer #3 (Public Review):

      In this work, Kita et al., aim to understand the activation mechanisms of the kinesin-3 motors KLP-6 and UNC-104 from C. elegans. As many other motor proteins involved in intracellular transport processes, KLP-6 and UNC-104 motors suppress their ATPase activities in the absence of cargo molecules. Relieving the autoinhibition is thus a crucial step that initiates directional transport of intracellular cargo. To investigate the activation mechanisms, the authors make use of mass photometry to determine the oligomeric states of the full length KLP-6 and the truncated UNC-104(1-653) motors at sub-micromolar concentrations. While full length KLP-6 remains monomeric, the truncated UNC-104(1-653) displays a sub-population of dimeric motors that is much more pronounced at high concentrations, suggesting a monomer-to-dimer conversion. The authors push this equilibrium towards dimeric UNC-104(1-653) motors solely by introducing a point mutation into the coiled-coil domain and ultimately unleash a robust processivity of the UNC-104 dimer. The authors find that the same mechanistic concept does not apply to the KLP-6 kinesin-3 motor, suggesting an alternative activation mechanism of the KLP-6 that remains to be resolved. The present study encourages further dissection of the kinesin-3 motors with the goal of uncovering the main factors needed to overcome the 'self-inflicted' deactivation.

    1. Reviewer #1 (Public Review):

      The study examines how hemocytes control whole-body responses to oxidative stress. Using single cell sequencing they identify several transcriptionally distinct populations of hemocytes, including one subset that show altered immune and stress gene expression. They also find that knockdown of DNA Damage Response (DDR) genes in hemocytes increases expression of the immune cytokine, upd3, and that both upd3 overexpression in hemocytes and hemocyte knockdown of DDR genes leads to increased lethality upon oxidative stress. And they find that the PQ-induced lethality seen when the DDR is disrupted can be rescued in upd3 null background, suggesting links between proper regulation of DDR in hemocytes, modulation of systemic upd3 signaling, and the control of oxidative stress survival.

      The paper has two key strengths:

      1, The single cell analyses provide a clear description of how oxidative stress can cause distinct transcriptional changes in different populations of hemocytes. These results add to the emerging them in the field that there functionally different subpopulations of hemocytes that can control organismal responses to stress.<br /> 2, The discovery that DDR genes are required upon oxidative stress to modulate upd3 cytokine production and lethality provides interesting new insight into the DDR may play non-canonical roles in controlling organismal responses to stress.

    2. Reviewer #2 (Public Review):

      Hersperger et al. investigated the importance of Drosophila immune cells, called hemocytes, in the response to oxidative stress in adult flies. They found that hemocytes are essential in this response, and using state-of-the-art single-cell transcriptomics, they identified expression changes at the level of individual hemocytes. This allowed them to cluster hemocytes into subgroups with different responses, which certainly represents very valuable work. One of the clusters appears to respond directly to oxidative stress and shows a very specific expression response that could be related to the observed systemic metabolic changes and energy mobilization.

      Using hemocyte-specific genetic manipulation, the authors convincingly show that the DNA damage response in hemocytes regulates JNK activity and subsequent expression of the JAK/STAT ligand Upd3. Silencing of the DNA damage response or excessive activation of JNK and Upd3 leads to increased susceptibility to oxidative stress. This nicely demonstrates the importance of tight control of JNK-Upd3 signaling in hemocytes during oxidative stress. The treatment the authors used is quite harsh, and in such a situation it is simply better not to use upd3 signaling, but it is still worth bearing in mind that upd3 signaling may have a protective role under milder stresses, but Upd3 could require very tight control - this could be an interesting objective for future studies.

      The authors demonstrate that hemocytes play an important role in energy mobilization during oxidative stress, suggesting that control of energy mobilization by hemocytes is essential for the response. They further postulate that "hemocyte-derived upd3 is most likely released by the activated plasmatocyte cluster C6 during oxidative stress in vivo and is subsequently controlling energy mobilization and subsequent tissue wasting upon oxidative stress." It is important to note here that the association of upd3 with the observed changes in energy metabolism has not been tested, and the subsequent tissue wasting allegedly caused by excessive upd3 as a cause of death remains an open question.

    3. Reviewer #3 (Public Review):

      In this study, Kierdorf and colleagues investigated the function of hemocytes in oxidative stress response and found that non-canonical DNA damage response (DDR) is critical for controlling JNK activity and the expression of cytokine unpaired3. Hemocyte-mediated expression of upd3 and JNK determines the susceptibility to oxidative stress and systemic energy metabolism required for animal survival, suggesting a new role for hemocytes in the direct mediation of stress response and animal survival.

      In the revised manuscript, the authors provide additional evidence to support the role of DNA damage-modulated cytokine release by hemocytes during oxidative stress responses and strengthen the connection between DNA damage and the regulation of upd3 release from hemocytes. The authors have also included new analyses to emphasize the significance of hemocytes in the regulation of energy during oxidative stress. Following the reviewers' suggestions, the authors made improvements to the manuscript and the graphical abstract to better display their findings. Overall, the revised manuscript makes it easier to understand the main points, flows better, and is supported by convincing data and analysis throughout.

    1. Reviewer #1 (Public Review):

      Summary:

      This study is one of several around the world to investigate how urban wildlife responded to changes in human activity during the lockdowns associated with the COVID-19 pandemic. Unlike several other studies on the topic that used observational data from citizen science programs, this project relied on passive acoustic monitoring to record bird vocalizations during and after stringent lockdown periods in an urban environment. The authors focused on three species that differ in their level of adaptation to human presence, providing an ecologically relevant comparison that highlights the importance of micro-habitats for species living in close proximity to humans.

      Strengths:

      The element that sets this study apart from most others examining avian responses to COVID-19 lockdowns is the use of passive acoustic monitoring. As the authors describe, this method offers several advantages over other methods (though, it does come with some limitations on what questions can be addressed). Perhaps the most relevant advantage is that it offers the ability to concurrently measure anthropogenic noise in the environment, which is one of the most likely mechanisms for effects on wildlife from changes to human activity. These authors were, therefore, able to show local-scale differences in bird responses to human activity measured at the same scale. To my knowledge, only one other study (Derryberry et al. Science. 2020) has used recordings of vocalizations to examine the influence of COVID-19 lockdowns on a bird species.

      It was encouraging to see a study that focused on the local-scale impacts of lockdowns, with methods that could investigate effects within micro-habitats. Logistics prevented many other projects from operating at such fine scales, making the results from this study particularly useful for the examination of rapid changes in bird behavior. This does mean that comparisons between this study and others examining the effects of COVID-19 lockdowns on birds should be done with care, as the effects described here may have been the result of different processes, operating at different spatial and temporal scales. However, that also means this study fills an important gap in our knowledge of how wildlife reacts to human activity in urban spaces.

      Weaknesses:

      One drawback of the approach is that the acoustic sampling only occurred during the pandemic: samples were taken during several lockdown periods in the early spring (March through early May) of 2020 and then for a period of 10 days after the end of the final lockdown period in late May of 2020. Unfortunately, this means that the interpretation of the effect of lockdowns could have been affected by any shifts in the birds' vocal behavior that resulted from unmodeled environmental factors or normal seasonal phenology during that three-month period. However, the authors chose focal species that would be less prone to seasonal changes in vocal behavior and their approach did account for several factors to minimize any such effects.

    2. Reviewer #2 (Public Review):

      In this study, the authors tried to gauge the effect of human activity on three species, (1) the Hooded grow, an urban exploiter, (2) the Rose ring parakeet, an invasive, alien species that has adapted to exploit human resources, and (3) the Graceful prinia, an urban adapter, which is relatively shy of humans. A goal of the study was to increase awareness of the importance of urban parks.

      Strengths:<br /> Strengths of the study include the fact that it was conducted at 17 different sites, including parks, roads and residential areas, and included three species with different habitat preferences. Each species produced relatively loud and repeatable vocalizations. To avoid the effect of seasonal changes, sounds were sampled within a 10 day period of the lockdown as well as post-lockdown. The analysis included a comparison of the number of sound files, binary values indicating emission of a common syllable, and also the total number of syllables emitted as a measurement of bird activity. Ambient temperatures and sound levels of human activity were also recorded. All of these factors speak to the comprehensive approach and analysis adopted in this study. The results are based on a rigorous statistical analysis, ruling out the effects of various extraneous parameters.

      Weaknesses:<br /> Most significant changes may occur near the ambient noise levels and this could lead to a different conclusion, but the authors authors acknowledge this possibility and clarify that they only analyzed vocalizations with high signal-to-noise signals to avoid ambiguity. In the revised version, they also replaced the previous ambient noise parameter with an estimate of ambient noise under 1kHz, assuming that it reflects most anthropogenic noise (not restricted to human speech). This seems reasonable and this new model gave very similar results to the previous one.

      In interpreting the data, the authors mention the effect of human activity on bird vocalizations in the context of inter-species predator-prey interactions; however, the presence of humans could also modify intraspecies interactions by acting as triggers for communication of warning and alarm, and/or food calls (as may sometimes be the case) to conspecifics. The behavioral significance of the syllables used to monitor animal activity could be informative in this context; however, the authors acknowledge this possibility in the Discussion. Most importantly, the authors acknowledge the possibility of the above-noted bias, and the potential of a transient nature of the observed effects.

      Conclusion:<br /> In general, the authors achieved their aim of illustrating the complexity of the affect of human activity on animal behavior notwithstanding the caveats noted above. Their study also makes it clear that estimating such affects is not simple given the dynamics of animal behavior. For example, seasonality, temperature changes, animal migration and movement, as well as interspecies interactions, such as those related to predator-prey behavior, and inter/intra-species competition in other respects can all play into site-specific changes in the vocal activity of a particular species.

    1. Reviewer #1 (Public Review):

      Summary:<br /> This important study nicely integrates a breadth of experimental and computational data to address fundamental aspects of RNA methylation by an important for biology and health RNA methyltransferases (MTases). 



      Strengths:<br /> The authors offer compelling and strong evidence, based on carefully performed work with appropriate and well-established techniques to shed light on aspects of the methyl transfer mechanism of the methyltransferase-like protein 3 (METTL3), which is part of the methyltransferase-like proteins 3 & 14 (METTL3-14) complex. 


      Weaknesses:<br /> 
The significance of this foundational work is somewhat diminished mostly due to mostly efficient communication of certain aspects of this work. Parts of the manuscript are somewhat uneven and don't quite mesh well with one another. The manuscript could be enhanced by careful revision and significant textual and figure edits. 

Examples of recommended edits that would improve clarity and allow accessibility to a broader audience are highlighted in some detail below.

    2. Reviewer #2 (Public Review):

      Summary:<br /> Caflisch and coworkers investigate the methyltransferase activity of the complex of methyltransferase-like proteins 3 and 14 (METTL3-14). To obtain a high-resolution description of the complete catalytic cycle they have carefully designed a combination of experiments and simulations. Starting from the identification of bisubstrate analogues (BAs) as binders to stabilise a putative transition state of the reaction, they have determined multiple crystal structures and validated relevant interactions by mutagenesis and enzymatic assays.

      Using the resolved structure and classical MD simulations they obtained a kinetic picture of the binding and release of the substrates. Of note, they accumulated very good statistics on these processes using 16 simulation replicates over a time scale of 500 ns. To compare the time scale of the release of the products with that of the catalytic step they performed state-of-the-art QM/MM free energy calculations (testing multiple levels of theory) and obtained a free energy barrier that indicates how the release of the product is slower than the catalytic step.

      Strengths:<br /> All the work proceeds through clear hypothesis testing based on a combination of literature and new results. Eventually, this allows them to present in Figure 10 a detailed step-by-step description of the catalytic cycle. The work is very well crafted and executed.

      Weaknesses:<br /> To fulfill its potential of guiding similar studies for other systems as well as to allow researchers to dig into their vast work, the authors should share the results of their simulations (trajectories, key structures, input files, protocols, and analysis) using repositories like Zenodo, the plumed-nest, figshare or alike.

    3. Reviewer #3 (Public Review):

      Summary:<br /> The manuscript by Coberski et al describes a combined experimental and computational study aimed to shed light on the catalytic mechanism in a methyltransferase that transfers a methyl group from S-adenosylmethionine (SAM) to a substrate adenosine to form N6-methyladenosine (m6A).

      Strengths:<br /> The authors determine crystal structures in complex with so-called bi-substrate analogs that can bridge across the SAM and adenosine binding sites and mimic a transition state or intermediate of the methyl-transfer reaction. The crystal structures suggest dynamical motions of the substrate(s) that are examined further using classical MD simulations. The authors then use QM/MM calculations to study the methyl-transfer process. Together with biochemical assays of ligand/substrate binding and enzyme turnover, the authors use this information to suggest what the key steps are in the catalytic cycle. The manuscript is in most places easy to read.

      Weaknesses:<br /> My main suggestion for the authors is that they show better how their conclusions are supported by the data. This includes how the electron density maps for example support the key interactions and water molecules in the active site and a better error analysis of the computational analyses.

    1. Reviewer #1 (Public Review):

      The manuscript by Dr. Shinkai and colleagues is about the posttranslational modification of a highly important protein, MT3, also known as the growth inhibitory factor. Authors postulate that MT3, or generally all MT isoforms, are sulfane sulfur binding proteins. The presence of sulfane sulfur at each Cys residue has, according to the authors, a critical impact on redox protein properties and almost does not affect zinc binding. They show a model in which 20 Cys residues with sulfane sulfur atoms can still bind seven zinc ions in the same clusters as unmodified protein. They also show that recombinant MT3 (but also MT1 and MT2) protein can react with HPE-IAM, an efficient trapping reagent of persulfides/polysulfides. This reaction performed in a new approach (high temperature and high reagent concentration) resulted in the formation of bis-S-HPE-AM product, which was quantitatively analyzed using LC-MS/MS. This analysis indicated that all Cys residues of MT proteins are modified by sulfane sulfur atoms. The authors performed a series of experiments showing that such protein can bind zinc, which dissociates in the reaction with hydrogen peroxide or SNAP. They also show that oxidized MT3 is reduced by thioredoxin. It gives a story about a new redox-dependent switching mechanism of zinc/persulfide cluster involving the formation of cystine tetrasulfide bridge.

      The whole story is hard to follow due to the lack of many essential explanations or full discussion. What needs to be clarified is the conclusion (or its lack) about MT3 modification proven by mass spectrometry. Figure 1B shows the FT-ICR-MALDI-TOF/MS spectrum of recombinant MT3. It clearly shows the presence of unmodified MT3 protein without zinc ions. Ions dissociate in acidic conditions used for MALDI sample preparation. If the protein contained all Cys residues modified, its molecular weight would be significantly higher. Then, they show the MS spectrum (low quality) of oxidized protein (Fig. 1C), in which new signals (besides reduced apo-MT3) are observed. They conclude that new signals come from protein oxidation and modification with one or two sulfur atoms. If the conclusion on Cys residue oxidation is reasonable, how this protein contains sulfur is unclear. What is the origin of the sulfur if apo-MT does not contain it? Oxidized protein was obtained by acidification of the protein, leading to zinc dissociation and subsequent neutralization and air oxidation. Authors should perform a detailed isotope analysis of the isotopic envelope to prove that sulfur is bound to the protein. They say that the +32 mass increase is not due to the appearance of two oxygen donors. They do not provide evidence. This protein is not a sulfane sulfur binding protein, or its minority is modified. Moreover, it is unacceptable to write that during MT3 oxidation are "released nine molecules of H2". How is hydrogen molecule produced? Moreover, zinc is not "released", it dissociates from protein in a chemical process.

      Another important point is a new approach to the HPE-IAM application. Zinc-binding MT3 was incubated with 5 mM reagent at 60oC for 36 h. Authors claim that high concentration was required because apoMT3 has stable conformation. Figure 2B shows that product concentration increases with higher temperature, but it is unclear why such a high temperature was used. Figure 1D shows that at 37oC, there is almost no reaction at 5 mM reagent. Changing parameters sounds reasonable only when the reaction is monitored by mass spectrometry. In conclusion, about 20 sulfane sulfur atoms present in MT3 would be clearly visible. Such evidence was not provided. Increased temperature and reagent concentration could cause modification of cysteinyl thiol/thiolates as well, not only persulfides/polysulfides. Therefore, it is highly possible that non-modified MT3 protein could react with HPE-IAM, giving false results. Besides mass spectrometry, which would clearly prove modifications of 20 Cys, authors should use very important control, which could be chemically synthesized beta- or alfa-domain of MT3 reconstituted with zinc (many protocols are present in the literature). Such models are commonly used to test any kind of chemistry of MTs. If a non-modified chemically obtained domain would undergo a reaction with HPE-IAM under such rigorous conditions, then my expectation would be right.

      - The remaining experiments provided in the manuscript can also be applied for non-modified protein (without sulfane sulfur modification) and do not provide worthwhile evidence. For instance, hydrogen peroxide or SNAP may interact with non-modified MTs. Zinc ions dissociate due to cysteine residue modification, and TCEP may reduce oxidized residue to rescue zinc binding. Again, mass spectrometry would provide nice evidence.

      - The same is thioredoxin (Fig. 7) and its reaction with oxidized MT3. Nonmodified and oxidized MT3 would react as well.

      - If HPE-IAM reacts with Cys residues with unmodified MT3, which is more likely the case under used conditions, the protein product of such reaction will not bind zinc. It could be an explanation of the cyanolysis experiment (Fig. 6).

      - Figure 4 shows the reactivity of (pol)sulfides with TCEP and HPE-IAM. What are redox potentials? Do they correlate with the obtained results?

      - Raman spectroscopy experiments would illustrate the presence of sulfane sulfur in MT3 only if all Cys were modified.

      - The modeling presented in this study is very interesting and confirms the flexibility of metallothioneins. MT domains are known to bind various metal ions of different diameters. They adopt in this way to larger size the ions. The same mechanism could be present from the protein site. The presence of 9 or 11 sulfur atoms in the beta or alfa domain would increase the size of the domains without changing the cluster structure.

      - Comment to authors. Apo-MT is not present in the cell. It exists as a partially metallated species. The term "apo-MT" was introduced to explain that MTs are not fully saturated by metals and function as a metal buffer system. Apo-MT comes from old ages when MT was considered to be present only in two forms: apo-form and fully saturated forms.

    2. Reviewer #3 (Public Review):

      Summary:<br /> The authors were trying to show that a novel neuronal metallothionein of poorly defined function, GIF/MT3, is actually heavily persulfidated in both the Zn-bound and apo (metal-free) forms of the molecule as purified from a heterologous or native host. Evidence in support of this conclusion is compelling, with both spectroscopic and mass spectrometry evidence strongly consistent with this general conclusion. The authors would appear to have achieved their aims.

      Strengths:<br /> The analytical data are compelling in support of the author's primary conclusions are strong. The authors also provide some modeling evidence that strongly supports the contention that MT3 (and other MTs) can readily accommodate sulfane sulfur on each of the 20 cysteines in the Zn-bound structure, with little perturbation of the structure. This is not the case with Cys trisulfides, which suggests that the persulfide-metallated state is clearly positioned at lower energy relative to the immediately adjacent thiolate- or trisulfidated metal coordination complexes.

      Weaknesses:<br /> The biological significance of the findings is not entirely clear. On the one hand, the analytical data are clearly solid (albeit using a protein derived from a bacterial over-expression experiment), and yes, it's true that sulfane S can protect Cys from overoxidation, but everything shown in the summary figure (Fig. 8D) can be done with Zn release from a thiol by ROS, and subsequent reduction by the Trx/TR system. In addition, it's long been known that Zn itself can protect Cys from oxidation. I view this as a minor weakness that will motivate follow-up studies. Fig. 1 was incomplete in its discussion and only suggests that a few S atoms may be covalently bound to MT3 as isolated. This is in contrast to the sulfate S "release" experiment, which I find quite compelling.

      Impact:<br /> The impact will be high since the finding is potentially disruptive to the metals in the biology field in general and the MT field for sure. The sulfane sulfur counting experiment (the HPE-IAM electrophile trapping experiment) may well be widely adopted by the field. Those of us in the metals field always knew that this was a possibility, and it will interesting to see the extent to which metal-binding thiolates broadly incorporate sulfate sulfur into their first coordination shells.

    3. Reviewer #2 (Public Review):

      Summary:<br /> In this manuscript, the authors reveal that GIF/MT-3 regulates zinc homeostasis depending on the cellular redox status. The manuscript technically sounds, and their data concretely suggest that the recombinant MTs, not only GIF/MT-3 but also canonical MTs such as MT-1 and MT-2, contain sulfane sulfur atoms for the Zn-binding. The scenario proposed by the authors seems to be reasonable to explain the Zn homeostasis by the cellular redox balance.

      Strengths:<br /> The data presented in the manuscript solidly reveal that recombinant GIF/MT-3 contains sulfane sulfur.

      Weaknesses:<br /> It is still unclear whether native MTs, in particular, induced MTs in vivo contain sulfane sulfur or not.

    1. Reviewer #3 (Public Review):

      Summary:<br /> This manuscript describes some biochemical experiments on the crucial virulence factor EsxA (ESAT-6) of Mycobacterium tuberculosis. EsxA is secreted via the ESX-1 secretion system. Although this system is recognized to be crucial for virulence the actual mechanisms employed by the ESX-1 substrates are still mostly unknown. The EsxA substrate is attracting the most attention as the central player in virulence, especially phagosomal membrane disruption. EsxA is secreted as a dimer together with EsxB. The authors show that EsxA is also able to form homodimers and even tetramers, albeit at very low pH (below 5). Furthermore, the addition of a nanobody that specifically binds EsxA blocks intracellular survival, as well as if the nanobody is produced in the cytosol of the infected macrophages.

      Strengths:<br /> -Decent biochemical characterization of EsxA and identification of a new and interesting tool to study the function of EsxA (nanobody).

      -The manuscript is well-written.

      Weaknesses:<br /> The findings are not critically evaluated using extra experiments or controls.

      For instance, tetrameric EsxA in itself is interesting and could reveal how EsxA works. But one would say that this is a starting point to make small point mutations that specifically affect tetramer formation and then evaluate what the effect is on phagosomal membrane lysis. Also one would like to see experiments to indicate whether these structures can be produced under in vitro conditions, especially because it seems that this mainly happens when the pH is lower than 5, which is not normally happening in phagosomes that are loaded with M. tuberculosis.

      Also, the fact that the addition of the nanobody, either directly to the bacteria or produced in the cytosol of macrophages is interesting, but again it is the starting point for further experimentation. As a control, one would like to see the effect on an Esx-1 secretion mutant. Furthermore, does cytosolic production or direct addition of the nanobody affect phagosomal escape? What happens if an EsxA mutant is produced that does not bind the nanobody?

      Finally, it is a bit strange that the authors use a non-native version of esxA that has not only an additional His-tag but also an additional 12 amino acids, which makes the protein in total almost 20% bigger. Of course, these additions do not have to alter the characteristics, but they might. On the other hand, they easily discard the natural acetylation of EsxA by mycobacteria itself (proven for M. marinum) as not relevant for the function because it might not happen in (the close homologue) M. tuberculosis.

    2. Reviewer #1 (Public Review):

      Summary:<br /> The authors sought to establish a biochemical strategy to study ESAT-6 and CFP-10 biochemistry. They established recombinant reagents to study these protein associations in vitro revealing an unexpected relationship at low pH. They next develop much-needed reagents to study these proteins in an infection context and reveal that treatment with an ESAT-6 nanobody enhances Mtb control.

      Strengths:<br /> The biochemical conclusions are supported by multiple configurations of the experiments. They combine multiple approaches to study a complex problem.

      Weaknesses:<br /> It would be valuable to understand if the nanobody is disrupting the formation of the ESAT6-CFP10 complex. It is unclear how the nanobody is functioning to enhance control in the infection context. More detail or speculation in the discussion would have been valuable. Where is the nanobody in the cell during infection?

    3. Reviewer #2 (Public Review):

      Summary:<br /> Bates TA. et al. studied the biochemical characteristics of ESAT-6, a major virulence factor of Mycobacterium tuberculosis (Mtb), as part of the heterodimer with CFP10, a molecular chaperon of ESAT-6, as in homodimer and in homotetramer using recombinant ESAT-6 and CFP10 expressed in E. coli by applying several biochemical assays including Biolayer Interferometry (BLI) assay. The main findings show that ESAT-6 forms a tight interaction with CFP10 as a heterodimer at neutral pH, and ESAT-6 forms homodimer and even tetramer-based larger molecular aggregates at acidic pH. Although the discussion of the potential problems associated with the contamination of ESAT-6 preparations with ASB-14 during the LPS removal step is interesting, this research does not test the potential impact of residual ASB-14 contaminant on the biochemical behavior ESAT-6-CFP10 heterodimer and ESAT-6 homodimer or tetramer and their hemolytic activity in comparison with the ones without ASB-14. The main strength of this study is the generation of ESAT-6 specific nanobodies and the demonstration of its anti-tuberculosis efficiency in THP-1 cell lines infected with Mtb strains with reporter genes.

      Strengths:<br /> Generation and demonstration of the anti-ESAT-6 nanobodies against tuberculosis infection in a cell line based Mtb infection model.

      Weaknesses:<br /> Although the biochemistry studies provide quantitative data about the interactions of ESAT-6 with its molecular chaperon CFP10 and the interaction of ESAT-6 homodimer and tetramers, the novel information from these studies is minimal.

    1. Reviewer #1 (Public Review):

      Summary:

      Exposure to cranial irradiation (IR) leads to cognitive deficits in the survivors of brain cancer. IR upregulates miR-206-3p, which in turn reduces the PAK3-LIMK1 axis leading to the loss of F and G-actin ratio and, thereby, mature dendritic spine loss. Silencing miR-206-3p reverses these degenerative consequences.

      Strengths:<br /> The authors show compelling data indicating a clear correlation between PAK3 knockdown and the loss of mature dendritic spine density. In contrast, overexpression of PAK3 in the irradiated neurons restored mature spine types and recovered the F/G ratio. These in vitro results support the authors' hypotheses that PAK3 and LIMK1-mediated downstream signaling impact neuronal structure and reorganization in vitro. These data were supported by similar experiments using differentiated human neurons. Importantly, silencing miR-206-30 using antagonist miR also reverses IR-induced downregulation of the PAK3-LIMK1 axis, preventing spine loss and cognitive deficits.

      Weaknesses:

      All the miR-206-3p data are presented from in vitro cortical neurons or human stem cell-derived neuron cultures. This data (IR-induced elevation of miR-206-3p) should also be confirmed in vivo using an irradiated mouse brain to correlate the cognitive dysfunction timepoint.

      Antago-miR-206-3p reversed Ir-induced upregulation of miR-206 (in vitro), and prevent reductions in PAK3 and downstream markers. Importantly, it reversed cognitive deficits induced by IR. This data should be supported by in vivo staining for important dendritic markers, including cofillin, p-cofilin, PSD-95, F- and G-actin within the hippocampal and PFC regions.

      Other neuronal and non-neuronal targets of miR-206-3p should be discussed and looked into as a downstream impact of IR-induced functional and physiological impairments in the brain.

    2. Reviewer #2 (Public Review):

      Summary:<br /> The paper entitled "PAK3 downregulation induces cognitive 1 impairment following cranial irradiation" by Lee et al. aimed at investigating the functional impact of cranial irradiation in mouse and propose PAK3 as molecular element involved in radiation-induced cognitive decrement. The results provided in this paper are problematic as both the irradiation paradigm (5X2 Gy) as well as the timing of investigation (3 to 8 days post-IR) are completely irrelevant to investigate radiation induced neurocognitive impairment. This testifies to the team's lack of knowledge in radiobiology/radiotherapy and the methodology to explore radiation induced neurocognitive damages. It precludes any further relevance of the molecular results.

      Weaknesses:

      First and according to the BED equation a single dose of 10 Gy cannot not be approximated by 5 fractions of 2 Gy, as fractionation is known to decrease normal tissue toxicity. Note that in radiobiology/radio-oncology, the BED stands for "Biologically Effective Dose." This equation is used to compare the effects of different radiation treatments on biological tissues, taking into account the dose, fractionation, and the overall biological response of the tissue to radiation.<br /> The BED equation is commonly used to calculate the equivalent dose of a fractionated radiation treatment, which is the dose that would produce the same biological effect as a single, higher dose delivered in a single fraction.<br /> The general formula for BED is:BED = D * (1 + d / α/β)<br /> D is the total physical dose of radiation delivered in Grays (Gy)<br /> d is the dose per fraction in Gy<br /> α/β is the tissue-specific ratio of the linear (α) and quadratic (β) components of the radiation response. It is measured in Gy and describes how the tissue responds to different fractionation schedules (usually equal to 3 for the normal brain).<br /> Please refer to radiobiology/radiotherapy textbooks by Hall or Joiner.

      Second, the brain is a late responding organ. GBM patients treated with 60 Gy exhibit progressive and debilitating impairments in memory, attention and executive function several month post-irradiation. In mice, neurocognitive decrements after a single dose of 10 Gy delivered to the whole brain does occur at late time point, usually > 2 months post-exposure. Multiple publications such as the one by Limoli C lab, Rossi S lab, Britten R lab or earlier Fike J lab and Robin M lab support this. Next, 5 fractions of 2 Gy will be more protective than a single dose of 10 Gy and neurocognitive decrements will require at least 5-6 months to occur if they ever occur. In Figure 1, the decrement reported is marginal, the number of animals included (4 to 5 at most?) The number of animals is not specified) is too low to draw any significant conclusions. In addition to the timing issue, the strategy described for NOR analysis shows methodological issues with the habituation period being too short and exploration level being very low.

    1. Reviewer #1 (Public Review):

      In this study, authors performed multiple sets of mesoscale chromatin simulations at nucleosome resolution to study the effects of TF binding on chromatin structures. Through simulations at various conditions, authors performed systemically analysis to investigate how linker histone, tail acetylation, and linker DNA length can operate together with TFs to regulate chromatin architecture. Using gene Eed as one example, authors found that binding of Myc:Max could repress the gene expression by increasing fiber folding and compaction and this repression can be reversed by the linker histone. Understanding how transcription factors bind to regulatory DNA elements and modulate chromatin structure and accessibility is an essential question in epigenetics. Through modelling of TF binding to chromatin structures at nucleosome levels, authors demonstrated that TF binding could create microdomains that are visible in the ensemble-based contact maps and short DNA linkers prevent the formation microdomains. It has also been shown that tail acetylation and TF binding have opposite effects on chromatin compaction and linker histone can compete for the linker DNA with TF binding to impair the effect of TF binding. This study improves our knowledge on how TFs collaborate with different epigenetic marks and chromatin features to regulate chromatin structure and accessibility, which will be of broad interest to the community.

      For this reviewer, there were a few notable limitations. One was the implicit model of TF binding, which is modelled by adding harmonic restraints at two DNA beads. The model is very simple and it lacks kind of validation of how the results can be extended to many other TFs. In addition, the results of TF binding creating microdomains are very interesting but it requires further quantitative analysis of how microdomains was affected under different conditions. Also, some definitions and protocols demand further elucidation.

    2. Reviewer #2 (Public Review):

      Summary: In this paper, Portillo-Ledesma et al. study chromatin organization in the length scale of a gene, simulating the polymer at nucleosome resolution. The authors have presented an extensive simulation study with an excellent model of chromatin. The model has linker DNA and nucleosomes with all relevant interactions (electrostatics, tails, etc). Authors simulate 10 to 26 kb chromatin with varying linker lengths, linker histones (LH), and acetylated tails. The authors then study the effect of a transcription factor (TF) Myc: Max binding. The critical physical feature of the TF in the model is that it binds to the linker region and bends the DNA to make loops/intra-chromatin contacts. Authors systematically investigate the interplay between different variables such as linker DNA length, LH density, and the TF concentration in determining chromatin compaction and 3D organization.

      Strengths: The manuscript is well-written and is a relevant study with many useful results. The biggest strength of the work is the fact that the authors start with a relevant model that incorporates well-known biophysical properties of DNA, nucleosomes, linker histones, and the transcription factor Myc:Max. One of the novel results is the demonstration of how linker lengths play an important role in chromatin compaction (measured by computing packing ratio) in the presence of DNA-bending TFs. As the TF concentration increases, chromatin with short linker lengths does not compact much (only a small change in packing ratio). If the linker lengths are long, a higher percentage of TFs leads to an increase in packing ratio (higher compaction). Authors further show that TFs are able to compact Life-like chromatin fiber with linker length taken from a realistic distribution. The authors compute inter-nucleosomal contact maps from their simulated configurations and show that the map has features similar to what is observed in Hi-C/Micro-C experiments. Authors study the compaction of the Eed gene locus and show that TF binding leads to the formation of small domains known as micro-domains. Authors have predicted many relevant and testable quantities. Many of the results agree with known experiments like the formation of the micro-domains. Hence, the conclusions made in this study are justified - they follow from the simulation results.

      Weaknesses: (1) While this has the advantage of a minimal model (model with minimal factors incorporated), it is a disadvantage for predicting in vivo organization; one might need to incorporate the action of many other proteins (for example, PRC, HP1, etc) and several other histone modifications to predict in vivo organization. (2) While this forward model produces features of relevant contact maps, one would need to tune some of the intra-chromatin interaction parameters to obtain an accurate contact map and radius of gyration.

    1. Reviewer #1 (Public Review):

      Summary:

      This revised study follows up on previous work showing a female-specific enhancer region of PAX1 is associated with adolescent idiopathic scoliosis (AIS). This new analysis combines human GWAS analysis from multiple countries to identify a new AIS-associated coding variant in the COL11A1 gene (COL11A1P1335L). Using a Pax1 knockout mouse they go on to find that PAX1 and Collagen XI protein are expressed in the intervertebral discs (IVDs) and robustly in the growth plate, showing that COL11A1 expression is reduced in Pax1 mutant growth plate. Moreover, other AIS-associated genes, Gpr126 and Sox6, were also reduced in Pax1 mutant mice, suggesting a common pathway is involved in AIS.

      Using SV40 immortalized costal cartilage cells, derived from floxed Col11a1 mice primary rib cage cartilage, they go to show that removal of Col11a1 leads to reduction of Mmp3 expression. In this context, the expression of wild-type Col11a1 restored regular levels of Mmp3 expression, while expression of the AIS-associated Col11a1P1335L allele failed to restore normal Mmp3 expression. This supports a model that the AIS-associated Col11a1P1335L allele leads to the dysregulation of ECM in vivo.

      Using this culture system, they go on to test the role of the estrogen receptor ESR2, showing that loss of this receptor leads to reduced Mmp3 and Pax1 expression, and increased Col11a1 expression. They support this by showing similar gene expression changes and estrogen receptor function in Rat cartilage endplate cell culture.

      Altogether, this study nicely brings together an impressive number of human genetic data from multi-ethnic AIS cohorts and controls from across the globe and functionally tests these findings in cell culture and animal models. This study wonderfully integrates other findings from other human and mouse work in AIS and supports a new molecular mechanism by which estrogen can interact and synergize with COL11A1/PAX1/MMP3 signaling to change ECM development and dynamics, thus providing a tangible model for mutations and dysregulation of this pathway can increase the susceptibility of scoliosis.

      Strengths:

      This work integrates a large cohort of human genetic data from AIS patient and control from diverse ethnic backgrounds, across the globe. This work attempts to functionally test their findings in vivio and by use of cell culture.

      Weaknesses:

      Many of the main functional work was done in cell culture and not in vivo.

    2. Reviewer #2 (Public Review):

      Summary:

      In this manuscript, Yu and colleagues sought to identify new susceptibility genes for adolescent idiopathic scoliosis (AIS). Significance for this work is high, especially given the still large knowledge gap of the mechanistic underpinnings for AIS. In this multidisciplinary body of work, the authors first performed a genetic association study of AIS case-control cohorts (combined 9,161 cases and 80,731 controls) which leveraged common SNPs in 1027 previously defined matrisome genes. Two nonsynonymous variants were found to be significantly associated with AIS: MMP14 p.Asp273Asn and COL11A1 p.Pro1153Leu, the latter of which had the more robust association and remained significant when females were tested independent of males. Next, the authors followed a series of functional validation experiments to support biological involvement of COL11A1 p.Pro1153Leu in AIS through expression, biochemical, and histological studies in physiologically relevant cell and mouse models. Together, the authors propose a hitherto unreported model for AIS that involves the interplay of the COL11A1 susceptibility locus with estrogen signaling to alter a Pax1-Col11a1-Mmp3 signaling axis at the growth plate.

      Strengths:

      The manuscript is clearly written and follows a series of logical steps toward connecting multiple matrisome genes and putative AIS effectors in a new framework of pathomechanism. The multidisciplinary nature of the work makes it a strong body of work wherein multiple models offer multiple lines of supportive data.

      Weaknesses:

      This manuscript remains an important multidisciplinary study of the genetic and functional basis of adolescent idiopathic scoliosis (AIS). To the benefit of the overall manuscript quality, the reviewers have addressed most concerns to satisfaction. I have a few remaining suggestions:

      1. Regarding the genetic association of the common COL11A1 variant rs3753841, p.Pro1335Leu, please soften this statement to indicate that the variant could be a "risk locus" rather than "causal" in the following sentence on page 7-8: "These observations suggested that rs3753841 itself could be causal, although our methods would not detect deep intronic variants that could contribute to the overall association signal."

      2. Include the list of three rare missense variants mentioned in the response to reviewers as a supplementary table. Please also include methods for the SKATO rare variant burden analysis.

      3. Thank you for addressing the question of whether p.Pro1335Leu is a loss of function, gain of function, or dominant negative variant. The rationale in the response to reviewers was helpful, so please include this line of reasoning, and that there remains uncertainty, in the Discussion of the main text of the manuscript.

    3. Reviewer #3 (Public Review):

      Summary:

      This article demonstrates a Pax1-Col11a1-Mmp3 signaling axis associated with adolescent idiopathic scoliosis and finds that estrogen affects this signaling axis. In addition, the authors have identified a new COL11A1 mutation and verified its effect on the Pax1-Col11a1-Mmp3 axis.

      Strengths:

      1. Col11a1P1335L is verified in multicenter cohorts with high confidence.

      2. The article identified a potential pathogenesis of AIS.

      Weaknesses:

      The SV40-immortalized cell line established from Col11a1fl/fl mouse rib cartilage was applied in the study, and overexpression system was used to confirm that P1335L variant in COL11A1 perturbs its regulation of MMP3. However, due to the absence of P1335L point mutant mice, it cannot be confirmed whether P1335L can actually cause AIS, and the pathogenicity of this mutation cannot be directly verified.

    1. Reviewer #1 (Public Review):

      Summary:<br /> The study's abstract, introduction, and conclusions are not supported by the methods and results conducted. In fact, the results presented suggest that Arabidopsis could easily adapt to an extremely high CO2 environment.

      This study offers good evidence pointing to a genetic basis for Arabidopsis thaliana's response to elevated CO2 (eCO2) levels and its subsequent impact on the leaf ionome. The natural variation analyses in the study support the hypothesis that genetic factors, rather than local adaptation, guide the influence of eCO2 on the ionome of rosette leaves in Arabidopsis. However, the manuscript's claim regarding its role in "the development of biofortified crops adapted to a high-CO2 world" (line 23) is overstated, especially given the absence of any analysis on the influence of eCO2 on the seed ionome and Arabidopsis is a poor model for harvest index for any crop. The manuscript, in its current form, necessitates massive revisions, particularly in clarifying its broader implications and in providing more substantial evidence for some of its assertions.

      Major Drawbacks and Questions:

      1. Evidence for the Central Premise:<br /> The foundational premise of the study is the assertion that rising atmospheric CO2 levels result in a decline in plant mineral content. This phenomenon is primarily observed in C3 plants, with C4 plants seemingly less affected. The evidence provided on this topic is scant and, in some instances, contradicts the authors' own references. The potential reduction of certain minerals, especially in grains, can be debated. For instance, reduced nitrogen (N) and phosphorus (P) content in grains might not necessarily be detrimental for human and animal consumption. In fact, it could potentially mitigate issues like nitrogen emissions and phosphorus leaching. Labeling this as a "major threat to food security" (line 30) is exaggerated. While the case for microelements might be more compelling, the introduction fails to articulate this adequately. Furthermore, the introduction lacks any discussion on how eCO2 might influence nutrient allocation to grains, which would be crucial in substantiating the claim that eCO2 poses a threat to food security. A more comprehensive introduction that clearly delineates the adverse effects of eCO2 and its implications for food security would greatly enhance the manuscript.

      2. Exaggerated Concerns:<br /> The paper begins with the concern that carbon fertilization will lead to carbon dilution in our foods. While we indeed face numerous genuine threats in the coming decades, this particular issue is manageable. The increase in CO2 alone offers many opportunities for boosting yield. However, the heightened heat and increased evapotranspiration will pose massive challenges in many environments.

      Figure 4 in fact suggests that 43% of the REGMAP panel (cluster 3) is already pre-adapted to very high CO2 levels. This suggests annual species could adapt very rapidly.

      3. Assumptions on CO2 Levels:<br /> The assumption of 900ppm seems to be based on a very extreme climate change scenario. Most people believe we will overshoot the 1.5{degree sign}C scenario, however, it seems plausible that 2.5 to 3{degree sign}C scenarios are more likely. This would correspond to around 500ppm of CO2. https://www.nature.com/articles/s41597-022-01196-7/tables/4

      4. Focus on Real Challenges:<br /> We have numerous real challenges, such as extreme heat and inconsistent rainfall, to address in the context of climate change. However, testing under extreme CO2 conditions and then asserting that carbon dilution will negatively impact nutrition is exaggerated.

      In contrast, the FACE experiments are fundamental and are conducted at more realistic eCO2 levels. Understanding the interaction between a 20% increase in CO2 and new precipitation patterns is key for global carbon flux prediction.

      As I look at the literature on commercial greenhouse tomato production, 1000ppm of eCO2 is common, but it also looks like the breeders and growers have already solved for flavor and nutrition under these conditions.

      Conclusion:<br /> While the study provides valuable insights into the genetic underpinnings of Arabidopsis thaliana's response to elevated CO2 levels, it requires an entirely revised writeup, especially in its abstract, broader claims and implications. The manuscript would benefit from a more thorough introduction, a clearer definition of its scope, and a clear focus on the limits of this study.

    2. Reviewer #2 (Public Review):

      Strengths:<br /> The authors have conducted a large, well-designed experiment to test the response to eCO2. Overall, the experimental design is sound and appropriate for the questions about how a change in CO2 affects the ionome of Arabidopsis. Most of the conclusions in this area are well supported by the data that the authors present.

      Weakness:<br /> While the authors have done good experiments, it is a big stretch from Arabidopsis grown in an arbitrary concentration of CO2 to relevance to human and animal nutrition in future climates. Arabidopsis is a great model plant, but its leaves are not generally eaten by humans or animals.

      The authors don't justify their choice of a CO2 concentration. Given the importance of the parameter for the experiment, the rationale for selecting 900 ppm as elevated CO2 compared to any other concentration should be addressed. And CO2 is just one of the variables that plants will have to contend with in future climates, other variables will also affect elemental concentrations.

      Given these concerns, I think the emphasis on biofortification for future climates is unwarranted for this study.

      Additionally, I have trouble with these conclusions:

      -Abstract "Finally, we demonstrate that manipulating the function of one of these genes can mitigate the negative effect of elevated CO2 on the plant mineral composition. "<br /> -Discussion "Consistent with these results, we show that manipulating TIP2;2 expressions with a knock-out mutant can modulate the Zn loss observed under high CO2."

      The authors have not included the data to support this conclusion as stated. They have shown that this mutant increases the Zn content of the leaves when compared to WT but have not demonstrated that this response is different than in ambient CO2. This is an important distinction: one way to ameliorate the reduction of nutrients due to eCO2 is to try to identify genes that are involved in the mechanism of eCO2-induced reduction. Another way is to increase the concentration of nutrients so that the eCO2-induced reduction is not as important (i.e. a 10% reduction in Zn due to eCO2 is not as important if you have increased the baseline Zn concentration by 20%). The authors identified tip2 as a target from the GWAS on difference, but their validation experiment only looks at eCO2.

    1. Reviewer #1 (Public Review):

      Peng et al develop a computational method to predict/rank transcription factors (TFs) according to their likelihood of being pioneer transcription factors--factors that are capable of binding nucleosomes--using ChIP-seq for 225 human transcription factors, MNase-seq and DNase-seq data from five cell lines. The authors developed relatively straightforward, easy to interpret computational methods that leverage the potential for MNase-seq to enable relatively precise identification of the nucleosome dyad. Using an established smoothing approach and local peak identification methods to estimate positions together with identification of ChIP-seq peaks and motifs within those peaks which they referred to as "ChIP-seq motifs", they were able to quantify "motif profiles" and their density in nucleosome regions (NRs) and nucleosome depleted regions (NDRs) relative to their estimated nucleosome dyad positions. Using these profiles, they arrived at an odd-ratio based motif enrichment score along with a Fisher's exact test to assess the odds and significance that a given transcription factor's ChIP-seq motifs are enriched in NRs compared to NDRs, hence, its potential to be a pioneer transcription factor. They showed that known pioneer transcription factors had among the highest enrichment scores, and they could identify a number of relatively novel pioneer TFs with high enrichment scores and relatively high expression in their corresponding cell line. They used multiple validation approaches including (1) calculating the ROC-AUC and Matthews correlation coefficient (MCC) and generating ROC and precision-recall curves associated with their enrichment score based on 32 known pioneer TFs among their 225 TFs which they used as positives and the remaining TFs (among the 225) as negatives; (2) use of the literature to note that known pioneer TFs that acted as key regulators of embryonic stem cell differentiation had a highest enrichment scores; (3) comparison of their enrichment scores to three classes of TFs defined by protein microarray and electromobility shift assays (1. strong binder to free and nucleosomal DNA, 2. weak binder to free and nucleosomal DNA, 3. strong binding to free but not nucleosomal DNA); and (4) correlation between their calculated TF motif nucleosome end/dyad binding ratio and relevant data from an NCAP-SELEX experiment. They also characterize the spatial distribution of TF motif binding relative to the dyad by (1) correlating TF motif density and nucleosome occupancy and (2) clustering TF motif binding profiles relative to their distance from the dyad and identifying 6 clusters.

      The strengths of this paper are the use of MNase-seq data to define relatively precise dyad positions and ChIP-seq data together with motif analysis to arrive at relatively accurate TF binding profiles relative to dyad positions in NRs as well as in NDRs. This allowed them to use a relatively simple odds ratio based enrichment score which performs well in identifying known pioneer TFs. Moreover, their validation approaches either produced highly significant or reasonable, trending results.

      The weaknesses of the paper are relatively minor, and the authors do a good job describing the limitations of the data and approach.

    2. Reviewer #2 (Public Review):

      In this study, the authors utilize a compendium of public genomic data to identify transcription factors (TF) that can identify their DNA binding motifs in the presence of nuclosome-wrapped chromatin and convert the chromatin to open chromatin. This class of TFs are termed Pioneer TFs (PTFs). A major strength of the study is the concept, whose premise is that motifs bound by PTFs (assessed by ChIP-seq for the respective TFs) should be present in both "closed" nucleosome wrapped DNA regions (measured by MNase-seq) as well as open regions (measured by DNAseI-seq) because the PTFs are able to open the chromatin. Use of multiple ENCODE cell lines, including the H1 stem cell line, enabled the authors to assess if binding at motifs changes from closed to open. Typical, non-PTF TFs are expected to only bind motifs in open chromatin regions (measured by DNaseI-seq) and not in regions closed in any cell type. This study contributes to the field a validation of PTFs that are already known to have pioneering activity and presents an interesting approach to quantify PTF activity.

      For this reviewer, there were a few notable limitations. One was the uncertainty regarding whether expression of the respective TFs across cell types was taken into account. This would help inform if a TF would be able to open chromatin. Another limitation was the cell types used. While understandable that these cell types were used, because of their deep epigenetic phenotyping and public availability, they are mostly transformed and do not bear close similarity to lineages in a healthy organism. Next, the methods used to identify PTFs were not made available in an easy-to-use tool for other researchers who may seek to identify PTFs in their cell type(s) of interest. Lastly, some terms used were not define explicitly (e.g., meaning of dyads) and the language in the manuscript was often difficult to follow and contained improper English grammar.

    3. Reviewer #3 (Public Review):

      Peng et al. designed a computational framework for identifying pioneer factors using epigenomic data from five cell types. The identification of pioneer factors is important for our understanding of the epigenetic and transcriptional regulation of cells. A computational approach toward this goal can significantly reduce the burden of labor-intensive experimental validation.

      The authors have addressed my previous comments.

      The main issue identified in this re-review is based on the authors' additional experiments to investigate the reproducibility of the pioneer factors identified in the previously analysis that anchored on H1 ESCs.

      The additional analysis that uses the other four cell types (HepG2, HeLa-S3, MCF-7, and K562) as anchors reveals the low reproducibility/concordance and high dependence on the selection of anchor cell type in the computational framework. In particular, now several stem cell related TFs (e.g. ESRRB, POU5F1) are ranked markedly higher when H1 ESC is not used as the anchor cell type as shown in Supplementary Figure 5.

      Of note, the authors have now removed the shape labels that denote Yamanaka factors in Figure 2c (revised manuscript) that was presented in the main Figure 2a in the initial submission. The NFYs and ESRRB labels in Supplementary 4a are also removed and the boxplot comparing NFYs and ESRRB with other TF are also removed in this figure. Removing these results effectively hides the issues of the computational framework we identified in this revision. Please justify why this was done.

      In summary, these new results reveal significant limitations of the proposed computational framework for identifying pioneer factors. The current identifications appear to be highly dependent on the choice of cell types.

    1. Reviewer #2 (Public Review):

      Summary:

      In this article, the authors employed modified CRISPR screens ["guide-only (GO)-CRISPR"] in the attempt to identify the genes which may mediate cancer cell dormancy in the high grade serous ovarian cancer (HGSOC) spheroid culture models. Using this approach, they observed that abrogation of several of the components of the netrin (e.g., DCC, UNC5Hs) and MAPK pathways compromise the survival of non-proliferative ovarian cancer cells. This strategy was complemented by the RNAseq approach which revealed that a number of the components of the netrin pathway are upregulated in non-proliferative ovarian cancer cells and that their overexpression is lost upon disruption of DYRK1A kinase that has been previously demonstrated to play a major role in survival of these cells. Perampalam et al. then employed a battery of cell biology approaches to support the model whereby the Netrin signaling governs the MEK-ERK axis to support survival of non-proliferative ovarian cancer cells. Moreover, the authors show that overexpression of Netrins 1 and 3 bolsters dissemination of ovarian cancer cells in the xenograft mouse model, while also providing evidence that high levels of the aforementioned factors are associated with poor prognosis of HGSOC patients.

      Strengths:

      Overall it was thought that this study is of potentially broad interest inasmuch as it provides previously unappreciated insights into the potential molecular underpinnings of cancer cell dormancy, which has been associated with therapy resistance, disease dissemination, and relapse as well as poor prognosis. Notwithstanding the potential limitations of cellular models in mimicking cancer cell dormancy, it was thought that the authors provided sufficient support for their model that netrin signaling drives survival of non-proliferating ovarian cancer cells and their dissemination. Collectively, it was thought that these findings hold a promise to significantly contribute to the understanding of the molecular mechanisms of cancer cell dormancy and in the long term may provide a molecular basis to address this emerging major issue in the clinical practice.

      Weaknesses:

      Several issues were observed regarding methodology and data interpretation. The major concerns were related to the reliability of modelling cancer cell dormancy. To this end, it was relatively hard to appreciate how the employed spheroid model allows to distinguish between dormant and e.g., quiescent or even senescent cells. This was in contrast to solid evidence that netrin signaling stimulates abdominal dissemination of ovarian cancer cells in the mouse xenograft and their survival in organoid culture. Moreover, the role of ERK in mediating the effects of netrin signaling in the context of the survival of non-proliferative ovarian cancer cells was found to be somewhat underdeveloped.

    2. Reviewer #1 (Public Review):

      Summary:

      Perampalam et al. describe novel methods for genome-wide CRISPR screening to identify and validate genes essential for HGSOC spheroid viability. In this study, they report that Netrin signaling is essential for maintaining disseminated cancer spheroid survival, wherein overexpression of Netrin pathway genes increases tumor burden in a xenograft model of ovarian cancer. They also show that high netrin expression correlates with poor survival outcomes in ovarian cancer patients. The study provides insights into the biology of netrin signaling in DTC cluster survival and warrants development of therapies to block netrin signaling for treating serous ovarian cancer.

      Strengths:

      - The study identifies Netrin signaling to be important in disseminated cancer spheroid survival<br /> - A Novel GO-CRISPR methodology was used to find key genes and pathways essential for disseminated cancer cell survival

      Weaknesses:

      - The term dormancy is not fully validated and requires additional confirmation to claim the importance of Netrin signaling in "dormant" cancer survival.<br /> - Findings shown in the study largely relate to cancer dissemination and DTS survival rather than cancer dormancy.

    1. Reviewer #3 (Public Review):

      Summary:

      In this study, Warfvinge and colleagues use CITE-seq to interrogate how CML stem cells change between diagnosis and after one year of TKI therapy. This provides important insight into why some CML patients are "optimal responders" to TKI therapy while others experience treatment failure. CITE-seq in CML patients revealed several important findings. First, substantial cellular heterogeneity was observed at diagnosis, suggesting that this is a hallmark of CML. Further, patients who experienced treatment failure demonstrated increased numbers of primitive cells at diagnosis compared to optimal responders. This finding was validated in a bulk gene expression dataset from 59 CML patients, in which it was shown that the proportion of primitive cells versus lineage-primed cells correlates to treatment outcome. Even more importantly, because CITE-seq quantifies cell surface protein in addition to gene expression data, the authors were able to identify that BCR/ABL+ and BCR/ABL- CML stem cells express distinct cell surface markers (CD26+/CD35- and CD26-/CD35+, respectively). In optimal responders, BCR/ABL- CD26-/CD35+ CML stem cells were predominant, while the opposite was true in patients with treatment failure. Together, these findings represent a critical step forward for the CML field and may allow more informed development of CML therapies, as well as the ability to predict patient outcomes prior to treatment.

      Strengths:

      This is an important, beautifully written, well-referenced study that represents a fundamental advance in the CML field. The data are clean and compelling, demonstrating convincingly that optimal responders and patients with treatment failure display significant differences in the proportion of primitive cells at diagnosis, and the ratio of BCR-ABL+ versus negative LSCs. The finding that BCR/ABL+ versus negative LSCs display distinct surface markers is also key and will allow for a more detailed interrogation of these cell populations at a molecular level.

      Weaknesses:

      CITE-seq was performed in only 9 CML patient samples and 2 healthy donors. Additional samples would greatly strengthen the very interesting and notable findings.

    2. Reviewer #1 (Public Review):

      Summary:

      This manuscript by Warfvinge et al. reports the results of CITE-seq to generate single-cell multi-omics maps from BM CD34+ and CD34+CD38- cells from nine CML patients at diagnosis. Patients were retrospectively stratified by molecular response after 12 months of TKI therapy using European Leukemia Net (ELN) recommendations. They demonstrate heterogeneity of stem and progenitor cell composition at diagnosis, and show that compared to optimal responders, patients with treatment failure after 12 months of therapy demonstrate increased frequency of molecularly defined primitive cells at diagnosis. These results were validated by deconvolution of an independent previously published dataset of bulk transcriptomes from 59 CML patients. They further applied a BCR-ABL-associated gene signature to classify primitive Lin-CD34+CD38- stem cells as BCR:ABL+ and BCR:ABL-. They identified variability in the ratio of leukemic to non-leukemic primitive cells between patients, showed differences in the expression of cell surface markers, and determined that a combination of CD26 and CD35 cell surface markers could be used to prospectively isolate the two populations. The relative proportion of CD26-CD35+ (BCR:ABL-) primitive stem cells was higher in optimal responders compared to treatment failures, both at diagnosis and following 3 months of TKI therapy.

      Strengths:

      The studies are carefully conducted and the results are very clearly presented. The data generated will be a valuable resource for further studies. The strengths of this study are the application of single-cell multi-omics using CITE-Seq to study individual variations in stem and progenitor clusters at diagnosis that are associated with good versus poor outcomes in response to TKI treatment. These results were confirmed by deconvolution of a historical bulk RNAseq data set. Moreover, they are also consistent with a recent report from Krishnan et al. and are a useful confirmation of those results. The major new contribution of this study is the use of gene expression profiles to distinguish BCR-ABL+ and BCR-ABL- populations within CML primitive stem cell clusters and then applying antibody-derived tag (ADT) data to define molecularly identified BCR:ABL+ and BCR-ABL- primitive cells by expression of surface markers. This approach allowed them to show an association between the ratio of BCR-ABL+ vs BCR-ABL- primitive cells and TKI response and study dynamic changes in these populations following short-term TKI treatment.

      Weaknesses:

      One of the limitations of the study is the small number of samples employed, which is insufficient to make associations with outcomes with confidence. Although the authors discuss the potential heterogeneity of primitive stem, they do not directly address the heterogeneity of hematopoietic potential or response to TKI treatment in the results presented. Another limitation is that the BCR-ABL + versus BCR-ABL- status of cells was not confirmed by direct sequencing for BCR-ABL. The BCR-ABL status of cells sorted based on CD26 and CD35 was evaluated in only two samples. We also note that the surface markers identified were previously reported by the same authors using different single-cell approaches, which limits the novelty of the findings. It will be important to determine whether the GEP and surface markers identified here are able to distinguish BCR-ABL+ and BCR-ABL- primitive stem cells later in the course of TKI treatment. Finally, although the authors do describe differential gene expression between CML and normal, BCR:ABL+ and BCR:ABL-, primitive stem cells they have not as yet taken the opportunity to use these findings to address questions regarding biological mechanisms related to CML LSC that impact on TKI response and outcomes.

    3. Reviewer #2 (Public Review):

      Summary:

      The authors use single-cell "multi-comics" to study clonal heterogeneity in chronic myeloid leukemia (CML) and its impact on treatment response and resistance. Their main results suggest 1) Cell compartments and gene expression signatures both shared in CML cells (versus normal), yet 2) some heterogeneity of multiomic mapping correlated with ELN treatment response; 3) further definition of s unique combination of CD26 and CD35 surface markers associated with gene expression defined BCR::ABL1+ LSCs and BCR::ABL1- HSCs. The manuscript is well-written, and the method and figures are clear and informative. The results fit the expanding view of cancer and its therapy as a complex Darwinian exercise of clonal heterogeneity and the selective pressures of treatments.

      Strengths:

      Cutting-edge technology by one of the expert groups of single-cell 'comics.

      Weaknesses:

      Very small sample sizes, without a validation set.<br /> The obvious main problem with the study is that an enormous amount of results and conjecture arise from a very small data set: only nine cases for the treatment response section (three in each of the ELN categories), only two normal marrows, and only two patient cases for the division kinetic studies. Thus, it is very difficult to know the "noise" in the system - the stability of clusters and gene expression and the normal variation one might expect, versus patterns that may be reproducibly study artifact, effects of gene expression from freezing-thawing, time on the bench, antibody labeling, etc. This is not so much a criticism as a statement of reality: these elegant experiments are difficult, time-consuming, and very expensive. Thus in the Discussion, it would be helpful for the authors to just frankly lay out these limitations for the reader to consider. Also in the Discussion, it would be interesting for the authors to consider what's next: what type of validation would be needed to make these studies translatable to the clinic? Is there a clever way to use these data to design a faster/cheaper assay?

    1. Reviewer #1 (Public Review):

      Summary:

      Mainali and colleagues provide evidence for Itaconate stabilising Cpt1a via a decrease in ubiquitination. This in turn likely regulates fatty acid oxidation which in turn would appear to be involved in thermoregulation in the context of sepsis.

      Strengths:

      These findings add to our knowledge of the role of Itaconate in sepsis and its rather complex effects on metabolism, specifically lipid metabolism.

      Weaknesses:

      1. This is a complex paper and would benefit from a schematic depicting the key findings.

      2. The paper would benefit from additional supporting evidence. Would it be possible to measure fatty acid oxidation by metabolic tracing here, in IRG-deficient cells or in response to 4-OI? Although changes in protein level for Cpt1A are seen, this is correlated with fatty acid oxidation rather than direct demonstration. This may be challenging but would strengthen the manuscript.

      3. The aspect concerning body temperature regulation is confusing. Would Itaconate not promote fatty acid oxidation to increase or maintain body temperature? Itaconate must therefore not be involved in the hypothermic response? Bringing UCP1 into the finding is confusing and needs to be better explained. Again a diagram would help, but enhanced BAT fatty acid oxidation and UCP1 expression appear linked here, with both being affected by Itaconate. This needs clarifying.

    2. Reviewer #2 (Public Review):

      Summary:

      This manuscript provides important new findings regarding the connection between inflammation and metabolism. It also identifies a new type of post-translational modification and its connection to protein stability. This finding is expected to be generalizable to other protein targets. In vitro evidence is solid. In vivo evidence needs some additional controls.

      Strengths:

      A new connection between inflammation and metabolism.

      A novel type of PTM was identified.

      Findings would be of broad interest and the mechanisms are likely generalizable to related control systems.

      In vitro data are well-supported.

      The authors successfully demonstrated that treatment with 4-octyl Itaconate (4-OI), a prodrug form of itaconate, reduces neutral lipid accumulation in the AML12 cell line and primary hepatocytes. They show that 4-OI promotes fatty acid beta-oxidation through increased stability of CPT1a protein, the rate-limiting step in this process.

      Weaknesses:

      Some conclusions involving the Irg1 knockout mice require important controls and clarifications to be fully convincing and some controls are missing.

    1. Reviewer #1 (Public Review):

      The association of vitamin D supplementation in reducing Asthma risk is well studied, although the mechanistic basis for this remains unanswered. In the presented study, Kilic and co-authors aim to dissect the pathway of Vitamin D-mediated amelioration of allergic airway inflammation. They use initial leads from bioinformatic approaches, which they then associate with results from a clinical trial (VDAART) and then validate them using experimental approaches in murine models. The authors identify a role of VDR in inducing the expression of the key regulator Ikzf3, which possibly suppresses the IL-2/STAT5 axis, consequently blunting the Th2 response and mitigating allergic airway inflammation.

      The major strength of the paper lies in its interdisciplinary approach, right from hypothesis generation, and linkage with clinical data, as well as in the use of extensive ex vivo experiments and in vivo approaches using knock-out mice. The study presents some interesting findings including an inducible baseline absence/minimal expression of VDR in lymphocytes, which could have physiological implications and needs to be explored in future studies.<br /> However, the study presents a potential for further dissection of relevant pathophysiological parameters using additional techniques, to explain certain seemingly associative results, and allow for a more effective translation.

      Several results in the study suggest multiple factors and pathways influencing the phenotype seen, which remain unexplored. The inferences of this study also need to be read in the context of the different sub-phenotypes and endotypes of Asthma, where the Th2 response may not be predominant. While this does not undermine the importance of this elegant study, it is essential to emphasise a holistic picture while interpreting the results.

    2. Reviewer #2 (Public Review):

      Summary:<br /> This study seeks to advance our knowledge of how vitamin D may be protective in allergic airway disease in both adult and neonatal mouse models. The rationale and starting point are important human clinical, genetic/bioinformatic data, with a proposed role for vitamin D regulation of 2 human chromosomal loci (Chr17q12-21.1 and Chr17q21.2) linked to the risk of immune-mediated/inflammatory disease. The authors have made significant contributions to this work specifically in airway disease/asthma. They link these data to propose a role for vitamin D in regulating IL-2 in Th2 cells implicating genes associated with these loci in this process.

      Strengths:<br /> Here the authors draw together evidence form. multiple lines of investigation to propose that amongst murine CD4+ T cell populations, Th2 cells express high levels of VDR, and that vitamin D regulates many of the genes on the chromosomal loci identified to be of interest, in these cells. The bottom line is the proposal that vitamin D, via Ikfz3/Aiolos, suppresses IL-2 signalling and reduces IL-2 signalling in Th2 cells. This is a novel concept and whilst the availability of IL-2 and the control of IL-2 signalling is generally thought to play a role in the capacity of vitamin D to modulate both effector and especially regulatory T cell populations, this study provides new data.

      Weaknesses:<br /> Overall, this is a highly complicated paper with numerous strands of investigation, methodologies etc. It is not "easy" reading to follow the logic between each series of experiments and also frequently fine detail of many of the experimental systems used (too numerous to list), which will likely frustrate immunologists interested in this. There is already extensive scientific literature on many aspects of the work presented, much of which is not acknowledged and largely ignored. For example, reports on the effects of vitamin D on Th2 cells are highly contradictory, especially in vitro, even though most studies agree that in vivo effects are largely protective. Similarly other reports on adult and neonatal models of vitamin D and modulation of allergic airway disease are not referenced. In summary, the data presentation is unwieldy, with numerous supplementary additions, that makes the data difficult to evaluate and the central message lost. Whilst there are novel data of interest to the vitamin D and wider community, this manuscript would benefit from editing to make it much more readily accessible to the reader.

      Wider impact: Strategies to target the IL-2 pathway have long been considered and there is a wealth of knowledge here in autoimmune disease, transplantation, GvHD etc - with some great messages pertinent to the current study. This includes the use of IL-2, including low dose IL-2 to boost Treg but not effector T cell populations, to engineered molecules to target IL-2/IL-2R.

    1. Reviewer #2 (Public Review):

      Summary:

      This manuscript from Liu et al. examines the role of Fat and Dachsous, two transmembrane proto-cadherins that function both in planar cell polarity and in tissue growth control mediated by the Hippo pathway. The authors developed a new method for measuring growth of the wing imaginal disc during late larval development and then used this approach to examine the effects of disruption of Fat/Dachsous function on disc growth. The authors show that during mid to late third instar the wing imaginal disc normally grows in a linear rather than exponential fashion and that this occurs due to slowing of the mitotic cell cycle as the disc grows during this period. Consistent with their known role in regulating Hippo pathway activity, this slowing of growth is disrupted by loss of Fat/Dachsous function. The authors also observed a previously unreported gradient of Fat protein across the wing blade. However, graded expression of Fat or Dachsous is not necessary for proper growth regulation in the late third instar because ectopic Dachsous expression, which affects gradients of both Dachsous and Fat, has no growth phenotype.

      Strengths:

      Although the role of the Hippo pathway in growth control has been extensively studied, our understanding of how the pathway controls growth during normal development remains relatively weak. This work addresses this question by examining normal growth of the wing imaginal disc during part of its development in the larva and characterizing the effects of Fat/Dachsous manipulation on that growth. The authors developed tools for measuring wing growth by measuring wing volume, an approach that could be useful in future studies of tissue growth.

      Weaknesses:

      1) Although the approach used to measure volume is new to this study, the basic finding that imaginal disc growth slows at the mid-third instar stage has been known for some time from studies that counted disc cell number during larval development (Fain and Stevens, 1982; Graves and Schubiger, 1982). Although these studies did not directly measure disc volume, because cell size in the disc is not known to change during larval development, cell number is an accurate measure of tissue volume. However, it is worth noting that the approach used here does potentially allow for differential growth of different regions of the disc.

      2) Related to point 1, a main conclusion of this study, that cell cycle length scales with growth of the wing, is based on a developmentally limited analysis that is restricted to the mid-third instar larval stage and later (early third instar begins at 72 hr - the authors' analysis started at 84 hr). The previous studies cited above made measurements from the beginning of the 3rd instar and combined them with previous histological analyses of cell numbers starting at the beginning of the 2nd instar. Interestingly, both studies found that cell number increases exponentially from the start of the 2nd instar until mid-third instar, and only after that point does the cell cycle slow resulting in the linear growth reported here. The current study states that growth is linear due to scaling of cell cycle with disc size as though this is a general principle, but from the earlier studies, this is not the case earlier in disc development and instead applies only to the last day of larval life.

      3) The analysis of the roles of Fat and Dachsous presented here has weaknesses that should be addressed. It is very curious that the authors found that depletion of Fat by RNAi in the wing blade had essentially no effect on growth while depletion of Dachsous did, given that the loss of function overgrowth phenotype of null mutations in fat is more severe than that of null mutations in dachsous (Matakatsu and Blair, 2006). An obvious possibility is that the Fat RNAi transgene employed in these experiments is not very efficient. The authors tried to address this by doubling the dose of the transgene, but it is not clear to me that this approach is known to be effective. The authors should test other RNAi transgenes and additionally include an analysis of growth of discs from animals homozygous for null alleles, which as they note survive to the late larval stages.

      4) It is surprising that the authors detect a gradient of Fat expression that has not been seen previously given that this protein has been extensively studied. It is also surprising that they find that expression of Nubbin Gal4 is graded across the wing blade given that previous studies indicate that it is uniform (ie. Martín et al. 2004). These two surprising findings raise the possibility that the quantification of fluorescence could be inaccurate. The curvature of the wing blade makes it a challenging tissue to image, particularly for quantitative measurements.

      5) Overall, in my view the impact of these findings is limited. The focus on growth solely at the end of larval development, when there are a number of potentially confounding variables (for example hormonal cues), makes the generality of the findings reported here difficult to judge. Additionally, the functional analysis of Fat/Dachsous function in this process is limited - for example does disruption of other Hippo pathway components have a similar effect?

    2. Reviewer #1 (Public Review):

      Summary and Strengths:

      The manuscript presents novel results on the regulation of Drosophila wing growth by the protocadherins Ds and Fat. The manuscript performs a more careful analysis of disc volume, larval size, and the relationship between the two, in normal and mutant larvae, and after localized knockdown or overexpression of Fat and Ds. Not all of the results are equally surprising given the previous work on Fat, Ds, and their regulation of disc growth, pupariation, and the Hippo pathway, but the presentation and detail of the presented data is new. The most novel results concern the scaling of gradients of Fat and Ds protein during development, a largely unstudied gradient of Fat protein, and using overexpression of Ds to argue that changes in the Ds gradient do not underlie the slowing and halting of cell divisions during development.

      Weaknesses:

      Below I list questions and suggestions about the methodology, the presentation, and the interpretation of the data.

      1) Pouch growth: division or recruitment? The study chooses to examine growth only in the prospective wing blade (the "pouch") rather than the wing disc as a whole. This can create biases, as fat and ds manipulations often cause stronger effects on growth, and on Hippo signaling targets, in the adjacent hinge regions of the disc. So I am curious about this choice.

      The limitation to the wing region also creates some problems for the measurements themselves. The division between wing and pouch is not a strict lineage boundary, and thus cells can join or leave this region, creating two different reasons for changes in wing pouch size; growth of cells already in the region, or recruitment of cells into or out of the region. The authors do not discuss the second mechanism.

      It is not at all clear that the markers for the pouch used by the authors are stable during development. One of these is Vg expression, or the Vg quadrant enhancer. But the Vg-expressing region is thought to increase by recruitment over late second and third instar through a feed-forward mechanism by which Vg-expressing cells induce Vg expression in adjacent cells. In fact, this process is thought to be driven in part by Fat and Ds (Zecca et al 2010). So when the authors manipulate Fat and Ds are they increasing growth or simply increasing Vg recruitment? I would prefer that this limitation be addressed.

      The second pouch marker the authors use is epithelial folding, but this also has problems, as Fat and Ds manipulations change folding. Even in wild type, the folding patterns are complex. For instance, to make folding fit the Vg-QE pattern at late third the authors appear to be jumping in the dorsal pouch between two different sets of folds (Fig 1S2A). The authors also do not show how they use folding patterns in younger, less folded discs, nor provide evidence that the location of the folds are the same and do not shift relative to the cells. They also do not explain how they use folds and measure at later wpp and bpp stages, as the discs unfold and evert, exposing cells that were previously hidden in the folds.

      Finally, the authors limit their measurements to cells with exposed apical faces and thus a measurable area but apparently ignore the cells inside the folds. At late third, however, a substantial amount of the prospective wing blade is found within the folds, especially where they are deepest near the A/P compartment boundary. Using the third vein sensory organ precursors as markers, the L3-2 sensillum is found just distal to the fold, the L3-1 and the ACV sensilla are within the fold, and the GSR of the distal hinge is found just proximal to the fold. That puts the proximal half of the central wing blade in the fold, and apparently uncounted in their assays. These cells will however be exposed at wpp and especially bpp stages. How are the authors adjusting for this?

      2) Stabilizing and destabilizing interactions between Fat and Ds- The authors describe a distal accumulation of Fat protein in the wing, and show that this is unlikely to be through Fat transcription. They further try to test whether the distal accumulation depends on destabilization of proximal Fat by proximal Ds by looking at Fat in ds mutant discs.

      However, the authors do not describe how they take into account the stabilizing effects of heterophilic binding between the extracellular domains (ECDs) of Fat and Ds; without one, the junctional levels and stability of the other is reduced (Ma et al., 2003; Hale et al. 2015). So when they show that the A-P gradient of Fat is reduced in a ds mutant, is this because of the loss of a destabilizing effect of Ds on Fat, as they assume, or is it because all junctional Fat has been destabilized by loss of extracelluarlar binding to Ds? The description of the Fat gradient in Ds mutants is also confusing (see note 6 below), making this section difficult for the reader to follow.

      The authors do not propose or test a mechanism for the proposed destabilization. Fat and Ds bind not only through their ECDs, but binding has now also been demonstrated through their ICDs (Fulford et al. 2023)

      3) Ds gradient scales by volume, rather than cell number - This is an intriguing result, but the authors do not discuss possible mechanisms.

      4) Autonomous effects on growth- Fat and Ds are already known to have autonomous effects on growth and Hippo signaling from clonal analyses and localized knockdowns. One novelty here is showing that localized knockdown does not delay pupariation in the way that whole animal knockdown does, although the mechanism is not investigated. Another novelty is that the authors find stronger wing pouch overgrowth after localized ds RNAi or whole disc loss of fat than after localized fat RNAi, the latter being only 11% larger. The fat RNAi result would have strengthened by testing different fat RNAi stocks, which vary in their strength and are commonly weaker than null mutations, or stronger drivers such as the ap-gal4 they used for some of their ds-RNAi experiments or use of UAS-dcr2. Another reason for caution is that Garoia (2005) found much stronger overgrowth in fat mutant clones, which were about 75% larger than control clones.

      5) Flattening of Ds gradients does not slow growth. One model suggests that the flattening of the Ds gradient, and thus polarized Ds-Fat binding, account for slowed growth in older discs. The difficulty in the past has been that two ways of flattening the Ds gradient, either removing Ds or overexpressing Ds uniformly, give opposite results; the first increases growth, while the latter slows it. Both experiments have the problem of not just flattening the gradient, but also altering overall levels of Ds-Fat binding, which will likely alter growth independent of the gradients. Here, the authors instead use overexpression to create a strong Ds gradient (albeit a reversely oriented one) that does not flatten, and show that this does not prevent growth from slowing and arresting.

      To make sure that this is not some effect caused by using a reverse gradient, one might instead induce a more permanent normally oriented Ds gradient and see if this also does not alter growth; there is a ds Trojan gal4 line available that might work for this, and several other proximal drivers.

      Another possible problem is that, unlike previous studies, the authors have not blocked the Four-jointed gradient; Fj alters Fat-Ds binding and might regulate polarity independently of Ds expression. A definitive test would be to perform the tests above in four-joined mutant discs.

      The Discussion of these data should be improved. The authors state in the Discussion "The significance of these dynamics is unclear, but the flattening of the Fat gradient is not a trigger for growth cessation." While the Discussion mentions the effects of Ds on Fat distribution in some detail, this is the only phrase that discusses growth, which is surprising given how often the gradient model of growth control is mentioned elsewhere. The reader would be helped if details are given about what experiment supports this conclusion, the effect on not only growth cessation but cell cycle time, and why the result differs from those of Rogjula 2008 and Willecke 2008 using Ds and Fj overexpression.

      6) Discussion of Dpp. The authors spend much of the discussion speculating on the possibility that Fat and Ds control growth by changing the wing's sensitivity to the BMP Dpp. As the manuscript contains no new data on Dpp, this is somewhat surprising. The discussion also ignores Schwank (2011), who argues that Fat and Dpp are relatively independent. There have also been studies showing genetic interactions between Fat and signaling pathways such as Wg (Cho and Irvine 2004) and EGF (Garoia 2005).

    1. Reviewer #1 (Public Review):

      The manuscript by Geurrero and colleagues introduces two new metrics that extend the concept of "druggability"- loosely speaking, the potential suitability of a particular drug, target, or drug-target interaction for pharmacological intervention-to collections of drugs and genetic variants. The study draws on previously measured growth rates across a combinatoriality complete mutational landscape involving 4 variants of the TEM-50 (beta lactamase) enzyme, which confers resistance to commonly used beta-lactam antibiotics. To quantify how growth rate - in this case, a proxy for evolutionary fitness - is distributed across allelic variants and drugs, they introduce two concepts: "variant vulnerability" and "drug applicability".

      Variant vulnerability is the mean vulnerability (1-normalized growth rate) of a particular variant to a library of drugs, while drug applicability measures the mean across the collection of genetic variants for a given drug. The authors rank the drugs and variants according to these metrics. They show that the variant vulnerability of a particular mutant is uncorrelated with the vulnerability of its one-step neighbors, and analyze how higher-order combinations of single variants (SNPs) contribute to changes in growth rate in different drug environments.

      The work addresses an interesting topic and underscores the need for evolution-based metrics to identify candidate pharmacological interventions for treating infections. The authors are clear about the limitations of their approach - they are not looking for immediate clinical applicability - and provide simple new measures of druggability that incorporate an evolutionary perspective, an important complement to the orthodoxy of aggressive, kill-now design principles.

      As I said in my initial review, I think the work could be improved with additional analysis that tie the new metrics to evolutionary outcomes. Without this evidence, or some other type of empirical or theoretical support for the utility of these metrics, I am not fully convinced that these concepts have substantial impact. The new metrics could indeed be useful--and they have intuitive appeal--but the current revisions stop short of demonstrating that these intuitive notions hold up under "realistic" conditions (whether in simulation, theory, or experiment).

    2. Reviewer #2 (Public Review):

      In the main text, the authors apply their metrics to a data set that was published by Mira et al. in 2015. The data consist of growth rate measurements for a combinatorially complete set of 16 genetic variants of the antibiotic resistance enzyme beta-lactamase across 10 drugs and drug combinations at 3 different drug concentrations, comprising a total of 30 different environmental conditions. In my previous report I had asked the authors to specify why they selected only 7 out of 30 environments for their analysis, with only one concentration for drug, but a clear explanation is still lacking. In the Data section of Material and Methods, the authors describe their criterion for data selection as follows: "we focus our analyses on drug treatments that had a significant negative effect on the growth of wildtype/TEM-1 strains". However, in Figure 2 it is seen that, even for the selected data sets, not all points are significant compared to wild type (grey points). So what criterion was actually applied?

      In effect, for each chosen drug or drug combination, the authors choose the data set corresponding to the highest drug concentration. As a consequence, they cannot assess to what extent their metrics depend on drug concentration. This is a major concern, since Mira et al. concluded in their study that the differences between growth rate landscapes measured at different concentrations were comparable to the differences between drugs. I argued before that, if the new metrics display a significant dependence on drug concentration, this would considerably limit their usefulness. The authors challenge this, saying in their rebuttal that "no, that drug concentration would<br /> be a major actor in the value of the metrics does not limit the utility of the metric. It is simply another variable that one can consider when computing the metrics." While this is true in principle, I don't think any practicing scientist would disagree with the statement that the existence of additional confounding factors (in particular if they are unknown) reduces the usefulness<br /> of a quantitative metric.

      As a consequence of the small number of variant-drug-combinations that are used, the conclusions that the authors draw from their analysis are mostly tentative. For example, on line 123 the authors write that the observation that<br /> the treatment of highest drug applicability is a combination of two drugs "fits intuition". In the Discussion this statement is partly retracted with reference to the piperacillin/tazobactam-combination which has low drug applicability. Being based on only a handful of data points, both observations are essentially anecdotal and it is unclear what the reader is supposed to learn.

      To assess the environment-dependent epistasis among the genetic mutations comprising the variants under study, the authors decompose the data of Mira et al. into epistatic interactions of different orders. This part of the analysis is incomplete in two ways. First, in their study, Mira et al. pointed out that a fairly large fraction of the fitness differences between variants that they measured were not statistically significant. This information has been removed in the depiction of the Mira et al. fitness landscapes in Figure 1 of the present manuscript, and it does not seem to be reflected in the results of the interaction analysis in Figure 4. Second, the interpretation of the coefficients obtained from the epistatic decomposition depends strongly on the formalism that is being used. In a note added on page 15 of the revised manuscript, the authors write that they have used the LASSO regression for their analysis and refer the reader to a previous publication (Guerrero et al. 2019) which however (as far as I could see) also does not fully explain how the method works. To give an example of the difficulty of interpreting the data in Figure 4 without further information: The substitution C (G238S) is well known to have a strong positive effective in cefotaxime, but the corresponding coefficient is essentially zero. So whatever the LASSO regression does, it cannot simply measure the effect on growth.

    3. Reviewer #3 (Public Review):

      The authors introduce two new concepts for antimicrobial resistance borrowed from pharmacology, "variant vulnerability" (how susceptible a particular resistance gene variant is across a class of drugs) and "drug applicability" (how useful a particular drug is against multiple allelic variants). They group both terms under an umbrella term "drugability". They demonstrate these features for an important class of antibiotics, the beta-lactams, and allelic variants of TEM-1 beta-lactamase. In the revised version, they investigate a second drug class that targets dihydrofolate reductase in Plasmodium (the causative agent of malaria).

      The strength of the result is in its conceptual advance and that the concepts seem to work for beta-lactam resistance and DHFR inhibitors in a protozoan. However, I do not necessarily see the advance of lumping both terms under "drugability", as this adds an extra layer of complicaton in my opinion.

      I think that the utility of the terms will be more comprehensively demonstrated by using examples across a breadth of drug classes classes and/or resistance genes. For instance, another good bacterial model with published data might have been trimethoprim resistance, which arises through point mutations in the folA gene (although, clinical resistance tends to be instead conferred by a suite of horizontally acquired dihydrofolate reductase genes, which are not so closely related as the TEM variants explored here).

      The impact of the work on the field depends on a more comprehensive demonstration of the applicability of these new concepts to other drugs. This would be demonstrated in future work.

    1. Reviewer #1 (Public Review):

      Summary:<br /> The study follows the role of yeast eIF2A protein as a potential translation initiation factor engaged in the non-canonical translation initiation under stress conditions and as a substitute for eIF2. Using ribosome profiling, RNA-Seq and reporter-based assays authors evaluated the role of eIF2A protein under regular or stress conditions (cells starved for branched amino acids). The authors found that yeast cells depleted of eIF2A protein do not change significantly their translation initiation, or translation in general. In contrast to previously reported data for human homolog, yeast eIF2A does not significantly contribute to the regulation of the uORFs, regardless of whether they start with canonical AUG or near cognate start codons. eIF2A is not involved in the repression of IRES element in the URE2 gene or has a role in purine biosynthesis. It appears that in yeast eIF2A contributes to the regulation of a very limited number of mRNAs (32 with significant changes in translation efficiency), where only 17 of such messages indeed are consistent with eIF2A deletion, and single mRNA (HKR1) could be validated in reporter assay.

      Strengths:<br /> The main strength of the manuscript is a complete analysis and unbiased approach using genomic analysis methods (ribosome profiling and RNA-seq) as well as reporter validation studies. Additional strengths of the manuscript are scientific rigor and statistics associated with data analyses, clear data presentation, and discussion of the results in the context of the previous studies and results.

      Weaknesses:<br /> No weaknesses were noted by this reviewer.

    2. Reviewer #2 (Public Review):

      Summary:<br /> Gaikwad et al. investigated the role of eIF2A in translational response to stress in yeast. For this purpose, the authors conducted ribosome profiling under SM treatment in an eIF2A-depleted strain. Data analysis revealed that eIF2A did not influence translation from mRNAs bearing uORFs or cellular IRESes, in the stress condition, broadly. The authors found that only a small number of mRNAs were supported by eIF2A. The data should be helpful for researchers in the field.

      Major points:<br /> 1. The weakness of this work is the lack of clarification on the function of eIF2A in general. The novelty of this study was limited.

      2. Related to this, it would be worth investigating common features in mRNAs selectively regulated (surveyed in Figure 3A). Also, it would be worth analyzing the effect of eIF2A deletion on elongation (ribosome occupancy on each codon and/or global ribosome footprint distribution along CDS) and termination/recycling (footprint reads on stop codon and on 3′ UTR).

      3. Regarding Figure 3D, the reporters were designed to include promoter and 5′ UTR of the target genes. Thus, it should be worth noting that reporter design was based on the assumption that eIF2A-dependency in translation regulation was not dependent on 3′ UTR or CDS region. The reason why the effects on ribosome profiling-supported mRNAs could not be recapitulated in reporter assay may originate from this design. This should be also discussed.

      4. Related to the point above, the authors claimed that eIF2A affects "possibly only one" (HKR1) mRNA. However, this was due to the reporter assay which is technically variable and could not allow some of the constructs to pass the authors' threshold. Alternative wording for this point should be considered.

      5. For Figure 3D, it would be worth considering testing the #-marked genes (in Figure 3C) in this set up.

      6. In box plots, the authors should provide the statistical tests, at least where the authors explained in the main text.

    3. Reviewer #3 (Public Review):

      Summary:<br /> The authors have undertaken a study to rigorously characterize the possible role of eIF2A in regulating translation in yeast. The authors test for the role of eIF2A in the absence or presence of cellular stress and conclude that eIF2A does not play any significant role in regulating translation initiation in yeast.

      Strengths:<br /> The authors have used rigorous experimental approaches, including genome-wide ribosome profiling analysis in the absence or presence of stress, to show that eIF2A does not function in translation initiation on most mRNAs in yeast. Interestingly, the authors do identify a small number of mRNAs that possess some eIF2A dependency, so they constructed reporters to rigorously test them. One mRNA, HKR1, appears to possess a degree of eIF2A-dependent translation regulation.

      Weaknesses:<br /> While no role of eIF2A in translation initiation is apparent, the authors do not determine what function eIF2A does play in yeast. Whether it plays a role in regulating translation in a different stress response is not determined.

    1. Reviewer #1 (Public Review):

      Summary and Strengths:

      Zhang et al. conducted a study in which they isolated and characterized a Marburg virus (MARV) glycoprotein-specific antibody, AF-03. The antibody was obtained from a phage-display library. The study shows that AF-03 competes with the previously characterized MARV-neutralizing antibody MR78, which binds to the virus's receptor binding site. The authors also performed GP mutagenesis experiments to confirm that AF-03 binds near the receptor binding site. In addition, the study confirmed that AF-03, like MR78, can neutralize Ebola viruses with cleaved glycoproteins. Finally, the authors demonstrated that NPC2-fused AF-03 was effective in neutralizing several filovirus species.

      Weaknesses:

      1. The main premise of this study is unclear. Flyak et al. in 2015 described the isolation and characterization of a large panel of neutralizing antibodies from a Marburg survivor (Flyak et al., Cell, 2015). Based on biochemical and structural characterization, Flyak proposed that the Marburg neutralizing antibodies bind to the NPC1 receptor binding side. In the same study, it has been shown that several MARV-neutralizing antibodies can bind to cleaved Ebola glycoproteins that were enzymatically treated to remove the mucin-like domain and glycan cap. In the following study, it has been shown that the bispecific-antibody strategy can be used to deliver Marburg-specific antibodies into the endosome, where they can neutralize Ebola viruses (Wec et al., Science 2016). Finally, the use of lysosome-resident protein NPC2 to deliver antibody cargos to late endosomes has been previously described (Wirchnianski et al., Front. Immunol, 2021)

      The above-mentioned studies are not referenced in the introduction. The authors state that "there is no licensed treatment or vaccine for Marburg [virus] infection." While this is true, there are human antibodies that recognize neutralizing epitopes - that information can't be excluded while providing the rationale for the study. Furthermore, the authors use the word "novel" to describe the AF-03 antibody. How novel is AF-03 if multiple Marburg-neutralizing antibodies were previously characterized in multiple studies? Since AF-03 competes with previously characterized MR78, it binds to the same antigenic region as MR78. AF-03 also has comparable neutralization potency as MR78.

      2. Without the AF-03-MARV GP crystal structure, it's unclear how van der Waals interactions, H-bonds, and polar and electrostatic interactions can be evaluated. While authors use computer-guided homology modeling, this technique can't be used to determine critical interactions. Furthermore, Flyak et al. reported that binding to the NPC1 receptor binding site is the main mechanism of Marburg virus neutralization by human monoclonal antibodies. Since both AF-03 (this study) and MR78 (Flyak study) competed with each other, that information alone was sufficient for GP mutagenesis experiments that identified the NPC1 receptor binding site as the main region for mutagenesis.

      3. The AF-03-GP affinity measurements were performed using bivalent IgG molecules and trimeric GP molecules. This format does not allow accurate measurements of affinity due to the avidity effect. The reported KD value is abnormally low due to avidity effects. The authors need to repeat the affinity experiments by immobilizing trimeric GPs and then adding monovalent AF-03 Fab.

    2. Reviewer #2 (Public Review):

      Summary:

      The authors describe the discovery of a filovirus neutralizing antibody, AF03, by phage display, and its subsequent improvements to include NPC2 that resulted in a greater breadth of neutralization. Overall, the manuscript would benefit from considerable grammatical review, which would improve the communication of each point to the reader. The authors do not convincingly map the AF03 epitope, nor do they provide any strong support for their assumption that AF03 targets the NPC1 binding site. However, the authors do show that AF03 competes for MR78 binding to its epitope, and provides good support for the internalization of AF03-NL as the mechanism for improved breadth over the original AF03 antibody.

      Strengths:

      This study shows convincing binding to Marburgvirus GP and neutralization of Marburg viruses by AF03, as well as convincing neutralization of Ebolaviruses by AF03-NL. While there are no distinct populations of PE-stained cells shown by FACS in Figure 5A, the cell staining data in Figure 5C are compelling to a non-expert in endosomal staining like me. The control experiments in Figure 7 are compelling showing neutralization by AF03-NL but not AF03 or NPC2 alone or in combination. Altogether these data support the internalisation and stabilisation mechanism that is proposed for the gain in neutralization breadth observed for Ebolaviruses by AF03-NL over AF03 alone.

      Weaknesses:

      Overall, this reviewer is of the opinion that this paper is constructed haphazardly. For instance, the neutralization of mutant pseudoviruses is shown in Figure 2 before the concept of pseudovirus neutralization by AF03 is introduced in Figure 3. Similarly, the control experiments for AF03+NPC2 are described in Figure 7 after the data for breadth of neutralization are shown in Figure 6. GP quality controls are shown in Figure 2 after GP ELISAs / BLI experiments are done in Figure 1. This is disorienting for the reader.

      Figure 1: The visualisation of AF03 modelling and docking endeavours is extremely difficult to interpret. Firstly, there is no effort to orient the non-specialist reader with respect to the Marburgvirus GP model. Secondly, from the figures presented it is impossible to tell if the Fv docks perfectly onto the GP surface, or if there are violent clashes between the deeply penetrating AF03 CDRs and GP. This information would be better presented on a white background, perhaps showing GP in surface view from multiple angles and slices. The authors attempt to label potential interactions, but these are impossible to read, and labels should be added separately to appropriately oriented zoomed-in views.

      Figure 2: The neutralization of mutant pseudoviruses cannot be properly assessed using bar graphs. These data should be plotted as neutralization curves as they were done for the wild-type neutralization data in Figure 3. The authors conclude that Q128 & N129 are contact residues, but the neutralization data for this mutant appear odd as the lowest two concentrations of AF03 show higher neutralization than the second highest AF03 concentration. Neutralization of T204/Q205/T206 (green), Y218 (orange), K222 (blue), or C226 (purple) appears to be better than neutralization of the wild-type MARV. The authors do not discuss this oddity. What are the IC50's? The omission of antibody concentrations on the x-axis and missing IC50 values give a sense of obscuring the data, and the manuscript would benefit from greater transparency, and be much easier to interpret if these were included. I am intrigued that the Q128S/N129S mutant is reported as having little effect on the neutralization of MR78. The bar graph appears to show some effect (difficult to interpret without neutralization curves and IC50 data), and indeed PDB:5UQY seems to suggest that these amino acids form a central component of the MR78 epitope (Q128 forms potential hydrogen bonds with CDRH1 Y35 and CDRL3 Y91, while N129 packs against the MR78 CDRH3 and potentially makes additional polar contact with the backbone). Lastly, since neutralization was tested in both HEK293T cells and Huh7 cells in Figure 3, the authors should clarify which cells were used for neutralization in Figure 2.

      Figure 3: The first two images in Figure 3C showing bioluminescent intensity from pseudovirus-injected mice pretreated with either 10mg/kg or 3mg/kg AF03 are identical images. This is apparent from the location, shape, and intensity of the bioluminescence, as well as the identical foot placement of each mouse in these two panels. Currently, this figure is incomplete and should be corrected to show the different mice treated with either 10mg/kg or 3mg/kg of AF03.

      Figure 4 would benefit from a control experiment without antibodies comparing infection with GP-cleaved and GP-uncleaved pseudoviruses. The paragraph describing these data was also difficult to read and would benefit from additional grammatical review.

      Figure 5: The authors should clarify in the methods section that the "mock" experiment included the PE anti-human IgG Fc antibody. Without this clarification, the lack of a distinct negative population in the FACS data could be interpreted as non-specific staining with PE. If the PE antibody was added at an equivalent concentration to all panels, what does the directionality of the arrowheads in Figure 5A (labelled PE) and 5B (labelled pHrodo Red) indicate?

      Figure 6B: These data would benefit from the inclusion of IC50, transparency of antibody concentrations used, and consistency in the direction of antibody concentrations (increasing to the right or left of the x-axis) when compared to Figure 2.

    1. Reviewer #1 (Public Review):

      Summary:

      Mandal et al build upon their earlier work in CD 4 T cells to address the role of WASP in cytotoxic T cell mechanosensing. As shown previously by this group and others, the authors present evidence that tumour cell lysis is stiffness dependent and requires CTL WASP expression. They proceed to show that CTLs engaging targets form actin-rich foci, that the formation of these structures is dependent upon tumour cell stiffness and WASP dependent actin nucleation. Traction force measurements show that WASP is involved in force generation, and evidence that WASP plays a role in mechanosensing comes from studies showing that stiffness dependent phosphorylation of early TCR signalling intermediates (but not the later stages of T cell activation) is WASP dependent, as is phosphorylation of the tension sensor CasL. Finally, the authors provide in vivo data that WASP-deficient T cells kill tumours inefficiently.

      Strengths:

      The paper is well-written and brings together a range of well-established techniques for measuring T cell stiffness responses, force production, signalling, and effector function. Although some of the findings are necessarily correlative, the authors have largely achieved their aims. One particularly interesting observation is that stiffness dependent phosphorylation of ZAP70 requires WASP expression. Evidence that ZAP70 phosphorylation is WASP dependent is important, as it suggests that forces exerted by WASP are needed for some of the earliest stages of TCR signalling, perhaps TCR deformation itself. This observation, made in CD8 T cells, is particularly interesting given that previous work from this group [Kumari et al eLife 2015] showed that ZAP70 phosphorylation was intact in WASP-/- CD4 T cell blasts. In that study, the first clear differences in TCR signaling were seen at the level of PLCγ phosphorylation. This could represent an interesting difference between CD4 and CD8 T cells, but supplemental data from Figure S2 also show WASP dependence for CD3ζ and ZAP70 phosphorylation in naïve CD4 T cells. Unfortunately, this interesting issue was not discussed or pursued experimentally.

      Weaknesses:

      While the study is well executed, it is rather limited in scope, and many of the observations have been reported previously in other systems. These weaknesses limit the impact of the study. In particular, the authors have previously shown in CD4 T cells that the nucleation promoting activity of WASP is responsible for the formation of actin foci, for early TCR signalling events associated with T cell activation, for traction force generation and for CasL phosphorylation [Kumari et al eLife 2015, Kumari et al EMBO J 2020]. It could be argued that this paper extends findings made originally in CD4 cells to include CD8 T cells. But the authors did not make this clear, and the advance is rather incremental. Moreover, similar studies have been done in CD8 T cells by other labs. Most notably, the Huse group has conducted highly relevant work investigating the mechanobiology of CTL function in vitro and in vivo [Basu et al Cell 2016, Wang et al Nat Comms 2022, Tamzalit et al Sci Immunol 2019, Tello-Lafoz et al Immunity 2021, de Jesus et al bioRxiv Preprint 2023]. Indeed, one study showed that WASP depletion impairs the formation of protrusions that deform the target cell surface and promote target lysis [Tamzalit et al Sci Immunol 2019]. Mandal et al cite this work and argue that what they show differs from the mechanopotentiation shown in Tamzalit et al, but they don't explore the issue further. They also fail to cite work from Tello-Lafoz et al showing that regulated changes in target cell stiffness contribute to CTL vulnerability. Finally, Mandal et al. fail to deal with evidence that WASP participates in many phases of the CTL response, including adhesion, migration, granule release, and serial killing. All of these are likely contributors to the in vivo phenotypes shown in Figure 4.

    2. Reviewer #2 (Public Review):

      Summary:

      Mandal et al. use WASP-deficient T cells to study the role of WASP in T cell signaling and activation and tying WASP to mechanosensing in T cells. Using both CD8 and CD4 T cells from WASP-deficient animals, the authors show defects in T cell signaling and function as well as defects in mechanosensing in activated CD8 T cells.

      Strengths:

      Confirming findings from many previous studies, Mandal et al. demonstrate that WASP-deficiency in T cells leads to defective T cell function (Figs 1, 2, 3, and 4). Fig 3 shows direct effects of mechanical stress on CD8 T cell signaling in the absence of WASP.

      Weaknesses:

      The title does not reflect the data presented as the only data demonstrating a role for WASP in mechanosensing in this manuscript doesn't directly connect WASP mechanosensing with tumors (Fig 3). The results shown in Fig 1 using an actin inhibitor doesn't directly connect WASP with mechanosensing. Fig 4 uses WASP-deficient animals in a tumor model, but doesn't demonstrate any role for mechanosensing in the WASP-deficient animals. The title should reflect the lack of data connecting WASP in mechanosensing to a tumor context.

      One major oversight is the absence of discussion of a previous publication demonstrating a direct role of WASP in mechanosensing to the actin cytoskeleton in dendritic cells and naive CD4 and CD8 T cells (Gaertner et al. Dev Cell 2022). There should be a discussion of how the findings in Gaertner et al. shed light on the results from this manuscript.

      The use of Myca to disrupt the actin cytoskeleton as a "modulator of stiffness" is problematic. While one of the potential effects of disrupting the actin cytoskeleton is changing stiffness, as shown in Figure 1, many other functions are simultaneously disturbed also. The use of B16 tumor cells is simply for antigen presentation, and not in a tumor context, so generalized statements about "stiffness" or "softness" and "tumor cells" in reference to Figure 1 should be changed to account for these alternative explanations.

      Fig S2 shows Myca treatment of BMDCs leads to decreased functionality of OTII CD4s. Interpretation in the manuscript claims "This indicates that leaching of Myca from treated cells does not cause inhibition of bystander cells". This would not be my interpretation of the data. An alternative interpretation is that if Myca is remaining in the media, then effects on APCS (either BMDCs or B16s) could lead to decreased CD4 or CD8 T cell activation and thus be responsible for effects seen in Fig 1. This possibility should be considered.

      Fig 4 claims that high rigidity leads to downstream effects of WASP-/- T cell function. But there is no demonstration of the role of mechanosensing in Figure 4. To make this claim, the authors would need to compare high and low rigidity conditions.

      Fig 4 also shows that WASP-/- showed higher tumor growth in an implanted tumor model. For 4F, since WASP is deficient in all hematopoietic cells, the finding in 4G may not be due to T cells. In 4H-J, because implantation of tumors occurs within 1 day of lymphodepletion and assessing tumor growth prior to reconstitution of the hematopoietic compartment, there should be control experiments shown to demonstrate that other hematopoietic cell types that remain are not function and thus do not participate in the differences seen in tumor growth. Also, statistical tests need to be done to show the significance of the differences between groups in Fig 4I and 4J (also 4G).

    3. Reviewer #3 (Public Review):

      The manuscript from Mandal et al. aims to show that the actin cytoskeleton is the key mechanosensitive element in cytotoxic T lymphocytes, enabling them to discriminate between target cells of different cortical stiffness. They further examine whether WASP activation is sensitive to substrate stiffness, and thus modulates actin polymerization and early T cell signaling in a mechanosensitive manner. Overall, the mechanosensitivity of CTLs has attracted a lot of attention in the last few years and this study explores new and interesting facets. The manuscript asks an important question regarding the mechanisms underlying the stiffness dependent response observed in T cells. The authors have used a variety of techniques ranging from mouse models and in vivo studies, cell biological manipulations and biophysical measurements which is commendable. Their work suggests that the actin cytoskeleton regulated by WASP plays a key role in mechanosensitivity - which is an intriguing finding.

      While this manuscript has wide-ranging experiments and interesting results, a number of points need to be carefully addressed to support the central claims.

      The first major issue is that the irreversible actin inhibitor myca can have a number of non-specific effects on CTL activation. It is not clear that the effects observed are due to the change in stiffness alone. Since Myca depolymerizes actin, the B16 target cells would have altered MHC mobility or impaired receptor-ligand engagement - which might affect actin foci formation and signaling. There is also no gain of function experiment, wherein the stiffness of the target cell is enhanced. Moreover, there are two populations in both the control and myca-treated Young's modulus histograms for B16 cells. Are these sub-populations fundamentally different in their cytoskeletal organization? This can also confound or introduce variability in results on stiffness-dependence of CTL function, given the second sub-population of Myca-treated cells overlaps with the first sub-population of control cells. The authors need to provide a justification for these.

      Secondly, the WASP knockout still shows mechanosensitivity but at reduced force levels (Fig. 3B). Similarly, other measures (Fig. 3) still show increases with stiffness. Thus, it is not clear whether WASP is necessary for mechanosensing but simply for maintaining force levels and (expectedly) lower actin levels and foci in the WASP knockout. In fact, Fig 3 implies WASP is required for signaling and not for mechanosensing, undermining the main claim of the paper. At the very least, ANOVA or factor analysis (stiffness x WASP) needs to be done to demonstrate the requirement of WASP for CTL mechanosensitivity.

      Third, there are some concerns regarding the traction force microscopy. The authors do not present key details in the manuscript about the methods used. Secondly, the traction values are entirely too high compared to reported values in the literature for CTLs. A back-of-the-envelope calculation of the total force yields ~30 nN for wild-type cells) on 10 kPa gels, which is about an order of magnitude higher than reported values (Tamzalit et al. 2020, Hui et al. 2017, Bashour et al. 2014, Pathni et al. 2022). The authors should clearly demonstrate and justify that their measured values are reasonable and accurate. The lack of representative movies and displacement maps used for the traction force measurements make it hard to evaluate the results. Typical bead displacements for CTLs on softer gels are on the order of 1 micron (Mustapha et al. 2022), which should decrease to 0.1 micron or less on 50 kPa gels. These would make the tractions hard to estimate accurately. The authors should evaluate and show the displacements underneath the cell and outside the cell boundaries to give estimates of the noise floor for tractions. Finally, there is no discussion of how the tractions were calculated from the displacements - was Fourier Transform or Finite element method used? What is the noise level of the measurements and how were the traction estimates regularized?

      Fourth, many of the plots in the manuscripts are not accompanied by representative images to show how these aspects (distribution of actin and signaling markers for example) change qualitatively under different conditions (e.g. stiffness). Details of analysis and quantification need to be provided for a clearer understanding of the results and interpretations. All figures and captions should include information about the number of cells and experiments. Along these lines, there is very little detail in the methods, statistical power, calculations are not mentioned, there is little description of the pmel-1 knockout mouse, all of which make it hard to evaluate the soundness of the results.

      Finally, the study as presented, doesn't conclusively show that WASP is required for mechanosensitive CTL function. The results presented show that WASP is required for early and longer-term signaling events and cytolytic activity, and that knocking out WASP reduces early TCR signaling, actin foci formation in response to substrate stiffness. To make the claim of WASP-mediated regulation of CTL mechanosensitivity stronger, it would be helpful to see how WASP knockout affects CTL killing in response to softened and (possibly) stiffened B16 targets.

    1. Reviewer #1 (Public Review):

      The authors investigate the function of the PTB domain containing adaptor protein Numb in skeletal muscle structure and function. In particular, the effects of reduced Numb expression in aging muscle is proposed as a mechanism for reduced contractile function associated with sarcopenia. Using ex-vivo analysis of conditional Numb and Numblike knockout muscle the authors demonstrate that loss of Numb but not the related Numblike gene expression perturbs muscle force generation. In order to explore the molecular mechanisms involved, Numb interacting proteins were identified in C2C12 cell cultured myotubes by immunoprecipitation and LC-MS/MS. The authors identify Septin 7 as well as Septin 2, 9 and 10 as a Numb binding proteins and demonstrate that loss of Numb/Numblike in myofibers causes changes in Septin 7 subcellular localization. Of note, whether additional septins form a complex or are also disrupted by Numb/Numblike loss remains an interesting area for further investigation. Additional investigation of the specificity and mapping of the Numb-Septin 7 (or another Septin) interaction would be of interest and provide an approach for future studies to demonstrate the biological relevance and specificity of the Numb-Septin 7 interaction in skeletal muscle

    2. Reviewer #2 (Public Review):

      Summary:

      The main purpose of this investigation was to 1) compare the effects of a single knockout (sKO) of Numb or a double knockout (dKO) of Numb and NumbL on ex-vivo physiological properties of the extensor digitorium longus (EDL) muscle in C57BL/6NCrl mice; and 2) analyze protein complexes isolated from C2C12 myotubes via immunoprecipitation and LC/MS/MS for potential Numb binding partners. The main findings are 1) the muscles from sKO and dKO were significantly weaker with little difference between the sKO and dKO lines, indicating the reduced force is mainly due to the inactivation of the Numb gene; and 2) there were 11 potential Numb binding proteins that were identified and cytoskeletal specific proteins including Septin 7.

      Strengths:

      Straight-forward yet elegant design to help determine the important role the Numb has in skeletal muscle.

      Weaknesses:

      There were a limited number of samples (3-6) that were used for the physiological experiments; however, there was a very large effect size in terms of differences in muscle tension development between the induced KO models and the controls.

    1. Reviewer #1 (Public Review):

      Summary:

      This manuscript presents the development of a new microscope method termed "open-top two-photon light sheet microscopy (OT-TP-LSM)". While the key aspects of the new approach (open-top LSM and Two-photon microscopy) have been demonstrated separately, this is the first system of integrating the two. The integration provides better imaging depth than a single-photon excitation OT-LSM.

      Strengths:

      - The use of liquid prism to minimize the aberration induced by index mismatching is interesting and potentially helpful to other researchers in the field.<br /> - The use of propidium iodide (PI) provided a deeper imaging depth.

      Weaknesses:

      - Details are lacking on imaging time, data size, the processing time to generate large-area en face images, and inference time to generate pseudo H&E images. This makes it difficult to assess how applicable the new microscope approach might be in various pathology applications.

    2. Reviewer #2 (Public Review):

      Summary:

      In this manuscript, the authors developed an open-top two-photon light sheet microscopy (OT-TP-LSM) that enables high-throughput and high-depth investigation of 3D cell structures. The data presented here shows that OT-T-LSM could be a complementary technique to traditional imaging workflows of human cancer cells.

      Strengths:

      High-speed and high-depth imaging of human cells in an open-top configuration is the main strength of the presented study. An extended depth of field of 180 µm in 0.9 µm thickness was achieved together with an acquisition of 0.24 mm2/s. This was confirmed by 3D visualization of human cancer cells in the skin, pancreas, and prostate.

      Weaknesses:

      The complementary aspect of the presented technique in human pathological samples is not convincingly presented. The traditional hematoxylin and eosin (H&E) staining is a well-established and widely used technique to detect human cancer cells. What would be the benefit of 3D cell visualization in an OT-TP-LSM microscope for cancer detection in addition to H&E staining?

    1. Joint Public Review:

      In countries endemic for P vivax the need to administer a primaquine (PQ) course adequate to prevent relapse in G6PD deficient persons poses a real dilemma. On one hand PQ will cause haemolysis; on the other hand, without PQ the chance of relapse is very high. As a result, out of fear of severe haemolysis, PQ has been under-used.

      In view of the above, the authors have investigated in well-informed volunteers, who were kept under close medical supervision in hospital throughout the study, two different schedules of PQ administration: (1) escalating doses (to a total of 5-7 mg/kg); (2) single 45 mg dose (0.75 mg/kg).

      It is shown convincingly that regimen (1) can be used successfully to deliver within 3 weeks, under hospital conditions, the dose of PQ required to prevent P vivax relapse.

      As expected, with both regimens acute haemolytic anaemia (AHA) developed in all cases. With regimen (2), not surprisingly, the fall in Hb was less, although it was abrupt. With regimen (1) the average fall in Hb was about 4 G. Only in one subject the fall in Hb mandated termination of the study.

      Since the data from the Chicago group some sixty years ago, there has been no paper reporting a systematic daily analysis of AHA in so many closely monitored subjects with G6PD deficiency. The individual patient data in the Supplementary material are most informative and more than precious.

      Comments on the revised version:

      In my view this important paper is further improved in this revised version (R2), particularly with respect to clarity in the discussion. All the points I had previously raised have been tackled.

    1. Reviewer #1 (Public Review):

      Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disorder leading to the loss of innervation of skeletal muscles, caused by the dysfunction and eventual death of lower motor neurons. A variety of approaches have been taken to treat this disease. With the exception of three drugs that modestly slow progression, most therapeutics have failed to provide benefit. Replacing lost motor neurons in the spinal cord with healthy cells is plagued by a number of challenges, including the toxic environment, inhibitory cues that prevent axon outgrowth to the periphery, and proper targeting of the axons to the correct muscle groups. These challenges seem to be well beyond our current technological approaches. Avoiding these challenges altogether, Bryson et al. seek to transplant the replacement motor neurons into the peripheral nerves, closer to their targets. The current manuscript addresses some of the challenges that will need to be overcome, such as immune rejection of the allograft and optimizing maturation of the neuromuscular junction.

    2. Reviewer #2 (Public Review):

      The authors provide convincing evidence that optogenetic stimulation of ChR2-expressing motor neurons implanted in muscles effectively restore innervation of severely affected skeletal muscles in the aggressive SOD1 mouse model of ALS, and concluded that this method can be applied to selectively control the function of implicated muscles, which was supported by convincing data presented in the paper.

    1. Reviewer #1 (Public Review):

      Summary:

      The authors have implemented the Optimal Transport algorithm in GromovMatcher for comparing LC/MS features from different datasets. This paper gains significance in the proteomics field for performing meta-analysis of LC/MS data.

      Strengths:

      The main strength is that GromovMatcher achieves significant performance metrics compared to other existing methods. The authors have done extensive comparisons to claim that GromovMatcher performs well.

      Weaknesses:

      There are two weaknesses.

      1. When the number of features is reduced the precision drops to ~0.8.<br /> 2. How applicable is the method for other non-human datasets?

    2. Reviewer #2 (Public Review):

      Summary:

      The goal of untargeted metabolomics is to identify differences between metabolomes of different biological samples. Untargeted metabolomics identifies features with specific mass-to-charge ratio (m/z) and retention time (RT). Matching those to specific metabolites based on the model compounds from databases is laborious and not always possible, which is why methods for comparing samples on the level of unmatched features are crucial.

      The main purpose of the GromovMatcher method presented here is to merge and compare untargeted metabolomes from different experiments. These larger datasets could then be used to advance biological analyses, for example, for the identification of metabolic disease markers. The main problem that complicates merging different experiments is m/z and RT vary slightly for the same feature (metabolite).

      The main idea behind the GromovMatcher is built on the assumption that if two features match between two datasets (that feature i from dataset 1 matches feature j from dataset 2, and feature k from dataset 1 matches feature l from dataset 2), then the correlations or distances between the two features within each of the datasets (i and k, and j and l) will be similar. The authors then use the Gromov-Wasserstein method to find the best matches matrix from these data.

      The variation in m/z between the same features in different experiments is a user-defined value and it is initially set to 0.01 ppm. There is no clear limit for RT deviations, so the method estimates a non-linear deviation (drift) of RT between two studies. GromovMatcher estimates the drift between the two studies and then discards the matching pairs where the drift would deviate significantly from the estimate. It learns the drift from a weighted spline regression.

      The authors validate the performance of their GromovMatcher method by a validation experiment using a dataset of cord blood. They use 20 different splits and compare the GromovMatcher (both its GM and GMT iterations, whereby the GMT version uses the deviation from estimated RT drift to filter the matching matrix) with two other matching methods: M2S and metabCombiner.

      The second validation was done using a (scaled and centered) dataset of metabolics from cancer datasets from the EPIC cohort that was manually matched by an expert. This dataset was also used to show that using automatic methods can identify more features that are associated with a particular group of samples than what was found by manual matching. Specifically, the authors identify additional features connected to alcohol consumption.

      Strengths:

      I see the main strength of this work in its combination of all levels of information (m/z, RT, and higher-order information on correlations between features) and using each of the types of information in a way that is appropriate for the measure. The most innovative aspect is using the Gromov-Wasserstein method to match the features based on distance matrices.

      The authors of the paper identify two main shortcomings with previously established methods that attempt to match features from different experiments: a) all other methods require fine-tuning of user-defined parameters, and, more importantly, b) do not consider correlations between features. The main strength of the GromovMatcher is that it incorporates the information on distances between the features (in addition to also using m/z and RT).

      Weaknesses:

      The first, minor, weakness I could identify is that there seem not to be plenty of manually curated datasets that could be used for validation. The second is also emphasized by the authors in the discussion. Namely, the method as it is set up now can be directly used only to compare two datasets.

    1. Reviewer #1 (Public Review):

      Summary:

      Liang et. al., uses a previously devised full isotope labeling of peptidoglycan followed by mass spec to study the kinetics of Lpp tethering to PG and the hydrolysis of this bond by YafK.

      Strengths:

      -The labeling and mass spec analysis technique works very well to discern differentially labelled Tri-KR muropeptide containing new and old Lpp and PG.

      Weaknesses:

      -Only one line of experimentation using mass spec based analysis of labeled PG-Lpp is used to make all conclusions in the paper. The evidence is also not enough to fully deleanate the role of YafK.<br /> -Only one mutant (YafK) is used to make the conclusion.<br /> -The paper makes a lot of 'implications' with minimal proof to support their hypothesis. Other lines of experimentations must be added to fully delineate their claims.<br /> -Time points to analyse Tri-KR isotopologues in Wt (0,10,20,40,60 min) and yafK mutant (0,15, 25, 40, 60 min) are not the same.<br /> -Experiments to define physiological role of YafK are also missing.

    2. Reviewer #2 (Public Review):

      Summary:

      The authors of this study have sought to better understand the timing and location of the attachment of the lpp lipoprotein to the peptidoglycan in E. coli, and to determine whether YafK is the hydrolase that cleaves lpp from the peptidoglycan.

      Strengths:

      The method is relatively straightforward. The authors are able to draw some clear conclusions from their results, that lpp molecules get cleaved from the peptidoglycan and then re-attached, and that YafK is important for that cleavage.

      Weaknesses:

      However, the authors make a few other conclusions from their data which are harder to understand the logic of, or to feel confident in based on the existing data. They claim that their 5-time point kinetic data indicates that new lpp is not substantially added to lipidII before it is added to the peptidoglycan, and that instead lpp is attached primarily to old peptidoglycan. I believe that this conclusion comes from the comparison of Fig.s 3A and 3C, where it appears that new lpp is added to old peptidoglycan a few minutes before new lpp is added to new peptidoglycan. However, the very small difference in the timing of this result, the minimal number of time points and the complete lack of any presentation of calculated error in any of the data make this conclusion very tenuous. In addition, the authors conclude that lpp is not significantly attached to septal peptidoglycan. The logic behind this conclusion appears to be based on the same data, but the authors do not provide a quantitative model to support this idea.

      This work will have a moderate impact on the field of research in which the connections between the OM and peptidoglycan are being studied in E. coli. Since lpp is not widely conserved in gram negatives, the impact across species is not clear. The authors do not discuss the impact of their work in depth.

    1. Reviewer #1 (Public Review):

      Guan et al. explored the mechanisms responsible for the development, maintenance, and functional properties of a specific subset of unconventional T cells expressing a Va3.2 T cell receptor that recognizes a peptide, QFL, presented by the class Ib protein Qa-1. Prior studies from this group showed that cells from mice deficient in the ER protease ERAAP elicit responses in wild-type animals enriched for Qa-1-restricted CD8 T cells. They further showed that a significant proportion of these responses were directed against the QFL peptide derived from a conserved protein with incompletely understood functions. Many of these so-called QFL T cells expressed Va3.2-Ja21, were present in the spleen of wild-type mice, and exhibited a memory-like phenotype. Due to their relatively low frequency and weak staining with Qa-1 tetramers, analyzing QFL T cells has been challenging. Therefore, the authors generated dextramers, which permitted them to more rigorously identify these cells. They confirmed some of their previous findings and further showed that Va3.2+ and Va3.2- QFL T cells were present in the intestinal epithelium, where they also express CD8alpha homodimers, a characteristic of most small intestinal intraepithelial lymphocytes (siIELs), and most similar to the so-called natural siIELs that acquire their innate functions in the thymus. The authors show that TAP but not Qa-1 or ERAAP expression are required for the development of these cells, and both Qa-1 and ERAAP are required for the natural siIEL phenotype. Some of these findings were confirmed using a new TCR transgenic mouse expressing the QFL TCR. They further show that retention but not homing of QFL T cells to the intestinal epithelium involves commensal microorganisms, and using in silico approaches, they identify a commensal that contains a peptide similar to QFL that can activate QFL T cells. Finally, they show that this organism, P. pentosaceus, can promote gut retention of QFL T cells when it is introduced into germ-free mice. From these findings, the authors conclude that the microbiota influence the maintenance of Qa-1-restricted T cells.

      Comments:

      1. The authors employ a number of new reagents and elegant approaches to explore the development, maintenance and functional properties of QFL T cells.<br /> 2. Generally, conclusions made are well supported by the data presented.<br /> 3. One limitation of the work is that the immunological functions of QFL T cells remain unclear.<br /> 4. In their revised manuscript, the authors present additional data that have appropriately addressed the reviewer comments.

    2. Reviewer #2 (Public Review):

      Summary: CD8+ QFL T cells recognize a peptide, FYAEATPML (FL9), presented on Erap1-deficient cells. QFL T cells are present at a high frequency in the spleen of naïve mice. They express an antigen-experienced phenotype, and about 80% express an invariant TCRα chain Vα3.2Jα21.

      Here, Guan and coll. report that QFL T cells are present not only in the spleen but also in the intestinal epithelium, where they display several phenotypic and functional peculiarities. The establishment of spleen and gut Vα3.2+ QFL T cells is TAP-dependent, and their phenotype is regulated by the presence/absence of Qa-1b and Erap1. Maintenance of gut Vα3.2+ QFL T cells depends on the gut microbiota and is associated with colonization by Pediococcus pentosaceus.

      Strengths:

      This article contains in-depth studies of a peculiar and interesting subset of unconventional CD8 T cells, based partly on generating two novel TCR-transgenic models.

      The authors discovered a clear relation between the gut microbiome and the maintenance of gut QFL T cells. One notable observation is that monocolonization of the gut with Pediococcus pentosaceus is sufficient to sustain gut QFL T cells.

      Weaknesses:

      In the absence of immunopeptidomic analyses, the presence or absence of the FL9 peptide on various cell types is inferred based on indirect evidence. Hence, whether the FL9 peptide is presented by some cells that express Qa-1b but not Erap1 remains unknown.

      Analyses of the homology between the FL9 and bacterial peptides were limited to two amino acid residues (P4 and P6). This limitation is mitigated in part by the justifications provided by the authors in the revised preprint.

      The potential function of QFL T cells remains elusive. The present article should provide an incentive for further functional studies.

    3. Reviewer #3 (Public Review):

      The authors investigate the role of commensal microbes and molecules in the antigen presentation pathway in the development and phenotype of CD8 T cells specific for the Qa-1b-restricted peptide FL9 (QFL). The studies track both endogenous QFL-specific T cells and utilize a recently generated TCR transgenic model. The authors confirm that QFL-specific T cells in the spleen and small intestine intraepithelial lymphocyte (IEL) pool show an antigen-experienced phenotype as well as unique phenotypic and innate-like functional traits, especially among CD8+ T cells expressing Va3.2+ TCRs. They find that deficiency in the TAP transporter leads to almost complete loss of QFL-specific T cells but that loss of either Qa1 or the ERAAP aminopeptidase does not impact QFL+ T cell numbers but does cause them to maintain a more conventional, naïve-like phenotype. In germ-free (GF) mice, the QFL-specific T cells are present at similar numbers and with a similar phenotype to SPF animals, but in older animals (>18w) there is a notable loss of IEL QFL-specific cells. This drop can be avoided by neonatal colonization of GF mice with the commensal microbe Pediococcus pentosaceus but not a different commensal, Lactobacillus johnsonii, and the authors show that P. pentosaceus encodes a peptide that weakly stimulates QFL-specific T cells, while the homologous peptide from L. johnsonii does not stimulate such cells.

      This study provides new insights into the way in which the differentiation, phenotype, and function of CD8+ T cells specific for Qa-1b/FL9 is regulated by peptide processing and Qa1 expression, and by interactions with the microbiota. The approaches are well designed, the data compelling, and the interpretation, for the most part, appropriate.

      The response to several of my concerns involved reference to a different manuscript from the authors (which has not been through peer review), and for point #3, it would have been useful to provide experimental evidence (e.g., competitive inhibition assays) to justify their hypothesis that P4 serves as a TCR contact while P6 may be a Qa-1b contact residue. Nevertheless, the authors have made considerable efforts to clarify their approaches and interpretation, which strengthens the manuscript.

    1. Reviewer #1 (Public Review):

      Summary:

      Khan et. al., investigated the functional redundancy of the non-canonical L-cysteine synthases of M. tuberculosis, CysM and CysK2, focussing on their role in mitigating the effects of host-derived stress. They found that while deletion mutants of the two synthases (Rv∆cysM, Rv∆cysK2) have similar transcriptomes under standard conditions, their transcriptional response to oxidative stress is distinct. The impact of deleting the synthases also differentially affected the pools of L-cysteine-derived metabolites. They show that the mutants (Rv∆cysM, Rv∆cysK2) have impaired survival in peritoneal macrophages and in a mouse model of infection. Importantly, they show that the survival of the mutants increases when the host is defective in producing reactive oxygen and nitrogen species, linking the phenotype to a defect in combating host-derived stress. Finally, they show that compounds inhibiting L-cysteine synthases reduce the intracellular survival of M. tuberculosis.

      Strengths:

      1. The distinct transcriptome of the Rv∆cysM and Rv∆cysK2 mutants in the presence of oxidative stress provides solid evidence that these mutants are distinct in their response to oxidative stress, and suggests that they are not functionally redundant.<br /> 2. The use of macrophages from phox-/- and INF-/- mice and an iNOS inhibitor for the intracellular survival assays provides solid evidence that the survival defect seen for the Rv∆cysM and Rv∆cysK2 mutants is related to their reduced ability to combat host-derive oxidative and nitrosative stress. This is further supported by the infection studies in phox-/- and INF-/- mice.

      Weaknesses:

      1. There are several previous studies looking at the transcriptional response of M. tuberculosis to host-derived stress, however, the authors do not discuss initial RNA-seq data in the context of these studies. Furthermore, while several of the genes in sulfur assimilation and L-cysteine biosynthetic pathway genes are upregulated by more than one stress condition, the data does not support the statement that it is the "most commonly upregulated pathway in Mtb exposed to multiple host-like stresses".<br /> 2. For the quantification of the metabolites, it isn't clear how the abundance was calculated (e.g., were standards for each metabolite used? How was abundance normalised between samples?), and this information should be included to strengthen the data. Furthermore, labelling with L-methionine was performed to determine the rate of synthesis of the L-cysteine-derived metabolites. L-cysteine is produced from L-methionine via the transsulfuration pathway, which is independent of CysM and CysK2. It is therefore difficult to interpret this experiment, as the impact of deleting CysM and CysK2 on the transsulfuration pathway is likely indirect.

      3. The ability of L-cysteine to rescue the survival defect of the Rv∆cysM and Rv∆cysK2 mutants in macrophages is interpreted as exogenous L-cysteine being able to compensate for reduced intracellular levels. However, there is no evidence that L-cysteine is being taken up by the mutants and an alternate explanation is that L-cysteine functions as an antioxidant within cells i.e., it reduces intracellular ROS.

      The authors sought to investigate the functional redundancy of the non-canonical L-cysteine synthases CysM and CysK2. While their distinct transcriptional response to oxidative stress suggests distinct physiological roles, the study did not explore these differences and therefore provides only preliminary insight into the underlying reasons for this observation. In the context of drug development, this work suggests that while L-cysteine synthase inhibitors do not have high potency for killing intracellular M. tuberculosis, they have the potential to decrease the pathogen's survival in the presence of host-derive stress.

    2. Reviewer #2 (Public Review):

      Summary:

      The paper examines the role L-cysteine metabolism plays in the biology of Mycobacterium tuberculosis. The authors have preliminary data showing that Mycobacterium tuberculosis has two unique pathways to synthesize cysteine. The data showing new compounds that act synergistically with INH is very interesting.

      Strengths:

      RNAseq data is interesting and important.

      Weaknesses:

      The paper would be strengthened if the authors were to add further detail to their genetic manipulations.

      The authors provide evidence that they have successfully made a cysK2 mutant by recombineering. This data looks promising, but I do not see evidence for the cysM deletion. It is also important to state what sort of complementation was done (multicopy plasmid, integration proficient vector, or repair of the deletion). Since these mutants are the basis for most of the additional studies, these details are essential. It is important to include complementation in mouse studies as unexpected loss of PDIM could have occurred.

    3. Reviewer #3 (Public Review):

      In this work, the authors conduct transcriptional profiling experiments with Mtb under various different stress conditions (oxidative, nitrosative, low pH, starvation, and SDS). The Mtb transcriptional responses to these stress conditions are not particularly new, having been reported extensively in the literature over the past ~20 years in various forms. A common theme from the current work is that L-cysteine synthesis genes are seemingly up-regulated by many stresses. Thus, the authors focused on deleting two of the three L-cysteine synthesis genes (cysM and cysK2) in Mtb to better understand the roles of these genes in Mtb physiology.

      The cysM and cysK2 mutants display fitness defects in various media (Sautons media, starvation, oxidative and nitrosative stress) noted by CFU reductions. Transcriptional profiling studies with the cysM and cysK2 mutants revealed that divergent gene signatures are generated in each of these strains under oxidative stress, suggesting that cysM and cysK2 have non-redundant roles in Mtb's oxidative stress response which likely reflects the different substrates used by these enzymes, CysO-L-cysteine and O-phospho-L-serine, respectively. Note that these studies lack genetic complementation and are thus not rigorously controlled for the engineered deletion mutations.

      The authors quantify the levels of sulfur-containing metabolites (methionine, ergothioneine, mycothiol, mycothionine) produced by the mutants following exposure to oxidative stress. Both the cysM or cysK2 mutants produce more methionine, ergothioneine, and mycothionine relative to WT under oxidative stress. Both mutants produce less mycothiol relative to WT under the same condition. These studies lack genetic complementation and thus, do not rigorously control for the engineered mutations.

      Next, the mutants were evaluated in infection models to reveal fitness defects associated with oxidative and nitrosative stress in the cysM or cysK2 mutants. In LPS/IFNg activated peritoneal macrophages, the cysM or cysK2 mutants display marked fitness defects which can be rescued with exogenous cysteine added to the cell culture media. Peritoneal macrophages lacking the NADPH oxidase (Phox) or IFNg fail to produce fitness phenotypes in the cysM or cysK2 mutants suggesting that oxidative stress is responsible for the phenotypes. Similarly, chemical inhibition of iNOS partly abrogated the fitness defect of the cysM or cysK2 mutants. Similar studies were conducted in mice lacking IFNg and Phox establishing that cysM or cysK2 mutants have fitness defects in vivo that are dependent on oxidative and nitrosative stress.

      Lastly, the authors use small molecule compounds to inhibit cysteine synthases. It is demonstrated that the compounds display inhibition of Mtb growth in 7H9 ADC media. No evidence is provided to demonstrate that these compounds are specifically inhibiting the cysteine synthases via "on-target inhibition" in the whole Mtb cells. Additionally, it is wrongly stated in the discussion that "combinations of L-cys synthase inhibitors with front-line TB drugs like INH, significantly reduced the bacterial load inside the host". This statement suggests that the INH + cysteine synthase inhibitor combinations reduce Mtb loads within a host in an infection assay. No data is presented to support this statement.

    1. Reviewer #1 (Public Review):

      Summary:<br /> In this paper by Zhang, the authors build a physical framework to probe the mechanisms that underlie the exchange of molecules between coexisting dense and dilute liquid-like phases of condensates. They first propose a continuum model, in the context of a FRAP-like experiment where the fluorescently labeled molecules inside the condensate are bleached at t=0 and the recovery of fluorescence is measured. Through this model, they identify how the key timescales of internal molecular mixing, replenishment from dilute phase, and interface transfer contribute to molecular exchange timescale. Motivated by a recent experiment reported by some of the co-authors previously (Brangwynne et al. in 2019) finding strong interfacial resistance in in-vitro protein droplets of LAF-1, they seek to understand the microscopic features contributing to the interfacial conductance (inversely proportional to the resistance). To check, they perform coarse-grained MD simulations of sticker-spacer self-associative polymers and report how conductance varies significantly even across the few explored sequences. Further, by looking at individual trajectories, they postulate that "bouncing" - i.e., molecules that approach the interface but are not successfully absorbed - is a strong contributor to this mass transfer limitation. Consistent with their predictions, sequences that have more free unbound stickers (i.e., for example through imbalance sequence sticker stoichiometries) have higher conductances and they show a simple linear scaling between the number of unbound stickers and conductance. Finally, they predict a droplet-size-dependent transition in recovery time behavior.

      Strengths:<br /> 1. This paper is well-written overall and clear to understand.

      2. By combining coarse-grained simulations, continuum modeling, and comparison to published data, the authors provide a solid picture of how their proposed framework relates to molecular exchange mechanisms that are dominated by interface resistance and LAF-1 droplets.

      3. The choice of different ways to estimate conductance from simulation and reported data are thoughtful and convincing in their near agreement (although a little discussion of why and when they differ would be merited as well).

      Weaknesses:<br /> 1. Almost the entirety of this paper is motivated by a previously reported FRAP experiment on a particular LAF-1 droplet in vitro. There are a few major concerns I have with how the original data is used, how these results may generalize, and the lack of connection of predictions with any other experiments (published or new).

      a. The mean values of cdense, cdilute, diffusivities, etc. are taken from Taylor et al. to rule in the importance of interfacial mass transfer limits. While this may be true, the values originally inferred (in the 2019 paper that this paper is strongly built off) report extremely large confidence intervals/inferred standard errors. The authors should accordingly report all their inferences with correct standardized errors or confidence intervals, which in turn, allow us to better understand these data.

      b. The generalizability of this model is hard to gauge when all comparisons are made to a single experiment reported in a previous paper.<br /> i. Conceptually, the model is limited to single-component sticker-spacer polymers undergoing phase separation which is already a very simplified model of condensates - for e.g., LAF1 droplets in the cell have no perceptible interfacial mass limitations, also reported in Taylor et al. 2019 - so how these mechanisms relate to living systems as opposed to specific biochemistry experiments. So the authors need to discuss the implications and limitations of their model in the living context where there are multiple species, finite-size effects, and active processes at play.

      ii. Second, can the authors connect their model to make predictions of the impact of perturbations to LAF-1 on exchange timescales? For example, are mutants (which change the number or positioning of "stickers") expected to show particular trends in conductances or FRAP timescales? Since LAF-1 is a relatively well-studied protein in vitro, can the authors further contrast their expectations with already published datasets that explore these perturbations, even if they don't generate new data?

      iii. A key prediction of the interface limitation model is the size-dependent crossover in FRAP dynamics. Can the authors reanalyze published data on LAF-1 (albeit of different-size droplets) to check their predictions? At the least, is the crossover radius within experimentally testable limits?

      c. The authors nicely relate the exchange timescale to various model parameters. Is LAF-1 the only protein for which the various dilute/dense concentrations/diffusivities are known? Given the large number of FRAP and other related studies, can the authors report on a few other model condensate protein systems? This will help broaden the reach of this model in the context of other previously reported data. If such data are lacking, a discussion of this would be important.

      2. The reported sticker-spacer simulations, while interesting, represent a very small portion of the parameter space. Can the authors - through a combination of simulation, analyses, or physical reasoning, comment on how the features of their underlying microscopic model (sequence length, implicit linker length, relative stoichiometry of A/B for a given length, overall concentration, sequence pattern properties like correlation length) connect to conductance? This will provide more compelling evidence relating their studies beyond the cursory examination of handpicked sequences. A more verbose description of some of the methods would be appreciated as well, including specifically how to (a) calculate the bond lifetime of isolated A-B pair, and (b) how equilibration/convergence of MD simulations is established.

      3. A lot of the main text repeats previously published models (continuum ones in Taylor et al. 2019 and Hubsatch et al., 2021, amongst others) and the idea of interface resistance being limiting was already explored quantitatively in Taylor 2019 (including approximate estimates of mass transfer limitations) - this is fine in context. While the authors do a good job of referring to past work in context, the main results of this paper, in my reading, are:<br /> - a simplified physical form relating conductance timescales.<br /> - sticker-spacer simulations probing microscopic origins.<br /> - analysis of size-dependent FRAP scaling.

      I am stating this not as a major weakness, but, rather - I would recommend summarizing and categorizing the sections to make the distinctions between previously reported work and current advances sufficiently clear.

    2. Reviewer #2 (Public Review):

      Summary:<br /> In this paper, the authors have obtained an analytical expression that provides intuition about regimes of interfacial resistance that depend on droplet size. Additionally, through simulations, the authors provide microscopic insight into the arrangement of sticky and non-sticky functional groups at the interface. The authors introduce bouncing dynamics for rationalizing quantity recovery timescales.

      I found several sections that felt incomplete or needed revision and additional data to support the central claim and make the paper self-contained and coherent.

      First, the analytical theory operates with diffusion coefficients for dilute and dense phases. For the dilute phase, this is fine. For the dense phase, I have doubts that dynamics can be described as diffusive. Most likely, dynamics is highly subdiffusive due to crowded, entangled, and viscoelastic environments of densely packed interactive biomolecules. Some explanation and justification are in order here.

      The second major issue is that I did not find a clean comparison of simulations with the derived analytical expression. Simulations test various microscopic properties on the value of k, which is important. But how do we know that it is the same quantity that appears in the expressions? Also, how can we be sure that analytical expressions can guide simulations and experiments as claimed? The authors should provide sound evidence of the predictive aspect of their derived expressions.

      Are the plots in Figure 4 coming from experiment, theory, and simulation? I could not find any information either in the text or in the caption.

    1. Reviewer #1 (Public Review):

      Summary:<br /> This manuscript explores the importance of food type on virus infection dynamics using a nematode virus as a model system. The authors demonstrate that susceptibility to viral infection can change by several orders of magnitude based on the type of bacterial food that potential hosts consume. They go on to show that, for the bacterial food source that reduces susceptibility, the effect is modulated by quorum sensing molecules that the bacteria produce.

      Strengths:<br /> This manuscript shows convincingly that nematode susceptibility to viral infection changes by several orders of magnitude (i.e. doses must be increased by several orders of magnitude to infect the same fraction of the population) depending on the bacterial food source on which hosts are reared. The authors then focus on the bacteria that reduce host susceptibility to viral infection and demonstrate that certain bacterial quorum-sensing compounds are required to see this effect of reduced susceptibility. Overall, sample sizes are large, methods are generally rigorous, experiments are repeated, and patterns are clear.

      Weaknesses:<br /> Although the molecular correlate of reduced susceptibility is identified (i.e. quorum sensing compounds) the mechanisms underlying this effect are missing. For example, there are changes in susceptibility due to altered nutrition, host condition, the microbiome, feeding rate, mortality of infected hosts, etc. In addition, the authors focus almost entirely on the reduction in susceptibility even though I personally find the increased susceptibility generated when reared on Ochrobactrum to be much more exciting.

      I was a bit surprised that there was no data on basic factors that could have led to reductions in susceptibility. In particular, data on feeding rates and mortality rates seem really important. I would expect that feeding rates are reduced in the presence of Pseudomonas. Reduced feeding rates would translate to lower consumed doses, and so even though the same concentration of virus is on a plate, it doesn't mean that the same quantity of virus is consumed. Likewise, if Pseudomonas is causing mortality of virus-infected hosts, it could give the impression of lower infection rates. Perhaps mortality rates are too small in the experimental setup to explain this pattern, but that isn't clear in the current version of the manuscript. Is mortality greatly impacted by knocking out quorum-sensing genes? Also, the authors explored susceptibility to infection, but completely ignored variation in virus shedding.

      I was also curious why the authors did not further explore the mechanism behind the quorum-sensing effect. Not sure whether this is possible, but would it be possible to add spent media to the infection plates where the spent media was from Pseudomonas that produce the quorum sensing compound but the plates contain OOP50, Pseudomonas, or the quorum sensing knockout of Pseudomonas? That would reveal whether it is the compound itself vs. something that the compound does.

      In addition, I was surprised by how much focus there was on the attenuation of infection and how little there was on the enhancement of infection. To me, enhancement seems like the more obvious thing to find a mechanism for -- is the bacteria suppressing immunity, preventing entry to gut cells, etc?

      I was a bit concerned about the "arbitrary units", which were used without any effort to normalize them. David Wang and Hongbing Jiang have developed a method based on tissue culture infectious dose 50 (TCID50) that can be used to measure infectious doses in a somewhat repeatable way. Without some type of normalization, it is hard to imagine how this study could be repeated. The 24-hour time period between exposure and glowing suggests very high doses, but it is still unclear precisely how high. Also, it is clear that multiple batches of virus were used in this study, but it is entirely unclear how variable these batches were.

      The authors in several places discuss high variability or low variability in incidence as though it is a feature of the virus or a feature of the host. It isn't. For infection data (or any type of binomial data) results are highly variable in the middle (close to 50% infection) and lowly variable at the ends (close to 0% or 100% infection). This is a result that is derived from a binomial distribution and it should not be taken as evidence that the bacteria or the host affect randomness. If you were to conduct dose-response experiments, on any of your bacterial food source treatments, you would find that variability is lowest at the extremely high and extremely low doses and it is most variable in the middle when you are at doses where about 50% of hosts are infected.

    2. Reviewer #2 (Public Review):

      Summary and Major Findings/Strengths:

      Across diverse hosts, microbiota can influence viral infection and transmission. C. elegans is naturally infected by the Orsay virus, which infects intestinal cells and is transmitted via the fecal-oral route. Previous work has demonstrated that host immune defense pathways, such as antiviral RNAi and the intracellular pathogen response (IPR), can influence host susceptibility to virus infection. However, little is known about how bacteria modulate viral transmission and host susceptibility.

      In this study, the authors investigate how diverse bacterial species influence Orsay virus transmission and host susceptibility in C. elegans. When C. elegans is grown in the presence of two Ochrobactrum species, the authors find that animals exhibit increased viral transmission, as measured by the increased proportion of newly infected worms (relative to growth on E. coli OP50). The presence of the two Ochrobactrum species also resulted in increased host susceptibility to the virus, which is reflected by the increased fraction of infected animals following exposure to the exogenous Orsay virus. In contrast, the presence of Pseudomonas lurida MYb11, as well as Pseudomonas PA01 or PA14, attenuates viral transmission and host susceptibility relative to E. coli OP50. For growth in the presence of P. aeruginosa PA01 and PA14, the attenuated transmission and susceptibility are suppressed by mutations in regulators of quorum sensing and the gacA two-component system. The authors also identify six virulence genes in P. aeruginosa PA14 that modulate host susceptibility to virus and viral transmission, albeit to a lesser extent. Based on the findings in P. aeruginosa, the authors further demonstrate that deletion of the gacA ortholog in P. lurida results in loss of the attenuation of viral transmission and host susceptibility.

      Taken together, these findings provide important insights into the species-specific effects that bacteria can have on viral infection in C. elegans. The authors also describe a role for Pseudomonas quorum sensing and virulence genes in influencing viral transmission and host susceptibility.

      Major weaknesses:

      The manuscript has several issues that need to be addressed, such as insufficient rigor of the experiments performed and questions about the reproducibility of the data presented in some places. In addition, confounding variables complicate the interpretations that can be made from the authors' findings and weaken some of the conclusions that are stated in the manuscript.

      1. The authors sometimes use pals-5p::GFP expression to indicate infection, however, this is not necessarily an accurate measure of the infection rate. Specifically, in Figures 4-6, the authors should include measurements of viral RNA, either by FISH staining or qRT-PCR, to support the claims related to differences in infection rate.

      2. In several instances, the experimental setup and presentation of data lack sufficient rigor. For example, Fig 1D and Fig 2B only display data from one experimental replicate. The authors should include information from all 3 experimental replicates for more transparency. In Fig 3B, the authors should include a control that demonstrates how RNA1 levels change in the presence of E. coli OP50 for comparison with the results showing replication in the presence of PA14. In order to support the claim that "P. aeruginosa and P. lurida MYb11 do not eliminate Orsay virus infection", the authors should also measure RNA1 fold change in the presence of PA01 and P. lurida in the context of exogenous Orsay virus. Additionally, the authors should standardize the amount of bacteria added to the plate and specify how this was done in the Methods, as differing concentrations of bacteria could be the reason for species-specific effects on infection.

      3. The authors should be more careful about conclusions that are made from experiments involving PA14, which is a P. aeruginosa strain (isolated from humans), that can rapidly kill C. elegans. To eliminate confounding factors that are introduced by the pathogenicity of PA14, the authors should address how PA14 affects the health of the worms in their assays. For example, the authors should perform bead-feeding assays to demonstrate that feeding rates are unaffected when worms are grown in the presence of PA14. Because Orsay virus infection occurs through feeding, a decrease in C. elegans feeding rates can influence the outcome of viral infection. The authors should also address whether or not the presence of PA14 affects the stability of viral particles because that could be another trivial reason for the attenuation of viral infection that occurs in the presence of PA14.

    1. Joint Public Review:

      Summary:<br /> In this interesting work, the authors investigated an important topical question: when we see travelling waves in cortical activity, is this due to true wave-like spread, or due to sequentially activated sources? In simulations, it is shown that sequential brain module activation can show up as a travelling wave - even in improved methods such as phase delay maps - and a variety of parameters is investigated. Then, in ex-vivo turtle eye-brain preparations, the authors show that visual cortex waves observable in local field potentials are in fact often better explained as areas D1 and D2 being sequentially activated. This has implications for how we think about travelling wave methodology and relevant analytical tools.

      Strengths:<br /> I enjoyed reading the discussion. The authors are careful in their claims, and point out that some phenomena may still indeed be genuine travelling waves, but we should have a higher evidence bar to claim this for a particular process in light of this paper and Zhigalov & Jensen (2023) (ref 44). Given this careful discussion, the claims made are well-supported by the experimental results. The discussion also gives a nice overview of potential options in light of this and future directions.

      The illustration of different gaussian covariances leading to very different latency maps was interesting to see.

      Furthermore, the methods are detailed and clearly structured and the Supplementary Figures, particularly single trial results, are useful and convincing.

      Weaknesses:<br /> The details of the sequentially activated Gaussian simulations give some useful results, but the fundamental idea still appears to be "sequential activation is often indistinguishable from a travelling wave", an idea advanced e.g. by Zhigalov & Jensen (2023). It takes a while until the (in my opinion) more intriguing experimental results.

      One of the key claims is that the spikes are more consistent with two sequentially activated modules rather than a continuous wave (with Fig 3k and 3l key to support this). Whilst this is *more* consistent, it is worth mentioning that there seems to be stochasticity to this and between-trial variability, especially for spikes.

    1. Reviewer #3 (Public Review):

      Summary:<br /> Here, Osnes et al examine the population dynamics of Neisseria gonorrhoeae. They develop new methodologies to deal with the issue of recombination, as well as using ancestral state reconstruction approaches to quantify the number of import and export transmission events occurring in different regions in the world. Overall, they provide a framework for understanding intercontinental transmission that could be applied to other microbial pathogens.

      Strengths:<br /> A major strength of this study is the incredibly large number of genomes analysed, which span a wide temporal range with significant geographical diversity. The use of ancestral state reconstruction to quantitatively determine the number of import and export events of N. gonorrhoeae in densely sampled Norway and Victoria, Australia, is an interesting application of this well-known method and could be applied to other bacterial species that have been well-sampled.

      Weaknesses:<br /> The methods development to deal with the issue of recombination in their dataset to ensure that the recombination signal does not affect their dating estimates and effective population size analysis is thorough but has likely not been able to remove all bias. Additionally, the authors discuss the utility of using the identified transmission lineages in this study to better type N. gonorrhoeae as there are issues with traditional typing, such as MLST, due to the highly recombinogenic nature of this species. However, no method seems to be provided to enable future researchers to easily assign their genomes to the transmission lineages identified in this study.

    2. Reviewer #1 (Public Review):

      Summary:<br /> Osnes et al., describe a large and impressive study of the population and transmission dynamics of Neisseria gonorrhoeae using a global dataset of 9,732 genomes. This included dense sampling from Norway and the state of Victoria, Australia. Understanding the transmission dynamics of this increasingly drug-resistant pathogen is crucial for designing optimal public health interventions. This study provides useful insights into the differing transmission dynamics between two well-sampled populations.

      The authors have also developed novel techniques to address the size and complexity of the dataset, including an approach to account for recombination when building large phylogenies. While the authors have made significant efforts to account for sampling biases in the data, it is not clear that this has been sufficient to address the problem. The use of non-standard analysis techniques also requires further validation.

      Strengths:<br /> The size of the dataset and the comparisons between densely sampled regions are major strengths of the manuscript. While sampling biases may limit the generalizability of the results, as acknowledged by the authors, the characterization of local and inter-country transmission, will help to inform future studies into N. gonorrhoeae.

      Weaknesses:<br /> Sampling bias:<br /> The authors have gone to considerable efforts to acknowledge and account for biases in the sampling between different locations. Despite this, comparisons are still frequently made in the manuscript between populations with very different sampling profiles, which are likely to dominate the import, export and local transmission signals.

      To determine the sensitivity of their results to sampling, the authors randomly took subsamples of each population at varying sizes. While this would address issues with the overall number of genomes being considered, it is not obvious that it would account for biases in sampling including the differing dates over which each population was sampled.<br /> Randomly subsampling tips of the tree is unlikely to change the overall population structure of each dataset much. For example, subsampling a single outbreak would result in highly similar genomes each time. Subsampling clades would provide a better indication of how sensitive the results are to particular clusters within each population. Simulations would also help to determine under what conditions the inferred asymptotes for import and export fractions are likely to hold.

      The text states that Europe and the USA have 'older' transmission lineages than Norway and Victoria. Norway is also found to export more lineages than Victoria, which is likely to be heavily influenced by biases in the distribution of the 'rest of world' samples. Although the impact of sampling bias is acknowledged by the authors in cases such as these, it would be better to avoid making direct comparisons in the first place.

      Recombination detection and filtering:<br /> The authors introduce a novel pipeline for masking recombination before building phylogenetic trees, based on randomly subsetting the dataset and running the Gubbins algorithm. While I appreciate it is challenging to account for recombination in a dataset of this size, further verification needs to be done to demonstrate the effectiveness of this approach.

      In particular, this approach resulted in ~ 10% of sites being filtered out from a diverse set of genomes. This is considerably less than a previous publication that considered ~400 diverse gonococcal genomes, where just under 50% of sites were removed using the Gubbins algorithm (Sánchez-Busó et al., 2019).

      One reason for this is that the new approach requires recombination events called by Gubbins to meet additional filtering requirements before they are masked from the alignment. This may exclude rarer recombination events, which could subsequently impact the length of branches in the final phylogeny.

      Transmission clustering:<br /> The use of LineageHomology and ancestral state reconstruction to determine transmission clusters may be susceptible to biases in sampling between locations. As noted by the authors, locations with sparse sampling, such as the USA, are likely to have older ancestral nodes that are exclusive to that location. Biases in the sampling of countries that transmit to and from each location will also heavily impact the size of the inferred clusters.

      This could potentially explain the occurrence of larger 'mixed outbreaks' in Victoria when compared to Norway, as these clusters may be older and driven by a lack of observed isolates in the 'rest of world' subset.

      While it would not solve the problem entirely, a SNP-based cut-off as used in the original study of Victorian isolates by Williamson et al., is less likely to be as heavily biased.

      Import and export estimates:<br /> Using LineageHomology to define import and export estimates may have a similar problem with sampling biases. This is acknowledged by the authors and nicely described in Supplementary Figure 6. The authors make a comparison with the analysis of SARS-CoV-2 genomes by du Plessis et. al. (2021). However, in the analysis of SARS-CoV-2, the sampling times were far more consistent than those observed in the gonococcal dataset. To address this, the authors could compare their results to an analysis restricted to samples observed in a similar time period. This could most easily be achieved by cutting the inferred phylogeny at a particular date and re-running the LineageHomology analyses.

    3. Reviewer #2 (Public Review):

      Summary:<br /> This work analyses the historical spread and evolution, termed 'population dynamics', of a human bacterial pathogen, Neisseria gonorrhoeae, the cause of the sexually transmitted infection, gonorrhoea. N. gonorrhoeae is classified as a high priority pathogen by the World Health Organisation, due to infections numbering in the tens of millions annually, with high levels of antibiotic resistance and no vaccine available, meaning treating and preventing infections is becoming increasingly more difficult. To implement interventions effectively, important resistant lineages and their transmission routes must be identified on a national and international level.

      In this work, Osnes et al. use genomic data, coupled with geographic, temporal and demographic metadata, to analyse the global population dynamics N. gonorrhoeae using 9,732 genomes. The study also includes a granular analysis of transmission between and within four regions of different sizes with high levels of data coverage: USA, Europe, Norway, and Victoria state in Australia.<br /> The authors built a phylogenetic tree including all genomes using a novel computationally efficient method for removing genome regions resulting from recombination, which would otherwise result in incorrect branch lengths and tree topology. Using the tree, the authors show that the effective population size of N. gonorrhoeae, describing population size and diversity, decreased in the period from 2010 to present day, and was not entirely an artefact of sampling bias. The authors then stratified the tree based on isolates that contained alleles that are associated with resistance to antibiotics commonly used to treat gonorrhoea. The authors found resistance was associated with particular lineages, of which most, but not all, underwent shrinking in effective population size in the last decade.<br /> Using the tree, the authors then inferred likely importation, exportation, and local transmission events, finding notable differences in the contribution of imports to local incidence between locations, as well as the likelihood of exportation. As inference of these events relies on sampling density, the authors used a novel method for identifying whether sampling was representative of the population diversity of a given location. Using this approach, they found that the densely sampled regions, Norway and Victoria, were likely representative of the local N. gonorrhoeae population diversity, whilst the larger, less densely sampled regions, Europe and USA, were not. Finally, they investigated the contribution of specific transmission networks to the spread gonorrhoea, finding that the frequency of males within a transmission network may play a role in the rate of N. gonorrhoeae transmission in Norway, but not Victoria.<br /> This work introduces several novel approaches to the analysis of pathogen population dynamics, and highlights notable differences in N. gonorrhoeae transmission between and within distinct geographic locations.

      Strengths:<br /> • The authors have collated a large global collection of N. gonorrhoeae genomes with associated metadata, and in some cases generated assemblies themselves. A dataset of this size and detail is a valuable asset to the public health community, enabling analysis of both national and international population dynamics.<br /> • The stratification of the phylogenetic tree by antimicrobial resistance gene alleles enables the study of how antibiotic usage has shaped global and regional N. gonorrhoeae populations. Analysis of changes in the effective population size of clades harbouring resistance alleles is particularly impactful, as this can be used to show how changes in treatment patterns affect the growth or decline of drug-resistant pathogen populations. This analysis also enables the determination of the frequency of multiple resistance alleles being present in single isolates, important for determining the scale of multidrug resistance within the N. gonorrhoeae global population.<br /> • The use of ancestral trait reconstruction to quantify importation, exportation and local transmission is an important contribution to public health efforts tackling N. gonorrhoeae spread. Understanding the differences in transmission networks within and between different geographic locations provides public health researchers with crucial information to model and implement effective targeted interventions on regional and international scales.

      Weaknesses:<br /> • The method used to generate the phylogenetic tree and mask regions of recombination is likely flawed. The authors repeatedly down-sampled the whole population to 500 genomes, using Gubbins to identify regions that have recombined and therefore would not follow the clonal history of the N. gonorrhoeae population. This small sample size will result in the same ancient internal nodes being sampled repeatedly, whilst more recent internal nodes will not. Therefore, more recent recombination events would not be identified by this method and were therefore likely included in the whole genome alignment used to build the tree. Furthermore, Gubbins was designed to identify recombination between closely related genomes, not across a whole species, where the background mutation rate will be too high to differentiate between recombined regions and the clonal frame. Both of these factors will mean that the amount of the genome predicted to have recombined will likely be underestimated, resulting in inflated branch lengths and incorrect tree topology. This effect is potentially the cause of the observed drop in N. gonorrhoeae effective population size between 2010-present day in Figure 2, which does not align with gonorrhoea incidence, and the elevated estimated mutation rate of 7.41x10-6 substitutions per site per year, which is higher than previous estimates based on N. gonorrhoeae global populations. The result of underestimation of recombined regions will be two-fold. Inclusion of recombined regions in the alignment will result in inflated branch lengths, which will impact all estimates of effective population size in the study. Furthermore, tree topology may be incorrect, which will impact ancestral trait reconstruction and result in incorrect inference of import, export and local transmission events in Figures 3, 4 and 5. Additionally, the clade-specific resistance gene analyses will be affected in Figure 2, as certain isolates may be incorrectly included or excluded within stratified clades. Therefore, the conclusions made about the changes in effective population size for the global population, and individual clades, as well as the differences in transmission dynamics between locations, are likely to be incorrect.<br /> • The method used to identify sampling bias, shown in Figure 4, is a novel and interesting take on the problem. However, it is not clear whether the effect being measured is the presence of sampling bias or an artefact of differences in N. gonorrhoeae diversity between locations. The results in Figure 4 do align with what is known about the population datasets; the data from Norway and Victoria is more comprehensive than that of the USA and Europe due to the difference in size of the respective human populations, meaning the likelihood of sampling bias will be lower in the smaller population. However, with increased human population size, we would also expect a greater amount of pathogen diversity, due to increased within-region transmission and greater numbers of importation events. Supporting this, we see in Figure 3 that the transmission lineages in the USA and Europe are estimated to have emerged earlier than Norway or Victoria, indicative of a greater amount of standing population diversity. Therefore, the reason why convergence is observed when up-sampling from smaller populations may be because a vast majority of isolates will sit within a small part of the tree, whilst from a larger, more diverse population, isolates will be placed all across the tree and so convergence will never be observed. In effect, it is unknown whether increasing the sample size of the USA and Europe to be truly representative of their respective N. gonorrhoeae populations would ever result in convergence between the two methods of up-sampling. Testing this method using simulations could be used to determine whether it is sensitive to sampling bias, or population diversity.<br /> • In Figure 5, a significant difference in transmission lineage size was only found between male-dominated and mixed lineages in Norway and not Victoria. Therefore, the conclusion that sex distribution within transmission networks affects the size of transmission lineages is not supported by the data, and could also be due to geographical and other demographic differences between the datasets which were not accounted for.

    1. Joint Public Review:

      Randomized clinical trials use experimental blinding and compare active and placebo conditions in their analyses. In this study, Fassi and colleagues explore how individual differences in subjective treatment (i.e., did the participant think they received the active or placebo treatment) influence symptoms and how this is related to objective treatment. Authors address this highly relevant and interesting question using a powerful method by (re-)analyzing data from four published neurostimulation studies and including subjective treatment in statistical models explaining treatment response. The major strengths include the innovative and important research question, the inclusion of four different studies with different techniques and populations to address this question, sound statistical analyses, and findings that are of high interest and relevance to the field.

      The paper will have significant impact on the field. It will promote further investigation of the effects of sham vs active treatment by the introduction of the terms subjective treatment vs objective treatment and subjective dosage that can be used consistently in the future. The suggestions to assess the expectation of sham vs active earlier on in clinical trials will advance the understanding of subjective treatment in future studies. Overall, I believe the data will substantially contribute to the design and interpretation of future clinical trials by underscoring the importance of subjective treatment.

    1. Reviewer #1 (Public Review):

      This work seeks to understand how behaviour-related information is represented in the neural activity of the primate motor cortex. To this end, a statistical model of neural activity is presented that enables a non-linear separation of behaviour-related from unrelated activity. As a generative model, it enables the separate analysis of these two activity modes, here primarily done by assessing the decoding performance of hand movements the monkeys perform in the experiments. Several lines of analysis are presented to show that while the neurons with significant tuning to movements strongly contribute to the behaviourally-relevant activity subspace, less or un-tuned neurons also carry decodable information. It is further shown that the discovered subspaces enable linear decoding, leading the authors to conclude that motor cortex read-out can be linear.

      Strengths:

      In my opinion, using an expressive generative model to analyse neural state spaces is an interesting approach to understanding neural population coding. While potentially sacrificing interpretability, this approach allows capturing both redundancies and synergies in the code as done in this paper. The model presented here is a natural non-linear extension of a previous linear model (PSID) and

      Weaknesses:

      First, the model in the paper is almost identical to an existing VAE model (TNDM) that makes use of weak supervision with behaviour in the same way [1]. This paper should at least be referenced. If the authors wish they could compare their model to TNDM, which combines a state space model with smoothing similar to LFADS. Given that TNDM achieves very good behaviour reconstructions, it may be on par with this model without the need for a Kalman filter (and hence may achieve better separation of behaviour-related and unrelated dynamics).

      Second, in my opinion, the claims regarding identifiability are overstated - this matters as the results depend on this to some extent. Recent work shows that VAEs generally suffer from identifiability problems due to the Gaussian latent space [2]. This paper also hints that weak supervision may help to resolve such issues, so this model as well as TNDM and CEBRA may indeed benefit from this. In addition however, it appears that the relative weight of the KL Divergence in the VAE objective is chosen very small compared to the likelihood (0.1%), so the influence of the prior is weak and the model may essentially learn the average neural trajectories while underestimating the noise in the latent variables. This, in turn, could mean that the model will not autoencode neural activity as well as it should, note that an average R2 in this case will still be high (I could not see how this is actually computed). At the same time, the behaviour R2 will be large simply because the different movement trajectories are very distinct. Since the paper makes claims about the roles of different neurons, it would be important to understand how well their single trial activities are reconstructed, which can perhaps best be investigated by comparing the Poisson likelihood (LFADS is a good baseline model). Taken together, while it certainly makes sense that well-tuned neurons contribute more to behaviour decoding, I worry that the very interesting claim that neurons with weak tuning contain behavioural signals is not well supported.

      Third, and relating to this issue, I could not entirely follow the reasoning in the section arguing that behavioural information can be inferred from neurons with weak selectivity, but that it is not linearly decodable. It is right to test if weak supervision signals bleed into the irrelevant subspace, but I could not follow the explanations. Why, for instance, is the ANN decoder on raw data (I assume this is a decoder trained fully supervised) not equal in performance to the revenant distilled signals? Should a well-trained non-linear decoder not simply yield a performance ceiling? Next, if I understand correctly, distilled signals were obtained from the full model. How does a model perform trained only on the weakly tuned neurons? Is it possible that the subspaces obtained with the model are just not optimally aligned for decoding? This could be a result of limited identifiability or model specifics that bias reconstruction to averages (a well-known problem of VAEs). I, therefore, think this analysis should be complemented with tests that do not depend on the model.

      Finally, a more technical issue to note is related to the choice to learn a non-parametric prior instead of using a conventional Gaussian prior. How is this implemented? Is just a single sample taken during a forward pass? I worry this may be insufficient as this would not sample the prior well, and some other strategy such as importance sampling may be required (unless the prior is not relevant as it weakly contributed to the ELBO, in which case this choice seems not very relevant). Generally, it would be useful to see visualisations of the latent variables to see how information about behaviour is represented by the model.

      Summary:

      This paper presents a very interesting analysis, but I have several concerns as to well the analysis supports the main conclusions. I think the work could benefit from an additional complementary analysis that seeks to confirm with another method if weakly tuned neurons indeed show an encoding that differs qualitatively from the strongly tuned ones.

      [1] Hurwitz, Cole, et al. "Targeted neural dynamical modeling." Advances in Neural Information Processing Systems 34 (2021): 29379-29392.<br /> [2] Hyvarinen, Aapo, Ilyes Khemakhem, and Hiroshi Morioka. "Nonlinear Independent Component Analysis for Principled Disentanglement in Unsupervised Deep Learning." arXiv preprint arXiv:2303.16535 (2023).

    2. Reviewer #2 (Public Review):

      Li et al present a method to extract "behaviorally relevant" signals from neural activity. The method is meant to solve a problem which likely has high utility for neuroscience researchers. There are numerous existing methods to achieve this goal some of which the authors compare their method to, though there are notable omissions. However, I do believe that d-VAE is a promising approach that has its own advantages.

      That being said, there are issues with the paper as-is. This could have been a straightforward "methods" paper describing their approach and validating it on different ground truth and experimental datasets. Instead, the authors focus on the neuroscientific results and their implications for brain mechanisms. Unfortunately, while the underlying method seems sound and performs well relative to the assessed competition, the scientific results and presentation they put forward were not sufficiently strong to support these claims, especially given the small amount of data (recordings of one monkey per task, with considerable variability between them).

      Specific comments<br /> - Is the apparently increased complexity of encoding vs decoding so unexpected given the entropy, sparseness, and high dimensionality of neural signals (the "encoding") compared to the smoothness and low dimensionality of typical behavioural signals (the "decoding") recorded in neuroscience experiments? This is the title of the paper so it seems to be the main result on which the authors expect readers to focus.

      - I take issue with the premise that signals in the brain are "irrelevant" simply because they do not correlate with a fixed temporal lag with a particular behavioural feature hand-chosen by the experimenter. As an example, the presence of a reward signal in motor cortex [1] after the movement is likely to be of little use from the perspective of predicting kinematics from time-bin to time-bin using a fixed model across trials (the apparent definition of "relevant" for behaviour here), but an entire sub-field of neuroscience is dedicated to understanding the impact of these reward-related signals on future behaviour. Is there method sophisticated enough to see the behavioural "relevance" of this brief, transient, post-movement signal? This may just be an issue of semantics, and perhaps I read too much into the choice of words here. Perhaps the authors truly treat "irrelevant" and "without a fixed temporal correlation" as synonymous phrases and the issue is easily resolved with a clarifying parenthetical the first time the word "irrelevant" is used. But I remain troubled by some claims in the paper which lead me to believe that they read more deeply into the "irrelevancy" of these components.

      - The authors claim the "irrelevant" responses underpin an unprecedented neuronal redundancy and reveal that movement behaviors are distributed in a higher-dimensional neural space than previously thought." Perhaps I just missed the logic, but I fail to see the evidence for this. The neural space is a fixed dimensionality based on the number of neurons. A more sparse and nonlinear distribution across this set of neurons may mean that linear methods such as PCA are not effective ways to approximate the dimensionality. But ultimately the behaviourally relevant signals seem quite low-dimensional in this paper even if they show some nonlinearity may help.

      - Relatedly, I would like to note that the exercise of arbitrarily dividing a continuous distribution of a statistic (the "R2") based on an arbitrary threshold is a conceptually flawed exercise. The authors read too much into the fact that neurons which have a low R2 w.r.t. PDs have behavioural information w.r.t. other methods. To this reviewer, it speaks more about the irrelevance, so to speak, of the preferred direction metric than anything fundamental about the brain.

      - there is an apparent logical fallacy that begins in the abstract and persists in the paper: "Surprisingly, when incorporating often-ignored neural dimensions, behavioral information can be decoded linearly as accurately as nonlinear decoding, suggesting linear readout is performed in motor cortex." Don't get me wrong: the equivalency of linear and nonlinear decoding approaches on this dataset is interesting, and useful for neuroscientists in a practical sense. However, the paper expends much effort trying to make fundamental scientific claims that do not feel very strongly supported. This reviewer fails to see what we can learn about a set of neurons in the brain which are presumed to "read out" from motor cortex. These neurons will not have access to the data analyzed here. That a linear model can be conceived by an experimenter does not imply that the brain must use a linear model. The claim may be true, and it may well be that a linear readout is implemented in the brain. Other work [2,3] has shown that linear readouts of nonlinear neural activity patterns can explain some behavioural features. The claim in this paper, however, is not given enough

      - I am afraid I may be missing something, as I did not understand the fano factor analysis of Figure 3. In a sense the behaviourally relevant signals must have lower FF given they are in effect tied to the temporally smooth (and consistent on average across trials) behavioural covariates. The point of the original Churchland paper was to show that producing a behaviour squelches the variance; naturally these must appear in the behaviourally relevant components. A control distribution or reference of some type would possibly help here.

      - The authors compare the method to LFADS. While this is a reasonable benchmark as a prominent method in the field, LFADS does not attempt to solve the same problem as d-VAE. A better and much more fair comparison would be TNDM [4], an extension of LFADS which is designed to identify behaviourally relevant dimensions.

      [1] https://doi.org/10.1371/journal.pone.0160851<br /> [2] https://doi.org/10.1101/2022.03.31.486635<br /> [3] https://doi.org/10.1038/s41593-017-0028-6<br /> [4] Hurwitz et al, Targeted Neural Dynamical Modeling, NeurIPS 2021.

    3. Reviewer #3 (Public Review):

      The authors develop a variational autoencoder (VAE), termed d-VAE (or distill VAE) that aims to tease apart the behaviorally relevant and irrelevant sections of each neuron's firing rate. The input to the VAE is the population activity for a given time step, and the output is the inferred behaviorally relevant section of the population activity at that time step. The residual is referred to as behaviorally irrelevant: total neural activity = behaviorally relevant + behaviorally irrelevant (x = x_r + x_i). The mapping from the raw neural signals (x) to the bottlenecked latent in the autoencoder (called z, z=f(x)) and back to the inferred behaviorally relevant single-neuron activities (x_r = g(z)) is applied per time step (does not incorporate any info from past/future time steps) and, critically, it is nonlinear (f and g are nonlinear feedforward neural networks). The key technical novelty that encourages x_r to encode behaviorally relevant information is a term added to the loss, which penalizes bad linear behavior decoding from the latent z. Otherwise the method is very similar to a prior method called pi-VAE, which should be explained more thoroughly in the manuscript to clearly highlight the technical novelty.

      The authors apply their method to 3 non-human primate datasets to infer behaviorally relevant signals and contrast them with the raw neural signals and the residual behaviorally irrelevant signals. As a key performance metric, they compute the accuracy of decoding behavior from the inferred behaviorally relevant signals (x_r) using a linear Kalman filter (KF) or alternatively using a nonlinear feed forward neural network (ANN). They highlight 3 main conclusions in the abstract: first, that single neurons from which behavior is very poorly decodable do encode considerable behavior information in a nonlinear manner, which the ANN can decode. Second, they conclude from various analyses that behavior is occupying a higher dimensional neural space than previously thought. Third, they find that linear KF decoding and nonlinear ANN decoding perform similarly when provided with the inferred behaviorally relevant signals (x_r), from which they conclude that a linear readout must be performed in motor cortex.

      The paper is well-written in many places and has high-quality graphics. The questions that it aims to address are also of considerable interest in neuroscience. However, unfortunately, several main conclusions, including but not limited to all 3 conclusions that are highlighted in the abstract, are not fully supported by the results due to confounds, some of which are fundamental to the method. Several statements in the text also seem inaccurate due to use of imprecise language. Moreover, the authors fail to compare with some more relevant existing methods that are specifically designed for extracting behaviorally relevant signals. In addition, for some of the methods they compare with, they do not use an appropriate setup for the benchmark methods, rendering the validation of the proposed method unconvincing. Finally, in many places imprecise language that is not accompanied with an operational definition (e.g., smaller R2 [of what], similar [per what metric]) makes results hard to follow, unless most of the text is read very carefully. Some key details of the methods are also not explained anywhere.

    1. Reviewer #1 (Public Review):

      The authors present a number of deep-learning models to analyse the dynamics of epithelia. In this way, they want to overcome the time-consuming manual analysis of such data and also remove a potential operator bias. Specifically, they set up models for identifying cell division events and cell division orientation. They apply these tools to the epithelium of the developing Drosophila pupal wing. They confirm a linear decrease of the division density with time and identify a burst of cell division after the healing of a wound that they had induced earlier. These division events happen a characteristic time after and a characteristic distance away from the wound. These characteristic quantities depend on the size of the wound.

      Strength:<br /> The methods developed in this work achieve the goals set by the authors and are a very helpful addition to the toolbox of developmental biologists. They could potentially be used on various developing epithelia. The evidence for the impact of wounds on cell division is solid.

      Weakness:<br /> Some aspects of the deep-learning models remained unclear, and the authors might want to think about adding details. First of all, for readers not being familiar with deep-learning models, I would like to see more information about ResNet and U-Net, which are at the base of the new deep-learning models developed here. What is the structure of these networks? How many parameters do you use? What is the difference between validating and testing the model? Do the corresponding data sets differ fundamentally? How did you assess the quality of the training data classification?

    2. Reviewer #2 (Public Review):

      In this manuscript, the authors propose a computational method based on deep convolutional neural networks (CNNs) to automatically detect cell divisions in two-dimensional fluorescence microscopy timelapse images. Three deep learning models are proposed to detect the timing of division, predict the division axis, and enhance cell boundary images to segment cells before and after division. Using this computational pipeline, the authors analyze the dynamics of cell divisions in the epithelium of the Drosophila pupal wing and find that a wound first induces a reduction in the frequency of division followed by a synchronised burst of cell divisions about 100 minutes after its induction.

      In general, novelty over previous work does not seem particularly important. From a methodological point of view, the models are based on generic architectures of convolutional neural networks, with minimal changes, and on ideas already explored in general. The authors seem to have missed much (most?) of the literature on the specific topic of detecting mitotic events in 2D timelapse images, which has been published in more specialized journals or Proceedings. (TPMAI, CCVPR etc., see references below). Even though the image modality or biological structure may be different (non-fluorescent images sometimes), I don't believe it makes a big difference. How the authors' approach compares to this previously published work is not discussed, which prevents me from objectively assessing the true contribution of this article from a methodological perspective.

      On the contrary, some competing works have proposed methods based on newer - and generally more efficient - architectures specifically designed to model temporal sequences (Phan 2018, Kitrungrotsakul 2019, 2021, Mao 2019, Shi 2020). These natural candidates (recurrent networks, long-short-term memory (LSTM), gated recurrent units (GRU), or even more recently transformers), coupled to CNNs are not even mentioned in the manuscript, although they have proved their generic superiority for inference tasks involving time series (Major point 2). Even though the original idea/trick of exploiting the different channels of RGB images to address the temporal aspect might seem smart in the first place - as it reduces the task of changing/testing a new architecture to a minimum - I guess that CNNs trained this way may not generalize very well to videos where the temporal resolution is changed slightly (Major point 1). This could be quite problematic as each new dataset acquired with a different temporal resolution or temperature may require manual relabeling and retraining of the network. In this perspective, recent alternatives (Phan 2018, Gilad 2019) have proposed unsupervised approaches, which could largely reduce the need for manual labeling of datasets.

      Regarding the other convolutional neural networks described in the manuscript:

      1) the one proposed to predict the orientation of mitosis performs a regression task, predicting a probability for the division angle. The architecture, which must be different from a simple Unet, is not detailed anywhere, so the way it was designed is difficult to assess. It is unclear if it also performs mitosis detection, or if it is instead used to infer orientation once the timing and location of the division have been inferred by the previous network.

      2) the one proposed to improve the quality of cell boundary images before segmentation is nothing new, it has now become a classic step in segmentation, see for example Wolny et al. eLife 2020.

      As a side note, I found it a bit frustrating to realise that all the analysis was done in 2D while the original images are 3D z-stacks, so a lot of the 3D information had to be compressed and has not been used. A novelty, in my opinion, could have resided in the generalisation to 3D of the deep-learning approaches previously proposed in that context, which are exclusively 2D, in particular, to predict the orientation of the division.

      Concerning the biological application of the proposed methods, I found the results interesting, showing the potential of such a method to automatise mitosis quantification for a particular biological question of interest, here wound healing. However, the deep learning methods/applications that are put forward as the central point of the manuscript are not particularly original.

      Major point 1: generalisation potential of the proposed method.

      The neural network model proposed for mitosis detection relies on a 2D convolutional neural network (CNN), more specifically on the Unet architecture, which has become widespread for the analysis of biology and medical images. The strategy proposed here exploits the fact that the input of such an architecture is natively composed of several channels (originally 3 to handle the 3 RGB channels, which is actually a holdover from computer vision, since most medical/biological images are gray images with a single channel), to directly feed the network with 3 successive images of a timelapse at a time. This idea is, in itself, interesting because no modification of the original architecture had to be carried out. The latest 10-channel model (U-NetCellDivision10), which includes more channels for better performance, required minimal modification to the original U-Net architecture but also simultaneous imaging of cadherin in addition to histone markers, which may not be a generic solution.

      Since CNN-based methods accept only fixed-size vectors (fixed image size and fixed channel number) as input (and output), the length or time resolution of the extracted sequences should not vary from one experience to another. As such, the method proposed here may lack generalization capabilities, as it would have to be retrained for each experiment with a slightly different temporal resolution. The paper should have compared results with slightly different temporal resolutions to assess its inference robustness toward fluctuations in division speed.

      Another approach (not discussed) consists in directly convolving several temporal frames using a 3D CNN (2D+time) instead of a 2D, in order to detect a temporal event. Such an idea shares some similarities with the proposed approach, although in this previous work (Ji et al. TPAMI 2012 and for split detection Nie et al. CCVPR 2016) convolution is performed spatio-temporally, which may present advantages. How does the authors' method compare to such an (also very simple) approach?

      Major point 2: innovatory nature of the proposed method.

      The authors' idea of exploiting existing channels in the input vector to feed successive frames is interesting, but the natural choice in deep learning for manipulating time series is to use recurrent networks or their newer and more stable variants (LSTM, GRU, attention networks, or transformers). Several papers exploiting such approaches have been proposed for the mitotic division detection task, but they are not mentioned or discussed in this manuscript: Phan et al. 2018, Mao et al. 2019, Kitrungrotaskul et al. 2019, She et al 2020.

      An obvious advantage of an LSTM architecture combined with CNN is that it is able to address variable length inputs, therefore time sequences of different lengths, whereas a CNN alone can only be fed with an input of fixed size.

      Another advantage of some of these approaches is that they rely on unsupervised learning, which can avoid the tedious relabeling of data (Phan et al. 2018, Gilad et al. 2019).

      References :<br /> Ji, S., Xu, W., Yang, M., & Yu, K. (2012). 3D convolutional neural networks for human action recognition. IEEE transactions on pattern analysis and machine intelligence, 35(1), 221-231. >6000 citations

      Nie, W. Z., Li, W. H., Liu, A. A., Hao, T., & Su, Y. T. (2016). 3D convolutional networks-based mitotic event detection in time-lapse phase contrast microscopy image sequences of stem cell populations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (pp. 55-62).

      Phan, H. T. H., Kumar, A., Feng, D., Fulham, M., & Kim, J. (2018). Unsupervised two-path neural network for cell event detection and classification using spatiotemporal patterns. IEEE Transactions on Medical Imaging, 38(6), 1477-1487.

      Gilad, T., Reyes, J., Chen, J. Y., Lahav, G., & Riklin Raviv, T. (2019). Fully unsupervised symmetry-based mitosis detection in time-lapse cell microscopy. Bioinformatics, 35(15), 2644-2653.

      Mao, Y., Han, L., & Yin, Z. (2019). Cell mitosis event analysis in phase contrast microscopy images using deep learning. Medical image analysis, 57, 32-43.

      Kitrungrotsakul, T., Han, X. H., Iwamoto, Y., Takemoto, S., Yokota, H., Ipponjima, S., ... & Chen, Y. W. (2019). A cascade of 2.5 D CNN and bidirectional CLSTM network for mitotic cell detection in 4D microscopy image. IEEE/ACM transactions on computational biology and bioinformatics, 18(2), 396-404.

      Shi, J., Xin, Y., Xu, B., Lu, M., & Cong, J. (2020, November). A Deep Framework for Cell Mitosis Detection in Microscopy Images. In 2020 16th International Conference on Computational Intelligence and Security (CIS) (pp. 100-103). IEEE.

      Wolny, A., Cerrone, L., Vijayan, A., Tofanelli, R., Barro, A. V., Louveaux, M., ... & Kreshuk, A. (2020). Accurate and versatile 3D segmentation of plant tissues at cellular resolution. Elife, 9, e57613.

    1. Reviewer #1 (Public Review):

      In the submitted manuscript, Port et al. investigated the host and viral factors influencing the airborne transmission of SARS-CoV-2 Alpha and Delta variants of concern (VOC) using a Syrian hamster model. The authors analyzed the viral load profiles of the animal respiratory tracts and air samples from cages by quantifying gRNA, sgRNA, and infectious virus titers. They also assessed the breathing patterns, exhaled aerosol aerodynamic profile, and size distribution of airborne particles after SARS-CoV-2 Alpha and Delta infections. The data showed that male sex was associated with increased viral replication and virus shedding in the air. The relationship between co-infection with VOCs and the exposure pattern/timeframe was also tested. This study appears to be an expansion of a previous report (Port et al., 2022, Nature Microbiology). The experimental designs were rigorous, and the data were solid. These results will contribute to the understanding of the roles of host and virus factors in the airborne transmission of SARS-CoV-2 VOCs.

    2. Reviewer #2 (Public Review):

      This manuscript by Port and colleagues describes rigorous experiments that provide a wealth of virologic, respiratory physiology, and particle aerodynamic data pertaining to aerosol transmission of SARS-CoV-2 between infected Syrian hamsters. The data is particularly significant because infection is compared between alpha and delta variants, and because viral load is assessed via numerous assays (gRNA, sgRNA, TCID) and in tissues as well as the ambient environment of the cage. The paper will be of interest to a broad range of scientists including infectious diseases physicians, virologists, immunologists and potentially epidemiologists. The strength of evidence is relatively high but limited by unclear presentation in certain parts of the paper.

      Important conclusions are that infectious virus is only detectable in air samples during a narrow window of time relative to tissue samples, that airway constriction increases dynamically over time during infection limiting production of fine aerosol droplets, that variants do not appear to exclude one another during simultaneous exposures and that exposures to virus via the aerosol route lead to lower viral loads relative to direct inoculation suggesting an exposure dose response relationship.

      While the paper is valuable, I found certain elements of the data presentation to be unclear and overly complex.

    1. Reviewer #2 (Public Review):

      Tian et al. performed a meta-analysis of 113 genome-wide origin profile datasets in humans to assess the reproducibility of experimental techniques and shared genomics features of origins. Techniques to map DNA replication sites have quickly evolved over the last decade, yet little is known about how these methods fare against each other (pros and cons), nor how consistent their maps are. The authors show that high-confidence origins recapitulate several known features of origins (e.g., correspondence with open chromatin, overlap with transcriptional promoters, CTCF binding sites). However, surprisingly, they find little overlap between ORC/MCM binding sites and origin locations.

      Overall, this meta-analysis provides the field with a good assessment of the current state of experimental techniques and their reproducibility, but I am worried about: (a) whether we've learned any new biology from this analysis; (b) how binding sites and origin locations can be so mismatched, in light of numerous studies that suggest otherwise; and (c) some methodological details described below.

      -- I understand better the inclusion/exclusion logic for the samples. But I'm still not sure about the fragments. As the authors wrote, there is both noise and stochasticity; the former is not important but the latter is essential to include. How can these two be differentiated, and what may be the expected overlap as a function of different stochasticity rates?

      -- Many of the major genomic features analyzed have already been found to be associated with origin sites. For example, the correspondence with TSS has been reported before:

      https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6320713/<br /> https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6547456/

      -- Line 250: The most surprising finding is that there is little overlap between ORC/MCM binding sites and origin locations. The authors speculate that the overlap between ORC1 and ORC2 could be low because they come from different cell types. Equally concerning is the lack of overlap with MCM. If true, these are potentially major discoveries that butts heads with numerous other studies that have suggested otherwise.

      The key missing dataset is ORC1 and ORC2 CHiP-seq from the same cell type. This shouldn't be too expensive to perform, and I hope someone performs this test soon. Without this, I remain on the fence about how much existing datasets are "junk" vs how much the prevailing hypothesis about replication needs to be revisited. Nonetheless, the authors do perform a nice analysis showing that existing techniques should be carefully used and interpreted.

    2. Reviewer #3 (Public Review):

      Summary: The authors present a thought-provoking and comprehensive re-analysis of previously published human cell genomics data that seeks to understand the relationship between the sites where the Origin Recognition Complex (ORC) binds chromatin, where the replicative helicase (Mcm2-7) is loaded, and where DNA replication actually beings (origins). The view that these should coincide is influenced by studies in yeast where ORC binds site-specifically to dedicated nucleosome-free origins where Mcm2-7 can be loaded and remains stably positioned for subsequent replication initiation. However, this is most certainly not the case in metazoans where it has already been reported that chromatin bindings sites of ORC and Mcm2-7 do not necessarily overlap, nor do they always overlap with origins. This is likely due to Mcm2-7 possessing linear mobility on DNA (i.e., it can slide) such that other chromatin-contextualized processes can displace it from the site in which it was originally loaded. Additionally, Mcm2-7 is loaded in excess and thus only a fraction of Mcm2-7 would be predicted to coincide with replication start sites. This study reaches a very similar conclusion of these previous studies: they find a high degree of discordance between ORC, Mcm2-7, and origin positions in human cells.

      Strengths: The strength of this work is its comprehensive and unbiased analysis of all relevant genomics datasets. To my knowledge, this is the first attempt to integrate these observations. It also is an important cautionary tale to not confuse replication factor binding sites with the genomic loci where replication actually begins, although this point is already widely appreciated in the field.

      Weaknesses: The major weakness of this paper is the lack of novel biological insight and that the comprehensive approach taken failed to provide any additional mechanistic insight regarding how and why ORC, Mcm2-7, and origin sites are selected or why they may not coincide.

    3. Reviewer #1 (Public Review):

      In the best genetically and biochemically understood model of eukaryotic DNA replication, the budding yeast, Saccharomyces cerevisiae, the genomic locations at which DNA replication initiates are determined by a specific sequence motif. These motifs, or ARS elements, are bound by the origin recognition complex (ORC). ORC is required for loading of the initially inactive MCM helicase during origin licensing in G1. In human cells, ORC does not have a specific sequence binding domain and origin specification is not specified by a defined motif. There have thus been great efforts over many years to try to understand the determinants of DNA replication initiation in human cells using a variety of approaches, which have gradually become more refined over time.

      In this manuscript Tian et al. combine data from multiple previous studies using a range of techniques for identifying sites of replication initiation to identify conserved features of replication origins and to examine the relationship between origins and sites of ORC binding in the human genome. The authors identify a) conserved features of replication origins e.g. association with GC-rich sequences, open chromatin, promoters and CTCF binding sites. These associations have already been described in multiple earlier studies. They also examine the relationship of their determined origins and ORC binding sites and conclude that there is no relationship between sites of ORC binding and DNA replication initiation. While the conclusions concerning genomic features of origins are not novel, if true, a clear lack of colocalization of ORC and origins would be a striking finding. However, the majority of the datasets used do not report replication origins, but rather broad zones in which replication origins fire. Rather than refining the localisation of origins, the approach of combining diverse methods that monitor different objects related to DNA replication leads to a base dataset that is highly flawed and cannot support the conclusions that are drawn, as explained in more detail below.

      Methods to determine sites at which DNA replication is initiated can be divided into two groups based on the genomic resolution at which they operate. Techniques such as bubble-seq, ok-seq can localise zones of replication initiation in the range ~50kb. Such zones may contain many replication origins. Conversely, techniques such as SNS-seq and ini-seq can localise replication origins down to less than 1kb. Indeed, the application of these different approaches has led to a degree of controversy in the field about whether human replication does indeed initiate at discrete sites (origins), or whether it initiates randomly in large zones with no recurrent sites being used. However, more recent work has shown that elements of both models are correct i.e. there are recurrent and efficient sites of replication initiation in the human genome, but these tend to be clustered and correspond to the demonstrated initiation zones (Guilbaud et al., 2022).

      These different scales and methodologies are important when considering the approach of Tian et al. The premise that combining all available data from five techniques will increase accuracy and confidence in identifying the most important origins is flawed for two principal reasons. First, as noted above, of the different techniques combined in this manuscript, only SNS-seq can actually identify origins rather than initiation zones. It is the former that matters when comparing sites of ORC binding with replication origin sites, if a conclusion is to be drawn that the two do not co-localise.

      Second, the authors give equal weight to all datasets. Certainly, in the case of SNS-seq, this is not appropriate. The technique has evolved over the years and some earlier versions have significantly different technical designs that may impact the reliability and/or resolution of the results e.g. in Foulk et al. (Foulk et al., 2015), lambda exonuclease was added to single stranded DNA from a total genomic preparation rather than purified nascent strands), which may lead to significantly different digestion patterns (ie underdigestion). Curiously, the authors do not make the best use of the largest SNS-seq dataset (Akerman et al., 2020) by ignoring these authors separation of core and stochastic origins. By blending all data together any separation of signal and noise is lost. Further, I am surprised that the authors have chosen not to use data and analysis from a recent study that provides subsets of the most highly used and efficient origins in the human genome, at high resolution (Guilbaud et al., 2022).

      References

      Akerman I, Kasaai B, Bazarova A, Sang PB, Peiffer I, Artufel M, Derelle R, Smith G, Rodriguez-Martinez M, Romano M, Kinet S, Tino P, Theillet C, Taylor N, Ballester B, Méchali M (2020) A predictable conserved DNA base composition signature defines human core DNA replication origins. Nat Commun, 11: 4826

      Foulk MS, Urban JM, Casella C, Gerbi SA (2015) Characterizing and controlling intrinsic biases of lambda exonuclease in nascent strand sequencing reveals phasing between nucleosomes and G-quadruplex motifs around a subset of human replication origins. Genome Res, 25: 725-735

      Guilbaud G, Murat P, Wilkes HS, Lerner LK, Sale JE, Krude T (2022) Determination of human DNA replication origin position and efficiency reveals principles of initiation zone organisation. Nucleic Acids Res, 50: 7436-7450

      Update in response to authors' comments on the original review:

      While the authors have clarified their approach to some aspects of their analysis, I believe they and I are just going to have to disagree about the methodology and conclusions of this work. I do not find the authors responses sufficiently compelling to change my mind about the significance of the study or veracity of the conclusions. In my opinion, the method for identification of strong origins is not robust and of insufficient resolution. In addition, the resolution and the overlap of the MCM Chip-seq datasets is poor. While the conclusion of the paper would indeed be striking and surprising if true, I am not at all persuaded that it is based on the presented data.

    1. Reviewer #1 (Public Review):

      Summary of what the author was trying to achieve:<br /> In this study, the author aimed to develop a method for estimating neuronal-type connectivity from transcriptomic gene expression data, specifically from mouse retinal neurons. They sought to develop an interpretable model that could be used to characterize the underlying genetic mechanisms of circuit assembly and connectivity.

      Strengths:<br /> The proposed bilinear model draws inspiration from commonly implemented recommendation systems in the field of machine learning. The author presents the model clearly and addresses critical statistical limitations that may weaken the validity of the model such as multicollinearity and outliers. The author presents two formulations of the model for separate scenarios in which varying levels of data resolution are available. The author effectively references key work in the field when establishing assumptions that affect the underlying model and subsequent results. For example, correspondence between gene expression cell types and connectivity cell types from different references are clearly outlined in Tables 1-3. The model training and validation are sufficient and yield a relatively high correlation with the ground truth connectivity matrix. Seemingly valid biological assumptions are made throughout, however, some assumptions may reduce resolution (such as averaging over cell types), thus missing potentially important single-cell gene expression interactions.

      Weaknesses:<br /> The main results of the study could benefit from replication in another dataset beyond mouse retinal neurons, to validate the proposed method. Dimensionality reduction significantly reduces the resolution of the model and the PCA methodology employed is largely non-deterministic. This may reduce the resolution and reproducibility of the model. It may be worth exploring how the PCA methodology of the model may affect results when replicating. Figure 5, 'Gene signatures associated with the two latent dimensions', lacks some readability and related results could be outlined more clearly in the results section. There should be more discussion on weaknesses of the results e.g. quantification of what connectivity motifs were not captured and what gene signatures might have been missed.

      The main weakness is the lack of comparison against other similar methods, e.g. methods presented in<br /> Barabási, Dániel L., and Albert-László Barabási. "A genetic model of the connectome." Neuron 105.3 (2020): 435-445.<br /> Kovács, István A., Dániel L. Barabási, and Albert-László Barabási. "Uncovering the genetic blueprint of the C. elegans nervous system." Proceedings of the National Academy of Sciences 117.52 (2020): 33570-33577.<br /> Taylor, Seth R., et al. "Molecular topography of an entire nervous system." Cell 184.16 (2021): 4329-4347.

      Appraisal of whether the author achieved their aims, and whether results support their conclusions:<br /> The author achieved their aims by recapitulating key connectivity motifs from single-cell gene expression data in the mouse retina. Furthermore, the model setup allowed for insight into gene signatures and interactions, however could have benefited from a deeper evaluation of the accuracy of these signatures. The author claims the method sets a new benchmark for single-cell transcriptomic analysis of synaptic connections. This should be more rigorously proven. (I'm not sure I can speak on the novelty of the method)

      Discussion of the likely impact of the work on the field, and the utility of methods and data to the community :<br /> This study provides an understandable bilinear model for decoding the genetic programming of neuronal type connectivity. The proposed model leaves the door open for further testing and comparison with alternative linear and/or non-linear models, such as neural network-based models. In addition to more complex models, this model can be built on to include higher resolution data such as more gene expression dimensions, different types of connectivity measures, and additional omics data.

    2. Reviewer #2 (Public Review):

      Summary:<br /> In this study, Mu Qiao employs a bilinear modeling approach, commonly utilized in recommendation systems, to explore the intricate neural connections between different pre- and post-synaptic neuronal types. This approach involves projecting single-cell transcriptomic datasets of pre- and post-synaptic neuronal types into a latent space through transformation matrices. Subsequently, the cross-correlation between these projected latent spaces is employed to estimate neuronal connectivity. To facilitate the model training, connectomic data is used to estimate the ground-truth connectivity map. This work introduces a promising model for the exploration of neuronal connectivity and its associated molecular determinants. However, it is important to note that the current model has only been tested with Bipolar Cell and Retinal Ganglion Cell data, and its applicability in more general neuronal connectivity scenarios remains to be demonstrated.

      Strengths:<br /> This study introduces a succinct yet promising computational model for investigating connections between neuronal types. The model, while straightforward, effectively integrates single-cell transcriptomic and connectomic data to produce a reasonably accurate connectivity map, particularly within the context of retinal connectivity. Furthermore, it successfully recapitulates connectivity patterns and helps uncover the genetic factors that underlie these connections.

      Weaknesses:<br /> 1. The study lacks experimental validation of the model's prediction results.<br /> 2. The model's applicability in other neuronal connectivity settings has not been thoroughly explored.<br /> 3. The proposed method relies on the availability of neuronal connectomic data for model training, which may be limited or absent in certain brain connectivity settings.

    1. Reviewer #2 (Public Review):

      Summary:<br /> The authors investigate the effect of oscillatory activity on the chaotic dynamics of high-dimensional networks. The network oscillations are internally generated by synaptic delays which are known to produce oscillations. The authors demonstrate that the intensity of the chaos and the dimension of the chaotic attractor picks at a delay value. A similar effect is found when an external input drives the network. In this case, these quantities pick at the network's resonant frequency. This shows that the intensity of the chaotic dynamics can be boosted by internally or externally generated oscillations.

      Strengths:<br /> The paper is technically solid. They introduce a novel method to perform calculations of the Lyapunov spectrum in networks with delays, which have infinite dimensions, effectively transforming it into a network of finite dimensions. The conclusions of the paper are supported by strong analytical calculations and novel and intensive numerical methods.

      Weaknesses:<br /> The main weakness is that is difficult to find the relevance of the paper's findings to neuroscience. It is not clear to me that measures such as the rate of production of entropy of a chaotic attractor in spiking networks, its dimension, and its Lyapunov spectra are experimentally relevant. Moreover, the authors make little to no attempt to provide interpretations for these quantities nor put their work in a broader context in the field of systems neuroscience. The paper also is written in an overly technical way with sometimes the use of technical jargon which might be difficult to follow for a non-expert in mean field theories and statistical physics.

    2. Reviewer #1 (Public Review):<br /> <br /> Summary:<br /> Cortical activity displays high trial-to-trial variability and oscillatory transients. These dynamical features have implications for how information is encoded and transmitted in the brain. While trial-to-trial variability has been widely studied, via mathematical models, in asynchronous dynamical states, works investigating variability in synchronous states are more sparse. In this study, the authors characterise the nature of the chaotic attractor underlying neural activity at the onset of oscillations induced by transmission delays. They find that variability is boosted by delay-induced oscillations in comparison to the asynchronous state.


      Strengths:
<br /> 1. Quantifying the chaotic nature of high-dimensional neural activity is a hard mathematical challenge. This work builds upon prior theoretical work to study how spike chaos is affected by oscillatory mean activity, a phenomenon frequently observed in the cortex.


      2. The evidence supporting all findings appears to be highly robust.


      3. The manuscript is well written.

      Weaknesses:
<br /> 1. The core contribution of the paper is a description of chaotic activity as delays are increased (Fig. 2). Within the main text, it is noted that two instabilities leading to oscillatory activity emerge. However, the definition and nature of these two transitions lack some clarity. In particular, whether the two transitions are "real" (meaning that they separate three distinct regimes of activity), or whether they rather correspond to different measures of the same underlying instability, remains opaque.


      2. While the mathematical aspects of the analysis are discussed in detail, the biological implications of the findings remain rather less clear. In particular, a discussion regarding the implications of the findings for cortical coding is missing. Furthermore, while the authors have put forth efforts to contextualize their findings within the domain of the dynamical systems and applied math literature, the relationship with the corresponding neuroscience literature seems less developed.


      3. The connection with biology is also hindered by the fact that measures used to characterise trial-to-trial variability (metric entropy and Kaplan-Yorke dimension) significantly differ from those commonly used in the analysis of experimental data, and these measures are not contextualized within the manuscript.


      4. The text comprises a significant amount of undefined mathematical jargon.

      5. For the purpose of the mathematical analysis, the original delayed model is substituted with an effectively delayed version. The authors convincingly demonstrate an alignment between the outcomes from the two models. This alignment appears to be unaffected by variations in the reset parameter of the effective model (Fig. S2). Nonetheless, a systematic discussion on the efficacy and limitations of this replacement approach seems absent. Under what circumstances are the two models equivalent? Conversely, when does their correspondence become very poor?

    3. Reviewer #3 (Public Review):

      Summary:<br /> In this work, the authors propose a novel method for analyzing spiking neuron network models with delays. By modeling the delay as an additional axonal component to relay spikes, the infinite-dimensional system of the delayed network is transformed into a system of finite dimensions. This allows the calculation of the entire spectrum of Lyapunov exponents which provide information on the dimensionality of attractor and noise entropy of network responses. The authors demonstrate that chaos intensifies at the onset of oscillations as synaptic delay increases. This is surprising since network oscillation has been thought to indicate regular firing activity. The authors find similar results in different types of networks and in networks driven by oscillatory inputs, suggesting that the boosting of chaos by oscillation can be a general feature of spiking networks.

      Strengths:<br /> This work builds on the authors' past work on characterizing chaos in spiking networks and extends to include synaptic delays. The transformation of a delayed network into a network of two-compartment neurons, modeling the spike generation and transmission, is novel and interesting. This allows for an analytical expression of the single spike Jacobian of the network dynamics, which can be used to calculate the full spectrum of Lyapunov exponents.

      The analysis is rigorous and the parameter study is comprehensive.

      Weaknesses:<br /> Because the delayed interaction is spike-triggered, effectively it only requires N variables to count the remaining time since the last spike from each neuron. The axon component only implements the delay time to transmit a spike with no interaction with other neurons. It seems that the axon component can be simply modeled as a variable counting the time since the last spike and does not need to be modeled as a QIF model. Is there any advantage of modeling the axon component as a QIF model? The supplemental figure S2 considers the case of "dynamic delay", where delay time can depend on network activity, but the Lyapunov exponents seem to be largely independent of the reset parameter.

      In most of the results, the network mean firing rate is kept at a fixed value while the delay time parameter varies. What would be the results if only the delay parameter changes? It would be helpful if the authors could provide some reasoning as to why it is a better comparison with the network rate kept as a constant.

      The majority of the neurons have a CV below 1 (Fig 2d and Fig S3c). This indicates that many neurons are in the mean-driven regime. This is different from balanced networks where CVs are around 1. It would be helpful for the authors to comment on this discrepancy.

    1. Reviewer #1 (Public Review):

      Summary:<br /> This study investigated behavioural performance on a competing speech task and neural attentional filtering over the course of two years in a group of middle-aged to older adults. Neural attentional filtering was quantified using EEG by comparing neural envelope tracking to an attended vs. an unattended sentence. This dataset was used to examine the stability of the link between behavior and neural filtering over time. They found that neural filtering and behavior were correlated during each measurement, but EEG measures at the first time point did not predict behavioural performance two years later. Further, while behavioural measures showed relatively high test-retest reliability, the neural filtering reliability was weak with an r-value of 0.21. The authors conclude that neural tracking-based metrics have limited ability to predict longitudinal changes in listening behavior.

      Strengths:<br /> This study is novel in its tracking of behavioural performance and neural envelope tracking over time, and it includes an impressively large dataset of 105 participants. The manuscript is clearly written.

      Weaknesses:<br /> The weaknesses are minor, primarily concerning how the reviewers interpret their data. Specifically, the envelope tracking measure is often quite low, close to the noise floor, and this may affect test-retest reliability. Furthermore, the trajectories may be affected by accelerated age-related declines that are more apparent in neural tracking than in behaviour.

    2. Reviewer #2 (Public Review):

      Summary:<br /> This study examined the longitudinal brain-behaviour link between attentional neural filtering and listening behaviour among a sample of aging individuals. The results based on the latent change score modeling showed that neither attentional neural filtering at T1 nor its T1-T2 change predicted individual two-year listening performance change. The findings suggest that neural filtering and listening behaviour may follow independent developmental trajectories. This study focuses on an interesting topic and has the potential to contribute a better understanding of the neurobiological mechanisms of successful communication across the lifespan.

      Strengths:<br /> Although research suggests that speech comprehension is neurally supported by an attention-guided filter mechanism, the evidence of their causal association is limited. This study addresses this gap by testing the longitudinal stability of neural filtering as a neural mechanism upholding listening performance, potentially shedding light on translational efforts aiming at the preservation of speech comprehension abilities among aging individuals.

      The latent change score modeling approach is appropriately used as a tool to examine key developmental questions and distinguish the complex processes underlying lifespan development in brain and behaviour with longitudinal data.

      Weaknesses:<br /> Although the paper does have strengths in principle, the weaknesses of the paper are that the findings are merely based on a single listening task. Since both neural and behavioral indicators are derived from the same task, the results may be applicable only to this specific task, and it is difficult to extrapolate them to cognitive and listening abilities measured by the other tasks. Therefore, more listening tasks are required to comprehensively measure speech comprehension and neural markers.

      The age span of the sample is relatively large. Although no longitudinal change from T1 to T2 was found at the group-level, from the cross-sectional and longitudinal change results (see Figure 3), individuals of different age groups showed different development patterns. Particularly, individuals over the age of 70 show a clear downward trend in both neural filtering index and accuracy. Therefore, different results may be found based on different age groups, especially older groups. However, due to sample limitations, this study was unable to examine whether age has a moderating effect on this brain-behaviour link.

      In the Dichotic listening task, valid and invalid cues were manipulated. According to the task description, the former could invoke selective attention, whereas the latter could invoke divided attention. It is possible that under the two conditions, the neural filtering index may reflect different underlying cognitive processes, and thus may differ in its predictive effect on behavioral performance. The author could perform a more in-depth data analysis on indicators under different conditions.

    3. Reviewer #3 (Public Review):

      Summary:<br /> The study investigates the longitudinal changes in hearing threshold, speech recognition behavior, and speech neural responses in 2 years, and how these changes correlate with each other. A slight change in the hearing threshold is observed in 2 years (1.2 dB on average) but the speech recognition performance remains stable. The main conclusion is that there is no significant correlation between longitudinal changes in neural and behavioral measures.

      Strengths:<br /> The sample size (N>100) is remarkable, especially for longitudinal studies.

      Weaknesses:<br /> The participants are only tracked for 2 years and relatively weak longitudinal changes are observed, limiting how the data may shed light on the relationships between basic auditory function, speech recognition behavior, and speech neural responses.

      Suggestions<br /> First, it's not surprising that a 1.2 dB change in hearing threshold does not affect speech recognition, especially for the dichotic listening task and when speech is always presented 50 dB above the hearing threshold. For the same listener, if the speech level is adjusted for 1.2 dB or much more, the performance will not be influenced during the dichotic listening task. Therefore, it is important to mention in the abstract that "sensory acuity" is measured using the hearing threshold and the change in hearing threshold is only 1.2 dB.

      Second, the lack of correlation between age-related changes in "neuronal filtering" and behavior may not suggest that they follow independent development trajectories. The index for "neuronal filtering" does not seem to be stable and the correlation between the two tests is only R = 0.21. This low correlation probably indicates low test-retest reliability, instead of a dramatic change in the brain between the two tests. In other words, if the "neuronal filtering" index only very weakly correlates with itself between the two tests, it is not surprising that it does not correlate with other measures in a different test. If the "neuronal filtering" index is measured on two consecutive days and the index remains highly stable, I'm more convinced that it is a reliable measure that just changes a lot within 2 years, and the change is dissociated with the changes in behavior.

      The authors attempted to solve the problem in the section entitled "Neural filtering reliably supports listening performance independent of age and hearing status", but I didn't follow the logic. As far as I could tell, the section pooled together the measurements from two tests and did not address the test-retest stability issue.

      Third, the behavioral measure that is not correlated with "neuronal filtering" is the response speed. I wonder if the participants are asked to respond as soon as possible (not mentioned in the method). If not, the response speed may strongly reflect general cognitive function or a personal style, which is not correlated with the changes in auditory functions. This can also explain why the hearing threshold affects speech recognition accuracy but not the response speed (lines 263-264).

    1. Reviewer #1 (Public Review):

      Summary:<br /> The manuscript by Adelus and colleagues investigates the snRNA sequencing of endothelial cells isolated from deceased heart donor aortic trimmings. From n=6 donors, the authors have identified 5 distinct endothelial cell (EC) populations. The expression levels of a set of genes are different among the different donors and different EC clusters. Furthermore, treatment with IL-1B, TGFB, or ERGsi decreased the proportion of some of these clusters and increased others, with some migratory and ECM-producing capacity. Another interesting observation in this study is that IL-1B alone induces a shift in the clusters and that is different from the TGFB-induced cells. However, ex vivo analyses showed most of the TGFB-induced population matched the in vitro observations. Another interesting finding of the work is that the authors detected SNPs linked to chromatin accessibility to the set of genes identified within these EC populations.

      Strengths:<br /> Overall, the work is intriguing and has some novel aspects to it, especially the link between EC-derived EndMT in culture and comparing that with ex vivo atherosclerotic samples.

      Weaknesses:<br /> The experiments are lacking in controls, the purity of the isolation, and the use of multiple donors (deceased hearts) to draw conclusions. The lack of validation of the work is a concern.

    2. Reviewer #2 (Public Review):

      This study by Adelus et al. profiled the transcriptome and chromatin accessibility in cultured human aortic endothelial cells (ECs) at single-cell resolution. They also stimulated these cells with EC-activating agents, such as IL1b, TGFB2, or si-EGR, to knock down this master transcription factor in ECs. The results show a subpopulation, EC3, with the highest plasticity and sensitivity to perturbations. The authors also reviewed and meta-analyzed three independent publicly available scRNA-seq datasets, identifying two distinct EC subpopulations. Additionally, they aligned CAD-related SNPs with open chromatin regions in EC subpopulations. This study provides fundamental evidence to enrich our understanding of vascular ECs and highlights potential subpopulations that may contribute to health and diseases. The work exhibits the potential impact in the field. While the manuscript is comprehensive, there are some concerns that should be addressed.

      1. My major concern is whether EC4 is derived from ECs. It seems that EC4 showed a lesser reaction to those perturbations and had lower expression levels of EC marker genes. Did the authors evaluate the purity of their isolated HAECs? Please discuss the potential cell lineage mapping of EC4.

      2. Although all the donors are de-identified, is there any information about the severity of their vascular impairment, particularly in the case of patient 5, who exhibits the unique EC5?

      3. The meta-analysis of the published datasets is comprehensive. The identified EC heterogeneity corresponds to their in vitro data. I am wondering, in terms of transcriptome, is there any similarity between endo1 and EC1/EC2, and also endo2 and EC3/EC4?

      4. The in vitro data indicates that EC3 shows the highest plasticity and sensitivity to perturbations, which may act as the major subtype of ECs responding to risk factors. It's very interesting that CAD-related SNPs do not seem to be enriched in EC3. Please discuss this discrepancy.

      5. The last sentence in the legend of Figure 1 seems incomplete: 'Module scores are generated for each cell barcode with Seurat function AddModuleScore().'

    1. Reviewer #2 (Public Review):

      Summary:<br /> The authors present a theoretical study of the length dynamics of bundles of actin filaments. They first show a "balance point model" in which the bundle is described as an effective polymer. The corresponding assembly and disassembly rates can depend on bundle length. This model generates a steady-state bundle-length distribution with a variance that is proportional to the average bundle length. Numerical simulations confirm this analytic result. The authors then present an analysis of previously published length distributions of actin bundles in various contexts and argue that these distributions have variances that depend quadratically with the average length. They then consider a bundle of N-independent filaments that each grow in an unregulated way. Defining the bundle length to be that of the longest filament, the resulting length distribution has a variance that scales quadratically with the average bundle length.

      Strengths:<br /> The manuscript is very well written, and the computations are nicely presented. The work gives fundamental insights into the length distribution of filamentous actin structures. The universal dependence of the variance on the mean length is of particular interest. It will be interesting to see in the future, how many universality classes there are, and which features of a growth process determine to which class it belongs.

      Weaknesses:<br /> 1) You present the data in Fig. 3 as arguments against the balance point model. Although I agree that the data is compatible with your description of a bundle of filaments, I think that the range of mean lengths you can explore is too limited to conclusively argue against the balance point model. In most cases, your data extend over half an order of magnitude only. Could you provide a measure to quantify how much your model of independent filaments fits better than the balance point model?

      2) Concerning your bundled-filament model, why do you consider the polymerizing ends to be all aligned? Similarly to the opposite end, fluctuations should be present. Furthermore, it is not clear to me, where the presence of crosslinking proteins enters your description. Finally, linked to my first remark on this model, why is the longest filament determining the length of the bundle in all the biological examples you cite? I am thinking in particular about the actin cables in yeast.

    2. Reviewer #1 (Public Review):

      Actin filaments and their kinetics have been the subject of extensive research, with several models for filament length control already existing in the literature. The work by Rosario et al. focuses instead on bundle length dynamics and how their fluctuations can inform us of the underlying kinetics. Surprisingly, the authors show that irrespective of the details, typical "balance point" models for filament kinetics give the wrong scaling of bundle length variance with mean length compared to experiments. Instead, the authors show that if one considers a bundle made of several individual filaments, length control for the bundle naturally emerges even in the absence of such a mechanism at the individual filament level. Furthermore, the authors show that the fluctuations of the bundle length display the same scaling with respect to the average as experimental measurements from different systems. This work constitutes a simple yet nuanced and powerful theoretical result that challenges our current understanding of actin filament kinetics and helps relate accessible experimental measurements such as actin bundle length fluctuations to their underlying kinetics. Finally, I found the manuscript to be very well written, with a particularly clear structure and development which made it very accessible.

    1. Reviewer #1 (Public Review):

      Summary:<br /> The emergence of catalytic self-replication of polymers is an important question in the context of the origin of life. Tkachenko and Maslov present a model in which such a catalytic polymer sequence emerges from a random pool of replicating polymers.

      Strengths:<br /> The model is part of a theme from many previous papers from the same authors and their colleagues. The model is interesting, technically correct, and demonstrates qualitatively new phenomena. It is good that the paper also makes a connection with possible experimental scenarios -- specifically, concrete proposals are made for testing the core ideas of the model. It would indeed be an exciting demonstration when such an experiment does indeed materialize.

      Weaknesses:<br /> Unlike the rest of the paper which is very tight in its arguments, I find that the discussion section is not so. Specifically, sentences such as " In fact, this can be seen as a special case of the classical error catastrophe" are a bit loose and not well substantiated -- although these are in the discussion section, I find this to be a weakness of an otherwise good paper. Tightening some of the arguments here will make it an excellent paper in my opinion.

    2. Reviewer #2 (Public Review):

      Summary:<br /> The replication of information-coding polymers and the emergence of catalytic ribozymes pose significant challenges, both experimentally and theoretically, in the study of the RNA world hypothesis. In this context, Tkachenko et al. put forth a novel hypothesis regarding a replication oligomer system based on a cleavage ribozyme. They initially highlighted that the breakage of oligomers could contribute to self-replication, provided that these fragments function as primers for subsequent replications. Next, they proposed a self-replicating system of oligomers founded on a hammerhead structure that catalyzes cleavage. By a simple dynamical model, they demonstrated that such a system is self-sustainable in certain parameter regimes. Furthermore, they delved into discussions regarding the potential emergence of such a system and the evolution toward further optimized ribozymes.

      Strengths:<br /> Although the cleavage (hammerhead) ribozyme has been discussed in the context of the origins of life, the authors are the first to discuss how they could be selected using a mathematical model as far as I know. The idea is simple: ribozyme activity creates fragments by breakage of an oligomer, which works as a primer for the ribozyme itself, resulting in a positive feedback system (i.e., autocatalytic sets in a broader sense). This potentially enables us to resolve at the same time problems on the (i) supply of new primers (but note that there is a major concern on this as described in the 'weakness'), and (ii) the sustaining of the cleavage ribozyme.

      Weaknesses:<br /> The major weakness of their theory is that the ends of the new primers, formed through the breakage/cleavage of polymers, must be chemically active (as the authors have already emphasized in the last paragraph of their discussion) to enable further elongation. Reactivating the ends of preexisting oligomers without enzymes, to the best of our current knowledge, could be a challenging task. Although their model heavily relies on this aspect, the authors do not elaborate on it.

      Another weakness is in the setup of their discussion on evolutionary dynamics. While they claim that their model is robust against replication errors, their approach to evolutionary dynamics appears unconventional, and it remains unclear under what conditions their assumptions are founded. They treat a whole set of oligos as a subject of evolution, rather than each individual oligo. This may necessitate more complex assumptions, such as the encapsulation of sets of oligos inside a protocell, to be adequately rationalized. Thus, it remains uncertain whether the system is indeed robust against replication errors in a more natural context. For example, if a mutant oligo, denoted as b', arises due to an error in the replication of oligo b, and if b' has lower catalytic activity but replicates more rapidly than b, it may ultimately come to dominate the system.

    3. Reviewer #3 (Public Review):

      Summary:<br /> Non-enzymatic replication of RNA or a similar polymer is likely to be important for the origin of life. The authors present a model of how a functional catalytic sequence could emerge from a mixture of sequences undergoing non-enzymatic replication.

      Strengths:<br /> Interesting model describing details of the proposed replication mechanism.

      Weaknesses:<br /> A discussion of the virtual circular genome idea proposed in [33] is included in the discussion section together with the problem of sequence scrambling faced by this mechanism that was raised in [34]. However, the authors state that sequence scrambling is a special case of the classical error catastrophe. This should be reworded, because these phenomena are completely different. The error catastrophe occurs due to single-point mutational errors in a model that assumes that a complete template is being copied in one cycle. Sequence scrambling arises in models that assume cycles of melting and reannealing, in which case only part of a template is copied in one cycle. Scrambling is due to the many alternative ways in which pairs of sequences can reanneal. Many of these alternatives are incorrect and this leads to the disappearance of the original sequence. This problem exists even in the limit where there is zero mutational error rate. Therefore, it cannot be called a special case of the error catastrophe problem.

      The authors seem to believe that their model avoids the scrambling problem. If this is the case, a clear explanation should be added about why this problem is avoided. Two possible points are mentioned.<br /> (i) Replication is bidirectional in this model. This seems like a small detail to me. I don't think it makes any difference to whether scrambling occurs.<br /> (ii) The functional activity is located in a short sequence region. I can imagine that if the length of a strand that is synthesized in a single cycle is long enough to cover the complete functional region, then sometimes the complete functional sequence can be copied in one cycle. Is this what is being argued? If so, it depends a lot on rates of primer extension and lengths of melting cycles etc, and some comment on this should be made.

    1. Reviewer #2 (Public Review):

      Overall: This paper describes new material of Acanthomeridion serratum that the authors claim supports its synonymy with Acanthomeridion anacanthus. The material is important and the description is acceptable after some modification. In addition, the paper offers thoughts and some exploration of the possibility of multiple origins of the dorsal facial suture among artiopods, at least once within Trilobita and also among other non-trilobite artiopods. Although this possibility is real and apparently correct, the suggestions presented in this paper are both surprising and, in my opinion, unlikely to be true because the potential homologies proposed with regard to Acanthomeridion and trilobite-free cheeks are unconventional and poorly supported.

      What to do? I can see two possibilities. One, which I recommend, is to concentrate on improving the descriptive part of the paper and omit discussion and phylogenetic analysis of dorsal facial suture distribution, leaving that for more comprehensive consideration elsewhere. The other is to seek to improve both simultaneously. That may be possible but will require extensive effort.

      Major concerns

      Concern 1 - Ventral sclerites as free cheek homolog, marginal sutures, and the trilobite doublure

      Firstly, a couple of observations that bear on the arguments presented - the eyes of A. serratum are almost marginal and it is not clear whether a) there is a circumocular suture in this animal and b) if there was, whether it merged with the marginal suture. These observations are important because this animal is not one in which an impressive dorsal facial suture has been demonstrated - with eyes that near marginal it simply cannot do so. Accordingly, the key argument of this paper is not quite what one would expect. That expectation would be that a non-trilobite artiopod, such as A. serratum, shows a clear dorsal facial suture. But that is not the case, at least with A. serratum, because of its marginal eyes. Rather, the argument made is that the ventral doublure of A. serratum is the homolog of the dorsal free cheeks of trilobites. This opens up a series of issues.

      The paper's chief claim in this regard is that the "teardrop" shaped ventral, lateral cephalic plates in Acanthomeridion serratum are potential homologs of the "free cheeks" of those trilobites with a dorsal facial suture. There is no mention of the possibility that these ventral plates in A. serratum could be homologs of the lateral cephalic doublure of olenelloid trilobites, which is bound by an operative marginal suture or, in those trilobites with a dorsal facial suture, that it is a homolog of only the doublure portions of the free cheeks and not with their dorsal components.

      The introduction to the paper does not inform the reader that all olenelloids had a marginal suture - a circumcephalic suture that was operative in their molting and that this is quite different from the situation in, say, "Cedaria" woosteri in which the only operative cephalic exoskeletal suture was circumocular. The conservative position would be that the olenelloid marginal suture is the homolog of the marginal suture in A. serratum: the ventral plates thus being homolog of the trilobite cephalic doublure, not only potential homolog to the entire or dorsal only part of the free cheeks of trilobites with a dorsal facial suture. As the authors of this paper decline to discuss the doublure of trilobites (there is a sole mention of the word in the MS, in a figure caption) and do not mention the olenelloid marginal suture, they give the reader no opportunity to assess support for this alternative.

      At times the paper reads as if the authors are suggesting that olenelloids, which had a marginal cephalic suture broadly akin to that in Limulus, actually lacked a suture that permitted anterior egression during molting. The authors are right to stress the origin of the dorsal cephalic suture in more derived trilobites as a character seemingly of taxonomic significance but lines such as 56 and 67 may be taken by the non-specialist to imply that olenelloids lacked a forward egression-permiting suture. There is a notable difference between not knowing whether sutures existed (a condition apparently quite common among soft-bodied artiopods) and the well-known marginal suture of olenelloids, but as the MS currently reads most readers will not understand this because it remains unexplained in the MS.

      With that in mind, it is also worth further stressing that the primary function of the dorsal sutures in those which have them is essentially similar to the olenelloid/limulid marginal suture mentioned above. It is notable that the course of this suture migrated dorsally up from the margin onto the dorsal shield and merged with the circumocular suture, but this innovation does not seem to have had an impact on its primary function - to permit molting by forward egression. Other trilobites completely surrendered the ability to molt by forward egression, and there are even examples of this occurring ontogenetically within species, suggesting a significant intraspecific shift in suture functionality and molting pattern. The authors mention some of this when questioning the unique origin of the dorsal facial suture of trilobites, although I don't understand their argument: why should the history of subsequent evolutionary modification of a character bear on whether its origin was unique in the group?

      The bottom line here is that for the ventral plates of A. serratum to be strict homologs of only the dorsal portion of the dorsal free cheeks, there would be no homolog of the trilobite doublure in A. serratum. The conventional view, in contrast, would be that the ventral plates are a homolog of the ventral doublure in all trilobites and ventral plates in artiopods. I do not think that this paper provides a convincing basis for preferring their interpretation, nor do I feel that it does an adequate job of explaining issues that are central to the subject.

      Concern 2. Varieties of dorsal sutures and the coexistence of dorsal and marginal sutures

      The authors do not clarify or discuss connections between the circumocular sutures (a form of dorsal suture that separates the visual surface from the rest of the dorsal shield) and the marginal suture that facilitates forward egression upon molting. Both structures can exist independently in the same animal - in olenelloids for example. Olenelloids had both a suture that facilitated forward egression in molting (their marginal suture) and a dorsal suture (their circumocular suture). The condition in trilobites with a dorsal facial suture is that these two independent sutures merged - the formerly marginal suture migrating up the dorsal pleural surface to become confluent with the circumocular suture. (There are also interesting examples of the expansion of the circumocular suture across the pleural fixigena.) The form of the dorsal facial suture has long figured in attempts at higher-level trilobite taxonomy, with a number of character states that commonly relate to the proximity of the eye to the margin of the cephalic shield. The form of the dorsal facial suture that they illustrate in Xanderella, which is barely a strip crossing the dorsal pleural surface linking marginal and circumocular suture, is comparable to that in the trilobites Loganopeltoides and Entomapsis but that is a rare condition in that clade as a whole. The paper would benefit from a clear discussion of these issues at the beginning - the dorsal facial suture that they are referring to is a merged circumcephalic suture and circumocular suture - it is not simply the presence of a molt-related suture on the dorsal side of the cephalon.

      Concern 3. Phylogenetics<br /> While I appreciate that the phylogenetic database is a little modified from those of other recent authors, still I was surprised not to find a character matrix in the supplementary information (unless it was included in some way I overlooked), which I would consider a basic requirement of any paper presenting phylogenetic trees - after all, there's no a space limit. It is not possible for a reviewer to understand the details of their arguments without seeing the character states and the matrix of state assignments.

      The section "phylogenetic analyses" provides a description of how tree topology changes depending on whether sutures are considered homologous or not using the now standard application of both parsimony and maximum likelihood approaches but, considering that the broader implications of this paper rest of the phylogenetic interpretation, I also found the absence of detailed discussion of the meaning and implications of these trees to be surprising, because I anticipated that this was the main reason for conducting these analysis. The trees are presented and briefly described but not considered in detail. I am troubled by "Circles indicate presence of cephalic ecdysial sutures" because it seems that in "independent origin of sutures" trilobites are considered to have two origins (brown color dot) of cephalic ecdysial sutures - this may be further evidence that the team does not appreciate that olenelloids have cephalic ecdysial sutures, as the basal condition in all trilobites. Perhaps I'm misunderstanding their views, but from what's presented it's not possible to know that. Similarly, in the "sutures homologous" analyses why would there be two independent green dots for both Acanthomeridion and Trilobita, rather than at the base of the clade containing them both, as cephalic ecdysial sutures are basal to both of them? Here again, we appear to see evidence that the team considers dorsal facial sutures and cephalic ecdysial sutures to be synonymous - which is incorrect.

      This point aside, and at a minimum, that team needs to do a more thorough job of characterizing and considering the variety of conditions of dorsal sutures among artiopods, their relationships to the marginal suture and to the circumocular suture, the number, and form of their branches, etc.

    2. Reviewer #1 (Public Review):

      Summary:<br /> Du et al. report 16 new well-preserved specimens of atiopodan arthropods from the Chengjiang biota, which demonstrate both dorsal and ventral anatomies of a potential new taxon of antipodeans that are closely related to trilobites. Authors assigned their specimens to Acanthomeridion serratum and proposed A. anacanthus as a junior subjective synonym of Acanthomeridion serratum. Critically, the presence of ventral plates (interpreted as cephalic liberigenae), together with phylogenic results, lead authors to conclude that the cephalic sutures originated multiple times within the Artiopoda.

      Strengths:<br /> New specimens are highly qualified and informative. The morphology of the dorsal exoskeleton, except for the supposed free cheek, was well illustrated and described in detail, which provides a wealth of information for taxonomic and phylogenic analyses.

      Weaknesses:<br /> The weaknesses of this work are obvious in a number of aspects. Technically, ventral morphology is less well revealed and is poorly illustrated. Additional diagrams are necessary to show the trunk appendages and suture lines. Taxonomically, I am not convinced by the authors' placement. The specimens are markedly different from either Acanthomeridion serratum Hou et al. 1989 or A. anacanthus Hou et al. 2017. The ontogenetic description is extremely weak and the morpholical continuity is not established. Geometric and morphometric analyses might be helpful to resolve the taxonomic and ontogenic uncertainties. I am confused by the author's description of the free cheek (libragena) and ventral plate. Are they the same object? How do they connect with other parts of the cephalic shield, e.g. hypostome, and fixgena? Critically, the homology of cephalic slits (eye slits, eye notch, dorsal suture, facial suture) is not extensively discussed either morphologically or functionally. Finally, the authors claimed that phylogenic results support two separate origins rather than a deep origin. However, the results in Figure 4 can explain a deep homology of the cephalic suture at molecular level and multiple co-options within the Atiopoda.

    3. Reviewer #3 (Public Review):

      Summary: Well-illustrated new material is documented for Acanthomeridion, a formerly incompletely known Cambrian arthropod. The formerly known facial sutures are shown to be associated with ventral plates that the authors very reasonably homologise with the free cheeks of trilobites. A slight update of a phylogenetic dataset developed by Du et al, then refined slightly by Chen et al, then by Schmidt et al, and again here, permits another attempt to optimise the number of origins of dorsal ecdysial sutures in trilobites and their relatives.

      Strengths: Documentation of an ontogenetic series makes a sound case that the proposed diagnostic characters of a second species of Acanthomeridion are variations within a single species. New microtomographic data shed some light on appendage morphology that was not formerly known. The new data on ventral plates and their association with the ecdysial sutures are valuable in underpinning homologies with trilobites.

      Weaknesses: The main conclusion remains clouded in ambiguity because of a poorly resolved Bayesian consensus and is consistent with work led by the lead author in 2019 (thus compromising the novelty of the findings). The Bayesian trees being majority rules consensus trees, optimising characters onto them (Figure 7b, d) is problematic. Optimising on a consensus tree can produce spurious optimisations that inflate tree length or distort other metrics of fit. Line 264 refers to at least three independent origins of cephalic sutures in artiopodans but the fully resolved Figure 7c requires only two origins. We can't say how many origins are required by Figures 7b and 7d.

      The question of how many times dorsal ecdysial sutures evolved in Artiopoda was addressed by Hou et al (2017), who first documented the facial sutures of Acanthomeridion and optimised them onto a phylogeny to infer multiple origins, as well as in a paper led by the lead author in Cladistics in 2019. Du et al. (2019) presented a phylogeny based on an earlier version of the current dataset wherein they discussed how many times sutures evolved or were lost based on their presence in Zhiwenia/Protosutura, Acanthomeridion, and Trilobita. To their credit, the authors acknowledge this (lines 62-65). The answer here is slightly different (because some topologies unite Acanthomeridion and trilobites).

      The following points are not meant to be "Weaknesses" but rather are refinements:

      I recommend changing the title of the paper from "cephalic sutures" to "dorsal ecdysial sutures" to be more precise about the character that is being tracked evolutionarily. Lots of arthropods have cephalic sutures (e.g., the ventral marginal suture of xiphosurans; the Y-shaped dorsomedian ecdysial line in insects). The text might also be updated to change other instances of "cephalic sutures" to a more precise wording.

      The authors have provided (but not explicitly identified) support values for nodes in their Bayesian trees but not in their parsimony ones. Please do the jackknife or bootstrap for the parsimony analyses and make it clear that the Bayesian values are posterior probabilities.

      In line 65 or somewhere else, it might be noted that a single origin of the dorsal facial sutures in trilobites has itself been called into question. Jell (2003) proposed that separate lineages of Eutrilobita evolved their facial sutures independently from separate sister groups within Olenellina.

      I have provided minor typographic or terminological corrections to the authors in a list of recommendations that may not be publicly available.

    1. Reviewer #1 (Public Review):

      This paper consists of a comprehensive analysis of the malaria parasite Plasmodium falciparum during its development in erythrocytes, using expansion microscopy. The authors used general dyes to stain membranes or proteins and a set of specific markers to label diverse cellular structures of the parasite, with a particular focus on the centriolar plaque.

      This is by nature a purely descriptive study, providing remarkable images with great details on subcellular structures such as the centriolar plaque, the basal complex, the cytostome and rhoptries. The work is extremely well performed and the images are beautiful. This study confirms a number of previous observations and illustrates the strength of expansion microscopy, an affordable and adaptable sample preparation method that will undoubtedly become standard in the field.

      This study provides a valuable resource that can serve as a reference dataset for the analysis of P. falciparum and other apicomplexan parasites.

    2. Reviewer #2 (Public Review):

      In this work the authors describe the shape and interconnectedness of intracellular structures of malaria blood stage parasites by taking advantage of expansion microscopy. Compared to previous microscopy work with these parasites, the strength of this paper lies in the increased resolution and the fact that the NHE ester highlights protein densities. Together with the BodipyC membrane staining, this results in data that is somewhere in between EM and standard fluorescence microscopy: it has higher resolution than standard fluorescence microscopy and provides some points of reference of different cellular structures due to the NHE ester/BodipyC.

      This study makes many interesting and useful observations and although it is somewhat "old school descriptory" in its presentation, researchers working in many different areas will find something of interest here. This ranges from mitosis, to organisation and distribution of major cellular structures, endocytosis and invasion, overall providing a rich and interesting resource. The results section is long but by taking the space to explain everything in detail, it has the advantage that it clearly transpires how things were done and on how many cells a conclusion is based on. Further the authors often also included a brief interpretation of their findings with a very open assessment what it does and what it does not show, highlighting interesting questions left by the data.

      Overall this is a very nice and useful paper that will be of interest to many, particularly those working on nuclear division, cytokinesis, endocytosis or invasion in malaria parasites. The spatiotemporal arrangement and interconnection of subcellular structures will also give a framework for specific functional studies.

    3. Reviewer #3 (Public Review):

      In their study the authors analyze the localization of multiple organelles and subcellular structure of blood stage malaria parasites with unprecedented detail. They use a 3D super-resolution imaging technique that has gained popularity in the protozoan field, ultrastructure expansion microscopy. Building on markers and labels established in the field they generate an appealing collection of images for all stages of the intraerythrocytic developmental stages of asexual blood stage parasites with some focus on nuclear division and cell segmentation stages.

      The authors have made a very good effort to address all the comments raised by the reviewers providing more clarity to the manuscript and appropriate interpretations of their results. Particularly the sharing of their image data in the Dryad repository adds significant value to their work.

    1. Reviewer #1 (Public Review):

      In this study, the authors offer a fresh perspective on how visual working memory operates. They delve into the link between anticipating future events and retaining previous visual information in memory. To achieve this, the authors build upon their recent series of experiments that investigated the interplay between gaze biases and visual working memory. In this study, they introduce an innovative twist to their fundamental task. Specifically, they disentangle the location where information is initially stored from the location where it will be tested in the future. Participants are tasked with learning a novel rule that dictates how the initial storage location relates to the eventual test location. The authors leverage participants' gaze patterns as an indicator of memory selection. Intriguingly, they observe that microsaccades are directed toward both the past encoding location and the anticipated future test location. This observation is noteworthy for several reasons. Firstly, participants' gaze is biased towards the past encoding location, even though that location lacks relevance to the memory test. Secondly, there's a simultaneous occurrence of an increased gaze bias towards both the past and future locations. To explore this temporal aspect further, the authors conduct a compelling analysis that reveals the joint consideration of past and future locations during memory maintenance. Notably, microsaccades biased towards the future test location also exhibit a bias towards the past encoding location. In summary, the authors present an innovative perspective on the adaptable nature of visual working memory. They illustrate how information relevant to the future is integrated with past information to guide behavior.

      This short manuscript presents one experiment with straightforward analyses, clear visualizations, and a convincing interpretation. For their analysis, the authors focus on a single time window in the experimental trial (i.e., 0-1000 ms after retro cue onset). While this time window is most straightforward for the purpose of their study, other time windows are similarly interesting for characterizing the joint consideration of past and future information in memory. First, assessing the gaze biases in the delay period following the cue offset would allow the authors to determine whether the gaze bias towards the future location is sustained throughout the entire interval before the memory test onset. Presumably, the gaze bias towards the past location may not resurface during this delay period, but it is unclear how the bias towards the future location develops in that time window. Also, the disappearance of the retro cue constitutes a visual transient that may leave traces on the gaze biases which speaks again for assessing gaze biases also in the delay period following the cue offset.

      Moreover, assessing the gaze bias before retro-cue onset allows the authors to further characterize the observed gaze biases in their study. More specifically, the authors could determine whether the future location is considered already during memory encoding and the subsequent delay period (i.e., before the onset of the retro cue). In a trial, participants encode two oriented gratings presented at opposite locations. The future rule indicates the test locations relative to the encoding locations. In their example (Figure 1a), the test locations are shifted clockwise relative to the encoding location. Thus, there are two pairs of relevant locations (each pair consists of one stimulus location and one potential test location) facing each other at opposite locations and therefore forming an axis (in the illustration the axis would go from bottom left to top right). As the future rule is already known to the participants before trial onset it is possible that participants use that information already during encoding. This could be tested by assessing whether more microsaccades are directed along the relevant axis as compared to the orthogonal axis. The authors should assess whether such a gaze bias exists already before retro cue onset and discuss the theoretical consequences for their main conclusions (e.g., is the future location only jointly used if the test location is implicitly revealed by the retro cue).

    2. Reviewer #2 (Public Review):

      Summary:<br /> The manuscript by Liu et al. reports a task that is designed to examine the extent to which "past" and "future" information is encoded in working memory that combines a retro cue with rules that indicate the location of an upcoming test probe. An analysis of microsaccades on a fine temporal scale shows the extent to which shifts of attention track the location of the location of the encoded item (past) and the location of the future item (test probe). The location of the encoded grating of the test probe was always on orthogonal axes (horizontal, vertical) so that biases in microsaccades could be used to track shifts of attention to one or the other axis (or mixtures of the two). The overall goal here was then to (1) create a methodology that could tease apart memory for the past and future, respectively, (2) to look at the time-course attention to past/future, and (3) to test the extent to which microsaccades might jointly encode past and future memoranda. Finally, some remarks are made about the plausibility of various accounts of working memory encoding/maintenance based on the examination of these time courses.

      Strengths:<br /> This research has several notable strengths. It has a clear statement of its aims, is lucidly presented, and uses a clever experimental design that neatly orthogonalizes "past" and "future" as operationalized by the authors. Figure 1b-d shows fairly clearly that saccade directions have an early peak (around 300ms) for the past and a "ramping" up of saccades moving in the forward direction. This seems to be a nice demonstration the method can measure shifts of attention at a fine temporal resolution and differentiate past from future-oriented saccades due to the orthogonal cue approach. The second analysis shown in Figure 2, reveals a dependency in saccade direction such that saccades toward the probe future were more likely also to be toward the encoded location than away from the encoded direction. This suggests saccades are jointly biased by both locations "in memory".

      Weaknesses:<br /> 1. The "central contribution" (as the authors characterize it) is that "the brain simultaneously retains the copy of both past and future-relevant locations in working memory, and (re)activates each during mnemonic selection", and that: "... while it is not surprising that the future location is considered, it is far less trivial that both past and future attributes would be retained and (re)activated together. This is our central contribution." However, to succeed at the task, participants must retain the content (grating orientation, past) and probe location (future) in working memory during the delay period. It is true that the location of the grating is functionally irrelevant once the cue is shown, but if we assume that features of a visual object are bound in memory, it is not surprising that location information of the encoded object would bias processing as indicated by microsaccades. Here the authors claim that joint representation of past and future is "far less trivial", this needs to be evaluaed from the standpoint of prior empirical data on memory decay in such circumstances, or some reference to the time-course of the "unbinding" of features in an encoded object.

      2. The authors refer to "future" and "past" information in working memory and this makes sense at a surface level. However, once the retrocue is revealed, the "rule" is retrieved from long-term memory, and the feature (e.g. right/left, top/bottom) is maintained in memory like any other item representation. Consider the classic test of digit span. The digits are presented and then recalled. Are the digits of the past or future? The authors might say that one cannot know, because past and future are perfectly confounded. An alternative view is that some information in working memory is relevant and some is irrelevant. In the digit span task, all the digits are relevant. Relevant information is relevant precisely because it is thought be necessary in the future. Irrelevant information is irrelevant precisely because it is not thought to be needed in the immediate future. In the current study, the orientation of the grating is relevant, but its location is irrelevant; and the location of the test probe is also relevant.

      3. It is not clear how the authors interpret the "joint representation" of past and future. Put aside "future" and "past" for a moment. If there are two elements in memory, both of which are associated with spatial bindings, the attentional focus might be a spatial average of the associated spatial indices. One might also view this as an interference effect, such that the location of the encoded location attracts spatial attention since it has not been fully deleted/removed from working memory. Again, for the impact of the encoded location to be exactly zero after the retrieval cue, requires zero interference or instantaneous decay of the bound location information. It would be helpful for the authors to expand their discussion to further explain how the results fit within a broader theoretical framework and how it fits with empirical data on how quickly an irrelevant feature of an object can be deleted from working memory.

    3. Reviewer #3 (Public Review):

      This study utilizes saccade metrics to explore, what the authors term the "past and future" of working memory. The study features an original design: in each trial, two pairs of stimuli are presented, first a vertical pair and then a horizontal one. Between these two pairs comes the cue that points the participant to one target of the first pair and another of the second pair. The task is to compare the two cued targets. The design is novel and original but it can be split into two known tasks - the first is a classic working memory task (a post-cue informs participants which of two memorized items is the target), which the authors have used before; and the second is a classic spatial attention task (a pre-cue signal that attention should be oriented left or right), which was used by numerous other studies in the past. The combination of these two tasks in one design is novel and important, as it enables the examination of the dynamics and overlapping processes of these tasks, and this has a lot of merit. However, each task separately is not new. There are quite a few studies on working memory and microsaccades and many on spatial attention and microsaccades. I am concerned that the interpretation of "past vs. future" could mislead readers to think that this is a new field of research, when in fact it is the (nice) extension of an existing one. Since there are so many studies that examined pre-cues and post-cues relative to microsaccades, I expected the interpretation here to rely more heavily on the existing knowledge base in this field. I believe this would have provided a better context of these findings, which are not only on "past" vs. "future" but also on "working memory" vs. "spatial attention".

    1. Joint Public Review:

      This study examined the role of parvalbumin (PV) cells in the rodent ventromedial prefrontal cortex (vmPFC) in active avoidance behavior. Using behavior combined with fiber photometry and optogenetics, the results indicate that prefrontal parvalbumin (PV) neurons play a permissive role in acquiring and performing signaled active avoidance learning. Notably, parvalbumin neurons suppress conditional freezing, enabling subjects to acquire the instrumental avoidance contingency and its subsequent performance. These findings advance our understanding of how the prefrontal cortex supports aversively motivated instrumental behavior and may provide insight into both stress vulnerability and resilience processes.

      Strengths

      All reviewers noted that the paper is well-written and compelling. The experiments themselves were well-designed using state-of-the-art methods and impressive and rigorous analyses. The reviewers appreciated that the authors included multiple controls to demonstrate that the uncovered prefrontal mechanism is selective for the initiation of operant behavior under aversive circumstances, rather than a role for cue offset in triggering changes in PV neuron activity, and for a nonspecific role in movement initiation. The results are all consistent with a conceptual model in which vmPFC PV neurons inhibit freezing to enable avoidance movements

      Weaknesses

      In general, no substantive weaknesses were noted. Minor weaknesses were noted across two areas, noted below.

      Additional Discussion Points

      1. There is not much exploration of potential mechanisms, i.e., the impact of PV neuron activity on the broader circuit. Additionally, the study exclusively focuses on PV cells and does not explore the role of other prefrontal populations, particularly those known to respond to cue-evoked fear states. The discussion should consider how PV activity might impact the broader circuit and whether the present findings are specific to PV cells or applicable to other interneuron subtypes.

      2. There is some discordance between changes in neural activity and behavior. For example, in Figure 4C, the relationship between PV neuron activity and movement emerges almost immediately during learning, but successful active avoidance emerges much more gradually. Why is this?

      3. vmPFC was defined here as including the infralimbic (IL) and dorsal peduncular (DP) regions. While the role of IL has been frequently characterized for motivated behavior, relatively few studies have examined DP. Perhaps the authors are just being cautious, given the challenges involved in the viral targeting of the IL region without leakage to nearby regions such as DP. But since the optical fibers were positioned above the IL region, it is possible that DP did not contribute much to either the fiber photometry signals or the effects of the optogenetic manipulations. Perhaps DP should be completely omitted, which is more consistent with the definitions of vmPFC in the field.

      4. In the Discussion, the authors should consider why PV cells exhibit increased activity during both movement initiation and successful chamber crossing during avoidance. While the functional contribution of the PV signal during movement initiation was tested with optogenetic inhibition, some discussion on the possible role of the additional PV signal during chamber crossing is of interest readers who are intrigued by the signaling of two events. Is the chamber crossing signal related to successful avoidance or learned safety (e.g., see Sangha, Diehl, Bergstrom, Drew 2020)?

      5. The primary conclusion here that PV cells control the fear response should be considered within the context of prior findings by the Herry laboratory. Courtin et al (2014) demonstrated a select role of prefrontal PV cells in the regulation of fear states, accomplished through their control over prefrontal output to the basolateral amygdala. The observations in this paper, which used both ChR2 and Arch-T to address the impact of vmPFC PV activity on reactive behavior, are highly relevant to issues raised both in the Introduction and Discussion.

      Additional analyses

      1. As avoidance trials progress (particularly on days 2 and 3), do PFC PV responses attenuate? That is, does continued unreinforced tone presentations lead to reduced reliance of PV cell-mediated suppression in order for successful avoidance to occur?

      2. In Figure 3D, it would be very informative and further support the claim of "no role for movement during reward" if the response of these cells during the "initiation of movement during reward-approach" was shown (similar to Figure 1F for threat avoidance).

    1. Reviewer #1 (Public Review):

      The authors' primary research question revolves around the inquiry of "how far in advance semantic information might become available from parafoveal preview." In contrast to prior studies, the current research seeks to achieve a breakthrough in terms of timing by employing innovative technology. They mention in the manuscript that "most of these studies have been limited to measuring parafoveal preview from fixations to an immediately adjacent word... We tackle these core issues using a new technique that combines the use of frequency tagging and the measurement of magnetoencephalography (MEG)-based signals." However, the argumentation for how this new technology constitutes a breakthrough is not sufficiently substantiated. Specifically, there are two aspects that require further clarification. Firstly, the authors should clarify the importance of investigating the timing of semantic integration in their research question. They need to justify why previous studies focusing on the preview effect during fixations to an immediately adjacent word cannot address their specific inquiry about "how far in advance semantic information might become available from parafoveal preview," which requires examining parafoveal processing (POF). Secondly, in terms of the research methodology, the authors should provide a more comprehensive explanation of the advantages offered by MEG technology in the observation of the timing of semantic integration compared to the techniques employed in prior research. Indeed, the authors have overlooked some rather significant studies in this area. For instance, the research conducted by Antúnez, Milligan, Hernández-Cabrera, Barber, & Schotter in 2022 addresses the same research question mentioned in the current study and employs a similar experimental design. Importantly, they utilize a natural reading paradigm with synchronized ERP and eye-tracking recordings. Collectively, these studies, along with the series of prior research studies employing ERP techniques and RSVP paradigms discussed by the authors in their manuscript, provide ample evidence that semantic information becomes available and integrated from words before fixation occurs. Therefore, the authors should provide a more comprehensive citation of relevant research and delve deeper into explaining the potential contributions of their chosen technology to this field.

      Further, the authors emphasize semantic integration in their observed results but overlook the intricate relationship between access, priming, and integration. This assertion appears overly confident. Despite using low-constraint sentences and low-predicted targets (lines 439-441), differences between congruent and incongruent conditions may be influenced by word-level factors. For instance, in the first coherent sentence, such as "Last night, my lazy brother came to the party one minute before it was over" (line 1049), replacing the keyword "brother" with an incongruent word could create an incoherent sentence, possibly due to semantic violation, relation mismatch with "lazy," or prediction error related to animate objects. A similar consideration applies to the second example sentence, "Lily says this blue jacket will be a big fashion trend this fall" (line 1050), where the effect might result from a discrepancy between "blue" and an incongruent word. However, the authors do not provide incongruent sentences to substantiate their claims. I recommend that the authors discuss alternative explanations and potentially control for confounding factors before asserting that their results unequivocally reflect semantic integration. My intention is not to dispute the semantic integration interpretation but to stress the necessity for stronger evidence to support this assertion.

    2. Reviewer #2 (Public Review):

      This MEG study used co-registered eye-tracking and Rapid Invisible Frequency Tagging (RIFT) to track the effects of semantic parafoveal preview during natural sentence reading. Unpredictable target words could either be congruent or incongruent with sentence context. This modulated the RIFT response already while participants were fixating on the preceding word. This indicates that the semantic congruency of the upcoming word modulates visual attention demands already in parafoveal preview.<br /> The quest for semantic parafoveal preview in natural reading has attracted a lot of attention in recent years, especially with the development of co-registered EEG and MEG. Evidence from dynamic neuroimaging methods using innovative paradigms as in this study is important for this debate.

      Major points:<br /> 1) The authors frame their study in terms of "congruency with sentence context". However, it is the congruency between adjective-noun pairs that determines congruency (e.g. "blue brother" vs "blue jacket", and examples p. 16 and appendix). This is confirmed by Suppl Figure 1, which shows a significantly larger likelihood of refixations to the pre-target word for incongruent sentences, probably because the pre-target word is most diagnostic for the congruency of the target word. The authors discuss some possibilities as to why there is variability in parafoveal preview effects in the literature. It is more likely to see effects for this simple and local congruency, rather than congruency that requires an integration and comprehension of the full sentence. I'm not sure whether the authors really needed to present their stimuli in a full-sentence context to obtain these effects. This should be explicitly discussed and also mentioned in the introduction (or even the abstract).

      2) The authors used MEG and provided a source estimate for the tagging response (Figure 2), which unsurprisingly is in the visual cortex. The most important results are presented at the sensor level. This does not add information about the brain sources of the congruency effect, as the RIFT response probably reflects top-down effects on visual attention etc. Was it necessary to use MEG? Would EEG have produced the same results? In terms of sensitivity, EEG is better than MEG as it is more sensitive to radial and deeper sources. This should be mentioned in the discussion and/or methods section.

      3) The earliest semantic preview effects occurred around 100ms after fixating the pre-target word (discussed around l. 323). This means that at this stage the brain must have processed the pre-target and the target word and integrated their meanings (at some level). Even in the single-word literature, semantic effects at 100 ms are provocatively early. Even studies that tried to determine the earliest semantic effects arrived at around 200 ms (e.g. (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3382728/, https://psycnet.apa.org/record/2013-17451-002). The present results need to be discussed in a bit more detail in the context of the visual word recognition literature.

      4) As in previous EEG/MEG studies, the authors found a neural but no behavioural preview effect. As before, this raises the question of whether the observed effect is really "critical" for sentence comprehension. The authors provide a correlation analysis with reading speed, but this does not allow causal conclusions: Some people may simply read slowly and therefore pay more attention and get a larger preview response. Some readers may hurry and therefore not pay attention and not get a preview response. In order to address this, one would have to control for reading speed and show an effect of RIFT response on comprehension performance (or vice versa, with a task that is not close to ceiling performance). The last sentence of the discussion is currently not justified by the results.

      5) L. 577f.: ICA components were selected by visual inspection. I would strongly recommend including EOG in future recordings when the control of eye movements is critical.

      6) The authors mention "saccade planning" a few times. I would suggest looking at the SWIFT model of eye movement control, which is less mechanistic than the dominant EZ-Reader model (https://psycnet.apa.org/record/2005-13637-003). It may be useful for the framing of the study and interpretation of the results (e.g. second paragraph of discussion).

    1. Reviewer #1 (Public Review):

      Summary:<br /> This study focuses on the role of GABA in semantic memory and its neuroplasticity. The researchers stimulated the left ATL and control site (vertex) using cTBS, measured changes in GABA before and after stimulation using MRS, and measured changes in BOLD signals during semantic and control tasks using fMRI. They analyzed the effects of stimulation on GABA, BOLD, and behavioral data, as well as the correlation between GABA changes and BOLD changes caused by the stimulation. The authors also analyzed the relationship between individual differences in GABA levels and behavioral performance in the semantic task. They found that cTBS stimulation led to increased GABA levels and decreased BOLD activity in the ATL, and these two changes were highly correlated. However, cTBS stimulation did not significantly change participants' behavioral performance on the semantic task, although behavioral changes in the control task were found after stimulation. Individual levels of GABA were significantly correlated with individuals' accuracy on the semantic task, and the inverted U-shaped (quadratic) function provides a better fit than the linear relationship. The authors argued that the results support the view that GABAergic inhibition can sharpen activated distributed semantic representations. They also claimed that the results revealed, for the first time, a non-linear, inverted-U-shape relationship between GABA levels in the ATL and semantic function, by explaining individual differences in semantic task performance and cTBS responsiveness

      Strengths:<br /> The findings of the research regarding the increase of GABA and decrease of BOLD caused by cTBS, as well as the correlation between the two, appear to be reliable. This should be valuable for understanding the biological effects of cTBS.

      Weaknesses:<br /> Regarding the behavioral effects of GABA on semantic tasks, especially its impact on neuroplasticity, the results presented in the article are inadequate to support the claims made by the authors. There are three aspects of results related to this: 1) the effects of cTBS stimulation on behavior, 2) the positive correlation between GABA levels and semantic task accuracy, and 3) the nonlinear relationship between GABA levels and semantic task accuracy. Among these three pieces of evidence, the clearest one is the positive correlation between GABA levels and semantic task accuracy. However, it is important to note that this correlation already exists before the stimulation, and there are no results supporting that it can be modulated by the stimulation. In fact, cTBS significantly increases GABA levels but does not significantly improve performance on semantic tasks. According to the authors' interpretation of the results in Table 1, cTBS stimulation may have masked the practice effects that were supposed to occur. In other words, the stimulation decreased rather than enhanced participants' behavioral performance on the semantic task.

      The stimulation effect on behavioral performance could potentially be explained by the nonlinear relationship between GABA and performance on semantic tasks proposed by the authors. However, the current results are also insufficient to support the authors' hypothesis of an inverted U-shaped curve. Firstly, in Figure 3C and Figure 3D, the last one-third of the inverted U-shaped curve does not have any data points. In other words, as the GABA level increases the accuracy of the behavior first rises and then remains at a high level. This pattern of results may be due to the ceiling effect of the behavioral task's accuracy, rather than an inverted U-shaped ATL GABA function in semantic memory. Second, the article does not provide sufficient evidence to support the existence of an optimal level of GABA in the ATL. Fortunately, this can be tested with additional data analysis. The authors can estimate, based on pre-stimulus data from individuals, the optimal level of GABA for semantic functioning. They can then examine two expectations: first, participants with pre-stimulus GABA levels below the optimal level should show improved behavioral performance after stimulation-induced GABA elevation; second, participants with pre-stimulus GABA levels above the optimal level should exhibit a decline in behavioral performance after stimulation-induced GABA elevation. Alternatively, the authors can categorize participants into groups based on whether their behavioral performance improves or declines after stimulation, and compare the pre- and post-stimulus GABA levels between the two groups. If the improvement group shows significantly lower pre-stimulus GABA levels compared to the decline group, and both groups exhibit an increase in GABA levels after stimulation, this would also provide some support for the authors' hypothesis.

      Another issue in this study is the confounding of simulation effects and practice effects. According to the results, there is a significant improvement in performance after the simulation, at least in the control task, which the authors suggest may reflect a practice effect. The authors argue that the results in Table 1 suggest a similar practice effect in the semantic task, but it is masked by the simulation of the ATL. However, since no significant effects were found in the ANOVA analysis of the semantic task, it is actually difficult to draw a conclusion. This potential confound increases the risk in data analysis and interpretation. Specifically, for Figure 3D, if practice effects are taken into account, the data before and after the simulation should not be analyzed together.

    2. Reviewer #2 (Public Review):

      Summary:<br /> The authors combined inhibitory neurostimulation (continuous theta-burst stimulation, cTBS) with subsequent MRI measurements to investigate the impact of inhibition of the left anterior temporal lobe (ATL) on task-related activity and performance during a semantic task and link stimulation-induced changes to the neurochemical level by including MR spectroscopy (MRS). cTBS effects in the ATL were compared with a control site in the vertex. The authors found that relative to stimulation of the vertex, cTBS significantly increased the local GABA concentration in the ATL. cTBS also decreased task-related semantic activity in the ATL and potentially delayed semantic task performance by hindering a practice effect from pre to post. Finally, pooled data from their previous MRS study suggest an inverted U-shape between GABA concentration and behavioral performance. These results help to better understand the neuromodulatory effects of non-invasive brain stimulation on task performance.

      Strengths:<br /> Multimodal assessment of neurostimulation effects on the behavioral, neurochemical, and neural levels. In particular, the link between GABA modulation and behavior is timely and potentially interesting.

      Weaknesses:<br /> The analyses are not sound. Some of the effects are very weak and not all conclusions are supported by the data since some of the comparisons are not justified. There is some redundancy with a previous paper by the same authors, so the novelty and contribution to the field are overall limited. A network approach might help here.

    3. Reviewer #3 (Public Review):

      Summary:<br /> The authors used cTBS TMS, magnetic resonance spectroscopy (MRS), and functional magnetic resonance imaging (fMRI) as the main methods of investigation. Their data show that cTBS modulates GABA concentration and task-dependent BOLD in the ATL, whereby greater GABA increase following ATL cTBS showed greater reductions in BOLD changes in ATL. This effect was also reflected in the performance of the behavioural task response times, which did not subsume to practice effects after AL cTBS as opposed to the associated control site and control task. This is in line with their first hypothesis. The data further indicates that regional GABA concentrations in the ATL play a crucial role in semantic memory because individuals with higher (but not excessive) GABA concentrations in the ATLs performed better on the semantic task. This is in line with their second prediction. Finally, the authors conducted additional analyses to explore the mechanistic link between ATL inhibitory GABAergic action and semantic task performance. They show that this link is best captured by an inverted U-shaped function as a result of a quadratic linear regression model. Fitting this model to their data indicates that increasing GABA levels led to better task performance as long as they were not excessively low or excessively high. This was first tested as a relationship between GABA levels in the ATL and semantic task performance; then the same analyses were performed on the pre and post-cTBS TMS stimulation data, showing the same pattern. These results are in line with the conclusions of the authors.

      Strengths:<br /> I thoroughly enjoyed reading the manuscript and appreciate its contribution to the field of the role of the ATL in semantic processing, especially given the efforts to overcome the immense challenges of investigating ATL function by neuroscientific methods such as MRS, fMRI & TMS. The main strengths are summarised as follows:

      • The work is methodologically rigorous and dwells on complex and complementary multimethod approaches implemented to inform about ATL function in semantic memory as reflected in changes in regional GABA concentrations. Although the authors previously demonstrated a negative relationship between increased GABA levels and BOLD signal changes during semantic processing, the unique contribution of this work lies within evidence on the effects of cTBS TMS over the ATL given by direct observations of GABA concentration changes and further exploring inter-individual variability in ATL neuroplasticity and consequent semantic task performance.

      • Another major asset of the present study is implementing a quadratic regression model to provide insights into the non-linear relationship between inhibitory GABAergic activity within the ATLs and semantic cognition, which improves with increasing GABA levels but only as long as GABA levels are not extremely high or low. Based on this finding, the authors further pinpoint the role of inter-individual differences in GABA levels and cTBS TMS responsiveness, which is a novel explanation not previously considered (according to my best knowledge) in research investigating the effect of TMS on ATLs.

      • There are also many examples of good research practice throughout the manuscript, such as the explicitly stated exploratory analyses, calculation of TMS electric fields, using ATL optimised dual echo fRMI, links to open source resources, and a part of data replicates a previous study by Jung et. al (2017).

      Weaknesses:<br /> • Research on the role of neurotransmitters in semantic memory is still very rare and therefore the manuscript would benefit from more context on how GABA contributes to individual differences in cognition/behaviour and more justification on why the focus is on semantic memory. A recommendation to the authors is to highlight and explain in more depth the particular gaps in evidence in this regard.

      • The focus across the experiments is on the left ATL; how do the authors justify this decision? Highlighting the justification for this methodological decision will be important, especially given that a substantial body of evidence suggests that the ATL should be involved in semantics bilaterally (e.g. Hoffman & Lambon Ralph, 2018; Lambon Ralph et al., 2009; Rice et al., 2017; Rice, Hoffman, et al., 2015; Rice, Ralph, et al., 2015; Visser et al., 2010).

      • When describing the results, (Pg. 11; lines 233-243), the authors first show that the higher the BOLD signal intensity in ATL as a response to the semantic task, the lower the GABA concentration. Then, they state that individuals with higher GABA concentrations in the ATL perform the semantic task better. Although it becomes clearer with the exploratory analysis described later, at this point, the results seem rather contradictory and make the reader question the following: if increased GABA leads to less task-induced ATL activation, why at this point increased GABA also leads to facilitating and not inhibiting semantic task performance? It would be beneficial to acknowledge this contradiction and explain how the following analyses will address this discrepancy.

      • There is an inconsistency in reporting behavioural outcomes from the performance on the semantic task. While experiment 1 (cTBS modulates regional GANA concentrations and task-related BOLD signal changes in the ATL) reports the effects of cTBS TMS on response times, experiment 2 (Regional GABA concentrations in the ATL play a crucial role in semantic memory) and experiment 3 (The inverted U-shaped function of ATL GABA concentration in semantic processing) report results on accuracy. For full transparency, the manuscript would benefit from reporting all results (either in the main text or supplementary materials) and providing further explanations on why only one or the other outcome is sensitive to the experimental manipulations across the three experiments.

      Overall, the most notable impact of this work is the contribution to a better understanding of individual differences in semantic behaviour and the potential to guide therapeutic interventions to restore semantic abilities in neurological populations. While I appreciate that this is certainly the case, I would be curious to read more about how this could be achieved.

    1. Reviewer #1 (Public Review):

      Summary: This article explores the role of Ecdysone in regulating female sexual receptivity in Drosophila. The researchers found that PTTH, throughout its role as a positive regulator of ecdysone production, negatively affects the receptivity of adult virgin females. Indeed, loss of larval PTTH before metamorphosis significantly increases female receptivity right after adult eclosion and also later. However, during metamorphic neurodevelopment, Ecdysone, primarily through its receptor EcR-A, is required to properly develop the P1 neurons since its silencing led to morphological changes associated with a reduction in adult female receptivity. Nonetheless, the result shown in this manuscript sheds light on how Ecdysone plays a dual role in female adult receptivity, inhibiting it during larval development and enhancing it during metamorphic development. Unfortunately, this dual and opposite effect in two temporally different developmental stages has not been highlighted or explained.

      Strengths: This paper exhibits multiple strengths in its approach, employing a well-structured experimental methodology that combines genetic manipulations, behavioral assays, and molecular analysis to explore the impact of Ecdysone on regulating virgin female receptivity in Drosophila. The study provides clear and substantial findings, highlighting that removing PTTH, a positive Ecdysone regulator, increases virgin female receptivity. Additionally, the research expands into the temporal necessity of PTTH and Ecdysone function during development.

      Weaknesses:<br /> There are two important caveats with the data that are reflecting a weakness:

      1-Contradictory Effects of Ecdysone and PTTH: One notable weakness in the data is the contrasting effects observed between Ecdysone and its positive regulator PTTH. PTTH loss of function increases female receptivity, while ecdysone loss of function reduces it. Given that PTTH positively regulates Ecdysone, one would expect that the loss of function of both would result in a similar phenotype or at least a consistent directional change.

      2- Discordant Temporal Requirements for Ecdysone and PTTH: Another weakness lies in the different temporal requirements for Ecdysone and PTTH. The data from the manuscript suggest that PTTH is necessary during the larval stage, as shown in Figure 2 E-G, while Ecdysone is required during the pupal stage, as indicated in Figure 5 I-K. Ecdysone is a crucial developmental hormone with precisely regulated expression throughout development, exhibiting several peaks during both larval and pupal stages. PTTH is known to regulate Ecdysone during the larval stage, specifically by stimulating the kinetics of Ecdysone peaking at the wandering stage. However, it remains unclear whether pupal PTTH, expressed at higher levels during metamorphosis, can stimulate Ecdysone production during the pupal stage. Additionally, given the transient nature of the Ecdysone peak produced at wandering time, which disappears shortly before the end of the prepupal stage, it is challenging to infer that larval PTTH will regulate Ecdysone production during the pupal stage based on the current state of knowledge in the neuroendocrine field.

      Considering these two caveats, the results suggest that the authors are witnessing distinct temporal and directional effects of Ecdysone on virgin female receptivity.

    2. Reviewer #2 (Public Review):

      Summary: The authors tried to identify novel adult functions of the classical Drosophila juvenile-adult transition axis (i.e. ptth-ecdysone). Surprisingly, larval ptth-expressing neurons expressed the sex-specific doublesex gene, thus belonging to the sexual dimorphic circuit. Lack of ptth during late larval development caused enhanced female sexual receptivity, an effect rescued by supplying ecdysone in the food. Among many other cellular players, pC1 neurons control receptivity by encoding the mating status of females. Interestingly, during metamorphosis, a subtype of pC1 neurons required Ecdysone Receptor A in order to regulate such female receptivity. A transcriptomic analysis using pC1-specific Ecdyone signaling down-regulation gives some hints of possible downstream mechanisms.

      Strengths: the manuscript showed solid genetic evidence that lack of ptth during development caused enhanced copulation rate in female flies, which includes ptth mutant rescue experiments by over-expressing ptth as well as by adding ecdysone-supplemented food. They also present elegant data dissecting the temporal requirements of ptth-expressing neurons by shifting animals from non-permissive to permissive temperatures, in order to inactivate neuronal function (although not exclusively ptth function). By combining different drivers together with a EcR-A RNAi line authors also identified the Ecdysone receptor requirements of a particular subtype of pC1 neurons during metamorphosis. Convincing live calcium imaging showed no apparent effect of EcR-A in neural activity, although some effect on morphology is uncovered. Finally, bulk RNAseq shows differential gene expression after EcR-A down-regulation.

      Weaknesses: the paper has three main weaknesses. The first one refers to temporal requirements of ptth and ecdysone signaling. Whereas ptth is necessary during larval development, the ecdysone effect appears during pupal development. ptth induces ecdysone synthesis during larval development but there is no published evidence about a similar role for ptth during pupal stages. Furthermore, larval and pupal ecdysone functions are different (triggering metamorphosis vs tissue remodeling). The second caveat is the fact that ptth and ecdysone loss-of-function experiments render opposite effects (enhancing and decreasing copulation rates, respectively). The most plausible explanation is that both functions are independent of each other, also suggested by differential temporal requirements. Finally, in order to identify the effect in the transcriptional response of down-regulating EcR-A in a very small population of neurons, a scRNAseq study should have been performed instead of bulk RNAseq.

      In summary, despite the authors providing convincing evidence that ptth and ecdysone signaling pathways are involved in female receptivity, the main claim that ptth regulates this process through ecdysone is not supported by results. More likely, they'd rather be independent processes.

    3. Reviewer #3 (Public Review):

      Summary:<br /> This manuscript shows that mutations that disable the gene encoding the PTTH gene cause an increase in female receptivity (they mate more quickly), a phenotype that can be reversed by feeding these mutants the molting hormone, 20-hydoxyecdysone (20E). The use of an inducible system reveals that inhibition or activation of PTTH neurons during the larval stages increases and decreases female receptivity, respectively, suggesting that PTTH is required during the larval stages to affect the receptivity of the (adult) female fly. Showing that these neurons express the sex-determining gene dsx leads the authors to show that interfering with 20E actions in pC1 neurons, which are dsx-positive neurons known to regulate female receptivity, reduces female receptivity and increases the arborization pattern of pC1 neurons. The work concludes by showing that targeted knockdown of EcRA in pC1 neurons causes 527 genes to be differentially expressed in the brains of female flies, of which 123 passed a false discovery rate cutoff of 0.01; interestingly, the gene showing the greatest down-regulation was the gene encoding dopamine beta-monooxygenase.

      Stengths<br /> This is an interesting piece of work, which may shed light on the basis for the observation noted previously that flies lacking PTTH neurons show reproductive defects ("... females show reduced fecundity"; McBrayer, 2007; DOI 10.1016/j.devcel.2007.11.003).

      Weaknesses:<br /> There are some results whose interpretation seem ambiguous and findings whose causal relationship is implied but not demonstrated.<br /> 1- At some level, the findings reported here are not at all surprising. Since 20E regulates the profound changes that occur in the central nervous system (CNS) during metamorphosis, it is not surprising that PTTH would play a role in this process. Although animals lacking PTTH (rather paradoxically) live to adulthood, they do show greatly extended larval instars and a corresponding great delay in the 20E rise that signals the start of metamorphosis. For this reason, concluding that PTTH plays a SPECIFIC role in regulating female receptivity seems a little misleading, since the metamorphic remodeling of the entire CNS is likely altered in PTTH mutants. Since these mutants produce overall normal (albeit larger--due to their prolonged larval stages) adults, these alterations are likely to be subtle. Courtship has been reported as one defect expressed by animals lacking PTTH neurons, but this behavior may stand out because reduced fertility and increased male-male courtship (McBrayer, 2007) would be noticeable defects to researchers handling these flies. By contrast, detecting defects in other behaviors (e.g., optomotor responses, learning and memory, sleep, etc) would require closer examination. For this reason, I would ask the authors to temper their statement that PTTH is SPECIFICALLY involved in regulating female receptivity.<br /> 2- The link between PTTH and the role of pC1 neurons in regulating female receptivity is not clear. Again, since 20E controls the metamorphic changes that occur in the CNS, it is not surprising that 20E would regulate the arborization of pC1 neurons. And since these neurons have been implicated in female receptivity, it would therefore be expected that altering 20E signaling in pC1 neurons would affect this phenotype. However, this does not mean that the defects in female receptivity expressed by PTTH mutants are due to defects in pC1 arborization. For this, the authors would at least have to show that PTTH mutants show the changes in pC1 arborization shown in Fig. 6. And even then the most that could be said is that the changes observed in these neurons "may contribute" to the observed behavioral changes. Indeed, the changes observed in female receptivity may be caused by PTTH/20E actions on different neurons.<br /> 3- Some of the results need commenting on, or refining, or revising:<br /> a- For some assays PTTH behaves sometimes like a recessive gene and at other times like a semi-dominant, and yet at others like a dominant gene. For instance, in Fig. 1D-G, PTTH[-]/+ flies behave like wildtype (D), express an intermediate phenotype (E-F), or behave like the mutant (G). This may all be correct but merits some comment.<br /> b- Some of the conclusions are overstated. i) Although Fig. 2E-G does show that silencing the PTTH neurons during the larval stages affects copulation rate (E) the strength of the conclusion is tempered by the behavior of one of the controls (tub-GAL80[ts]/+, UAS-Kir2.1/+) in panels F and G, where it behaves essentially the same as the experimental group (and quite differently from the PTTH-GAL4/+ control; blue line).(Incidentally, the corresponding copulation latency should also be shown for these data.). ii) For Fig. 5I-K, the conclusion stated is that "Knock-down of EcR-A during pupal stage significantly decreased the copulation rate." Although strictly correct, the problem is that panel J is the only one for which the behavior of the control lacking the RNAi is not the same as that of the experimental group. Thus, it could just be that when the experiment was done at the pupal stage is the only situation when the controls were both different from the experimental. Again, the results shown in J are strictly speaking correct but the statement is too definitive given the behavior of one of the controls in panels I and K. Note also that panel F shows that the UAS-RNAi control causes a massive decrease in female fertility, yet no mention is made of this fact.

    1. Joint Public Review:

      Summary:<br /> Given the cost of producing action potentials and transmitting them along axons, it has always seemed a bit strange that there are synaptic failures: when a spike arrives at a synapse, about half the time nothing happens. One explanation comes from a Bayesian inference perspective: because of noise and limited information, the best a synapse can do is compute a probability distribution over its true weight; to communicate the resulting uncertainty it samples from that distribution. In this view, failures are a means of sampling from a synapse's probability distribution. Here the authors offer another explanation: energy efficiency. In this view, synaptic parameters (mean and variance of the synaptic weights) are adapted to perform some task while penalising small variances, which, the authors show, are energetically expensive.

      The authors show both numerically and analytically the strong link between those two frameworks. In particular, both frameworks predict that (a) synaptic variance should decrease when the input firing rate increases and (b) the learning rate should increase when the weight variances increase. Both predictions have some experimental support.

      Finally, the authors relate the cost of small variance to the cost used in variational Bayesian inference. Intriguingly, the biophysical cost provides a lower bound on the variational inference cost. This is intellectually satisfying, as it answers a "why" question: why would evolution evolve to produce the kind of costs seen in the brain?

      Strengths:<br /> 1. The paper is very well written and the arguments are clearly presented. The tight link between the Bayesian inference and energy efficiency perspectives is elegant and well-supported, both with numerical simulations as well as with analytical arguments.

      2. A key component of the paper is the derivation of the reliability cost as a function of different biophysical mechanisms (calcium efflux, vesicle membrane, actin, and trafficking). Independent of the proposed mapping between the Bayesian inference perspective and the energy efficiency perspective, those reliability costs (expressed as power-law relationships) will be important for further studies on synaptic energetics.

      3. The extended appendices, which are generally easy to read, provide additional mathematical insight.

      Weaknesses:<br /> 1. The authors face a technical challenge (which they acknowledge): they use two numbers (mean and variance) to characterize synaptic variability, whereas in the brain there are three numbers (number of vesicles, release probability, and quantal size). Turning biological constraints into constraints on the variance, as is done in the paper, seems somewhat arbitrary. This by no means invalidates the results, but it means that future experimental tests of their model will be somewhat nuanced.

      2. The prediction that the learning rate should increase with variability relies on an optimization scheme in which the learning rate is scaled by the inverse of the magnitude of the gradients (Eq. 7). This seems like an extra assumption; the energy efficiency framework by itself does not predict that the learning rate should increase with variability. Further work will be needed to disentangle the assumption about the optimization scheme from the energy efficiency framework.

    1. Reviewer #1 (Public Review):

      Summary:<br /> In this paper, Chen et al. identified a role for the circadian photoreceptor CRYPTOCHROME (cry) in promoting wakefulness under short photoperiods. This research is potentially important as hypersomnolence is often seen in patients suffering from SAD during winter times. The mechanisms underlying these sleep effects are poorly known.

      Strengths:<br /> The authors clearly demonstrated that mutations in cry lead to elevated sleep under 4:20 Light-Dark (LD) cycles. Furthermore, using RNAi, they identified GABAergic neurons as a primary site of cry action to promote wakefulness under short photoperiods. They then provide genetic and pharmacological evidence demonstrating that cry acts on GABAergic transmission to modulate sleep under such conditions.

      Weaknesses:<br /> The authors then went on to identify the neuronal location of this cry action on sleep. This is where this reviewer is much more circumspect about the data provided. The authors hypothesize that the l-LNvs which are known to be arousal-promoting may be involved in the phenotypes they are observing. To investigate this, they undertook several imaging and genetic experiments.

      Major concerns:<br /> 1. Figure 2 A-B: The authors show that knocking down cry expression in GABAergic neurons mimics the sleep increase seen in cryb mutants under short photoperiod. However, they do not provide any other sleep parameters such as sleep bout numbers, sleep bout duration, and more importantly waking activity measurements. This is an essential parameter that is needed to rule out paralysis and/or motor defects as the cause of increased "sleep". Any experiments looking at sleep need to include these parameters.

      2. For all Figures displaying immunostaining and imaging data the resolution of the images is quite poor. This makes it difficult to assess whether the authors' conclusions are supported by the data or not.

      3. In Figure 4-S1A it appears that the syt-GFP signal driven by Gad1-GAL4 is colabeling the l-LNvs. This would imply that the l-LNvs are GABAergic. The authors suggest that this experiment suggests that l-LNvs receive input from GABAergic neurons. I am not sure the data presented support this.

      4. In Figure 4-S1B. The GRASP experiment is not very convincing. The resolution of the image is quite poor. In addition, the authors used Pdf-LexA to express the post t-GRASP construct in l-LNvs, but Pdf-LexA also labels the s-LNvs, so it is possible that the GRASP signal the authors observe is coming from the s-LNvs and not the l-LNvs. The authors could use a l-LNvs specific tool to do this experiment and remove any doubts. Altogether this reviewer is not convinced that the data presented supports the conclusion "All in all, these results demonstrate that GABAergic neurons project to the l-LNvs and form synaptic connections." (Line 176). In addition, the authors could have downregulated the expression of Rdl specifically in l-LNvs to support their conclusions. The data they are providing supports a role for RDL but does not prove that RDL is involved in l-LNvs.

      5. In Figures 4 A and C: it appears that GABA is expressed in the l-LNvs. Is this correct? Can the authors clarify this? Maybe the authors could do an experiment where they co-label using Gad1-GAL4 and Pdf-LexA to clearly demonstrate that l-LNvs are not GABAergic. Also, the choice of colors could be better. It is very difficult to see what GABA is and what is PDF.

      6. Figure 4G: Pdf-GAL4 expresses in both s-LNvs and l-LNvs. So, in this experiment, the authors are silencing both groups, not only the l-LNvs. Why not use a l-LNvs specific tool?

      7. Figure 4H-I: The C929-GAL4 driver expresses in many peptidergic neurons. This makes the interpretation of these data difficult. The effects could be due to peptidergic cells being different than the l-LNvs. Why not use a more specific l-LNvs specific tool? I am also confused as to why some experiments used Pdf-GAL4 and some others used C929-GAL4 in a view to specifically manipulate l-LNvs? This is confusing since both drivers are not specific to the l-LNvs.

      8. Figure 5-S1B: Why does the pdf-GAL80 construct not block the sleep increase seen when reducing expression of cry in Gad1-GAL4 neurons? This suggests that there are GABAergic neurons that are not PDF expressing involved in the cry-mediated effect on sleep under short photoperiods.

      In conclusion, it is not clear that the authors demonstrated that they are looking at a cry-mediated effect on GABA in s-LNvs resulting in a modulation of the activity of the l-LNvs. Better images and more-suited genetic experiments could be used to address this.

    2. Reviewer #2 (Public Review):

      Summary:<br /> The sleep patterns of animals are adaptable, with shorter sleep durations in the winter and longer sleep durations in the summer. Chen and colleagues conducted a study using Drosophila (fruit flies) and discovered that a circadian photoreceptor called cryptochrome (cry) plays a role in reducing sleep duration during day/night cycles resembling winter conditions. They also found that cry functions in specific GABAergic circadian pacemaker cells known as s-LNvs inhibit these neurons, thereby promoting wakefulness in the animals in the winter. They also identified l-LNvs, known as arousal-promoting cells, as the downstream neurons.

      Strengths:<br /> Detailed mapping of the neural circuits cry acts to mediate the shortened sleep in winter-like day/night cycles.

      Weaknesses:<br /> The supporting evidence for s-LNvs being GABAergic neurons is not particularly strong. Additionally, there is a lack of direct evidence regarding changes in neural activity for s-LNvs and l-LNvs under varying day/night cycles, as well as in cry mutant flies.

    3. Reviewer #3 (Public Review):

      Summary:<br /> In humans, short photoperiods are associated with hypersomnolence. The mechanisms underlying these effects are, however, unknown. Chen et al. use the fly Drosophila to determine the mechanisms regulating sleep under short photoperiods. They find that mutations in the circadian photoreceptor cryptochrome (cry) increase sleep specifically under short photoperiods (e.g. 4h light : 20 h dark). They go on to show that cry is required in GABAergic neurons. Further, they suggest that the relevant subset of GABAergic neurons are the well-studied small ventral lateral neurons that they suggest inhibit the arousal-promoting large ventral neurons via GABA signalling.

      Strengths:<br /> Genetic analysis to show that cryptochrome (but not other core clock genes) mediates the increase in sleep in short photoperiods, and circuit analysis to localise cry function to GABAergic neurons.

      Weaknesses:<br /> The authors' conclusion that the sLNvs are GABAergic is not well supported by the data. Better immunostaining experiments and perhaps more specific genetic driver lines would help with this point (details below).

      1. The sLNvs are well known as a key component of the circadian network. The finding that they are GABAergic would if true, be of great interest to the community. However, the data presented in support of this conclusion are not convincing. Much of the confocal images are of insufficient resolution to evaluate the paper's claims. The Anti-GABA immunostaining in Fig 4 and 5 seem to have a high background, and the GRASP experiments in Fig 4 supplement 1 low signal.

      Transcriptomic datasets are available for the components of the circadian network (e.g. PMID 33438579, and PMID 19966839). It would be of interest to determine if transcripts for GAD or other GABA synthesis/transport components were detected in sLNvs. Further, there are also more specific driver lines for GAD, and the lLNvs, sLNVs that could be used.

      2. The authors' model posits that in short photoperiods, cry functions to suppress GABA secretion from sLNvs thereby disinhibiting the lNVs. In Fig 4I they find that activating the lLNvs (and other peptidergic cells) by c929>NaChBac in a cryb background reduces sleep compared to activating lLNVs in a wild-type background. It's not clear how this follows from the model. A similar trend is observable in Fig 4H with TRP-mediated activation of lNVs, although it is not clear from the figure if the difference b/w cryb vs wild-type background is significant.

    1. Reviewer #1 (Public Review):

      Summary:<br /> The study by Klug et al. investigated the pathway specificity of corticostriatal projections, focusing on two cortical regions. Using a G-deleted rabies system in D1-Cre and A2a-Cre mice to retrogradely deliver channelrhodopsin to cortical inputs, the authors found that M1 and MCC inputs to direct and indirect pathway spiny projection neurons (SPNs) are both partially segregated and asymmetrically overlapping. In general, corticostriatal inputs that target indirect pathway SPNs are likely to also target direct pathway SPNs, while inputs targeting direct pathway SPNs are less likely to also target indirect pathway SPNs. Such asymmetric overlap of corticostriatal inputs has important implications for how the cortex itself may determine striatal output. Indeed, the authors provide behavioral evidence that optogenetic activation of M1 or MCC cortical neurons that send axons to either direct or indirect pathway SPNs can have opposite effects on locomotion and different effects on action sequence execution. The conclusions of this study add to our understanding of how cortical activity may influence striatal output and offer important new clues about basal ganglia function.

      The conceptual conclusions of the manuscript are supported by the data, but the details of the magnitude of afferent overlap and causal role of asymmetric corticostriatal inputs on behavioral outcomes were not yet fully resolved.

      After virally labeling either direct pathway (D1) or indirect pathway (D2) SPNs to optogenetically tag pathway-specific cortical inputs, the authors report that a much larger number of "non-starter" D2-SPNs from D2-SPN labeled mice responded to optogenetic stimulation in slices than "non-starter" D1 SPNs from D1-SPN labeled mice did. Without knowing the relative number of D1 or D2 SPN starters used to label cortical inputs, it is difficult to interpret the exact meaning of the lower number of responsive D2-SPNs in D1 labeled mice (where only ~63% of D1-SPNs themselves respond) compared to the relatively higher number of responsive D1-SPNs (and D2-SPNs) in D2 labeled mice. While relative differences in connectivity certainly suggest that some amount of asymmetric overlap of inputs exists, differences in infection efficiency and ensuing differences in detection sensitivity in slice experiments make determining the degree of asymmetry problematic.

      It is also unclear if retrograde labeling of D1-SPN- vs D2-SPN- targeting afferents labels the same densities of cortical neurons. This gets to the point of specificity in the behavioral experiments. If the target-based labeling strategies used to introduce channelrhodopsin into specific SPN afferents label significantly different numbers of cortical neurons, might the difference in the relative numbers of optogenetically activated cortical neurons itself lead to behavioral differences?

      In general, the manuscript would also benefit from more clarity about the statistical comparisons that were made and sample sizes used to reach their conclusions.

    2. Reviewer #2 (Public Review):

      Summary:<br /> Klug et al. use monosynaptic rabies tracing of inputs to D1- vs D2-SPNs in the striatum to study how separate populations of cortical neurons project to D1- and D2-SPNs. They use rabies to express ChR2, then patch D1-or D2-SPNs to measure synaptic input. They report that cortical neurons labeled as D1-SPN-projecting preferentially project to D1-SPNs over D2-SPNs. In contrast, cortical neurons labeled as D2-SPN-projecting project equally to D1- and D2-SPNs. They go on to conduct pathway-specific behavioral stimulation experiments. They compare direct optogenetic stimulation of D1- or D2-SPNs to stimulation of MCC inputs to DMS and M1 inputs to DLS. In three different behavioral assays (open field, intra-cranial self-stimulation, and a fixed ratio 8 task), they show that stimulating MCC or M1 cortical inputs to D1-SPNs is similar to D1-SPN stimulation, but that stimulating MCC or M1 cortical inputs to D2-SPNs does not recapitulate the effects of D2-SPN stimulation (presumably because both D1- and D2-SPNs are being activated by these cortical inputs).

      Strengths:<br /> Showing these same effects in three distinct behaviors is strong. Overall, the functional verification of the consequences of the anatomy is very nice to see. It is a good choice to patch only from mCherry-negative non-starter cells in the striatum.

      Weaknesses:<br /> One limitation is that all inputs to SPNs are expressing ChR2, so they cannot distinguish between different cortical subregions during patching experiments. Their results could arise because the same innervation patterns are repeated in many cortical subregions or because some subregions have preferential D1-SPN input while others do not. There are also some caveats with respect to the efficacy of rabies tracing. Although they only patch non-starter cells in the striatum, only 63% of D1-SPNs receive input from D1-SPN-projecting cortical neurons. It's hard to say whether this is "high" or "low," but one question is how far from the starter cell region they are patching. Without this spatial indication of where the cells that are being patched are relative to the starter population, it is difficult to interpret if the cells being patched are receiving cortical inputs from the same neurons that are projecting to the starter population. Convergence of cortical inputs onto SPNs may vary with distance from the starter cell region quite dramatically, as other mapping studies of corticostriatal inputs have shown specialized local input regions can be defined based on cortical input patterns (Hintiryan et al., Nat Neurosci, 2016, Hunnicutt et al., eLife 2016, Peters et al., Nature, 2021). A caveat for the optogenetic behavioral experiments is that these optogenetic experiments did not include fluorophore-only controls. Another point of confusion is that other studies (Cui et al, J Neurosci, 2021) have reported that stimulation of D1-SPNs in DLS inhibits rather than promotes movement.

    3. Reviewer #3 (Public Review):

      In the manuscript by Klug and colleagues, the investigators use a rabies virus-based methodology to explore potential differences in connectivity from cortical inputs to the dorsal striatum. They report that the connectivity from cortical inputs onto D1 and D2 MSNs differs in terms of their projections onto the opposing cell type, and use these data to infer that there are differences in cross-talk between cortical cells that project to D1 vs. D2 MSNs. Overall, this manuscript adds to the overall body of work indicating that there are differential functions of different striatal pathways which likely arise at least in part by differences in connectivity that have been difficult to resolve due to difficulty in isolating pathways within striatal connectivity and several interesting and provocative observations were reported. Several different methodologies are used, with partially convergent results, to support their main points.

      However, I have significant technical concerns about the manuscript as presented that make it difficult for me to interpret the results of the experiments. My comments are below.

      Major:<br /> There is generally a large caveat to the rabies studies performed here, which is that both TVA and the ChR2-expressing rabies virus have the same fluorophore. It is thus essentially impossible to determine how many starter cells there are, what the efficiency of tracing is, and which part of the striatum is being sampled in any given experiment. This is a major caveat given the spatial topography of the cortico-striatal projections. Furthermore, the authors make a point in the introduction about previous studies not having explored absolute numbers of inputs, yet this is not at all controlled in this study. It could be that their rabies virus simply replicates better in D1-MSNs than D2-MSNs. No quantifications are done, and these possibilities do not appear to have been considered. Without a greater standardization of the rabies experiments across conditions, it is difficult to interpret the results.

      The authors claim using a few current clamp optical stimulation experiments that the cortical cells are healthy, but this result was far from comprehensive. For example, membrane resistance, capacitance, general excitability curves, etc are not reported. In Figure S2, some of the conditions look quite different (e.g., S2B, input D2-record D2, the method used yields quite different results that the authors write off as not different). Furthermore, these experiments do not consider the likely sickness and death that occurs in starter cells, as has been reported elsewhere. The health of cells in the circuit is overall a substantial concern that alone could invalidate a large portion, if not all, of the behavioral results. This is a major confound given those neurons are thought to play critical roles in the behaviors being studied. This is a major reason why first-generation rabies viruses have not been used in combination with behavior, but this significant caveat does not appear to have been considered, and controls e.g., uninfected animals, infected with AAV helpers, etc, were not included.

      The overall purity (e.g., EnvA pseudotyping efficiency) of the RABV prep is not shown. If there was a virus that was not well EnvA-pseudotyped and thus could directly infect cortical (or other) inputs, it would degrade specificity.

      While most of the study focuses on the cortical inputs, in slice recordings, inputs from the thalamus are not considered, yet likely contribute to the observed results. Related to this, in in vivo optogenetic experiments, technically, if the thalamic or other inputs to the dorsal striatum project to the cortex, their method will not only target cortical neurons but also terminals of other excitatory inputs. If this cannot be ruled it, stating that the authors are able to selectively activate the cortical inputs to one or the other population should be toned down.

      The statements about specificity of connectivity are not well-founded. It may be that in the specific case where they are assessing outside of the area of injections, their conclusions may hold (e.g., excitatory inputs onto D2s have more inputs onto D1s than vice versa). However, how this relates to the actual site of injection is not clear. At face value, if such a connectivity exists, it would suggest that D1-MSNs receive substantially more overall excitatory inputs than D2s. It is thus possible that this observation would not hold over other spatial intervals. This was not explored and thus the conclusions are over-generalized. e.g., the distance from the area of red cells in the striatum to recordings was not quantified, what constituted a high level of cortical labeling was not quantified, etc. Without more rigorous quantification of what was being done, it is difficult to interpret the results.

      The results in figure 3 are not well controlled. The authors show contrasting effects of optogenetic stimulation of D1-MSNs and D2-MSNs in the DMS and DLS, results which are largely consistent with the canon of basal ganglia function. However, when stimulating cortical inputs, stimulating the inputs from D1-MSNs gives the expected results (increased locomotion) while stimulating putative inputs to D2-MSNs had no effect. This is not the same as showing a decrease in locomotion - showing no effect here is not possible to interpret.

      In light of their circuit model, the result showing that inputs to D2-MSNs drive ICSS is confusing. How can the authors account for the fact that these cells are not locomotor-activating, stimulation of their putative downstream cells (D2-MSNs) does not drive ICSS, yet the cortical inputs drive ICSS? Is the idea that these inputs somehow also drive D1s? If this is the case, how do D2s get activated, if all of the cortical inputs tested net activate D1s and not D2s? Same with the results in figure 4 - the inputs and putative downstream cells do not have the same effects. Given the potential caveats of differences in viral efficiency, spatial location of injections, and cellular toxicity, I cannot interpret these experiments.

    1. Reviewer #1 (Public Review):

      Summary:<br /> Previous work in humans and non-human animals suggests that during offline periods following learning, the brain replays newly acquired information in a sequential manner. The present study uses a MEG-based decoding approach to investigate the nature of replay/reactivation during a cued recall task directly following a learning session, where human participants are trained on a new sequence of 10 visual images embedded in a graph structure. During retrieval, participants are then cued with two items from the learned sequence, and neural evidence is obtained for the simultaneous or sequential reactivation of future sequence items. The authors find evidence for both sequential and clustered (i.e., simultaneous) reactivation. Replicating previous work by Wimmer et al. (2020), low-performing participants tend to show sequential, temporally segregated reactivation of future items, whereas high-performing participants show more clustered reactivation. Adding to previous work, the authors show that an image's reactivation strength varies depending on its proximity to the retrieval cue within the graph structure.

      Strengths:<br /> As the authors point out, work on memory reactivation has largely been limited to the retrieval of single associations. Given the sequential nature of our real-life experiences, there is clearly value in extending this work to structured, sequential information. State-of-the-art decoding approaches for MEG are used to characterize the strength and timing of item reactivation. The manuscript is very well written with helpful and informative figures in the main sections. The task includes an extensive localizer with 50 repetitions per image, allowing for stable training of the decoders and the inclusion of several sanity checks demonstrating that on-screen items can be decoded with high accuracy.

      Weaknesses:<br /> Of major concern, the experiment is not optimally designed for analysis of the retrieval task phase, where only 4 min of recording time and a single presentation of each cue item are available for the analyses of sequential and non-sequential reactivation. The authors could consider including data from the (final) learning blocks in their analysis. These blocks follow the same trial structure as the retrieval task, and apart from adding more data points could also reveal important insights regarding a possible shift from sequential to clustered reactivation as learning of the graph structure progresses.

      On a more conceptual note, the main narrative of the manuscript implies that sequential and clustered reactivation are mutually exclusive, such that a single participant would show either one or the other type. With the analytic methods used here, however, it seems possible to observe both types of reactivation. For example, the observation that mean reactivation strength (across the entire trial, or in a given time window of interest) varies with graph distance does not exclude the possibility that this reactivation is also sequential. In fact, the approach of defining one peak time window of reactivation may be biased towards simultaneous, graded reactivation. It would be helpful if the authors could clarify this conceptual point. A strong claim that the two types of reactivation are mutually exclusive would need to be supported by further evidence, for instance, a metric contrasting sequenceness vs clusteredness.

      On the same point, the non-sequential reactivation analyses often use a time window of peak decodability that appears to be determined based on the average reactivation of all future items, irrespective of graph distance. In a sequential forward cascade of reactivations, it seems reasonable to assume that the reactivation of near items would peak earlier than the reactivation of far items. The manuscript would be strengthened by showing the "raw" timecourses of item decodability at different graph distances, clearly demonstrating their peak reactivation times.

    2. Reviewer #2 (Public Review):

      Summary:<br /> The authors investigate replay (defined as sequential reactivation) and clustered reactivation during retrieval of an abstract cognitive map. Replay and clustered reactivation were analysed based on MEG recordings combined with a decoding approach. While the authors state to find evidence for both, replay and clustered reactivation during retrieval, replay was exclusively present in low performers. Further, the authors show that reactivation strength declined with an increasing graph distance.

      Strengths:<br /> The paper raises interesting research questions, i.e., replay vs. clustered reactivation and how that supports retrieval of cognitive maps. The paper is well-written, well-structured, and easy to follow. The methodological approach is convincing and definitely suited to address the proposed research questions.

      The paper is a great combination between replicating previous findings (Wimmer et al. 2020) with a new experimental approach but at the same time presenting novel findings (reactivation strength declines as a function of graph distance).<br /> What I also want to positively highlight is their transparency. They pre-registered this study but with a focus on a different part of the data and outlined this explicitly in the paper.

      The paper has very interesting, individual findings but there are some shortcomings.

      Weaknesses:<br /> Even though the individual findings are interesting, it is not easy to grasp how they are related. For example, the authors show that replay is present in low but not in high performers with the assumption that high performers tend to simultaneously reactivate items. But then, the authors do not investigate clustered reactivation (= simultaneous reactivation) as a function of performance (due to ceiling effects for most participants).

      Unfortunately, the evidence for clustered reactivation is not well supported by the analysis approach and the observed evidence. The analysis approach still holds the possibility of replay driving the observed clustered reactivation effect.

      A third shortcoming is that at least some analyses are underpowered (very low number of trials, n = ~10, and for some analyses, very low number of participants, n = 14). In both cases (low trial number and low participant number) the n could be increased by including the learning part in the analyses as well. It is not clear to me why the authors restricted their analyses to the retrieval period only (especially given that participants also have to retrieve during learning).

    1. Reviewer #1 (Public Review):

      Motoneurons constitute the final common pathway linking central impulse traffic to behavior, and neurophysiology faces an urgent need for methods to record their activity at high resolution and scale in intact animals during natural movement. In this consortium manuscript, Chung et al. introduce high-density electrode arrays on a flexible substrate that can be implanted into muscle, enabling the isolation of multiple motor units during movement. They then demonstrate these arrays can produce high-quality recordings in a wide range of species, muscles, and tasks. The methods are explained clearly, and the claims are justified by the data. While technical details on the arrays have been published previously, the main significance of this manuscript is the application of this new technology to different muscles and animal species during naturalistic behaviors. Overall, we feel the manuscript will be of significant interest to researchers in motor systems and muscle physiology.

      The authors have thoroughly addressed all our original comments, and we have no further concerns.

    2. Reviewer #2 (Public Review):

      This work provides a novel design of implantable and high-density EMG electrodes to study muscle physiology and neuromotor control at the level of individual motor units. Current methods of recording EMG using intramuscular fine-wire electrodes do not allow for isolation of motor units and are limited by the muscle size and the type of behavior used in the study. The authors of myomatrix arrays had set out to overcome these challenges in EMG recording and provided compelling evidence to support the usefulness of the new technology.

      Strengths:<br /> • They presented convincing examples of EMG recordings with high signal quality using this new technology from a wide array of animal species, muscles, and behavior.<br /> • The design included suture holes and pull-on tabs that facilitate implantation and ensure stable recordings over months.<br /> • Clear presentation of specifics of the fabrication and implantation, recording methods used, and data analysis

      I am satisfied with the authors' response to my previous concerns on the weaknesses of the study.

    1. Reviewer #1 (Public Review):

      Authors propose mathematical methods for inferring evolutionary parameters of interest from bulk/single cell sequencing data in healthy tissue and hematopoiesis. Authors attempt to go beyond previous models by including three phases of human development: early development, growth and maintenance, and mature phase. Introductory figures (1 and 2) provide the connection to previous analytical results (based on power laws), while figure 3 denotes the role of sampling effects, and figure 4 provides a real-world example.

      This approach dovetails nicely with previous literature, providing clear insight into when previous theoretical results are valid and when they break down. Much of the previous literature is devoted to bulk sequencing, leading the authors to investigate the role of (sub)-sampling due to single cell data, where mutation burden and mutation rate distributions are easily recapitulated. Although not strongly emphasized in the manuscript, sub-sampling does increase noise leading to differences between population and sample distributions. From my view, these results provide an important contribution to the literature and are able to nicely describe and make inferences in a single cell HSC data set.

    2. Reviewer #2 (Public Review):

      Summary: The authors provide a nice summary on the possibility to study genetic heterogeneity and how to measure the dynamics of stem cells. By combining single cell and bulk sequencing analyses, they aim to use a stochastic process and inform on different aspects of genetic heterogeneity.

      Strengths: Well designed study and strong methods.

    1. Reviewer #1 (Public Review):

      Summary:<br /> Dormancy/diapause/hibernation (depending on how the terms are defined) is a key life history strategy that allows the temporal escape from unfavorable conditions. Although environmental conditions do play a major role in inducing and terminating dormancy (authors call this energy limitation hypothesis), the authors test a mutually non-exclusive hypothesis (life-history hypothesis) that sex-specific selection pressures, at least to some extent, would further shape the timing of these life-history events. Authors use a metanalytic approach to collect data (mainly on rodents) on various life-history traits to test trade-offs among these traits between sexes and how they affect entry and termination of dormancy.

      Strengths:<br /> I found the theoretical background in the Introduction quite interesting, to the point and the arguments were well-placed. How sex-specific selection pressures would drive entry and termination of diapause in insects (e.g. protandry), especially in temperate butterflies, is very well investigated. Authors attempt to extend these ideas to endotherms and trying to find general patterns across ectotherms and endotherms is particularly exciting. This work and similar evidence could make a great contribution to the life-history theory, specifically understanding factors that drive the regulation of life cycle timing.

      Weaknesses:<br /> 1. I felt that including 'ectotherms' in the title is a bit misleading as there is hardly (in fact any?) any data presented on ectotherms. Also, most of the focus of the discussion is heavily mammal (rodent) focussed. I believe saying endotherms in the title as well is a bit misleading as the data is mammal-focused.

      2. I think more information needs to be provided early on to make readers aware of the diversity of animals included in the study and their geographic distribution. Are they mostly temperate or tropical? What is the span of the latitude as day length can have a major influence on dormancy timings? I think it is important to point out that data is more rodent-centric. Along the line of this point, is there a reason why the extensively studied species like the Red Deer or Soay Sheep and other well-studied temperate mammals did not make it into the list?

      3. Isn't the term 'energy limitation hypothesis' which is used throughout the manuscript a bit endotherm-centric? Especially if the goal is to draw generalities across ectotherms and endotherms. Moreover, climate (e.g. interaction of photoperiod and temperature in temperatures) most often induces or terminates diapause/dormancy in ectotherms so I am not sure if saying 'energy limitation hypothesis' is general enough.

      4. Since for some species, the data is averaged across studies to get species-level trait estimates, is there a scope to examine within population differences (e.g. across latitudes)? This may further strengthen the evidence and rule out the possibility of the environment, especially the length of the breeding season, affecting the timing of emergence and immergence.

      5. Although the authors are looking at the broader patterns, I felt like the overall ecology of the species (habitat, tropical or temperate, number of broods, etc.) is overlooked and could act as confounding factors.

      6. I strongly think the data analysis part needs more clarity. As of now, it is difficult for me to visualize all the fitted models (despite Table 1), and the large number of life-history traits adds to this complexity. I would recommend explicitly writing down all the models in the text. Also, the Table doesn't make it clear whether interaction was allowed between the predictors or not. More information on how PGLS were fitted needs to be provided in the main text which is in the supplementary right now. I kept wondering if the authors have fit multiple models, for example, with different correlation structures or by choosing different values of lambda parameter. And, in addition to PGLS, authors are also fitting linear regressions. Can you explain clearly in the text why was this done?

      7. Figure 2 is unclear, and I do not understand how these three regression lines were computed. Please provide more details.

    2. Reviewer #2 (Public Review):

      Summary:<br /> An article with lots of interesting ideas and questions regarding the evolution of timing of dormancy, emphasizing mammalian hibernation but also including ectotherms. The authors compare selective forces of constraints due to energy availability versus predator avoidance and requirements and consequences of reproduction in a review of between and within species (sex) differences in the seasonal timing of entry and exit from dormancy.

      Strengths:<br /> The multispecies approach including endotherms and ectotherms is ambitious. This review is rich with ideas if not in convincing conclusions.

      Weaknesses:<br /> The differences between physiological requirements for gameatogenesis between sexes that affect the timing of heterothermy and the need for euthermy during mammalian hibernator are significant issues that underlie but are under-discussed, in this contrast of selective pressures that determine seasonal timing of dormancy. Some additional discussion of the effects of rapid climate change on between and within species phenologies of dormancy would have been interesting.

    1. Reviewer #1 (Public Review):

      Summary:<br /> The authors were trying to understand the relationship between the development of large trunks and longirrostrine mandibles in bunodont proboscideans of Miocene, and how it reflects the variation in diet patterns.

      Strengths:<br /> The study is very well supported, written, and illustrated, with plenty of supplementary material. The findings are highly significant for the understanding of the diversification of bunodont proboscideans in Asia during Miocene, as well as explaining the cranial/jaw disparity of fossil lineages. This work elucidates the diversification of paleobiological aspects of fossil proboscideans and their evolutionary response to open environments in the Neogene using several methods. The authors included all Asian bunodont proboscideans with long mandibles and I suggest that they should use the expression "bunodont proboscideans" instead of gomphotheres.

      Weaknesses:<br /> I believe that the only weakness is the lack of discussion comparing their results with the development of gigantism and long limbs in proboscideans from the same epoch.

    2. Reviewer #2 (Public Review):

      This study focuses on the eco-morphology, the feeding behaviors, and the co-evolution of feeding organs of longirostrine gomphotheres (Amebelodontidae, Choerolophodontidae, and Gomphotheriidae) which are characterised by their distinctive mandible and mandible tusk morphologies. They also have different evolutionary stages of food acquisition organs which may have co-evolve with extremely elongated mandibular symphysis and tusks. Although these three longirostrine gomphothere families were widely distributed in Northern China in the Early-Middle Miocene, the relative abundances and the distribution of these groups were different through time as a result of the climatic changes and ecosysytems.

      These three groups have different feeding behaviors indicated by different mandibular symphysis and tusk morphologies. Additionally, they have different evolutionary stages of trunks which are reflected by the narial region morphology. To be able to construct the feeding behavior and the relation between the mandible and the trunk of early elephantiformes, the authors examined the crania and mandibles of these three groups from the Early and Middle Miocene of northern China from three different museums and also made different analyses.

      The analyses made in the study are:<br /> 1. Finite Element (FE) analysis: They conducted two kinds of tests: the distal forces test, and the twig-cutting test. With the distal forces test, advantageous and disadvantageous mechanical performances under distal vertical and horizontal external forces of each group are established. With the twig-cutting test, a cylindrical twig model of orthotropic elastoplasity was posed in three directions to the distal end of the mandibular task to calculate the sum of the equivalent plastic strain (SEPS). It is indicated that all three groups have different mandible specializations for cutting plants.

      2. Phylogenetic reconstruction: These groups have different narial region morphology, and in connection with this, have different stages of trunk evolution. The phylogenetic tree shows the degree of specialization of the narial morphology. And narial region evolutionary level is correlated with that of character-combine in relation to horizontal cutting. In the trilophodont longirostrine gomphotheres, co-evolution between the narial region and horizontal cutting behaviour is strongly suggested.

      3. Enamel isotopes analysis: The results of stable isotope analysis indicate an open environment with a diverse range of habitats and that the niches of these groups overlapped without obvious differentiation.

      The analysis shows that different eco-adaptations have led to the diverse mandibular morphology and open-land grazing has driven the development of trunk-specific functions and loss of the long mandible. This conclusion has been achieved with evidence on palaecological reconstruction, the reconstruction of feeding behaviors, and the examination of mandibular and narial region morphology from the detailed analysis during the study.

      All of the analyses are explained in detail in the supplementary files. The 3D models and movies in the supplementary files are detailed and understandable and explain the conclusion. The conclusions of the study are well supported by data.

    1. Reviewer #1 (Public Review):

      Summary:<br /> This study presents careful biochemical experiments to understand the relationship between LRRK2 GTP hydrolysis parameters and LRRK2 kinase activity. The authors report that incubation of LRRK2 with ATP increases the KM for GTP and decreases the kcat. From this, they suppose an autophosphorylation process is responsible for enzyme inhibition. LRRK2 T1343A showed no change, consistent with it needing to be phosphorylated to explain the changes in G-domain properties. The authors propose that phosphorylation of T1343 inhibits kinase activity and influences monomer-dimer transitions.

      Strengths:<br /> The strengths of the work are the very careful biochemical analyses and the interesting result for wild-type LRRK2.

      Weaknesses:<br /> A major unexplained weakness is why the mutant T1343A starts out with so much lower activity--it should be the same as wild-type, non-phosphorylated protein. Also, if a monomer-dimer transition is involved, it should be either all or nothing. Other approaches would add confidence to the findings.

    2. Reviewer #2 (Public Review):

      This study addresses the catalytic activity of a Ras-like ROC GTPase domain of LRRK2 kinase, a Ser/Thr kinase linked to Parkinson's disease (PD). The enzyme is associated with gain-of-function variants that hyper-phosphorylate substrate Rab GTPases. However, the link between the regulatory ROC domain and activation of the kinase domain is not well understood.

      It is within this context that the authors detail the kinetics of the ROC GTPase domain of pathogenic variants of LRRK2, in comparison to the WT enzyme. Their data suggest that LRRK2 kinase activity negatively regulates the ROC GTPase activity and that PD variants of LRRK2 have differential effects on the Km and catalytic efficiency of GTP hydrolysis.

      Based on mutagenesis, kinetics, and biophysical experiments, the authors suggest a model in which autophosphorylation shifts the equilibrium toward monomeric LRRK2 (locked GTP state of ROC). The authors further conclude that T1343 is a crucial regulatory site, located in the P-loop of the ROC domain, which is necessary for the negative feedback mechanism. Unfortunately, the data do not support this hypothesis, and further experiments are required to confirm this model for the regulation of LRRK2 activity.

      Specific comments are below:

      - Although a couple of papers are cited, the rationale for focusing on the T1343 site is not evident to readers. It should be clarified that this locus, and perhaps other similar loci in the wider ROCO family, are likely important for direct interactions with the GTP molecule.

      - Similar to the above, readers are kept in the dark about auto-phosphorylation and its effects on the monomer/dimer equilibrium. This is a critical aspect of this manuscript and a major conceptual finding that the authors are making from their data. However, the idea that auto-phosphorylation is (likely) to shift the monomer/dimer equilibrium toward monomer, thereby inactivating the enzyme, is not presented until page 6, AFTER describing much of their kinetics data. This is very confusing to readers, as it is difficult to understand the meaning of the data without a conceptual framework. If the model for the LRRK2 function is that dimerization is necessary for the phosphorylation of substrates, then this idea should be presented early in the introduction, and perhaps also in the abstract. If there are caveats, then they should be discussed before data are presented. A clear literature trail and the current accepted (or consensus) mechanism for LRRK2 activity is necessary to better understand the context for these data.

      - Following on the above concepts, I find it interesting that the authors mention monomeric cyotosolic states, and kinase-active oligomers (dimers??), with citations. Again here, it would be useful to be more precise. Are dimers (oligomers?) only formed at the membrane? That would suggest mechanisms involving lipid or membrane-attached protein interactions. Also, what do the authors mean by oligomers? Are there more than dimers found localized to the membrane?

      - Fig 5 is a key part of their findings, regarding the auto-phosphorylation induced monomer formation of LRRK2. From these two bar graphs, the authors state unequivocally that the 'monomer/dimer equilibrium is abolished', and therefore, that the underlying mechanism might be increased monomerization (through maintenance of a GTP-locked state). My view is that the authors should temper these conclusions with caveats. One is that there are still plenty of dimers in the auto-phosphorylated WT, and also in the T1343A mutant. Why is that the case? Can the authors explain why only perhaps a 10% shift is sufficient? Secondly, the T1343A mutant appears to have fewer overall dimers to begin with, so it appears to readers that 'abolition' is mainly due to different levels prior to ATP treatment at 30 deg. I feel these various issues need to be clarified in a revised manuscript, with additional supporting data. Finally, on a minor note, I presume that there are no statistically significant differences between the two sets of bar graphs on the right panel. It would be wise to place 'n.s.' above the graphs for readers, and in the figure legend, so readers are not confused.

      - Figure 6B, Westerns of phosphorylation, the lanes are not identified and it is unclear what these data mean.

    1. Reviewer #1 (Public Review):

      Summary:<br /> The authors use truncations, fragments, and HCN2/4 chimeras to narrow down the interaction and regulatory domains for LRMP inhibition of cAMP-dependent shifts in the voltage dependence of activation of HCN4 channels. They identify the N-terminal domain of HCN4 as a binding domain for LRMP, and highlight two residues in the C-linker as critical for the regulatory effect. Notably, whereas HCN2 is normally insensitive to LRMP, putting the N-terminus and 5 additional C-linker and S5 residues from HCN4 into HCN2 confers LRMP regulation in HCN2.

      Strengths:<br /> The work is excellent, the paper well written, and the data convincingly support the conclusions which shed new light on the interaction and mechanism for LRMP regulation of HCN4, as well as identifying critical differences that explain why LRMP does not regulate other isoforms such as HCN2.

    2. Reviewer #2 (Public Review):

      Summary:<br /> HCN-4 isoform is found primarily in the sino-atrial node where it contributes to the pacemaking activity. LRMP is an accessory subunit that prevents cAMP-dependent potentiation of HCN4 isoform but does not have any effect on HCN2 regulation. In this study, the authors combine electrophysiology, FRET with standard molecular genetics to determine the molecular mechanism of LRMP action on HCN4 activity. Their study shows that parts of N- and C-termini along with specific residues in C-linker and S5 of HCN4 are crucial for mediating LRMP action on these channels. Furthermore, they show that the initial 224 residues of LRMP are sufficient to account for most of the activity. In my view, the highlight of this study is Fig. 7 which recapitulates LRMP modulation on HCN2-HCN4 chimera. Overall, this study is an excellent example of using time-tested methods to probe the molecular mechanisms of regulation of channel function by an accessory subunit.

      Weaknesses:<br /> 1. Figure 5A- I am a bit confused with this figure and perhaps it needs better labeling. When it states Citrine, does it mean just free Citrine, and "LRMP 1-230" means LRMP fused to Citrine which is an "LF" construct? Why not simply call it "LF"? If there is no Citrine fused to "LRMP 1-230", this figure would not make sense to me.

      2. Related to the above point- Why is there very little FRET between NF and LRMP 1-230? The FRET distance range is 2-8 nm which is quite large. To observe baseline FRET for this construct more explanation is required. Even if one assumes that about 100 amino are completely disordered (not extended) polymers, I think you would still expect significant FRET.

      3. Unless I missed this, have all the Cerulean and Citrine constructs been tested for functional activity?

    3. Reviewer #3 (Public Review):

      Summary:<br /> Using patch clamp electrophysiology and Förster resonance energy transfer (FRET), Peters and co-workers showed that the disordered N-terminus of both LRMP and HCN4 are necessary for LRMP to interact with HCN4 and inhibit the cAMP-dependent potentiation of channel opening. Strikingly, they identified two HCN4-specific residues, P545 and T547 in the C-linker of HCN4, that are close in proximity to the cAMP transduction centre (elbow Clinker, S4/S5-linker, HCND) and account for the LRMP effect.

      Strengths:<br /> Based on these data, the authors propose a mechanism in which LRMP specifically binds to HCN4 via its isotype-specific N-terminal sequence and thus prevents the cAMP transduction mechanism by acting at the interface between the elbow Clinker, the S4S5-linker, the HCND.

      Weaknesses:<br /> Although the work is interesting, there are some discrepancies between data that need to be addressed.

      1. I suggest inserting in Table 1 and in the text, the Δ shift values (+cAMP; + LRMP; +cAMP/LRMP). This will help readers.

      2. Figure 1 is not clear, the distribution of values is anomalously high. For instance, in 1B the distribution of values of V1/2 in the presence of cAMP goes from - 85 to -115. I agree that in the absence of cAMP, HCN4 in HEK293 cells shows some variability in V1/2 values, that nonetheless cannot be so wide (here the variability spans sometimes even 30 mV) and usually disappears with cAMP (here not).

      This problem is spread throughout the manuscript, and the measured mean effects are indeed always at the limit of statistical significance. Why so? Is this a problem with the analysis, or with the recordings?

      There are several other problems with Figure 1 and in all figures of the manuscript: the Y scale is very narrow while the mean values are marked with large square boxes. Moreover, the exemplary activation curve of Figure 1A is not representative of the mean values reported in Figure 1B, and the values of 1B are different from those reported in Table 1.

      On this ground, it is difficult to judge the conclusions and it would also greatly help if exemplary current traces would be also shown.

      3. "....HCN4-P545A/T547F was insensitive to LRMP (Figs. 6B and 6C; Table 1), indicating that the unique HCN4 C-linker is necessary for regulation by LRMP. Thus, LRMP appears to regulate HCN4 by altering the interactions between the C-linker, S4-S5 linker, and N-terminus at the cAMP transduction centre."

      Although this is an interesting theory, there are no data supporting it. Indeed, P545 and T547 at the tip of the C-linker elbow (fig 6A) are crucial for LRMP effect, but these two residues are not involved in the cAMP transduction centre (interface between HCND, S4S5 linker, and Clinker elbow), at least for the data accumulated till now in the literature. Indeed, the hypothesis that LRMP somehow inhibits the cAMP transduction mechanism of HCN4 given the fact that the two necessary residues P545 and T547 are close to the cAMP transduction centre, remains to be proven.

      Moreover, I suggest analysing the putative role of P545 and T547 in light of the available HCN4 structures. In particular, T547 (elbow) points towards the underlying shoulder of the adjacent subunit and, therefore, is in a key position for the cAMP transduction mechanism. The presence of bulky hydrophobic residues (very different nature compared to T) in the equivalent position of HCN1 and HCN2 also favours this hypothesis. In this light, it will be also interesting to see whether a single T547F mutation is sufficient to prevent the LRMP effect.

    1. Reviewer #1 (Public Review):

      Summary:<br /> As the scientific community identifies increasing numbers of genetic variants that cause rare human diseases, a challenge is how the field can most quickly identify pharmacological interventions to address known deficits. The authors point out that defining phenotypic outcomes required for drug screen assays is often challenging, and emphasize how invertebrate models can be used for quick ID of compounds that may address genetic deficits. A major contribution of this work is to establish a framework for potential intervention drug screening based on quantitative imaging of morphology and mobility behavior, using methods that the authors show can define subtle phenotypes in a high proportion of disease gene knockout mutants.

      Overall, the work constitutes an elegant combination of previously developed high-volume imaging with highly detailed quantitative phenotyping (and some paring down to specific phenotypes) to establish proof of principle on how the combined applications can contribute to screens for compounds that may address specific genetic deficits, which can suggest both mechanism and therapy.

      In brief, the authors selected 25 genes for which loss of function is implicated in human neuro-muscular disease and engineered deletions in the corresponding C. elegans homologs. The authors then imaged morphological features and behaviors prior to, during, and after blue light stimuli, quantitating features, and clustering outcomes as they elegantly developed previously (PMID 35322206; 30171234; 30201839). In doing so, phenotypes in 23/25 tested mutants could be separated enough to distinguish WT from mutant and half of those with adequate robustness to permit high-throughput screens, an outcome that supports the utility of general efforts to ID phenotypes in C. elegans disease orthologs using this approach. A detailed discussion of 4 ciliopathy gene defects, and NACLN-related channelopathy mutants reveals both expected and novel phenotypes, validating the basic approach to modeling vetted targets and underscoring that quantitative imaging approaches reiterate known biology. The authors then screened a library of nearly 750 FDA-approved drugs for the capacity to shift the unc-80 NACLN channel-disrupted phenotype closer to the wild type. Top "mover" compound move outcome in the experimental outcome space; and also reveal how "side effects" can be evaluated to prioritize compounds that confer the fewest changes of other parameters away from the center.

      Strengths:<br /> Although the imaging and data analysis approaches have been reported and the screen is limited in scope and intervention exposure, it is important that the authors strongly combine individual approach elements to demonstrate how quantitative imaging phenotypes can be integrated with C. elegans genetics to accelerate the identification of potential modulators of disease (easily extendable to other goals). Generation of deletion alleles and documentation of their associated phenotypes (available in supplemental data) provide potentially useful reagents/data to the field. The capacity to identify "over-shooting" of compound applications with suggestions for scale back and to sort efficacious interventions to minimize other changes to behavioral and physical profiles is a strong contribution.

      Weaknesses:<br /> The work does not have major weaknesses, although it may be possible to expand the discussion to increase utility in the field:

      1) Increased discussion of the challenges and limitations of the approach may enhance successful adaptation application in the field.

      --It is quite possible that morphological and behavioral phenotypes have nothing to do with disease mechanisms and rather reflect secondary outcomes, such that positive hits will address "off-target" consequences.

      --The deletion approach is adequately justified in the text, but the authors may make the point somewhere that screening target outcomes might be enhanced by the inclusion of engineered alleles that match the human disease condition. Their work on sod-1 alleles (PMID 35322206) might be noted in this discussion.

      --Drug testing here involved a strikingly brief exposure to a compound, which holds implications for how a given drug might engage in adult animals. The authors might comment more extensively on extended treatments that include earlier life or more extended targeting. The assumption is that administering different exposure periods and durations, but if the authors are aware as to whether there are challenges associated with more prolonged applications, larger scale etc. it would be useful to note them.

      2) More justification of the shift to only a few target parameters for judging compound effectiveness.<br /> -In the screen in Figure 4D and text around 313, 3 selected core features of the unc-80 mutant (fraction that blue-light pause, speed, and curvature) were used to avoid the high replicate requirements to identify subtle phenotypes. Although this strategy was successful as reported in Figure 5, the pared-down approach seems a bit at odds with the emphasis on the range of features that can be compared mutant/wt with the author's powerful image analysis. Adding details about the reduced statistical power upon multiple comparisons, with a concrete example calculated, might help interested scientists better assess how to apply this tool in experimental design.

      3) More development of the side-effect concept. The side effects analysis is interesting and potentially powerful. Prioritization of an intervention because of minimal perturbation of other phenotypes might be better documented and discussed a bit further; how reliably does the metric of low side effects correlate with drug effectiveness?

    2. Reviewer #2 (Public Review):

      Summary and strengths:

      O'Brien et al. present a compelling strategy to both understand rare disease that could have a neuronal focus and discover drugs for repurposing that can affect rare disease phenotypes. Using C. elegans, they optimize the Brown lab worm tracker and Tierpsy analysis platform to look at the movement behaviors of 25 knockout strains. These gene knockouts were chosen based on a process to identify human orthologs that could underlie rare diseases. I found the manuscript interesting and a powerful approach to making genotype-phenotype connections using C. elegans. Given the rate at which rare Mendelian diseases are found and candidate genes suggested, human geneticists need to consider orthologous approaches to understand the disease and seek treatments on a rapid time scale. This approach is one such way. Overall, I have a few minor suggestions and some specific edits.

      Weaknesses:<br /> (1) Throughout the text on figures, labels are nearly impossible to read. I had to zoom into the PDF to determine what the figure was showing. Please make text in all figures a minimum of 10-point font. Similarly, the Figure 2D point type is impossible to read. Points should be larger in all figures. Gene names should be in italics in all figures, following C. elegans convention.

      (2) I have a strong bias against the second point in Figure 1A. Sequencing of trios, cohorts, or individuals NEVER identifies causal genes in the disease. This technique proposes a candidate gene. Future experiments (oftentimes in model organisms) are required to make those connections to causality. Please edit this figure and parts of the text.

      (3) How were the high-confidence orthologs filtered from 767 to 543 (lines 128-131)? Also, the choice of the final list of 25 genes is not well justified. Please expand more about how these choices were made.

      (4) Figures 3 and 4, why show all 8289 features? It might be easier to understand and read if only the 256 Tierpsy features were plotted in the heat maps.

      (5) The unc-80 mutant screen is clever. In the feature space, it is likely better to focus on the 256 less-redundant Tierpsy features instead of just a number of features. It is unclear to me how many of these features are correlated and not providing more information. In other words, the "worsening" of less-redundant features is far more of a concern than the "worsening" of 1000 correlated features.

    3. Reviewer #3 (Public Review):

      In this study, O'Brien et al. address the need for scalable and cost-effective approaches to finding lead compounds for the treatment of the growing number of Mendelian diseases. They used state-of-the-art phenotypic screening based on an established high-dimensional phenotypic analysis pipeline in the nematode C. elegans.

      First, a panel of 25 C. elegans models was created by generating CRISPR/Cas9 knock-out lines for conserved human disease genes. These mutant strains underwent behavioral analysis using the group's published methodology. Clustering analysis revealed common features for genes likely operating in similar genetic pathways or biological functions. The study also presents results from a more focused examination of ciliopathy disease models.

      Subsequently, the study focuses on the NALCN channel gene family, comparing the phenotypes of mutants of nca-1, unc-77, and unc-80. This initial characterization identifies three behavioral parameters that exhibit significant differences from the wild type and could serve as indicators for pharmacological modulation.

      As a proof-of-concept, O'Brien et al. present a drug repurposing screen using an FDA-approved compound library, identifying two compounds capable of rescuing the behavioral phenotype in a model with UNC80 deficiency. The relatively short time and low cost associated with creating and phenotyping these strains suggest that high-throughput worm tracking could serve as a scalable approach for drug repurposing, addressing the multitude of Mendelian diseases. Interestingly, by measuring a wide range of behavioural parameters, this strategy also simultaneously reveals deleterious side effects of tested drugs that may confound the analysis.

      Considering the wealth of data generated in this study regarding important human disease genes, it is regrettable that the data is not actually made accessible. This diminishes the study's utility. It would have a far greater impact if an accessible and user-friendly online interface were established to facilitate data querying and feature extraction for specific mutants. This would empower researchers to compare their findings with the extensive dataset created here. Otherwise, one is left with a very limited set of exploitable data.

      Another technical limitation of the study is the use of single alleles. Large deletion alleles were generated by CRISPR/Cas9 gene editing. At first glance, this seems like a good idea because it limits the risk that background mutations, present in chemically-generated alleles, will affect behavioral parameters. However, these large deletions can also remove non-coding RNAs or other regulatory genetic elements, as found, for example, in introns. Therefore, it would be prudent to validate the behavioral effects by testing additional loss-of-function alleles produced through early stop codons or targeted deletion of key functional domains.

    1. Reviewer #1 (Public Review):

      De Seze et al. investigated the role of guanine exchange factors (GEFs) in controlling cell protrusion and retraction. In order to causally link protein activities to the switch between the opposing cell phenotypes, they employed optogenetic versions of GEFs which can be recruited to the plasma membrane upon light exposure and activate their downstream effectors. Particularly the RhoGEF PRG could elicit both protruding and retracting phenotypes. Interestingly, the phenotype depended on the basal expression level of the optoPRG. By assessing the activity of RhoA and Cdc42, the downstream effectors of PRG, the mechanism of this switch was elucidated: at low PRG levels, RhoA is predominantly activated and leads to cell retraction, whereas at high PRG levels, both RhoA and Cdc42 are activated but PRG also sequesters the active RhoA, therefore Cdc42 dominates and triggers cell protrusion. Finally, they create a minimal model that captures the key dynamics of this protein interaction network and the switch in cell behavior.

      The conclusions of this study are strongly supported by data. Perhaps the manuscript could include some further discussion to for example address the low number of cells (3 out of 90) that can be switched between protrusion and retraction by varying the frequency of the light pulses to activate opto-PRG. Also, the authors could further describe their "Cell finder" software solution that allows the identification of positive cells at low cell density, as this approach will be of interest for a wide range of applications.

    2. Reviewer #2 (Public Review):

      Summary:

      This manuscript builds from the interesting observation that local recruitment of the DHPH domain of the RhoGEF PRG can induce local retraction, protrusion, or neither. The authors convincingly show that these differential responses are tied to the level of expression of the PRG transgene. This response depends on the Rho-binding activity of the recruited PH domain and is associated with and requires (co?)-activation of Cdc42. This begs the question of why this switch in response occurs. They use a computational model to predict that the timing of protein recruitment can dictate the output of the response in cells expressing intermediate levels and found that, "While the majority of cells showed mixed phenotypes irrespectively of the activation pattern, in few cells (3 out of 90) we were able to alternate the phenotype between retraction and protrusion several times at different places of the cell by changing the frequency while keeping the same total integrated intensity (Figure 6F and Supp Movie)."

      Strengths:

      The experiments are well-performed and nicely documented. However, the molecular mechanism underlying the shift in response is not clear (or at least clearly described). In addition, it is not clear that a prediction that is observed in ~3% of cells should be interpreted as confirming a model, though the fit to the data in 6B is impressive.

      Overall, the main general biological significance of this work is that RhoGEF can have "off target effects". This finding is significant in that an orthologous GEF is widely used in optogenetic experiments in drosophila. It's possible that these findings may likewise involve phenotypes that reflect the (co-)activation of other Rho family GTPases.

      Weaknesses:

      The manuscript makes a number of untested assumptions and the underlying mechanism for this phenotypic shift is not clearly defined.

      This manuscript is missing a direct phenotypic comparison of control cells to complement that of cells expressing RhoGEF2-DHPH at "low levels" (the cells that would respond to optogenetic stimulation by retracting); and cells expressing RhoGEF2-DHPH at "high levels" (the cells that would respond to optogenetic stimulation by protruding). In other words, the authors should examine cell area, the distribution of actin and myosin, etc in all three groups of cells (akin to the time zero data from figures 3 and 5, with a negative control). For example, does the basal expression meaningfully affect the PRG low-expressing cells before activation e.g. ectopic stress fibers? This need not be an optogenetic experiment, the authors could express RhoGEF2DHPH without SspB (as in Fig 4G).

      Relatedly, the authors seem to assume ("recruitment of the same DH-PH domain of PRG at the membrane, in the same cell line, which means in the same biochemical environment." supplement) that the only difference between the high and low expressors are the level of expression. Given the chronic overexpression and the fact that the capacity for this phenotypic shift is not recruitment-dependent, this is not necessarily a safe assumption. The expression of this GEF could well induce e.g. gene expression changes.

      The third paragraph of the introduction, which begins with the sentence, "Yet, a large body of works on the regulation of GTPases has revealed a much more complex picture with numerous crosstalks and feedbacks allowing the fine spatiotemporal patterning of GTPase activities" is potentially confusing to readers. This paragraph suggests that an individual GTPase may have different functions whereas the evidence in this manuscript demonstrates, instead, that *a particular GEF* can have multiple activities because it can differentially activate two different GTPases depending on expression levels. It does not show that a particular GTPase has two distinct activities. The notion that a particular GEF can impact multiple GTPases is not particularly novel, though it is novel (to my knowledge) that the different activities depend on expression levels.

      These descriptions are not precise. What is the nature of the competition between RhoA and Cdc42? Is this competition for activation by the GEFs? Is it a competition between the phenotypic output resulting from the effectors of the GEFs? Is it competition from the optogenetic probe and Rho effectors and the Rho biosensors? In all likelihood, all of these effects are involved, but the authors should more precisely explain the underlying nature of this phenotypic switch. Some of these points are clarified in the supplement, but should also be explicit in the main text.

    1. Reviewer #1 (Public Review):

      Summary:

      Sex differences in the liver gene expression and function have previously been proposed to be caused by sex differences in the pattern growth hormone (GH) secretion by the pituitary, which are established by the effects of testicular hormones that act on the hypothalamus perinatally to masculinize control of pituitary GH secretion beginning at puberty and for the rest of the animal's life. The Waxman lab has previously implicated GH control of STAT5 as a critical event leading to a masculine pattern of gene expression. The present study separates male-biased regulatory sites associated with the male-biased genes into different classes based on their responsiveness to the cyclic male pattern of STAT5 activity, and investigates DNAse hypersensitivity sites (DHS) of different classes showing cyclic sex-bias or not. It further reports on the binding of transcription factors to STAT5-sensitive DHS, and involvement of specific histone marks at these sites. The study argues that STAT5 is the proximate factor regulating chromatin accessibility in about 1/3 of male-biased DHS that are sexually differentiated by GH secretion. The authors propose the pulsatile GH secretion as a novel proximate mechanism of regulating chromatin accessibility to cause sex differences.

      Strengths:

      The study offers new insight into the effects of hypophysectomy and injection of GH on different classes of sex-biased genes in mouse liver. The results support the general conclusion of the authors. Cyclic secretion of other hormones (for example, estrous secretion of estrogens and progesterone) are well known to cause sex differences in multiple organs in rodents, and it will be interesting to assess if these cyclic secretions induce similar changes in chromatin accessibility causing female tissue gene expression to differ from that of males.

      Weaknesses:

      The authors argue for two major mechanisms controlling sexual bias in liver gene expression, and analyze in depth one of these mechanisms. The focus is on the group of DHS (about 1/3 of all male-biased DHS) in which the sex bias is controlled by cyclic secretion of growth hormone (GH) in males, compared to static and low growth hormone in adult females. The sex difference in pituitary secretion of GH is induced by permanent effects of androgens acting on the hypothalamus perinatally. The manuscript study would be improved by further discussion of the mechanistic relationship between this class of sex-biased DHS and the other 2/3 of liver DHS that also show male-biased accessibility but whose chromatin does not respond directly to GH-stimulated STAT5. Previous studies, including those in the Waxman lab (PMIDs: 26959237, 18974276, 35396276) suggest castration of males or gonadectomy of both sexes eliminates most sex differences in mRNA expression in mouse liver, and/or that androgens such as DHT or testosterone administered in adulthood potentially reverses the effects of gonadectomy and/or masculinizes liver gene expression. It is not clear from the present discussion whether the GH/STAT5 cyclic effects to masculinize chromatin status require the presence of androgens in adulthood to masculinize pituitary GH secretion. Are there analyses of the present (or past) data that might provide evidence about a dual role for GH and androgen acting on the same genes? For example, are sex-biased DHS bound by androgen-dependent factors or show other signs of androgen sensitivity? Are histone marks associated with DHS regulated by androgens? Moreover, it would help if the authors indicate whether they believe that the "constitutive" static sex differences in the larger 2/3 set of male-biased DHS are the result of "constitutive" (but variable) action of testicular androgens in adulthood. Although the present study is nicely focused on the GH pulse-sensitive DHS, is there mechanistic overlap in sex-biasing mechanisms with the larger static class of sex-biased liver DHS?

    2. Reviewer #2 (Public Review):

      Summary:<br /> The present work addresses the mechanisms linking the sex-dependent temporal GH secretion patterns to the robust sex differences in chromatin accessibility and transcription factor binding that ultimately regulate sexually dimorphic liver gene expression. Using DNAseq analysis genomic sites hypersensitive to cleavage by DNase I, DNase hypersensitive sites [DHS] were studied in hepatocytes from male and female mice. DHS in the genome correspond to accessible chromatin regions and encompass key regulatory elements, including enhancers, promoters, insulators, and silencers, often flanked by specific histone modifications, and all of these players were described in different settings of GH action. Importantly, the dynamics of sex-dependent and independent chromatin accessibility linked to STAT5 binding were evaluated. For that purpose, hepatic samples from mice were divided into STAT high and STAT low binding by EMSA screening. With this information changes in DHS related to STAT binding were calculated in both sexes, giving an approximation of chromatin opening in response to STAT5, or alternatively to hypophsectomy, or a single GH pulse. More the 800 male-biased DHS (from a total of more than 70000 DHS) regions were identified in the STAT5 high groups, implying that the binding of a plasma GH pulse activates STAT5, and evokes a dynamic cycle of male liver chromatin opening and closing at sites that comprised 31% of all male-biased DHS. This proves that the pulsatility of plasma GH stimulation confers significant male bias in chromatin accessibility, and STAT5 binding at a fraction of the genomic sites linked to sex-biased liver gene expression and liver disease. As a proof of concept, authors show that a single physiological replacement dose or pulse of GH given to hypophysectomized mice recapitulate, within 30 min, the pulsatile re-opening of chromatin seen in pituitary-intact male mouse liver.

      In another male-biased DHS set (69% of male-biased DHS), chromatin accessibility was static, that is unchanged across the peaks and valleys of GH-induced liver STAT5 activity and mapped to a set of target genes and processes distinct though sometimes overlapping those of the dynamic male-biased DHS.

      In view of these distinct dynamic and static DHS in males, authors evaluated key epigenetic features distinguishing the dynamic STAT5-driven mechanism of chromatin opening from that of static male-biased DHS, which are constitutively open in the male liver but closed in the female liver. The analysis of histone marks enriched at each class of sex-biased DHS indicated exquisite differences in the epigenetic mechanisms that mediate sex-specific gene repression in each sex. For example, H3K27me3 and H3K9me3, two widely used repressive histone marks, are used in a unique way in each sex to enforce sex differences in chromatin states at sex-biased DHS.

      Finally, the work recapitulates and explains the classifications of sex dimorphic genes made in previous works. Sex-biased and pituitary hormone-dependent DHS act as regulatory elements with a positive enhancer potential, to induce or maintain gene expression in the intact liver by sustaining an open chromatin in the case of class I male-biased DHS and class I male-biased genes in the male liver. Contrariwise DHS may participate in the inhibition of gene expression by maintaining a closed chromatin state, as in the case of class II male-biased DHS and class II female-biased genes in male liver.

      These results as a whole present a complex mechanism by which GH regulates the sexual dimorphism of liver genes in order to cope with the metabolic needs of each sex. In a complete story, the information on chromatin accessibility, histone modification, and transcription factor binding was integrated to elucidate the complex patterns of transcriptional regulation, which is sexually dimorphic in the liver.

      Strengths:<br /> The work presents a novel insight into the fundamental underlying epigenetic mechanisms of sex-biased gene regulation.<br /> Results are supported by numerous Tables, and Supplementary Tables with the raw data, which present the advantage that they may be reanalyzed in the future to prove new hypotheses.

      Weaknesses<br /> It is a complicated work to analyze, even though the main messages are clearly conveyed.

    1. Reviewer #2 (Public Review):

      Although Trabid missense mutations are identified across a range of neurodevelopmental disorders, its role in neurodevelopment is not understood. Here the authors study two different patient mutations and implicate defects in its deubiquitylating activity and interactions with STRIPAK. Knockin mice for these mutations impaired trafficking of APC to microtubule plus ends, with consequent defects in neuronal growth cone and neurite outgrowth.

      The authors focus on R438W and A451V, two missense mutations seen in patients. Recombinant fragments showed R438W is nearly completely DUB-dead whereas A451V showed normal activity but failed to efficiently precipitate STRIPAK. Knockin of these mutations showed a partially penetrant reduced cortical neuronal and glial cell numbers and reduced TH+ neurons and their neuronal processes. Cell culture demonstrated that both DUB and STRIPAK-binding activities of Trabid are required for efficient deubiquitylation of APC in cells, and alter APC transport along neurites. APC-tdTomato fluorescent reporter mice crossed with the Trabid mutants confirmed these results. The results suggest that Trabid's mechanism of action is to suppress APC ubiquitylation to regulate its intracellular trafficking and neurite formation.

    2. Reviewer #1 (Public Review):

      In this work, Frank, Bergamasco, Mlodzianoski et al study two microcephaly-associated patient variants in TRABID to identify and characterize a previously unrecognized role of this deubiquitylation enzyme during neurodevelopment. The authors generate TRABID p.R438W and p.A451V knock in mice, which exhibit smaller neuronal and glial cell densities as well as motor deficits, phenotypes that are consistent with the congenital defects observed in the patients. Through in vitro and cellular immunoprecipitation assays, the authors demonstrate that the p.R438W variant impairs the K29- and K63-chain cleavage activity of TRABID, while the p.A451V variant reduces binding to the STRIPAK complex, a previously identified TRABID interactor with established functions in cytoskeletal organization and neural development. Ubiquitylation assays performed in HEK293T cells further reveal that the hypomorphic patient variants are deficient in deubiquitylating APC, a previously identified substrate of TRABID that has been shown to control the neuronal cortical cytoskeleton during neurite outgrowth. Ex vivo experiments provide evidence that axonal APC trafficking and neurite outgrowth is disturbed in differentiating neural progenitors isolated from mouse embryos carrying Trabid patient alleles. From these experiments the authors propose a model in which TRABID- and STRIPAK-dependent APC deubiquitylation regulates its axonal trafficking to ensure faithful neurite outgrowth and misregulation of this function leads to neurodevelopmental phenotypes in TRABID/ZRANB1 patients.

    1. Reviewer #1 (Public Review):

      Summary:

      Because of the role of membrane tension in the process, and that caveloae regulate membrane tension, the authors looked at the formation of TEMs in cells depleted of Caveolin1 and Cavin1 (PTRF): They found a higher propensity to form TEMs, spontaneously (a rare event) and after toxin treatment, in both Caveolin 1 and Cavin 1. They show that in both siRNA-Caveolin1 and siRNA-Cavin1 cells, the cytoplasm is thinner. They show that in siCaveolin1 only, the dynamics of opening are different, with notably much larger TEMs. From the dynamic model of opening, they predict that this should be due to a lower bending rigidity of the membrane. They measure the bending rigidity from Cell-generated Giant liposomes and find that the bending rigidity is reduced by approx. 50%.

      Strengths:

      They also nicely show that caveolin1 KO mice are more susceptible to death from infections with pathogens that create TEMs.

      Overall, the paper is well-conducted and nicely written. There are however a few details that should be addressed.

    2. Reviewer #2 (Public Review):

      Summary:

      The manuscript by Morel et al. aims to identify some potential mechano-regulators of transendothelial cell macro-aperture (TEM). Guided by the recognized role of caveolar invaginations in buffering the membrane tension of cells, the authors focused on caveolin-1 and associated regulator PTRF. They report a comprehensive in vitro work based on siRNA knockdown and optical imaging approach complemented with an in vivo work on mice, a biophysical assay allowing measurement of the mechanical properties of membranes, and a theoretical analysis inspired by soft matter physics.

      Strengths:

      The authors should be complimented for this multi-faceted and rigorous work. The accumulation of pieces of evidence collected from each type of approach makes the conclusion drawn by the authors very convincing, regarding the new role of cavolin-1 as an individual protein instead of the main molecular component of caveolae. On a personal note, I was very impressed by the quality of STORM images (Fig. 2) which are very illuminating and useful, in particular for validating some hypotheses of the theoretical analysis.

      Weaknesses:

      While this work pins down the key role of caveolin-, its mechanism remains to be further investigated. The hypotheses proposed by the authors in the discussions about the link between caveolin and lipids/cholesterol are very plausible though challenging. Even though we may feel slightly frustrated by the absence of data in this direction, the quality and merit of this paper remain.

      - The analogy with dewetting processes drawn to derive the theoretical model is very attractive. However, although part of the model has already been published several times by the same group of authors, the definition of the effective membrane rigidity of a plasma membrane including the underlying actin cortex, was very vague and confusing. Here, for the first time, thanks to the STORM analysis, the authors show that HUVECs intoxicated by ExoC3 exhibit a loose and defective cortex with a significantly increased mesh size. This argues in favor of the validity of Helfrich formalism in this context. Nonetheless, there remains a puzzle. Experimentally, several TEMs are visible within one cell. Theoretically, the authors consider a simultaneous opening of several pores and treat them in an additive manner. However, when one pore opens, the tension relaxes and should prevent the opening of subsequent pores. Yet, experimentally, as seen from the beautiful supplementary videos, several pores open one after the other. This would suggest that the tension is not homogeneous within an intoxicated cell or that equilibration times are long. One possibility is that some undegraded actin pieces of the actin cortex may form a barrier that somehow isolates one TEM from a neighboring one. Could the authors look back at their STORM data and check whether intoxicated cells do not exhibit a bimodal population of mesh sizes and possibly provide a mapping of mesh size at the scale of a cell? In particular, it is quite striking that while bending rigidity of the lipid membrane is expected to set the maximal size of the aperture, most TEMs are well delimited with actin rings before closing. Is it because the surrounding loose actin is pushed back by the rim of the aperture? Could the authors better explain why they do not consider actin as a player in TEM opening?

      - Instead of delegating to the discussion the possible link between caveolin and lipids as a mechanism for the enhanced bending rigidity provided by caveolin-1, it could be of interest for the readership to insert the attempted (and failed) experiments in the result section. For instance, did the authors try treatment with methyl-beta-cyclodextrin that extracts cholesterol (and disrupts caveolar and clathrin pits) but supposedly keeps the majority of the pool of individual caveolins at the membrane?

      - Tether pulling experiments on Plasma membrane spheres (PMS) are real tours de force and the results are quite convincing: a clear difference in bending rigidity is observed in controlled and caveolin knock-out PMS. However, one recurrent concern in these tether-pulling experiments is to be sure that the membrane pulled in the tether has the same composition as the one in the PMS body. The presence of the highly curved neck may impede or slow down membrane proteins from reaching the tether by convective or diffusive motion. Could the authors propose an experiment to demonstrate that caveolin-1 proteins are not restricted to the body of the PMS and can access to the nanometric tether?

    1. Reviewer #1 (Public Review):

      Summary:

      The authors develop a memory consolidation theory utilizing the recall quality in the short-term memory system to decide what to consolidate in the long-term memory (LTM). The theory is based on a set of previously proposed models identifying memories and synaptic weights (without neuronal activity) with an addition of the second set of weights responsible for long-term storage. The rigorous analysis and numerical experiments show that under some assumptions, the long-term system achieves a high signal-to-noise ratio, particularly much higher than concurrently learning or localized in the same synapses LTM.

      Strengths:

      The authors take on an important problem of designing robust memory consolidation that fits the numerous experimental observations and, to a large extent, they succeed. The proposed solution is general and generalized to multiple contexts. The mathematical treatment is solid and convincing.

      Weaknesses:

      The presented model seems to be tuned for learning repetitive events. However, single-shot learning, for example, under fear conditioning or if a presented stimulus is astonishing, seems to contradict the proposed framework. I would assume that part of the load could be taken by a reply system that could vigorously replay more surprising events, but it seems to still not exactly match the proposed scheme.

      For context, I would like to see the comparison/discussion of the wide range of models on synaptic tagging for consolidation by various types of signals. Notably, studies from Wulfram Gerstner's group (e.g., Brea, J., Clayton, N. S., & Gerstner, W. (2023). Computational models of episodic-like memory in food-caching birds. Nature Communications, 14(1); and studies on surprise).

      The models that are taken for comparison with the slow but otherwise identical to STM LTM could be incapable per design. Reducing the probability of switching independently of the previous presentation does not make the system "slow"; instead, it should integrate previous signals (and thus slowly remove independent noise).

      The usage of terms and streamlining of writing could be improved for better understanding.

    2. Reviewer #2 (Public Review):

      Summary:

      In the manuscript "Recall-Gated Consolidation: A Model for Learning and Memory in Neural Systems," the authors suggest a computational mechanism called recall-gated consolidation, which prioritizes the storage of previously experienced synaptic updates in memory. The authors investigate the mechanism with different types of learning problems including supervised learning, reinforcement learning, and unsupervised auto-associative memory. They rigorously analyse the general mechanism and provide valuable insights into its benefits.

      Strengths:

      The authors establish a general theoretical framework, which they translate into three concrete learning problems. For each, they define an individual mathematical formulation. Finally, they extensively analyse the suggested mechanism in terms of memory recall, consolidation dynamics, and learnable timescales.

      The presented model of recall-gated consolidation covers various aspects of synaptic plasticity, memory recall, and the influence of gating functions on memory storage and retrieval. The model's predictions align with observed spaced learning effects.

      The authors conduct simulations to validate the recall-gated consolidation model's predictions, and their simulated results align with theoretical predictions. These simulations demonstrate the model's advantages over consolidating any memory and showcase its potential application to various learning tasks.

      The suggestion of a novel consolidation mechanism provides a good starting point to investigate memory consolidation in diverse neural systems and may inspire artificial learning algorithms.

      Weaknesses:

      I appreciate that the authors devoted a specific section to the model's predictions, and point out how the model connects to experimental findings in various model organisms. However, the connection is rather weak and the model needs to make more specific predictions to be distinguishable from other theories of memory consolidation (e.g. those that the authors discuss) and verifiable by experimental data.

      While the article extensively discusses the strengths and advantages of the recall-gated consolidation model, it provides a limited discussion of potential limitations or shortcomings of the model, such as the missing feature of generalization, which is part of previous consolidation models. The model is not compared to other consolidation models in terms of performance and how much it increases the signal-to-noise ratio. It is only compared to a simple STM or a parallel LTM, which I understand to be essentially the same as the STM but with a different timescale (so not really an alternative consolidation model). It would be nice to compare the model to an actual or more sophisticated existing consolidation model to allow for a fairer comparison.

      The article is lengthy and dense and it could be clearer. Some sections are highly technical and may be challenging to follow. It could benefit from more concise summaries and visual aids to help convey key points.

    3. Reviewer #3 (Public Review):

      Summary:

      In their article "Theory of systems memory consolidation via recall-gated plasticity ", Jack Lindsey and Ashok Litwin-Kumar describe a new model for systems memory consolidation. Their idea is that a short-term memory acts not as a teacher for a long-term memory - as is common in most complementary learning systems - but as a selection module that determines which memories are eligible for long-term storage. The criterion for the consolidation of a given memory is a sufficient strength of recall in the short-term memory.

      The authors provide an in-depth analysis of the suggested mechanism. They demonstrate that it allows substantially higher SNRs than previous synaptic consolidation models, provide an extensive mathematical treatment of the suggested mechanism, show that the required recall strength can be computed in a biologically plausible way for three different learning paradigms, and illustrate how the mechanism can explain spaced training effects.

      Strengths:

      The suggested consolidation mechanism is novel and provides a very interesting alternative to the classical view of complementary learning systems. The analysis is thorough and convincing.

      Weaknesses:

      The main weakness of the paper is the equation of recall strength with the synaptic changes brought about by the presentation of a stimulus. In most models of learning, synaptic changes are driven by an error signal and hence cease once the task has been learned. The suggested consolidation mechanism would stop at that point, although recall is still fine. The authors should discuss other notions of recall strength that would allow memory consolidation to continue after the initial learning phase. Aside from that, I have only a few technical comments that I'm sure the authors can address with a reasonable amount of work.

    1. Reviewer #1 (Public Review):

      Summary:

      The goal of Pawel et al. is to provide a more rigorous and quantitative approach for judging whether or not an initial null finding (conventionally with p >= 0.05) has been replicated by a second similarly null finding. They discuss important objections to relying on the qualitative significant/non-significant dichotomy to make this judgement. They present two complementary methods (one frequentist and the other Bayesian) which provide a superior quantitative framework for assessing the replicability of null findings.

      Strengths:

      Clear presentation; illuminating examples drawn from the well-known Reproducibility Project: Cancer Biology data set; R-code that implements suggested analyses. Using both methods as suggested provides a superior procedure for judging the replicability of null findings.

      Weaknesses:

      The proposed frequentist and the Bayesian methods both rely on binary assessments of an original finding and its replication. I'm not sure if this is a weakness or is inherent to making binary decisions based on continuous data.

      For the frequentist method, a null finding is considered replicated if the original and replication 90% confidence intervals for the effects both fall within the equivalence range. According to this approach, a null finding would be considered replicated if p-values of both equivalences tests (original and replication) were, say, 0.049, whereas would not be considered replicated if, for example, the equivalence test of the original study had a p-value of 0.051 and the replication had a p-value of 0.001. Intuitively, the evidence for replication would seem to be stronger in the second instance. The recommended Bayesian approach similarly relies on a dichotomy (e.g. Bayes factor > 1).

    1. Reviewer #2 (Public Review):

      Summary:<br /> The authors report the results of QM/MM simulations and kinetic measurements for the phosphoryl-transfer step in adenylate kinase. The main assertion of the paper is that a wide transition state ensemble is a key concept in enzyme catalysis as a strategy to circumvent entropic barriers. This assertion is based on the observation of a "structurally wide" set of energetically equivalent configurations that lie along the reaction coordinate in QM/MM simulations, together with kinetic measurements that suggest a decrease in the entropy of activation.

      Strengths:<br /> The study combines theoretical calculations and supporting experiments.

      Weaknesses:<br /> The role(s) of entropy in enzyme catalysis has been discussed extensively in the literature, from the Circe effect proposed by Jencks and many other works. The current paper hypothesizes a "wide" transition state ensemble as a catalytic strategy and key concept in enzyme catalysis. Overall, it is not clear the degree to which this hypothesis is supported by the data. The reasons are as follows:

      1. Enzyme catalysis reflects a rate enhancement with respect to a baseline reaction in solution. In order to assert that something is part of a catalytic strategy of an enzyme, it would be necessary to demonstrate from simulations that the activation entropy for the baseline reaction is indeed greater and the transition state ensemble less "wide". Alternatively stated, when indicating there is a "wide transition state ensemble" for the enzyme system - one needs to indicate that is with respect to the non-enzymatic reaction. However, these simulations were not performed and the comparisons were not demonstrated.

      2. The observation of a "wide conformational ensemble" is not a quantitative measure of entropy. In order to make a meaningful computational prediction of the entropic contribution to the activation of free energy, one would need to perform free energy simulations over a range of temperatures (for the enzymatic and non-enzymatic systems). Such simulations were not performed, and the entropy of activation was thus not quantified by the computational predictions.

      3. The authors indicate that lid-opening, essential for product release, and not P-transfer is the rate-limiting step in the catalytic cycle and Mg2+ accelerates both steps. How is it certain that the kinetic measurements are reporting on the chemical steps of the reaction, and not other factors such as metal ion binding or conformational changes?

      4. The authors explore different starting states for the chemical steps of the reaction (e.g., different metal ion binding and protonation states), and conclude that the most reactive enzyme configuration is the one with the more favorable reaction-free energy barrier. However, it is not clear what is the probability of observing the system in these different states as a function of pH and metal ion concentration without performing appropriate pKa and metal ion binding calculations. This was not done, and hence these results seem somewhat inconclusive.

    2. Reviewer #1 (Public Review):

      Summary:<br /> This study investigated the phosphoryl transfer mechanism of the enzyme adenylate kinase, using SCC-DFTB quantum mechanical/molecular mechanical (QM/MM) simulations, along with kinetic studies exploring the temperature and pH dependence of the enzyme's activity, as well as the effects of various active site mutants. Based on a broad free energy landscape near the transition state, the authors proposed the existence of wide transition states (TS), characterized by the transferring phosphoryl group adopting a meta-phosphate-like geometry with asymmetric bond distances to the nucleophilic and leaving oxygens. In support of this finding, kinetic experiments were conducted with Ca2+ ions (instead of Mg2+) at different temperatures, which revealed a negative entropy of activation. Overall, in its present form, the manuscript has more weaknesses in terms of interpretation of the simulation results than strengths, which need to be addressed by the authors.

      There are several major concerns:

      First, the authors' claim that the catalytic mechanism of adenylate kinase (Adk) has not been previously studied by QM/MM free energy simulations is somewhat inaccurate. In fact, two different groups have previously investigated the catalytic mechanism of Adk. The first study, cited by the authors themselves, used the string method to determine the minimum free energy profile, but resulted in an unexpected intermediate; note that they obtained a minimum free energy profile, not a minimum energy profile. The second study (Ojedat-May et al., Biochemistry 2021 and Dulko-Smith et al., J Chem Inf Model 2023) overlaps substantially with the present study, but its main conclusions differ from those of the present study. Therefore, a thorough discussion comparing the results of these studies is needed.

      Second, the interpretation of the TS ensemble needs deeper scrutiny. In general, the TS is defined as the hypersurface separating the reactant and product states. Consequently, if a correct reaction coordinate is defined, trajectories initiated at the TS should have equal probabilities of reaching either the reactant or product state; if an approximate reaction coordinate, such as the distance difference used in this study, is used, recrossing may be introduced as a correction into the probabilities. Thus, in order to establish the presence of a wide TS region, it is necessary to characterize the TS ensemble through a commitment analysis across the TS region.

      The relatively flat free energy surface observed near TS in Figures 1c and 2a, may be attributed to the cleavage and formation of P-O bonds relative to the marginally stable phosphorane intermediate, as described in Zhou et al.'s work (Chem Rev 1998, 98:991). This scenario is clearly different from a wide TS ensemble concept. In addition, given the inherent similarity in reactivity of the two oxygens towards the phosphoryl atom, it is reasonable to expect a single TS as shown in Figure 1 - supplement 9, rather than two TSs with a marginally stable intermediate as shown in Figure 1c. Consequently, it remains uncertain whether the elongated P-O bonds observed near the TS and their asymmetry are realistic or potentially an artifact of the pulling/non-equilibrium MD simulations. Further validation in this regard is required.

      Third, there are several inconsistencies in the free energy results and their discussion. First, the data from Kerns et al. (Kerns, NSMB, 2015, 22:124) indicate that the ATP/AMP -> ADP/ADP reaction proceeds at a faster rate than the ADP/ADP -> ATP/AMP reaction, suggesting that the ADP/ADP state has a lower free energy (approximately -1.0 kcal/mol) compared to the ATP/ATP state. This contrasts with Figure 1c, which shows a higher free energy of 6.0 kcal/mol for the ATP/ADP state. This discrepancy needs to be discussed. Furthermore, the barrier for ATP/AMP -> ADP/ADP, calculated to be 20 kcal/mol for the fully charged state, exceeds the corresponding barrier for the monoprotonated state. This cautions against the conclusion that the fully charged state is the reactive state. In addition, the difference in the barrier for the no-Mg2+ system compared to the barriers with Mg2+ is substantially too large (21 kcal/mol from the calculation versus 7 kcal/mol from the experimental values). These inconsistencies raise questions as to their origins, whether they result from the use of the pulling/non-equilibrium MD simulation approach, which may yield unrealistic TS geometries, or from potential issues related to the convergence of the determined free energy values. To address this issue, a comparison of results obtained by umbrella sampling and similar methodologies is necessary.

    3. Reviewer #3 (Public Review):

      Summary:<br /> By conducting QM/MM free energy simulations, the authors aimed to characterize the mechanism and transition state for the phosphoryl transfer in adenylate kinase. The qualitative reliability of the QM/MM results has been supported by several interesting experimental kinetic studies. However, the interpretation of the QM/MM results is not well supported by the current calculations.

      Strengths:<br /> The QM/MM free energy simulations have been carefully conducted. The accuracy of the semi-empirical QM/MM results was further supported by DFT/MM calculations, as well as qualitatively by several experimental studies.

      Weaknesses:<br /> 1. One key issue is the definition of the transition state ensemble. The authors appear to define this by simply considering structures that lie within a given free energy range from the barrier. However, this is not the rigorous definition of transition state ensemble, which should be defined in terms of committor distribution. This is not simply an issue of semantics, since only a rigorous definition allows a fair comparison between different cases - such as the transition state in an enzyme vs in solution, or with and without the metal ion. For a chemical reaction in a complex environment, it is also possible that many other variables (in addition to the breaking and forming P-O bonds) should be considered when one measures the diversity in the conformational ensemble.

      2. While the experimental observation that the activation entropy differs significantly with and without the Ca2+ ion is interesting, it is difficult to connect this result with the "wide" transition state ensemble observed in the QM/MM simulations so far. Even without considering the definition of the transition state ensemble mentioned above, it is unlikely that a broader range of P-O distances would explain the substantial difference in the activation entropy measured in the experiment. Since the difference is sufficiently large, it should be possible to compute the value by repeating the free energy simulations at different temperatures, which would lead to a much more direct evaluation of the QM/MM model/result and the interpretation.

    1. Reviewer #1 (Public Review):

      Summary:<br /> Using a state-of-the-art image analysis pipeline the authors report that muscle cell hypertrophy in mice and humans occurs primarily through an increase in the number of myofibrils (myofibrillogenesis) and not myofibril hypertrophy.

      Strengths:<br /> A strength of the study is the development and validation of an automated image analysis pipeline to quantify myofibril size and abundance in mouse and human muscle cells. In addition to the pipeline, which requires relatively readily available microscopy equipment (an additional strength) is the development of a methodology to optimally prepare muscle samples for high-resolution imaging.

      Weaknesses:<br /> A weakness of the study was that only one time-point was assessed during hypertrophy. As mentioned by the authors, this precluded an assessment of the myofibril splitting mechanism. The second weakness was the criteria (aspect ratio of <2.5:1) used to identify a myofibril which excluded a significant number of myofibrils from analysis. How might the inclusion of these odd-shaped myofibrils impact the outcome of the study?

    2. Reviewer #2 (Public Review):

      Summary:<br /> In this work, the authors sought to 1) establish a method for measuring muscle fiber subcellular structure (myofibrils) using common, non-specialized laboratory techniques and equipment, and 2) use this method to provide evidence on whether loading-induced muscle fiber growth was the result of myofibril growth (of existing myofibrils) or myofbrillogenesis (creation of new myofibrils) in mice and humans. The latter is a fundamental question in the muscle field. The authors succeeded in their aims and provided useful methods for the muscle field and detailed insight into muscle fiber hypertrophy; specifically, that loading-induced muscle fiber hypertrophy may be driven mostly by myofibrillogenesis.

      Strengths:<br /> 1) The usage of murine and human samples to provide evidence on myofibril hypertrophy vs myofibrillogenesis.<br /> 2) A nice historical perspective on myofibrillogenesis in skeletal muscle.<br /> 3) The description of a useful and tractable IHC imaging method for the muscle biology field supported by extensive validation against electron microscopy.<br /> 4) Fundamental information on how myofiber hypertrophy ensues.

      Weaknesses:<br /> 1) The usage of young growing mice (8-10 weeks) versus adult mice (>4 months) in the murine mechanical overload experiments, as well as no consideration for biological sex. The former point is partly curtailed by the adult human data that is provided (male only). Still, the usage of adult mice would be preferable for these experiments given that maturational growth may somehow affect the outcomes. For the latter point, it is not clear whether male or female mice were used.

      2) Information on whether myofibrillogenesis is dependent on hypertrophy induced by loading, or just hypertrophy in general. To provide information on this, the authors could use, for instance, inducible Myostatin KO mice (a model where hypertrophy and force production are not always in lockstep) to see whether hypertrophy independent from load induces the same result as muscle loading regarding myofibrillogenesis.

      3) Limited information on Type 1 fiber hypertrophy. A "dual overload" model is used for the mouse where the soleus is also overloaded, but presumably, the soleus was too damaged to analyze. Exploring hypertrophy of murine Type 1 fibers using a different model (weight pulling, weighted wheel running, or forced treadmill running) would be a welcome addition.

    3. Reviewer #3 (Public Review):

      Summary:<br /> Radial muscle growth involves an increase in overall muscle cross-sectional area. For decades this process has been described as the splitting of myofibrils to produce more myofibrils during the growth process. However, a closer look at the original papers shows that the evidence underlying this description was incomplete. In this paper, the authors have developed a novel method using fluorescence microscopy to directly measure myofibril size and number. Using a mouse model of mechanical loading and a human model of resistance exercise they discovered that myofibrillogenesis is playing a key role in the radial growth of muscle fibers.

      Strengths:<br /> 1. Well-written and clear description of hypothesis, background, and experiments.<br /> 2. Compelling series of experiments.<br /> 3. Different approaches to test the hypothesis.<br /> 4. Rigorous study design.<br /> 5. Clear interpretation of results.<br /> 6. Novel findings that will be beneficial to the muscle biology field.<br /> 7. Innovative microscopy methods that should be widely available for use in other muscle biology labs.

      Weaknesses:<br /> Supplemental Figure 1 is not very clear.

    1. Reviewer #2 (Public Review):

      With the data presented in this manuscript, the authors help complete the set of high-resolution HER2-associated complex heterodimer structures as well as HER4 homodimer structures in the presence of NRG1b and BTC. Purification of HER2-HER4 heterodimers appears to be inherently challenging due to the propensity of HER4 to form homodimers. The authors have used an effective scheme to isolate these HER2-HER4 heterodimers and have employed graphene-oxide grid chemistry to presumably overcome the issues of low sample yield for solving cryo-EM structures of these complexes. The authors conclude HER2-HER4 heterodimers with either ligand are conformationally homogeneous relative to the HER4 homodimers. The HER2-HER4 heterodimers also appear to be better stabilized compared to other published HER2 heterodimers. The ability to model glycans in the context of HER4 homodimers is exciting to see and provides a strong rationale for the stability of these structures. Overall, the work is of great interest and the methods described in this work would benefit a wide variety of structural biology projects.

      Major comments-<br /> 1. The HER2-HER4 heterodimer with BTC appears to be the lowest resolution of the reported structures. Although the authors claim the overall structure is similar to the HER2-HER4 heterodimer with NRG1b, it is therefore unclear whether the lower resolution of the BTC is due to challenging data collection conditions, sample preparation, or conformational dynamics not discernible due to the lower resolution. The authors should minimally clarify where they see the possible issues arising for the lower resolution as this is a key aspect of the work.

      2. For all maps, authors should display Euler angle plots from their final refinements to assess the degree of preferred orientation. Judging by the sphericity, it appears all the structures, except HER2-HER4-BTC, have well-sampled projection distributions. However, a formal clarification would be useful to the reader.

      3. The authors should also include map-model FSCs to ascertain the quality of the map with respect to model building, as this is currently missing in the submission.

      Minor comments-<br /> 1. With respect to complex formation, is there a reason why HER2 expression is dramatically lower than HER4?

      2. Figures S1e authors should clarify if HER2 substitutions are VR alone or do these include GD substitutions as well. These should be suitably clarified in the main text.

      3. The validation reports for all 4 reported structures suggest the user-provided FSC-derived resolutions are different from those calculated by the deposition server. Are the masks deposited significantly different compared to the ones generated within cryoSPARC?

      4. For interpretation regarding activation through phosphorylation in Figure 2e, have the authors considered HER4 could homodimerize as well? It appears from the data presented in Figure 4 and S12 that the propensity to form homodimers is greater for HER4 than to heterodimerize with HER2, despite the VR/IQ substitutions. This also appears to be supported by the reasonable amount of signal for pERK in lanes with HER4-IQ alone in the presence of NRG1b. It is recommended that the authors comment on this possibility.

      5. In the following line, "NRG1b-induced phosphorylation of HER2, HER4, ERK and AKT was not notably affected by substitution of the HER4 dimerization arm to a GS-arm relative to wild type receptors", it is unclear what the authors mean by wild-type receptors? There is presently no wild-type HER2 and/or HER4 tested in this blot.

      6. Considering the asparagine residues can potentially mediate stabilization of HER2-HER4 dimers through glycosylation, the authors should include western blot data for receptor-activation for mutants where glycosylation can be disrupted. This could minimally instruct the reader on how functionally relevant the identified interactions like N576-N358 are.

    1. Reviewer #1 (Public Review):

      Summary:<br /> This work is an extension of the authors' earlier work published in Sci Adv in 2001, wherein the authors showed that DTD2 deacylates N-ethyl-D-aminoacyl-tRNAs arising from acetaldehyde toxicity. The authors in this study, investigate the role of archaeal/plant DTD2 in the deacylation/detoxification of D-Tyr-tRNATyr modified by multiple other aldehydes and methylglyoxal (produced by plants). Importantly, the authors take their biochemical observations to plants, to show that deletion of DTD2 gene from a model plant (Arabidopsis thaliana) makes them sensitive to the aldehyde supplementation in the media especially in the presence of D-Tyr. These conclusions are further supported by the observation that the model plant shows increased tolerance to the aldehyde stress when DTD2 is overproduced from the CaMV 35S promoter. The authors propose a model for the role of DTD2 in the evolution of land plants. Finally, the authors suggest that the transgenic crops carrying DTD2 may offer a strategy for stress-tolerant crop development. Overall, the authors present a convincing story, and the data are supportive of the central theme of the story.

      Strengths:<br /> Data are novel and they provide a new perspective on the role of DTD2, and propose possible use of the DTD2 lines in crop improvement.

      Weaknesses:<br /> (a) Data obtained from a single aminoacyl-tRNA (D-Tyr-tRNATyr) have been generalized to imply that what is relevant to this model substrate is true for all other D-aa-tRNAs (term modified aa-tRNAs has been used synonymously with the modified Tyr-tRNATyr). This is not a risk-free extrapolation. For example, the authors see that DTD2 removes modified D-Tyr from tRNATyr in a chain-length dependent manner of the modifier. Why do the authors believe that the length of the amino acid side chain will not matter in the activity of DTD2?<br /> (b) While the use of EFTu supports that the ternary complex formation by the elongation factor can resist modifications of L-Tyr-tRNATyr by the aldehydes or other agents, in the context of the present work on the role of DTD2 in plants, one would want to see the data using eEF1alpha. This is particularly relevant because there are likely to be differences in the way EFTu and eEF1alpha may protect aminoacyl-tRNAs (for example see description in the latter half of the article by Wolfson and Knight 2005, FEBS Letters 579, 3467-3472).

    2. Reviewer #2 (Public Review):

      In bacteria and mammals, metabolically generated aldehydes become toxic at high concentrations because they irreversibly modify the free amino group of various essential biological macromolecules. However, these aldehydes can be present in extremely high amounts in archaea and plants without causing major toxic side effects. This fact suggests that archaea and plants have evolved specialized mechanisms to prevent the harmful effects of aldehyde accumulation.

      In this study, the authors show that the plant enzyme DTD2, originating from archaea, functions as a D-aminoacyl-tRNA deacylase. This enzyme effectively removes stable D-aminoacyl adducts from tRNAs, enabling these molecules to be recycled for translation. Furthermore, they demonstrate that DTD2 serves as a broad detoxifier for various aldehydes in vivo, extending its function beyond acetaldehyde, as previously believed. Notably, the absence of DTD2 makes plants more susceptible to reactive aldehydes, while its overexpression offers protection against them. These findings underscore the physiological significance of this enzyme.

    1. Reviewer #1 (Public Review):

      The authors aimed to investigate if 2-hydroxybutyrate (2HB), a metabolite induced by exercise, influences physiological changes, particularly metabolic alterations post-exercise training. They treated young mice and cultured myoblasts with 2HB, conducted exercise tests, metabolomic profiling, gene expression analysis, and knockdown experiments to understand 2HB's mechanisms. Their findings indicate that 2HB enhances exercise tolerance, boosts branch chain amino acid (BCAA) enzyme gene expression in skeletal muscles, and increases oxidative capacity. They also highlight the role of SIRT4 in these effects. This study establishes 2HB, once considered a waste product, as a regulator of exercise-induced metabolic processes. The study's strength lies in its consistent results across in vitro, in vivo, and ex vivo analyses. The authors propose a mechanism in which 2HB inhibits BCAA breakdown, raises NAD+/NADH ratio, activates SIRT4, increases ADP ribosylation, and controls gene expression.

      However, some questions remain unclear based on these findings:

      This study focused on the effects of short-term exercise (1 or 5 bouts of treadmill running) and short-term 2HB treatment (1 or 4 days of treatment). Adaptations to exercise training typically occur progressively over an extended period. It's important to investigate the effects of long-term 2HB treatment and whether extended combined 2HB treatment and exercise training have independent, synergistic, or antagonistic effects.

      Exercise training leads to significant mitochondrial changes, including increased mitochondrial biogenesis in skeletal muscle. It would be valuable to compare the impact of 2HB treatment on mitochondrial content and oxidative capacity in treated mice to that in exercised mice.

      The authors demonstrate that 2-ketobutyrate (2KB) can serve as an oxidative fuel, suggesting a role for the intact BCAA catabolic pathway. However, it's puzzling that the knockout of BCKDHA, a subunit crucial for the second step of BCAA catabolism, did not result in changes in oxidative capacity in cultured myoblasts.

      Nevertheless, this innovative model of metabolic signaling during exercise will serve as a valuable reference for informing future.

    2. Reviewer #2 (Public Review):

      Summary:<br /> The manuscript entitled "A 2-HB-mediated feedback loop regulates muscular fatigue" by the Johnson group reports interesting findings with implications for the health benefits of exercise. The authors use a combination of metabolic/biochemical in vivo and in vitro assays to delineate a metabolic route triggered by 2-HB (a relatively stable metabolite induced by exercise in humans and mice) that controls branched-chain amino transferase enzymes and mitochondrial oxidative capacity. Mechanistically, the author shows that 2-HB is a direct inhibitor of BCAT enzymes that in turn control levels of SIRT4 activity and ADp-ribosylation in the nucleus targeting C/EBP transcription factor, affecting BCAA oxidation genes (see Fig 4i in the paper). Overall, these are interesting and novel observations and findings with relevance to human exercise, with the potential implication of using these metabolites to mimic exercise benefits, or conditions or muscular fatigue that occurs in different human chronic diseases including rheumatic diseases or long COVID.

      Weaknesses:<br /> There are several experiments/comments that will strengthen the manuscript-

      1- A final model in Figure 6 integrating the exercise/mechanistic findings, expanding on Fig 4i) will clarify the findings.

      2- In some of the graphs, statistics are missing (e.g Fig 6G).

      3- The conclusions on SIRT4 dependency should be carefully written, as it is likely that this is only one potential mechanism, further validation with mouse models would be necessary.

      4- One of the needed experiments to support the oxidative capacity effects that could be done in cultured cells, is the use of radiosotope metabolites including BCCAs to determine the ability to produce CO2. Alternatively or in combination metabolite flux using isotopes would be useful to strengthen the current results.

    1. Reviewer #1 (Public Review):

      Summary:<br /> The OSCA/TMEM63 channels have recently been identified as mechanosensitive channels. In a previous study, the authors found that OSCA subtypes (1, 2, and 3) respond differently to stretch and poke stimuli. For example, OSCA1.2 is activated by both poke and stretch, while OSCA3.1, responds strongly to stretch but poorly to poke stimuli. In this study, the authors use cryo-EM, mutagenesis, and electrophysiology to dissect the mechanistic determinants that underlie the channels' ability to respond to poke and stretch stimuli.

      The starting hypothesis of the study is that the mechanical activation of OSCA channels relies on the interactions between the protein and the lipid bilayer and that the differential responses to poke and stretch might stem from variations in the lipid-interacting regions of OSCA proteins. The authors specifically identify the amphipathic helix (AH), the fenestration, and the Beam Like Domain (BLD) as elements that might play a role in mechanosensing.

      The strength of this paper lies in the technically sound data - the structural work and electrophysiology are both very well done. For example, the authors produce a high-resolution OSCA3.1 structure which will be a useful tool for many future studies. Also, the study identifies several interesting mutants that seemingly uncouple the OSCA1.2 poke and stretch responses. These might be valuable in future studies of OSCA mechanosensation.

      However, the experimental approach employed by the authors to dissect the molecular mechanisms of poke and stretch falls short of enabling meaningful mechanistic conclusions. For example, we are left with several unanswered questions surrounding the role of AH and the fenestration lipids in mechanosensation: Is the AH really important for the poke response if mutating residues conserved between OSCA1.2 and OSCA3.1 disrupts the OSCA1.2 ability to respond to poke but mutating the OSCA1.2 AH to resemble that of OSCA3.1 results in no change to its "pokability"? Similar questions arise in response to the study of the fenestration-lining residues.

    2. Reviewer #2 (Public Review):

      Summary:<br /> Jojoa-Cruz et al. determined a high-resolution cryo-EM structure in the Arabidopsis thaliana (At) OSCA3.1 channel. Based on a structural comparison between OSCA3.1 and OSCA1.2 and the difference between these two paralogs in their mechanosensitivity to poking and membrane stretch, the authors performed structural-guided mutagenesis and tested the roles of three structural domains, including an amphipathic helix, a beam-like domain, and a lipid fenestration site at the pore domain, for mechanosensation of OSCA channels.

      Strengths:<br /> The authors successfully determined a structure of the AtOSCA3.1 channel reconstituted in lipid nanodiscs by cryo-EM to a high resolution of 2.6 Å. The high-resolution EM map enabled the authors to observe putative lipid EM densities at various sites where lipid molecules are associated with the channel. Overall, the structural data provides the information for comparison with other OSCA paralogs.

      In addition, the authors identified OSCA1.2 mutants that exhibit differential responses to mechanical stimulation by poking and membrane stretch (i.e., impaired response to poke assay but intact response to membrane stretch). This interesting behavior will be useful for further study on differentiating the mechanisms of OSCA activation by distinct mechanical stimuli.

      Major weakness:<br /> 1. The major weaknesses of this study are the mutagenesis design and the functional characterization of the three structural domains - an amphipathic helix (AH), a beam-like domain (BLD), and the fenestration site at the pore, in OSCA mechanosensation.

      1) First of all, it is confusing to the reviewer, whether the authors set out to test these structural domains as a direct sensor(s) of mechanical stimuli or as a coupling domain(s) for downstream channel opening and closing (gating). The data interpretations are vague in this regard as the authors tend to interpret the effects of mutations on the channel 'sensitivity' to different mechanical stimuli (poking or membrane stretch). The authors ought to dissect the molecular bases of sensing mechanical force and opening/closing (gating) the channel pore domain for the structural elements that they want to study.

      Furthermore, the authors relied on the functional discrepancies between OSCA1.2 (sensitive to both membrane poking and stretch) and OSCA3.1 (little or weak sensitivity to poking but sensitive to membrane stretch). But the experimental data presented in the study are not clear to address the mechanisms of channel activation by poking vs. by stretch, and why the channels behave differently.

      2) The reviewer questions if the "apparent threshold" of poke-induced membrane displacement and the threshold of membrane stretch are good measures of the change in the channel sensitivity to the different mechanical stimuli.

      3) Overall, the mutagenesis design in the various structural domains lacks logical coherence and the interpretation of the functional data is not sufficient to support the authors' hypothesis. Essentially the authors mutated several residues on the hotspot domains, observed some effects on the channel response to poking and membrane stretch, then interpreted the mutated residues/regions are critical for OSCA mechanosensation. Examples are as follows.

      In the section "Mutation of key residues in the amphipathic helix", the authors mutated W75 and L80, which are located on the N- and C-terminal of the AH in OSCA1.2, and mutated Pro in the OSCA1.2 AH to Arg at the equivalent position in OSCA3.1 AH. W75 and L80 are conserved between OSCA 1.2 and OSCA3.1. Mutations of W75 and/or L80 impaired OSCA1.2 activation by poking, but not by membrane stretch. In comparison, the wildtype OSCA3.1 which contains W and L at the equivalent position of its AH exhibits little or weak response to poking. The loss of response to poking in the OSCA1.2 W/L mutants does not indicate their roles in poking-induced activation.

      Besides, the P2R mutation on OSCA1.2 AH showed no effect on the channel activation by poking, suggesting Arg in OSCA3.1 AH is not responsible for its weak response to poking. Together the mutagenesis of W75, L80, and P2R on OSCA1.2 AH does not support the hypothesis of the role of AH involved in OSCA mechanosensation.

      In the section "Replacing the OSCA3.1 BLD in OSCA1.2", the authors replaced the BLD in OSCA 1.2 with that from OSCA3.1, and only observed slightly stronger displacement by poking stimuli. The authors still suggest that BLD "appears to play a role" in the channel sensitivity to poke despite the evidence not being strong.

      OSCA1.2 has four Lys residues in TM4 and TM6b at the pore fenestration site, which were shown to interact with the lipid phosphate head group, whereas two of the equivalent residues in OSCA3.1 are Ile. In the section "Substitution of potential lipid-interacting lysine residues", the authors made K435I/K536I double mutant for OSCA1.2 to mimic OSCA3.1 and observed poor response to poking but an intact response to stretch. Did the authors mutate the Ile residues in OSCA3.1 to Lys, and did the mutation confer channel sensitivity to poking stimuli resembling OSCA1.2? The reviewer thinks it is necessary to perform such an experiment, to thoroughly suggest the importance of the four Lys residues in lipid interaction for channel mechanoactivation.

    3. Reviewer #3 (Public Review):

      Summary:<br /> Jojoa-Cruz et al provide a new structure of At-OSCA3.1. The structure of OSCA 3.1 is similar to previous OSCA cryo-em structures of both OSCA3.1 and other homologues validating the new structure. Using the novel structure of OSCA3.1 as a guide they created several point mutations to investigate two different mechanosensitive modalities: poking and stretching. To investigate the ability of OSCA channels to gate in response to poking they created point mutations in OSCA1.2 to reduce sensitivity to poking based on the differences between the OSCA1.2 and 3.1 structures. Their results suggest that two separate regions are responsible for gating in response to poking and stretching.

      Strengths:<br /> Through a detailed structure-based analysis, the authors identified structural differences between OSCA3.1 and OSCA1.2. These subtle structural changes identify regions in the amphipathic helix and near the pore that are essential for the gating of OSCA1.2 in response to poking and stretching. The use of point mutations to understand how these regions are involved in mechanosensation clearly shows the role of these residues in mechanosensation.

      Weaknesses:<br /> In general, the point mutations selected all show significant alterations to the inherent mechanosensitive regions. This often suggests that any mutation would disrupt the function of the region, additional mutations that are similar in function to the WT channel would support the claims in the manuscript. Mutations in the amphipathic helix at W75 and L80 show reduced gating in response to poking stimuli. The gating observed occurs at poking depths similar to cellular rupture, the similarity in depths suggests that these mutations could be a complete loss of function. For example, a mutation to L80I or L80Q would show that the addition of the negative charge is responsible for this disruption not just a change in the steric space of the residue in an essential region.

    1. Reviewer #1 (Public Review):

      This manuscript by Tan et al is using cryo-electron tomography to investigate the structure of yeast nucleosomes both ex vivo (nuclear lysates) and in situ (lamellae and cryosections). The sheer number of experiments and results are astounding and comparable with an entire PhD thesis. However, as is always the case, it is hard to prove that something is not there. In this case, canonical nucleosomes. In their path to find the nucleosomes, the authors also stumble over new insights into nucleosome arrangement that indicates that the positions of the histones is more flexible than previously believed.

      Major strengths and weaknesses:

      Personally, I am not ready to agree with their conclusion that heterogenous non-canonical nucleosomes predominate in yeast cells, but this reviewer is not an expert in the field of nucleosomes and can't judge how well these results fit into previous results in the field. As a technological expert though, I think the authors have done everything possible to test that hypothesis with today's available methods. One can debate whether it is necessary to have 35 supplementary figures, but after working through them all, I see that the nature of the argument needs all that support, precisely because it is so hard to show what is not there. The massive amount of work that has gone into this manuscript and the state-of-the art nature of the technology should be warmly commended. I also think the authors have done a really great job with including all their results to the benefit of the scientific community. Yet, I am left with some questions and comments:

      Could the nucleosomes change into other shapes that were predetermined in situ? Could the authors expand on if there was a structure or two that was more common than the others of the classes they found? Or would this not have been found because of the template matching and later reference particle used?

      Could it simply be that the yeast nucleoplasm is differently structured than that of HeLa cells and it was harder to find nucleosomes by template matching in these cells? The authors argue against crowding in the discussion, but maybe it is just a nucleoplasm texture that side-tracks the programs?

      The title of the paper is not well reflected in the main figures. The title of Figure 2 says "Canonical nucleosomes are rare in wild-type cells", but that is not shown/quantified in that figure. Rare is comparison to what? I suggest adding a comparative view from the HeLa cells, like the text does in lines 195-199. A measure of nucleosomes detected per volume nucleoplasm would also facilitate a comparison.

      If the cell contains mostly non-canonical nucleosomes, are they really non-canonical? Maybe a change of language is required once this is somewhat sure (say, after line 303).

      The authors could explain more why they sometimes use conventional the 2D followed by 3D classification approach and sometimes "direct 3-D classification". Why, for example, do they do 2D followed by 3D in Figure S5A? This Figure could be considered a regular figure since it shows the main message of the paper.

      Figure 1: Why is there a gap in the middle of the nucleosome in panel B? The authors write that this is a higher resolution structure (18Å), but in the even higher resolution crystallography structure (3Å resolution), there is no gap in the middle.

    2. Reviewer #2 (Public Review):

      Nucleosome structures inside cells remain unclear. Tan et al. tackled this problem using cryo-ET and 3-D classification analysis of yeast cells. The authors found that the fraction of canonical nucleosomes in the cell could be less than 10% of total nucleosomes. The finding is consistent with the unstable property of yeast nucleosomes and the high proportion of the actively transcribed yeast genome. The authors made an important point in understanding chromatin structure in situ. Overall, the paper is well-written and informative to the chromatin/chromosome field.

    3. Reviewer #3 (Public Review):

      Several labs in the 1970s published fundamental work revealing that almost all eukaryotes organize their DNA into repeating units called nucleosomes, which form the chromatin fiber. Decades of elegant biochemical and structural work indicated a primarily octameric organization of the nucleosome with 2 copies of each histone H2A, H2B, H3 and H4, wrapping 147bp of DNA in a left handed toroid, to which linker histone would bind.

      This was true for most species studied (except, yeast lack linker histone) and was recapitulated in stunning detail by in vitro reconstitutions by salt dialysis or chaperone-mediated assembly of nucleosomes. Thus, these landmark studies set the stage for an exploding number of papers on the topic of chromatin in the past 45 years.

      An emerging counterpoint to the prevailing idea of static particles is that nucleosomes are much more dynamic and can undergo spontaneous transformation. Such dynamics could arise from intrinsic instability due to DNA structural deformation, specific histone variants or their mutations, post-translational histone modifications which weaken the main contacts, protein partners, and predominantly, from active processes like ATP-dependent chromatin remodeling, transcription, repair and replication.

      This paper is important because it tests this idea whole-scale, applying novel cryo-EM tomography tools to examine the state of chromatin in yeast lysates or cryo-sections. The experimental work is meticulously performed, with vast amount of data collected. The main findings are interpreted by the authors to suggest that majority of yeast nucleosomes lack a stable octameric conformation. The findings are not surprising in that alternative conformations of nucleosomes might exist in vivo, but rather in the sheer scale of such particles reported, relative to the traditional form expected from decades of biochemical, biophysical and structural data. Thus, it is likely that this work will be perceived as controversial. Nonetheless, we believe these kinds of tools represent an important advance for in situ analysis of chromatin. We also think the field should have the opportunity to carefully evaluate the data and assess whether the claims are supported, or consider what additional experiments could be done to further test the conceptual claims made. It is our hope that such work will spark thought-provoking debate in a collegial fashion, and lead to the development of exciting new tools which can interrogate native chromatin shape in vivo. Most importantly, it will be critical to assess biological implications associated with more dynamic - or static forms- of nucleosomes, the associated chromatin fiber, and its three-dimensional organization, for nuclear or mitotic function.

    4. Reviewer #1 (Public Review):

      This manuscript by Tan et al is using cryo-electron tomography to investigate the structure of yeast nucleosomes both ex vivo (nuclear lysates) and in situ (lamellae and cryosections). The sheer number of experiments and results are astounding and comparable with an entire PhD thesis. However, as is always the case, it is hard to prove that something is not there. In this case, canonical nucleosomes. In their path to find the nucleosomes, the authors also stumble over new insights into nucleosome arrangement that indicates that the positions of the histones is more flexible than previously believed.

      Major strengths and weaknesses:

      Personally, I am not ready to agree with their conclusion that heterogenous non-canonical nucleosomes predominate in yeast cells, but this reviewer is not an expert in the field of nucleosomes and can't judge how well these results fit into previous results in the field. As a technological expert though, I think the authors have done everything possible to test that hypothesis with today's available methods. One can debate whether it is necessary to have 35 supplementary figures, but after working through them all, I see that the nature of the argument needs all that support, precisely because it is so hard to show what is not there. The massive amount of work that has gone into this manuscript and the state-of-the art nature of the technology should be warmly commended. I also think the authors have done a really great job with including all their results to the benefit of the scientific community. Yet, I am left with some questions and comments:

      Could the nucleosomes change into other shapes that were predetermined in situ? Could the authors expand on if there was a structure or two that was more common than the others of the classes they found? Or would this not have been found because of the template matching and later reference particle used?

      Could it simply be that the yeast nucleoplasm is differently structured than that of HeLa cells and it was harder to find nucleosomes by template matching in these cells? The authors argue against crowding in the discussion, but maybe it is just a nucleoplasm texture that side-tracks the programs?

      The title of the paper is not well reflected in the main figures. The title of Figure 2 says "Canonical nucleosomes are rare in wild-type cells", but that is not shown/quantified in that figure. Rare is comparison to what? I suggest adding a comparative view from the HeLa cells, like the text does in lines 195-199. A measure of nucleosomes detected per volume nucleoplasm would also facilitate a comparison.

      If the cell contains mostly non-canonical nucleosomes, are they really non-canonical? Maybe a change of language is required once this is somewhat sure (say, after line 303).

      The authors could explain more why they sometimes use conventional the 2D followed by 3D classification approach and sometimes "direct 3-D classification". Why, for example, do they do 2D followed by 3D in Figure S5A? This Figure could be considered a regular figure since it shows the main message of the paper.

      Figure 1: Why is there a gap in the middle of the nucleosome in panel B? The authors write that this is a higher resolution structure (18Å), but in the even higher resolution crystallography structure (3Å resolution), there is no gap in the middle.

    5. Reviewer #2 (Public Review):

      Nucleosome structures inside cells remain unclear. Tan et al. tackled this problem using cryo-ET and 3-D classification analysis of yeast cells. The authors found that the fraction of canonical nucleosomes in the cell could be less than 10% of total nucleosomes. The finding is consistent with the unstable property of yeast nucleosomes and the high proportion of the actively transcribed yeast genome. The authors made an important point in understanding chromatin structure in situ. Overall, the paper is well-written and informative to the chromatin/chromosome field.

    6. Reviewer #3 (Public Review):

      Several labs in the 1970s published fundamental work revealing that almost all eukaryotes organize their DNA into repeating units called nucleosomes, which form the chromatin fiber. Decades of elegant biochemical and structural work indicated a primarily octameric organization of the nucleosome with 2 copies of each histone H2A, H2B, H3 and H4, wrapping 147bp of DNA in a left handed toroid, to which linker histone would bind.

      This was true for most species studied (except, yeast lack linker histone) and was recapitulated in stunning detail by in vitro reconstitutions by salt dialysis or chaperone-mediated assembly of nucleosomes. Thus, these landmark studies set the stage for an exploding number of papers on the topic of chromatin in the past 45 years.

      An emerging counterpoint to the prevailing idea of static particles is that nucleosomes are much more dynamic and can undergo spontaneous transformation. Such dynamics could arise from intrinsic instability due to DNA structural deformation, specific histone variants or their mutations, post-translational histone modifications which weaken the main contacts, protein partners, and predominantly, from active processes like ATP-dependent chromatin remodeling, transcription, repair and replication.

      This paper is important because it tests this idea whole-scale, applying novel cryo-EM tomography tools to examine the state of chromatin in yeast lysates or cryo-sections. The experimental work is meticulously performed, with vast amount of data collected. The main findings are interpreted by the authors to suggest that majority of yeast nucleosomes lack a stable octameric conformation. The findings are not surprising in that alternative conformations of nucleosomes might exist in vivo, but rather in the sheer scale of such particles reported, relative to the traditional form expected from decades of biochemical, biophysical and structural data. Thus, it is likely that this work will be perceived as controversial. Nonetheless, we believe these kinds of tools represent an important advance for in situ analysis of chromatin. We also think the field should have the opportunity to carefully evaluate the data and assess whether the claims are supported, or consider what additional experiments could be done to further test the conceptual claims made. It is our hope that such work will spark thought-provoking debate in a collegial fashion, and lead to the development of exciting new tools which can interrogate native chromatin shape in vivo. Most importantly, it will be critical to assess biological implications associated with more dynamic - or static forms- of nucleosomes, the associated chromatin fiber, and its three-dimensional organization, for nuclear or mitotic function.

    1. Reviewer #1 (Public Review):

      The goal of this study was to determine whether short (1 month) internships for biomedical science trainees (mostly graduate students but some post-docs) were beneficial for the trainees, their mentors, and internship hosts. Over a 5 year period, the outcomes of trainees who completed internships were compared with peers who did not. Both quantitative results in terms of survey responses and qualitative results obtained from discussion groups were provided. Overall, the data suggest that internships aid graduate students in multiple ways and do not harm progress on dissertation projects. 'Buy-in' from mentors and prospective mentors appeared to increase over time, and hosts also gained from the contributions of the interns even in a short time period. While the program also appeared valuable for post-doctoral trainees, it was less favorably considered by post-doc mentors.

      Strengths:

      The internship program that was examined here appears to have been very well designed in terms of availability to students, range of internship offerings, length of time away from PhD lab, and assessments.<br /> Having a built-in peer control group of graduate students who did not do internships was valuable for much of the quantitative analyses. However, as the authors acknowledge, those who did opt for internships are a self-selected group who may have character traits that would help them overcome the potential negative impacts of the internship.<br /> The quantitative data is convincing and addresses important considerations for all stakeholders.<br /> The manuscript is well-constructed to individually address the impact of the program on each set of stakeholders, while also showcasing areas of mutual benefit.<br /> The discussion of challenges and limitations, from the perspectives of participating stakeholders, program leaders, and also institutions, is comprehensive and very thoughtful.

      Weaknesses:

      The qualitative data that resulted from the 'focus groups' of faculty mentors was somewhat difficult to evaluate given the very limited number of participants (n=7).

      Overall, the data support the authors' conclusions with respect to the utility of internship programs for all stakeholders. As the authors note, the data relate to a specific program where internship length was defined, costs were covered by a grant or institutional funding, and there were multiple off-site internship hosts available. Thus, the results here may not replicate for other programs with different criteria.

      This work provides a valuable assessment of how relatively short internships can impact graduate students, both in terms of their graduate tenure and in their decision-making for careers post-graduation. As more graduate programs are heeding calls from funding agencies and professional societies to increase knowledge about, and familiarity with, multiple career paths beyond academia for PhD students, there is a need to evaluate the best ways to accomplish that goal. Hands-on internships are valuable across many spheres so it makes sense that they would be for life science graduates too. However, the fear that time-to-degree and/or productivity would be negatively impacted is important to acknowledge. By providing clear data that this is not the case, these investigators have increased the likelihood that internships could be considered by more institutions. The one big drawback, and one that the authors discuss at some length, is the funding model that could enable internship programs to be used more widely.

    2. Reviewer #2 (Public Review):

      Summary:

      The authors describe five-year outcomes of an internship program for graduate students and postdoctoral fellows at their institution spurred by pilot funding from an NIH BEST grant. They hypothesized that such a program would be beneficial to interns, internship hosts, and research advisors. The mixed methods study used surveys and focus groups to gather qualitative and quantitative data from the stakeholder groups, and the authors acknowledge the limitation that the study subjects were self-selected and also had research advisors who agreed to allow them to participate. Thus the generally favorable outcomes may not be applicable to students such as those who are struggling in the lab and/or lack career focus or supportive research advisors. Nonetheless, the overall findings support the hypothesis and also suggest additional benefits, including in some cases positive impact for the lab, improved communication between the intern and their research advisor, and an advantage for recruitment of students to the institution. The data refute one of the principal concerns of research advisors: that by taking students out of the lab, internships reduce individual and overall lab productivity. Students who did internships were significantly less likely to pursue postdoctoral fellowships before entering the biomedical workforce and were more likely to have science-related careers versus research careers than control students who did not do internships, although the study design cannot determine whether this was due to selection bias or to the internship.

      Strengths:

      1. The sample size is good (123 internships).

      2. The internship program is well described. Outcomes are clearly defined.

      3. Methods and statistical analyses appear to be appropriate (although I am not an expert in mixed methods).

      4. "Take-home" lessons for institutions considering implementing internship programs are clearly stated.

      Weaknesses:

      1. It is possible that interns, hosts, and research advisers with positive experiences were more likely to respond to surveys than those with negative experiences. The response rate and potential bias in responses should be discussed in the Results, not just given in a table legend in Methods.

      2. With regard to the biased selection of participants, do the authors know many subjects requested but were not permitted to do internships?

      3. While the authors mention internships in professional degree programs in fields such as law and business, some mention of internship practices in non-biomedical STEM PhD programs such as engineering or computer science would be helpful. Is biomedical science rediscovering lessons learned when it comes to internships?

      4. Figure 1 k, l - internships did not appear to change career goals, but are the 76% who agreed pre-internship the same individuals as the 75% who agreed post-internship? What percentage gave discordant responses?

      Appraisal:

      Overall the authors achieve their aims of describing outcomes of an internship program for graduate career development and offering lessons learned for other institutions seeking to create their own internship programs.

      Impact:

      The paper will be very useful for other institutions to dispel some of the concerns of research advisers about internships for PhD students (although not necessarily for postdoctoral fellows). In the long run, wider adoption of internships as part of PhD training will depend not only on faculty buy-in but also on the availability of resources and changes to the graduate school funding model so that such programs are not viewed as another "unfunded mandate" in graduate education. Perhaps the industry will be motivated to support internships by the positive outcomes for hosts reported in this paper. Additionally, NIH could allow a certain amount of F, T, or even RPG funds to be used to support internships for purposes of career development.